How to block offensive / profane Spanish content

Hi
We use Web Marshal and Mail Marshal to control our internet and email gateways respectively.

We now need to block Spanish offensive language from making its way to our users' inboxes, or blocking websites that contain offensive Spanish words. The problem is twofold:
a) Marshal provide no support in terms of custom text censor scripts that we can import
b) We cannot find any comprehensive csv list of offensive Spanish words that we can import into our Marshal suite, with the confidence that it will not produce false positives

Does someone know of a comprehensive csv list that we can utilize for instance?

Thanks
FphcareAdminsAsked:
Who is Participating?

[Product update] Infrastructure Analysis Tool is now available with Business Accounts.Learn More

x
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

jhyieslaCommented:
If Spanish spam is like spam in English a list of offensive words will never work.  Let's say that the word "boob" is offensive.  So you block it. But then there is B**b, b0ob, and on and on. If your current filtering system depends on word lists to do it's job, you have the wrong system. I'm not familiar with the systems you are using so can't comment on them completely.  Over the years we have used such systems as SurfControl for web filtering and now use an appliance from St. Bernard. We also utilize the Postini email filtering system, but I honestly don't know if they deal with Spanish or not.
0
NemskinatorCommented:
many spam applicances offer block filters specifically for languages such as spanish, chineese, russian and several other. it worked for me.
0
NemskinatorCommented:
see attached
untitled.JPG
0
Determine the Perfect Price for Your IT Services

Do you wonder if your IT business is truly profitable or if you should raise your prices? Learn how to calculate your overhead burden with our free interactive tool and use it to determine the right price for your IT services. Download your free eBook now!

GladinatorCommented:
You can swear until you are blue in the face but spammers have gone through several generations of spam since we used to block mails simply by specific words.

They can replace vowels with character symbols, and consonants with others (example: number 1 for letter L,  @ sign for letter O or zero).  The human brain can easily translate these into their proper equivalent and then become offended before a computer analysis will do the same.

The systems used by gateway spam products like you are using should already be more sophisticated than a straightforward "bad" word list.  Usually they have training features that administrators can use to educate the software "types of email we don't want to allow through our network".

Gather a large sample of offensive email (if your gateway system isnt already collecting it for analysis, collect someones home email address spam folder contents [in spanish of course]) and forward each mail to a mail admin account within the company, and use the grading/feedback system of the gateway to train it on what shouldn't get through.
0

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
jwentingCommented:
and even if you block all those character substitutions, they'll just send a picture containing the text of the message, embedded in an html email.
Or an email containing a link to a website where the real message is to be found, masquerading in the email as something benign.

I've found that the most effective way to block spam is to figure out what TLDs most comes from and block those entirely.
This means blocking all of sub-Saharan Africa, most of South America (Chile seems clean, as does Peru), and parts of Asia (Indonesia, Burma, Vietnam, China, and a few others).
Blocking eastern Europe also helps.
You can always add whitelists to exclude specific addresses and domain names from the general block as needed.
0
FphcareAdminsAuthor Commented:
unfortunately we cannot block TLDs as we are a global company and receive emails etc from all over the globe
0
FphcareAdminsAuthor Commented:
thankyou
0
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
AntiSpam

From novice to tech pro — start learning today.