Filtering Spam in Foreign Languages

I can't speak for anyone outside the US, but here, we tend to get a lot of spam written exclusively in Chinese, Japanese, and Russian. Outlook seems to block this stuff pretty well, but we're using a web-based mail service (so that spam doesn't get to our server), which forwards mail as necessary (or people use POP3 to pick it up from our email host). In any case, way too much foreign spam gets to the server, which adds time to the download, and is just plain irritating. I don't mean to be xenophobic, but any email that isn't written using a character set I can interpret is spam, as far as I'm concerned.

After some research, we found a way to block much of the spam. This will only work for you if your email host allows you to create filtering rules based on any text within the email. In our case, I added rules that sent mail to the Spam folder if the following text appeared anywhere in the email:

 charset="GB2312"

I repeated, creating rules for the following character sets, as well:

charset="koi8-r"

charset="iso-2022-jp"

I'm sure there are others I'll need to block over time, but so far, this method has worked 100% effectively. The flow of Russian spam reaching my inbox has halted, finally.

Published Thursday, April 24, 2008 8:00 AM by KenG

Comments

Tuesday, May 13, 2008 9:50 PM by Alex Dybenko

# re: Filtering Spam in Foreign Languages

Hi Ken,

then if somebody from Russia send you non-spam email on  english - you will loose it. even if he lives in US, but still sometimes writes emails in russian - he will set default charset to koi8-r and you never get emails from him

Alex

Thursday, May 15, 2008 9:14 PM by KenG

# re: Filtering Spam in Foreign Languages

Alex:

Luckily, it's really easy to peruse the spam content, and "real" messages pop out, visually. Unfortunately, I get something like 50 russian spam messages per day, and they all somehow get through the spam control, otherwise. It's unfortunate, but true. I'd hate to miss an important email, but I also can't deal with a deluge of spam that consists solely of foreign language characters. With a regular pass through the spam folder, things work out OK. -- Ken