Contact: Web / Voice / Email / Tips
Simple Thoughts Blog - Java and Web Technologies

Simple solutions for complex problems.

 

New techniques of spamming…

July 28th, 2004 by Angsuman Chakraborty

For quite sometime naive bayesian classifier based SPAMBayes filtered my emails very accurately with very few false positives.

Recently however I have noticed few trends in spamming which are alarming in nature.

  • Database poisoning: Using otherwise innocuous words (ham words) in a SPAM, thereby effectively poisoning the database in the long run
  • Junk Tags: Hiding spam words by inserting invalid HTML tags in between words. Any HTML parser ignores tags it doesn’t understand, thereby resulting in properly viewable document
  • Invalid Words: Spam word like mortgage etc. are masked by inserting special characters or junk characters in between.

Solutions I could think of:

  • Most of the database poisoning email tend to be classified in Not Sure category. I suggest that you delete them instead of classifying them as spam. However it still requires that we spend some time for it which is what I don’t like.
  • Junk Tags: Add a filter in front of bayesian classifier to eliminate junk tags
  • Invalid Words: No-exact matching algorithms from Lucene etc. should help.

I have recently noticed a significant increase in mortgage spams. It should be easy to tackle them by legal means.

Overall the game is becoming tougher for spam prevention. A combination of existing techniques are required for any spam filters to remain effective.

Looking forward to hear your thoughts.


Filed under Spam Watch, Web | | RSS 2.0 | Email this Article

You may also like to read

»Understanding Context Aware Trackback Spamming: New Frontiers in Web Spamming
»WordPress Comment Spamming - Over 50% Contributed by Top 100 IP Addresses
»Enter Ethical Spammers; Is Spamming going Mainstream?
»What is comment spamming? Couple of interesting case studies.
»Skype 2.5 Heuristic Stealth Mode Cracked
»What is SEO?
»Bye Bye Yahoo Messenger...No Thanks To Your Spam
»
»ContactThem Network Perfects Distributed Spamming
»Using Evite For Spamming
»New Spamming Technique: Yahoo! Messenger Spamming From Myfreecamhost
»Toto Washlet Breaks New Ground in Advertising With Flash
»How To Use Yahoo Messenger on Linux (MSN, ICQ, IRC, Google Talk...)
»Hunting Spammers: The Legal Way
»New horizons in spamming aka SpamBlog ( Rick H needs to be cloned? )

2 Responses to “New techniques of spamming…”

  1. Dave M. Says:

    I have tried all the software solutions to twarting spam. I have yet to see one that works as good as simply owning a domain and creating many email addresses. One for each site I visit. Like the one I used here. If I start getting spam from that address, I simply forward it to null@null.net and that’s that. I have about 30 email addresses generating well over 250 spams a day. They are all being forwarded to null@null.net (Sure hope no one ever gets that address).

    I *NEVER* give out my main email address to anyone! All the non spam addresses get forwarded to my real email account so I can read them and respond to them. Sure, at that point my real address get’s sent out. However, it’s not accidently published on the web. At least not by posting it on a blog or a web store.

  2. Praveen Says:

    I facing the same problem. The new genre of spam that I noticed was that a bunch of unrelated words were pushed in at the end of the e-mail. These words are really rare words gathered from different contexts.

    Do you have any suggestions for it?

Looking forward to hear your thoughts.



Please enter the code shown below ( to verify that you are human ) before you click Submit Comment.

No. 1 method to ethically increase your blog traffic and reach.

Translate

Translate to EnglishÜbersetzen Sie zum Deutsch/GermanPřeložit do Čech/CzechOversætte hen til Dansk/DanishKääntää jotta Finnish/FinnishLefordít -hoz Magyar/HungarianÞýða til Íslenska/IcelandicTraducir a Latinoamericano Español/Latin American Spanishtagapagsalin sa Filipino/FilipinoTłumaczyć wobec Polski/PolishA traduce la spre Român/RomanianPrevesti za Srpski/Serbiantolmačiti v slovenski/SlovenianÖversätta till Svensk/SwedishChyfieitha at Cymraeg/Welshtercüme etmek -e doğru Türk/TurkishPrevesti to Hrvatski/CroatianПревеждам към Българин/BulgarianTraduzca al Español/SpanishTraduisez au Français/FrenchTraduca ad Italiano/ItalianTraduza ao Português/Portuguese日本語に翻訳しなさい /Japanese한국어에게 번역하십시오/Korean中文翻译/Chinese Simplifiedترجمة الى العربية/ArabicVertaal aan het Nederlands/DutchΜεταφράστε στα ελληνικά/GreekПереведите к русскому/RussianOversetter til Norsk/Norwegian中文翻译/Chinese TraditionalTraduzir a Língua portuguesa brasileira/Brazilian PortugueseReddo ut Latin/Latin

Taragana Network

»Ctrl-S
»Enterprise Blog
»Free Book on Eye Care by Natural Therapy
»Health Care Blog
»Hot Computer Jobs Blog
»Pet Care & Grooming News and Tips
»Phil Law Blog
»Taragana - Software Outsourcing
»The Angsuman Chakraborty Blog
»The Diabetes Cure Blog
»The Eye Treatment Blog
»The Stem Cell Blog
»Weblog Hosting Blog
"A man's ethical behavior should be based effectually on sympathy, education, and social ties; no religious basis is necessary. Man would indeeded be in a poor way if he had to be restrained by fear of punishment and hope of reward after death." - Albert Einstein