Just saw a new spam tactic

David Relson relson at osagesoftware.com
Fri Jan 31 03:23:37 CET 2003


At 09:06 PM 1/30/03, Zack Brown wrote:

>On Wed, Jan 29, 2003 at 11:40:28PM -0800, Max Rible wrote:
> > I just got a piece of spam that's full of bogus HTML tags-- lots
> > of </k> tags inserted in the middle of words.  The tags will be
> > ignored by most HTML renderers, but will break up the text for
> > spam parsing.
>
>Is it really necessary for Bogofilter to do anything about this? Won't
>bogofilter just learn to classify email containing those kinds of tags
>as spam?

Bogofilter currently disregards tokens that are only 1 or 2 
ch<x>ar<x>ac<x>te<x>rs long.  So an appropriately divided message would 
slip past.

Right now our goal is a stable release with mime processing and html 
comment killing.  We're very close to that.  In the last 10 days, a lot of 
bugs have been found and fixed.  After we've achieved a stable release is 
when I'd like to have an open discussion on how to deal with html.






More information about the Bogofilter mailing list