spaced out spam words

David Relson relson at osagesoftware.com
Fri Jun 9 13:05:00 CEST 2006


On Fri, 09 Jun 2006 11:03:47 +0200
Matthias Andree wrote:

> David Relson <relson at osagesoftware.com> writes:
> 
> > Boris 'pi' Piwinger has been running a customized version of
> > bogofilter that includes 1 and 2 character tokens in the wordlist
> > and the calculations.  I've got a patch (somewhere) that allows
> > setting both minimum and maximum token lengths and could likely
> > find it if you're interested.  I've thoughts of that patch being a
> > step that would work together with the ability to make multi-word
> > tokens (with '*' separators).  For*example here*are
> > several*double-word tokens.
> 
> This should however produce:
> For*example
> example*here
> here*are
> are*several
> several*double-word
> double-word*tokens
> 
> or perhaps several*double / double*word / word*tokens.

Correct!  You are showing the result of processing "For example here
are ...".  I was showing some _examples_ of double-word tokens.

Now all I need is time to find my old patches, apply, and test them...



More information about the Bogofilter mailing list