spaced out spam words
David Relson
relson at osagesoftware.com
Fri Jun 9 13:05:00 CEST 2006
On Fri, 09 Jun 2006 11:03:47 +0200
Matthias Andree wrote:
> David Relson <relson at osagesoftware.com> writes:
>
> > Boris 'pi' Piwinger has been running a customized version of
> > bogofilter that includes 1 and 2 character tokens in the wordlist
> > and the calculations. I've got a patch (somewhere) that allows
> > setting both minimum and maximum token lengths and could likely
> > find it if you're interested. I've thoughts of that patch being a
> > step that would work together with the ability to make multi-word
> > tokens (with '*' separators). For*example here*are
> > several*double-word tokens.
>
> This should however produce:
> For*example
> example*here
> here*are
> are*several
> several*double-word
> double-word*tokens
>
> or perhaps several*double / double*word / word*tokens.
Correct! You are showing the result of processing "For example here
are ...". I was showing some _examples_ of double-word tokens.
Now all I need is time to find my old patches, apply, and test them...
More information about the Bogofilter
mailing list