spaced out spam words
    Jason A. Smith 
    jason-bf at jazbo.dyndns.org
       
    Fri Jun  9 13:38:09 CEST 2006
    
    
  
On Fri, 2006-06-09 at 07:05, David Relson wrote:
> Correct!  You are showing the result of processing "For example here
> are ...".  I was showing some _examples_ of double-word tokens.
> 
> Now all I need is time to find my old patches, apply, and test them...
So this patch will handle the often requested multi-word feature and
deal with these spaced out words better.  Will it collapse/replace
multiple white-spaces (spaces, tabs and maybe newlines) with '+' before
adding to/checking the database?  What about html spam that often places
tags, spaces & newlines between single letters, such that when displayed
by an html viewer, still clearly shows the spammer's message?  Will the
bogofilter parser collapse white-spaces, tags and newlines allowing it
to combine the spaced out words in html spam?
~Jason
    
    
More information about the bogofilter
mailing list