filter evasion
    John McCain 
    jmccain at layer3al.com
       
    Thu Nov  6 22:10:05 CET 2003
    
    
  
I'm seeing messages slip through that have the following characteristics:
sp</ham>am
Which tokenize as:
sp
ham
am
Previously, this sort of thing was done with html comments:
sp<!--ham-->am
Which would tokenize (in .15.8) as:
spam
What can be done about this?  I personally don't think it would be a great 
loss to simply ingore all html closing tags.  I can't think of any other HTML 
evil which could be perpetrated to do this any other sort of way without 
seriously disrupting the text.
    
    
More information about the bogofilter
mailing list