lexer/html [was: Robinson algorithm ... ]

David Relson relson at osagesoftware.com
Wed Nov 27 13:16:49 CET 2002


At 12:38 AM 11/27/02, Shane Wegner wrote:

>Hi,
>
>continuum:~$ bogofilter -gvv < msg
>X-Bogosity: Yes, tests=bogofilter, spamicity=1.000000,
>version=0.9.0.1
>         0.010000  garner
>         0.010000  hooked
>         0.010000  stereo
>         0.990000  background-position
>         0.990000  bgproperties
>         0.990000  cellpadding
>         0.990000  cellspacing
>         0.990000  dbaseline
>         0.990000  diso-8859
>         0.990000  ffff80
>         0.990000  font-family
>         0.990000  font-size
>         0.990000  no-repeat
>         0.990000  tbody
>         0.990000  untitled

Looking at lexer.l, cellpadding and cellspacing (for example) are listed 
among all the html tags.  I'll check to see why they're passing through - 
an unexpected result.





More information about the bogofilter-dev mailing list