lexer/html [was: Robinson algorithm ... ]
David Relson
relson at osagesoftware.com
Wed Nov 27 13:16:49 CET 2002
At 12:38 AM 11/27/02, Shane Wegner wrote:
>Hi,
>
>continuum:~$ bogofilter -gvv < msg
>X-Bogosity: Yes, tests=bogofilter, spamicity=1.000000,
>version=0.9.0.1
> 0.010000 garner
> 0.010000 hooked
> 0.010000 stereo
> 0.990000 background-position
> 0.990000 bgproperties
> 0.990000 cellpadding
> 0.990000 cellspacing
> 0.990000 dbaseline
> 0.990000 diso-8859
> 0.990000 ffff80
> 0.990000 font-family
> 0.990000 font-size
> 0.990000 no-repeat
> 0.990000 tbody
> 0.990000 untitled
Looking at lexer.l, cellpadding and cellspacing (for example) are listed
among all the html tags. I'll check to see why they're passing through -
an unexpected result.
More information about the bogofilter-dev
mailing list