some SPAN results

David Relson relson at osagesoftware.com
Tue Jul 19 01:47:00 CEST 2005


Greetings,

With a lexer modified to ignore <span...>...</span>, I scored the 6672
spam I've received so far this month.  19 of them had different scores
(numerically) depending on whether I used standard bogofilter 0.95.2 or
the modified version.  None of the 19 classified differently.

Here's a list of the 19 messages and their old and new scores:

~/Mail/2005-07-Spam/12712   S 0.999997   S 1
~/Mail/2005-07-Spam/12806   S 0.996504   S 0.993623
~/Mail/2005-07-Spam/13016   S 1          S 0.999993
~/Mail/2005-07-Spam/13082   S 0.999696   S 0.999811
~/Mail/2005-07-Spam/13525   U 0.983411   U 0.988497
~/Mail/2005-07-Spam/13677   S 0.998613   S 0.999295
~/Mail/2005-07-Spam/13805   U 0.982878   U 0.989128
~/Mail/2005-07-Spam/14891   S 0.999816   S 0.999114
~/Mail/2005-07-Spam/15189   S 0.999924   S 1
~/Mail/2005-07-Spam/15448   S 0.999729   S 0.999903
~/Mail/2005-07-Spam/16334   S 0.999999   S 1
~/Mail/2005-07-Spam/17251   S 0.999981   S 0.999991
~/Mail/2005-07-Spam/18042   S 0.999989   S 0.999991
~/Mail/2005-07-Spam/18043   U 0.969251   U 0.977572
~/Mail/2005-07-Spam/18073   U 0.940609   U 0.954472
~/Mail/2005-07-Spam/18074   U 0.888287   U 0.909152
~/Mail/2005-07-Spam/18274   S 1          S 0.999999
~/Mail/2005-07-Spam/18325   S 1          S 0.999999
~/Mail/2005-07-Spam/18326   S 1          S 0.999999

It seems that the "span" change doesn't make a useful difference with
my wordlist.

Regards,

David




More information about the Bogofilter mailing list