some SPAN results
David Relson
relson at osagesoftware.com
Tue Jul 19 01:47:00 CEST 2005
Greetings,
With a lexer modified to ignore <span...>...</span>, I scored the 6672
spam I've received so far this month. 19 of them had different scores
(numerically) depending on whether I used standard bogofilter 0.95.2 or
the modified version. None of the 19 classified differently.
Here's a list of the 19 messages and their old and new scores:
~/Mail/2005-07-Spam/12712 S 0.999997 S 1
~/Mail/2005-07-Spam/12806 S 0.996504 S 0.993623
~/Mail/2005-07-Spam/13016 S 1 S 0.999993
~/Mail/2005-07-Spam/13082 S 0.999696 S 0.999811
~/Mail/2005-07-Spam/13525 U 0.983411 U 0.988497
~/Mail/2005-07-Spam/13677 S 0.998613 S 0.999295
~/Mail/2005-07-Spam/13805 U 0.982878 U 0.989128
~/Mail/2005-07-Spam/14891 S 0.999816 S 0.999114
~/Mail/2005-07-Spam/15189 S 0.999924 S 1
~/Mail/2005-07-Spam/15448 S 0.999729 S 0.999903
~/Mail/2005-07-Spam/16334 S 0.999999 S 1
~/Mail/2005-07-Spam/17251 S 0.999981 S 0.999991
~/Mail/2005-07-Spam/18042 S 0.999989 S 0.999991
~/Mail/2005-07-Spam/18043 U 0.969251 U 0.977572
~/Mail/2005-07-Spam/18073 U 0.940609 U 0.954472
~/Mail/2005-07-Spam/18074 U 0.888287 U 0.909152
~/Mail/2005-07-Spam/18274 S 1 S 0.999999
~/Mail/2005-07-Spam/18325 S 1 S 0.999999
~/Mail/2005-07-Spam/18326 S 1 S 0.999999
It seems that the "span" change doesn't make a useful difference with
my wordlist.
Regards,
David
More information about the Bogofilter
mailing list