garbage removal and 'outsiders noise'
Jim Correia
jim.correia at pobox.com
Fri Apr 18 03:59:06 CEST 2003
On Thursday, April 17, 2003, at 07:43 PM, Greg Louis wrote:
> To me, 10% fn would be a big number.
Well I'd like it to be lower, but that is currently what it is (no fp
though.)
> I don't run with -u, but train manually: copy all mail to a single mbox
> file, and periodically use bogofilter to break it in 3: spam, nonspam,
> unsure.
Is there a reason you do it this way?
Theoretically, since it "learns" by seeing representative spam and
non-spam messages, shouldn't more data produce more accurate results,
all other things equal?
(I know you previously commented about the lopsidedness of my word
lists...)
Jim
More information about the Bogofilter
mailing list