[bogofilter] spamitarium & block_on_subnets results

Tom Allison tallison at tacocat.net
Fri May 7 08:45:49 EDT 2004


Tom Anderson wrote:
 > On Fri, 2004-05-07 at 07:06, Tom Allison wrote:
 >
 >>Well, I decided to re-run the tests using
 >>robx=0.55
 >>robx=(1.0, 0.1 0.01)
 >
 >
 >   ^^^^
 > Do you mean robs here?
 >
YES

 >
 >>block_on_subnets=(yes no)
 >>and varous implimentations of spamitarium for a total of something like
 >>90 test sets.
 >
 >
 > Wow, that's ambitious!
 >

yeah, well bogofilter in bulk modem (-M) runs these tests pretty nicely,
but the single-mode of spamitarium adds some time.  But this will be
true with any type of "munging script" that we might put in front of
bogofilter to improve the data.  No fault of spamitarium, but a feature 
that will exist with anything that precedes bogofilter.

I'm guessing it will take something under 48 hours.

 > I'd still go lower with robx, but as long as min_dev > 0.05 in this
 > case, then it should be an adequate test.

With robx = 0.55 and min_dev=0.10 it should be sufficient to keep things
out of the test schema until they've been seen a few times.

Try running some tests on just a few words and see how long it takes to
move out of the min_dev arena with various robs values.  In all cases
that I tested it's not exactly 1 or 2 hits that will affect you.

 >
 >
 >>I need a course in remedial statistics.
 >
 >
 > Don't we all?


More information about the Bogofilter mailing list