Bug in tuning.sh?

Greg Louis glouis at dynamicro.on.ca
Fri Jun 13 13:02:24 CEST 2003


On 20030613 (Fri) at 1223:33 +0200, Boris 'pi' Piwinger wrote:
> Greg Louis wrote:
> 
> > Try this:
> > 
> > --- tuning.sh.orig	2003-06-12 12:45:04.000000000 -0400
> > +++ tuning.sh	2003-06-12 12:54:59.000000000 -0400
> 
> Now here is the result:
> 
> > Top 10 results
> > 06/13 11:24:55 1e-1   0.475 fpos.. 8 at cutoff 0.499000, run0..  21  run1..  21  run2..  25   67
> > 06/13 10:57:34 1      0.450 fpos.. 8 at cutoff 0.500000, run0..  42  run1..  38  run2..  47  127
> > 06/13 10:58:20 1      0.475 fpos.. 8 at cutoff 0.500000, run0..  41  run1..  38  run2..  47  126
> > 06/13 11:11:45 3.2e-1 0.450 fpos.. 8 at cutoff 0.500000, run0..  42  run1..  38  run2..  47  127
> > 06/13 11:12:25 3.2e-1 0.475 fpos.. 8 at cutoff 0.500000, run0..  41  run1..  38  run2..  47  126
> > 06/13 11:24:15 1e-1   0.450 fpos.. 8 at cutoff 0.500000, run0..  42  run1..  38  run2..  47  127
> > 06/13 11:36:44 3.2e-2 0.450 fpos.. 8 at cutoff 0.500000, run0..  42  run1..  38  run2..  47  127
> > 06/13 11:37:23 3.2e-2 0.475 fpos.. 8 at cutoff 0.500000, run0..  41  run1..  38  run2..  47  126
> > 06/13 11:49:15 1e-2   0.450 fpos.. 8 at cutoff 0.500000, run0..  42  run1..  38  run2..  47  127
> > 06/13 11:49:54 1e-2   0.475 fpos.. 8 at cutoff 0.500000, run0..  41  run1..  38  run2..  47  126
> 
> Again, I don't believe those results:
> 
> 1) Why do all have 8 false positives? That is a lot. Where
> is this chosen (getco, right?), why?

Ok, this looks much better.  24 is far too high a target for this data
set, which is why all your cutoffs are 0.5.  Try setting a target of 6
(there's a target= line early in the script).  The new bogotune will
evaluate your data and set an appropriate target; tuning.sh requires
that it be done manually.

One runs tuning.sh to seek the best s and min_dev values, not cutoff. 
You need a false-positive rate higher than would be desirable in
production to ensure that the cutoffs don't bottom or top out over the
range of s and min_dev that's being investigated.  The results are only
valid when the cutoff is exactly the target value, so having them all
the same is desirable.  Once you have the right s and min_dev, you
increase the spam cutoff till you have the desired balance (bogotune will
offer some choices) between fp and fn.

> 2) Again, no significance to robs:

There may really be none for your data, but until we get reasonable
cutoffs (above 0.5 and below 0.999999), I wouldn't hasten to conclude
anything.

> What is BTW the reason for making these three runs all the
> time instead of just one bigger one?

Largely historical, but also, if you do get a small effect of s or
min_dev, you can check its statistical significance with two or three
runs, but not with one.

-- 
| G r e g  L o u i s          | gpg public key: finger     |
|   http://www.bgl.nu/~glouis |   glouis at consultronics.com |
| http://wecanstopspam.org in signatures fights junk email |




More information about the Bogofilter mailing list