Understanding tuning results
Boris 'pi' Piwinger
3.14 at logic.univie.ac.at
Thu Jun 5 16:30:16 CEST 2003
Hi!
Using the new scripts for tuning I got some results:
> r0 r1 r2
> sp.mc 2142 2142 2142
> ns.mc 3713 3714 3713
>
> Top 10 results
> 06/05 14:45:43 1 0.025 fpos...0 at cutoff 0.999999, run0...394 run1...341 run2...377 1112
> 06/05 14:46:29 1 0.050 fpos...0 at cutoff 0.999997, run0...359 run1...315 run2...345 1019
> 06/05 14:47:15 1 0.075 fpos...0 at cutoff 0.999997, run0...347 run1...307 run2...334 988
> 06/05 14:47:58 1 0.100 fpos...0 at cutoff 0.999999, run0...308 run1...282 run2...292 882
> 06/05 14:48:45 1 0.125 fpos...0 at cutoff 0.999998, run0...294 run1...268 run2...279 841
> 06/05 14:49:28 1 0.150 fpos...0 at cutoff 0.999998, run0...281 run1...264 run2...270 815
> 06/05 14:50:14 1 0.175 fpos...0 at cutoff 0.999997, run0...281 run1...262 run2...267 810
> 06/05 14:51:05 1 0.200 fpos...0 at cutoff 0.999991, run0...252 run1...248 run2...249 749
> 06/05 14:51:52 1 0.225 fpos...0 at cutoff 0.999968, run0...238 run1...233 run2...235 706
> 06/05 14:52:37 1 0.250 fpos...0 at cutoff 0.999861, run0...230 run1...225 run2...231 686
I don't really understandt those. What is the number (1)
behind the time? What the next number?
Do I really have to go by time and look the values up above?
If so, the best would be:
robx = 0.415000 (4.15e-01)
robs = 1.000000 (1.00e+00)
min_dev = 0.100000 (1.00e-01)
cutoff 0.999861
OK, let me do the following. I take the r[0-2].(ns|sp) and
chech what happens using my real database:
[The config I use now]
algorithm=fisher
robs=0.0011
min_dev=0.025
ham_cutoff = 0.00
spam_cutoff = 0.53
spamicity_tags = Spam, Ham
spamicity_formats = %0.3f, %0.3f
header_format = %h: %c, spamicity=%p, version=%v/%a
bogofilter_dir=/usr/local/pi/bogolists/.bogofilter
Spam:
6424 test.spam
False negatives:
170
Ham:
11133 test.ham
False positives:
1
[The settings suggested above]
algorithm=fisher
robs=1
robx=0.415
min_dev=0.1
ham_cutoff=0.00
spam_cutoff=0.999861
spamicity_tags = Spam, Ham
spamicity_formats = %0.3f, %0.3f
header_format = %h: %c, spamicity=%p, version=%v/%a
bogofilter_dir=/usr/local/pi/bogolists/.bogofilter
Spam:
6424 test.spam
False negatives:
633
Ham:
11133 test.ham
False positives:
16
Now that is a real pain. Something is awfully wrong here.
BTW: My above setting "in production" show no mistake
whatsoever in the last three days or so.
pi
More information about the Bogofilter
mailing list