Bogotune results
Tamer Yousef
tamer.yousef at gmail.com
Wed Aug 21 18:38:22 CEST 2013
I was able to finally get bogotune to run, and here are the results below.
Here are some questions that I have:
1- The warning message indicates that the training set needs to
be-classified?
2- applying the recommendation without the "sp_esf&ns_esf" values is
totally screwing up the spam scores a lot of the text that previously got
scores below .5 is now over .9.
Changing the value of the min_div affects that final results significantly
but the warning message bogotune is outputting is really making me doubt
the while thing that I may need to re-annotate and rebuild my training
set....
and a side note:For applying the sp_esf&ns_esf , the "-E" option is not
supported by recent bogfilter?1
wordlist's ham to spam ratio is 1.2 to 1.0
Warning: test messages include many high scoring nonspam.
You may wish to reclassify them and rerun.
high ham scores:
1 1.000000
2 1.000000
3 1.000000
4 1.000000
5 1.000000
6 1.000000
7 1.000000
8 1.000000
9 1.000000
10 1.000000
low spam scores:
1 0.000013
2 0.000043
3 0.007088
4 0.029865
5 0.040703
6 0.046916
7 0.054538
Minimum found at s 0.3162, md 0.286, x 0.528, spesf 0.004228, nsesf 0.011573
fp 30 (2.8708%), fn 644 (62.2824%)
Performing final scoring:
Spam... Non-Spam...
0.254923 0.941857
0.298609 0.936104
0.307635 0.927185
0.362183 0.899385
0.413051 0.895696
0.466462 0.895564
0.470619 0.892552
0.471655 0.892334
0.472590 0.887125
0.477892 0.884211
Recommendations:
---cut---
db_cachesize=100
robs=0.3162
min_dev=0.286
robx=0.527809
sp_esf=0.004228
ns_esf=0.011573
spam_cutoff=0.936104 # for 0.10% fp (1); expect 99.32% fn (1027).
#spam_cutoff=0.927185 # for 0.20% fp (2); expect 98.94% fn (1023).
ham_cutoff=0.308
---cut---
More information about the Bogofilter
mailing list