extreme wierdness with RF & 0.10.0
Barry Gould
BarryGould at PennySaverUSA.net
Tue Jan 21 03:57:26 CET 2003
I've just put in 0.10.0 using Robinson-Fisher.
I sent my users a message regarding the new bogofilter, and CC'd myself.
Strangely, it got tagged as Unsure with a score of 0.179478
Even stranger is the output of bogofilter -vvv
I get a score of 0.189874 when I run it back through bogofilter (note the
headers are different at this point due to the bogofilter tag and whatever
Eudora added... no problem).
However, when looking for words that would have caused it to be Unsure, I
see many words with probabilities above 1.0!! Shouldn't this be impossible?!?
Even ones with both probabilities above 1.0!!!
example:
n pgood pbad fw
invfwlog fwlog U
"for" 123254 3.213524 2.704072 0.456954
-0.61056 -0.78317 -
"pennysaverusa.net" 85321 2.684474 0.121062 0.043151
-0.04411 -3.14305 +
"from" 138213 3.374996 3.902206 0.536223
-0.76835 -0.62320 -
Other than these weird tokens, I don't see anything that should have caused
bogofilter to be "Unsure"
BTW, meanwhile, I've just changed from min_dev 0.0 to 0.1, but the
wierdness persists.
The message and -vvv results are attached as gzipped text.
Thanks,
Barry
-------------- next part --------------
A non-text attachment was scrubbed...
Name: bogo-unsure.txt.gz
Type: application/octet-stream
Size: 3297 bytes
Desc: not available
URL: <http://www.bogofilter.org/pipermail/bogofilter/attachments/20030120/149fcf11/attachment.obj>
More information about the Bogofilter
mailing list