massive false negatives

David Relson relson at osagesoftware.com
Mon May 5 19:05:46 CEST 2003


At 12:53 PM 5/5/03, Dan Stromberg wrote:

>Ever since I upgraded to 0.11.1.3, I've been getting a Lot of false
>negatives.  In fact, I'm not sure Anything is getting filtered out.
>
>I recently recreated my db from known spam and ham, but that didn't
>appear to help.
>
>I did:
>
>/dcs/packages/bogofilter/bin/bogofilter -v < /tmp/bad
>X-Bogosity: No, tests=bogofilter, spamicity=0.000001, version=0.11.1.3
>
>...with what I consider fairly obvious spam (the "banned cd" thing).  Is
>that a strange spamicity.  The man page didn't appear to say.
>
>I'm attaching -vvv output, which I'm not sure what to do with.

Dan,

I need some more information.  As part of bogofilter-0.12, bogofilter 
understands a new file format, known as the msg-coutn format.  A msg-count 
file contains the .MSG_COUNT values from spamlist.db and goodlist.db and 
all the tokens from the message, along with their spam and ham 
counts.  Given your spam message in the .mc format, I can run bogofilter 
and reproduce your numbers to learn more about what is happening.

Run the attached script to convert the spam message to msg-count format, 
i.e. "bogolex.sh < spam > spam.mc", then gzip the output file and send it 
to me.

David
-------------- next part --------------
A non-text attachment was scrubbed...
Name: bogolex.sh
Type: application/octet-stream
Size: 359 bytes
Desc: not available
URL: <http://www.bogofilter.org/pipermail/bogofilter/attachments/20030505/31722f66/attachment.obj>


More information about the Bogofilter mailing list