massive false negatives

Dan Stromberg strombrg at dcs.nac.uci.edu
Mon May 5 19:16:11 CEST 2003


On Mon, 2003-05-05 at 10:05, David Relson wrote:
> At 12:53 PM 5/5/03, Dan Stromberg wrote:
> 
> >Ever since I upgraded to 0.11.1.3, I've been getting a Lot of false
> >negatives.  In fact, I'm not sure Anything is getting filtered out.
> >
> >I recently recreated my db from known spam and ham, but that didn't
> >appear to help.
> >
> >I did:
> >
> >/dcs/packages/bogofilter/bin/bogofilter -v < /tmp/bad
> >X-Bogosity: No, tests=bogofilter, spamicity=0.000001, version=0.11.1.3
> >
> >...with what I consider fairly obvious spam (the "banned cd" thing).  Is
> >that a strange spamicity.  The man page didn't appear to say.
> >
> >I'm attaching -vvv output, which I'm not sure what to do with.
> 
> Dan,
> 
> I need some more information.  As part of bogofilter-0.12, bogofilter 
> understands a new file format, known as the msg-coutn format.  A msg-count 
> file contains the .MSG_COUNT values from spamlist.db and goodlist.db and 
> all the tokens from the message, along with their spam and ham 
> counts.  Given your spam message in the .mc format, I can run bogofilter 
> and reproduce your numbers to learn more about what is happening.
> 
> Run the attached script to convert the spam message to msg-count format, 
> i.e. "bogolex.sh < spam > spam.mc", then gzip the output file and send it 
> to me.
> 
> David

Wow, thanks for the quick response.

Do I need to upgrade to 0.12 before running bogolex.sh?

I'm getting:

tesuji-strombrg> sh bogolex.sh < bad > bad.mc
Option -w requires an argument.
bogolex.sh: line 12: 11307 Done                    $BOGOLEXER -p $*
     11308 Broken pipe             | sort -u

tesuji-strombrg> cat bad.mc 

tesuji-strombrg> 

-- 
Dan Stromberg DCS/NACS/UCI <strombrg at dcs.nac.uci.edu>

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://www.bogofilter.org/pipermail/bogofilter/attachments/20030505/0794046b/attachment.sig>


More information about the Bogofilter mailing list