massive false negatives
Dan Stromberg
strombrg at dcs.nac.uci.edu
Mon May 5 19:16:11 CEST 2003
On Mon, 2003-05-05 at 10:05, David Relson wrote:
> At 12:53 PM 5/5/03, Dan Stromberg wrote:
>
> >Ever since I upgraded to 0.11.1.3, I've been getting a Lot of false
> >negatives. In fact, I'm not sure Anything is getting filtered out.
> >
> >I recently recreated my db from known spam and ham, but that didn't
> >appear to help.
> >
> >I did:
> >
> >/dcs/packages/bogofilter/bin/bogofilter -v < /tmp/bad
> >X-Bogosity: No, tests=bogofilter, spamicity=0.000001, version=0.11.1.3
> >
> >...with what I consider fairly obvious spam (the "banned cd" thing). Is
> >that a strange spamicity. The man page didn't appear to say.
> >
> >I'm attaching -vvv output, which I'm not sure what to do with.
>
> Dan,
>
> I need some more information. As part of bogofilter-0.12, bogofilter
> understands a new file format, known as the msg-coutn format. A msg-count
> file contains the .MSG_COUNT values from spamlist.db and goodlist.db and
> all the tokens from the message, along with their spam and ham
> counts. Given your spam message in the .mc format, I can run bogofilter
> and reproduce your numbers to learn more about what is happening.
>
> Run the attached script to convert the spam message to msg-count format,
> i.e. "bogolex.sh < spam > spam.mc", then gzip the output file and send it
> to me.
>
> David
Wow, thanks for the quick response.
Do I need to upgrade to 0.12 before running bogolex.sh?
I'm getting:
tesuji-strombrg> sh bogolex.sh < bad > bad.mc
Option -w requires an argument.
bogolex.sh: line 12: 11307 Done $BOGOLEXER -p $*
11308 Broken pipe | sort -u
tesuji-strombrg> cat bad.mc
tesuji-strombrg>
--
Dan Stromberg DCS/NACS/UCI <strombrg at dcs.nac.uci.edu>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://www.bogofilter.org/pipermail/bogofilter/attachments/20030505/0794046b/attachment.sig>
More information about the Bogofilter
mailing list