bogotune problem

Bopolissimus Platypus bopolissimus at sni.ph
Thu Jan 1 22:21:10 CET 2004


hello all,

I'm using (built from source):

bogofilter version 0.15.13
    Database: BerkeleyDB (4.2.50), combined

and, with it, bogotune 0.15.13

i'm trying to run bogotune, with my ham list of 7434 messages and my
spam list of 2353 messages (i have another spam list of 43873 messages).

when i run bogotune with -D, it works and produces settings.  it also 
warns about having too few high-scoring nonspams.  i'm not sure why
that is, maybe because it uses half of the messages to build a wordlist
and then the other half for training?

what i *would* like to do is run bogotune with "-d $/.bogofilter" so that
it can take advantage of the wordlist i've already got.  however,
when i do that, i get:

Reading good
Reading /home/tiger/.bogofilter/wordlist.db
7434 messages 
Reading spam
2353 messages 
    4m:59s for 9787 messages.  avg: 32.7 msg/sec
    7m:34s for 9787 messages.  avg: 21.6 msg/sec
The wordlist contains 36 non-spam and 20 spam messages.
Bogotune must be run with at least 2000 of each.

which doesn't make sense, since clearly it found 7434 ham and 2353 spam.
when i do the same thing, except i specify the 43873 spam mbox, i still get
the same error.

the mboxes are just standard mbox file (kmail and evolution).

can anyone tell me what i might be doing wrong?  my full command line is:

bogotune -c ~/.bogofilter.cf -d ~/.bogofilter -s spam -n good -vvv

removing the -c ~/.bogofilter doesn't do anything, i still get the same error.
at any rate, as far as the "wordlist contains N non-spam and M spam" line
is concerned.

Reading good
Reading /home/tiger/.bogofilter/wordlist.db
7434 messages 
Reading spam.old
43873 messages
    6m:03s for 51307 messages.  avg: 141.3 msg/sec
    7m:58s for 51307 messages.  avg: 107.3 msg/sec
The wordlist contains 36 non-spam and 20 spam messages.
Bogotune must be run with at least 2000 of each.

can that be affected by .bogofilter.cf settings? i don't think it should, but
if yes, then i could post my .bogofilter.cf settings too.

tiger

-- 
Gerald Timothy Quimpo  gquimpo*hotmail.com tiger*sni*ph
http://bopolissimus.sni.ph
Public Key: "gpg --keyserver pgp.mit.edu --recv-keys 672F4C78"

    The first half of our lives is ruined by our parents, and
     the second half by our children.
	                     Clarence Darrow





More information about the Bogofilter mailing list