bogotune problem

David Relson relson at osagesoftware.com
Thu Jan 1 22:34:48 CET 2004


On Fri, 2 Jan 2004 05:21:10 +0800
Bopolissimus Platypus <bopolissimus at sni.ph> wrote:

> hello all,
> 
> I'm using (built from source):
> 
> bogofilter version 0.15.13
>     Database: BerkeleyDB (4.2.50), combined
> 
> and, with it, bogotune 0.15.13
> 
> i'm trying to run bogotune, with my ham list of 7434 messages and my
> spam list of 2353 messages (i have another spam list of 43873
> messages).
> 
> when i run bogotune with -D, it works and produces settings.  it also 
> warns about having too few high-scoring nonspams.  i'm not sure why
> that is, maybe because it uses half of the messages to build a
> wordlist and then the other half for training?
> 
> what i *would* like to do is run bogotune with "-d $/.bogofilter" so
> that it can take advantage of the wordlist i've already got.  however,
> when i do that, i get:
> 
> Reading good
> Reading /home/tiger/.bogofilter/wordlist.db
> 7434 messages 
> Reading spam
> 2353 messages 
>     4m:59s for 9787 messages.  avg: 32.7 msg/sec
>     7m:34s for 9787 messages.  avg: 21.6 msg/sec
> The wordlist contains 36 non-spam and 20 spam messages.
> Bogotune must be run with at least 2000 of each.
> 
> which doesn't make sense, since clearly it found 7434 ham and 2353
> spam. when i do the same thing, except i specify the 43873 spam mbox,
> i still get the same error.
> 
> the mboxes are just standard mbox file (kmail and evolution).
> 
> can anyone tell me what i might be doing wrong?  my full command line
> is:
> 
> bogotune -c ~/.bogofilter.cf -d ~/.bogofilter -s spam -n good -vvv
> 
> removing the -c ~/.bogofilter doesn't do anything, i still get the
> same error. at any rate, as far as the "wordlist contains N non-spam
> and M spam" line is concerned.
> 
> Reading good
> Reading /home/tiger/.bogofilter/wordlist.db
> 7434 messages 
> Reading spam.old
> 43873 messages
>     6m:03s for 51307 messages.  avg: 141.3 msg/sec
>     7m:58s for 51307 messages.  avg: 107.3 msg/sec
> The wordlist contains 36 non-spam and 20 spam messages.
> Bogotune must be run with at least 2000 of each.
> 
> can that be affected by .bogofilter.cf settings? i don't think it
> should, but if yes, then i could post my .bogofilter.cf settings too.
> 
> tiger
> 
> -- 
> Gerald Timothy Quimpo  gquimpo*hotmail.com tiger*sni*ph
> http://bopolissimus.sni.ph
> Public Key: "gpg --keyserver pgp.mit.edu --recv-keys 672F4C78"
> 
>     The first half of our lives is ruined by our parents, and
>      the second half by our children.
> 	                     Clarence Darrow
> 
> 
> ---------------------------------------------------------------------
> FAQ: http://bogofilter.sourceforge.net/bogofilter-faq.html
> To unsubscribe, e-mail: bogofilter-unsubscribe at aotto.com
> For summary digest subscription: bogofilter-digest-subscribe at aotto.com
> For more commands, e-mail: bogofilter-help at aotto.com


-- 
David Relson                   Osage Software Systems, Inc.
relson at osagesoftware.com       Ann Arbor, MI 48103
www.osagesoftware.com          tel:  734.821.8800




More information about the Bogofilter mailing list