how to bogotune?

Trevor Smith trevor at haligonian.com
Wed Sep 29 17:34:15 CEST 2004


Man, bogotune is very difficult to figure out. The man page is confusing since 
it does not clearly state what it means by "wordlist" and "message files". 
After numerous readings I think I have figured out what it wants: I'm 
assuming that the "wordlist" is just my wordlist.db that I've built over 
months of using bogofilter and that the "message files" are some group of 
emails that I have separated into spam and nonspam categories. Is this close 
to correct?

Next question:

If I have already trained bogofilter with the messages in question, can 
bogotune work on them? Or does that screw it up?

Final question:

The man page appears to say it wants 500+ messages of spam/ham each, with 
2000+ ham/spam each in the wordlist. I certainly have more than 2000 each in 
my wordlist, and I fed it ~1000 each for messages but it complained about 
"low number / uniformity" of messages and produced no useful results (that I 
can tell). 

My guess is that the messages I feed in must NOT be already trained, or else 
they're all going to read 1.0000 and 0.0000 (and that would make sense). Is 
this correct?

-- 
Trevor Smith // trevor at haligonian.com



More information about the Bogofilter mailing list