Bogofilter Best Practices?

RW rwmaillists at googlemail.com
Tue Dec 8 19:09:32 CET 2009


On Mon, 07 Dec 2009 17:00:05 -0800
"Randy J. Ray" <rjray at blackperl.com> wrote:


>  We get
> good-enough performance and throughput on the actual classification
> of incoming messages. It's the creation of the word-list files from
> our (growing) corpus that is driving me nuts.

From the sound of it you are creating a wordlist from scratch from
historical corpora, why not just learn today's mail into yesterday's
wordlist. 



More information about the Bogofilter mailing list