Bogofilter Best Practices?
RW
rwmaillists at googlemail.com
Tue Dec 8 19:09:32 CET 2009
On Mon, 07 Dec 2009 17:00:05 -0800
"Randy J. Ray" <rjray at blackperl.com> wrote:
> We get
> good-enough performance and throughput on the actual classification
> of incoming messages. It's the creation of the word-list files from
> our (growing) corpus that is driving me nuts.
From the sound of it you are creating a wordlist from scratch from
historical corpora, why not just learn today's mail into yesterday's
wordlist.
More information about the Bogofilter
mailing list