DB won't train, 'Unsure' about everything

Adrian adrian at aeolian.org.uk
Mon Feb 17 16:54:09 CET 2025


Since my previous thread I've narrowed down the problem.  I can now
describe the specific issue in the hope someone can explain it.

I created a wordlist.db from a text dump, and it doesn't work.

$ bogofilter -s -v -B <any file>
# 0 words, 0 messages

$ bogofilter -t -v -B <any file>
<any file> U 0.520000

db_verify says it's OK.

The source text dump looks OK, though it has a lot of non-ASCII such as
AU<C2><F2> 0 1 20230305   (as displayed by less)

I renamed this DB away and ran
$ bogofilter -n -v -B ./INBOX/63957
# 208 words, 1 message
which created a new small DB.  After training with a few ham and spams.
it gives sensible spam scores.

-but I need my old database and its years of training!

Are there any lines in the text dump that might be causing this, that I
can remove?

Should I chuck Berkeley DB and install the Sqlite bogofilter?

And why should a text dump that loads without error result in a DB that
doesn't work??!









More information about the bogofilter mailing list