Clean the database from non-spam mails?
David Relson
relson at osagesoftware.com
Tue Dec 2 23:33:01 CET 2003
On Tue, 2 Dec 2003 23:21:22 +0100
Johannes Klug <derjoi at gmx.net> wrote:
> On Tuesday 02 December 2003 19:56 Chris Wilkes wrote:
> > I did a little experiment with removing all words whose ham counts
> > were higher than their spam counts [...]
>
> That is not exactly what I want to do ...
> With separated wordlists, it was possible to delete the unwanted one,
> now I'd need a possibility to remove all tokens that were registered
> as ham.
>
> But thanks for the answer, anyway.
>
> Greets,
> Johannes
As part of dumping, change the ham count to zero and then load the
tokens with non-zero spam counts.
Something like the following command should do it:
"bogoutil -d old/wordlist.db | awk '{print $1 " 0 " $3 " " $4}' | grep
-v " 0 0 " | bogoutil -l new/wordlist.db"
More information about the Bogofilter
mailing list