Clean the database from non-spam mails?

David Relson relson at osagesoftware.com
Tue Dec 2 23:33:01 CET 2003


On Tue, 2 Dec 2003 23:21:22 +0100
Johannes Klug <derjoi at gmx.net> wrote:

> On Tuesday 02 December 2003 19:56 Chris Wilkes wrote:
> > I did a little experiment with removing all words whose ham counts
> > were higher than their spam counts [...]
> 
> That is not exactly what I want to do ... 
> With separated wordlists, it was possible to delete the unwanted one,
> now I'd need a possibility to remove all tokens that were registered
> as ham.
> 
> But thanks for the answer, anyway.
> 
> Greets,
> Johannes

As part of dumping, change the ham count to zero and then load the
tokens with non-zero spam counts.  

Something like the following command should do it:

"bogoutil -d old/wordlist.db | awk '{print $1 " 0 " $3 " " $4}' | grep
-v " 0 0 " | bogoutil -l new/wordlist.db"




More information about the Bogofilter mailing list