New (?) idea to optimize database

.rp printer at moveupdate.com
Wed Mar 22 21:08:15 CET 2006


From:           	David Relson <relson at osagesoftware.com>
Copies to:      	bogofilter at bogofilter.org
Date sent:      	Sat, 18 Mar 2006 14:20:46 -0500
Subject:        	Re: New (?) idea to optimize database

> On Sat, 18 Mar 2006 16:04:54 +0100
> Boris 'pi' Piwinger wrote:
> 
> > Hi!
> > 
> > We had lengthy discussions how to optimize (=minimize) the
> > database to get best performance. This is why I created
> > bogominitrain. Now clearly, this will also collect useless
> > tokens. Now here is the idea to improve:
> > 
> > Do bogominitrain, remove all tokens which show up only once
> > in the training body (to do so, full training is needed in
> > a separate body). Also prevent those tokens from being added
> > again and do bogominitrain again. Repeat until is converged.
> 
> Hi pi,
> 
> Interesting idea!
> 
> Clearly the "prevent" part is a problem as it implies a "prevent"
> database and bogofilter lacks such a concept hence couldn't use
> prevent.db even if it existed.
> 
>[...] 

I don't understand what is meant by converge 
and
couldn't the "ignore" be used as a "prevent" for purposes of train?





More information about the Bogofilter mailing list