New (?) idea to optimize database

Boris 'pi' Piwinger 3.14 at piology.org
Wed Mar 22 21:34:31 CET 2006


".rp" <printer at moveupdate.com> wrote:

>> > We had lengthy discussions how to optimize (=minimize) the
>> > database to get best performance. This is why I created
>> > bogominitrain. Now clearly, this will also collect useless
>> > tokens. Now here is the idea to improve:
>> > 
>> > Do bogominitrain, remove all tokens which show up only once
>> > in the training body (to do so, full training is needed in
>> > a separate body). Also prevent those tokens from being added
>> > again and do bogominitrain again. Repeat until it converged.
>> 
>> Clearly the "prevent" part is a problem as it implies a "prevent"
>> database and bogofilter lacks such a concept hence couldn't use
>> prevent.db even if it existed.
>
>I don't understand what is meant by converge 

bogominitrain trains in several rounds. At some point there
are no more changes.

>couldn't the "ignore" be used as a "prevent" for purposes of train?

???

pi



More information about the Bogofilter mailing list