New (?) idea to optimize database
Boris 'pi' Piwinger
3.14 at piology.org
Wed Mar 22 21:34:31 CET 2006
".rp" <printer at moveupdate.com> wrote:
>> > We had lengthy discussions how to optimize (=minimize) the
>> > database to get best performance. This is why I created
>> > bogominitrain. Now clearly, this will also collect useless
>> > tokens. Now here is the idea to improve:
>> >
>> > Do bogominitrain, remove all tokens which show up only once
>> > in the training body (to do so, full training is needed in
>> > a separate body). Also prevent those tokens from being added
>> > again and do bogominitrain again. Repeat until it converged.
>>
>> Clearly the "prevent" part is a problem as it implies a "prevent"
>> database and bogofilter lacks such a concept hence couldn't use
>> prevent.db even if it existed.
>
>I don't understand what is meant by converge
bogominitrain trains in several rounds. At some point there
are no more changes.
>couldn't the "ignore" be used as a "prevent" for purposes of train?
???
pi
More information about the Bogofilter
mailing list