speaking of random words

Boris 'pi' Piwinger 3.14 at logic.univie.ac.at
Thu Mar 18 15:22:49 CET 2004


Tom Anderson wrote:

> On Wed, 2004-03-17 at 10:02, Boris 'pi' Piwinger wrote:
>> So the problem seems to be your wordlists has a very high
>> inertia. So probably it has seen so much it does not really
>> take up new information (with significance). Kind of
>> information overflow when you get tired after many hours of
>> TV and someone suddenly asks you about details a minute ago.
> 
> I really don't see this "high inertia", as you put it, as a problem.  In
> fact, I like that my wordlist is "stable" and does not swing wildly
> based on a single registration.  However, at the same time, it does
> require a lot of nudging to get ham tokens back toward more neutral
> territory.  

Well, clearly you cannot have both at the same time.

> What this means is that I can probably set my recursion max
> for exhaustive training slightly higher without adverse effects.

Do you really checkt for those effects?

I do, and for me they are very small, only very few old
messages need to be reconsidered.

> The main reason I posted the spam was as an illustration of a spammer
> who "got it right". 

It looked like spam to my wordlist in the first place.

> They chose a group of words which were
> overwhelmingly hammy. 

There is no way to know these words.

pi




More information about the Bogofilter mailing list