speaking of random words
Boris 'pi' Piwinger
3.14 at logic.univie.ac.at
Thu Mar 18 15:22:49 CET 2004
Tom Anderson wrote:
> On Wed, 2004-03-17 at 10:02, Boris 'pi' Piwinger wrote:
>> So the problem seems to be your wordlists has a very high
>> inertia. So probably it has seen so much it does not really
>> take up new information (with significance). Kind of
>> information overflow when you get tired after many hours of
>> TV and someone suddenly asks you about details a minute ago.
>
> I really don't see this "high inertia", as you put it, as a problem. In
> fact, I like that my wordlist is "stable" and does not swing wildly
> based on a single registration. However, at the same time, it does
> require a lot of nudging to get ham tokens back toward more neutral
> territory.
Well, clearly you cannot have both at the same time.
> What this means is that I can probably set my recursion max
> for exhaustive training slightly higher without adverse effects.
Do you really checkt for those effects?
I do, and for me they are very small, only very few old
messages need to be reconsidered.
> The main reason I posted the spam was as an illustration of a spammer
> who "got it right".
It looked like spam to my wordlist in the first place.
> They chose a group of words which were
> overwhelmingly hammy.
There is no way to know these words.
pi
More information about the Bogofilter
mailing list