Hapax survival over time

Boris 'pi' Piwinger 3.14 at logic.univie.ac.at
Wed Mar 24 08:19:46 CET 2004


David Relson <relson at osagesoftware.com> wrote:

>> This leads me to propose a different study... how many of those
>> hapaxes are outside of your min_dev range?  How many further
>> registrations does it take to move them into an influential scoring
>> range?
>
>Sorry to say, but that study is not very interesting.  A hapax is a
>token that has appeared exactly one.  That means it's score is roughly 0.0 (if
>the once was in ham) or 1.0 (if it was in spam).

That really depends on robx and more importantly robs.

pi




More information about the Bogofilter mailing list