Hapax survival over time
Boris 'pi' Piwinger
3.14 at logic.univie.ac.at
Wed Mar 24 08:19:46 CET 2004
David Relson <relson at osagesoftware.com> wrote:
>> This leads me to propose a different study... how many of those
>> hapaxes are outside of your min_dev range? How many further
>> registrations does it take to move them into an influential scoring
>> range?
>
>Sorry to say, but that study is not very interesting. A hapax is a
>token that has appeared exactly one. That means it's score is roughly 0.0 (if
>the once was in ham) or 1.0 (if it was in spam).
That really depends on robx and more importantly robs.
pi
More information about the Bogofilter
mailing list