[glouis at dynamicro.on.ca: Re: New vs Old]

Pavel Kankovsky peak at argo.troja.mff.cuni.cz
Thu Mar 25 22:44:14 CET 2004


On Thu, 25 Mar 2004, Greg Louis wrote:

> No, in fact it's the spam cutoff that determines that balance. 
> Unknowns are excluded by both sets, and the tiny s values ensure that
> no significant prior weight is given to low-count tokens.

Hmmm...doesn't a tiny value of s ensure the exact opposite?

The lower the value of s is, the weaker is the "pull" towards x, ergo
the lower the value of s is, the more significant low-count tokens are
(assuming x is near 0.5).

--Pavel Kankovsky aka Peak  [ Boycott Microsoft--http://www.vcnet.com/bms ]
"Resistance is futile. Open your source code and prepare for assimilation."





More information about the Bogofilter mailing list