[glouis at dynamicro.on.ca: Re: New vs Old]
Pavel Kankovsky
peak at argo.troja.mff.cuni.cz
Thu Mar 25 22:44:14 CET 2004
On Thu, 25 Mar 2004, Greg Louis wrote:
> No, in fact it's the spam cutoff that determines that balance.
> Unknowns are excluded by both sets, and the tiny s values ensure that
> no significant prior weight is given to low-count tokens.
Hmmm...doesn't a tiny value of s ensure the exact opposite?
The lower the value of s is, the weaker is the "pull" towards x, ergo
the lower the value of s is, the more significant low-count tokens are
(assuming x is near 0.5).
--Pavel Kankovsky aka Peak [ Boycott Microsoft--http://www.vcnet.com/bms ]
"Resistance is futile. Open your source code and prepare for assimilation."
More information about the Bogofilter
mailing list