Radical lexers
michael at optusnet.com.au
michael at optusnet.com.au
Thu Dec 11 06:11:25 CET 2003
michael at optusnet.com.au writes:
Sorry, forgot a point:
> michael at optusnet.com.au writes:
[...]
> The bogofilter
> implementation of the bayes algorithm suffers (as almost all
> implementations do) from quantization noise. That quantization
> noise is worse when the token counts are low.
In my opinion, this is the key reason that train-on-error is a bad
idea. It's leaves the token counts low, and thus maximizes the
quantization error for a given result. I think. :)
Michael.
More information about the Bogofilter
mailing list