Radical lexers

michael at optusnet.com.au michael at optusnet.com.au
Thu Dec 11 06:11:25 CET 2003


michael at optusnet.com.au writes:

Sorry, forgot a point:

> michael at optusnet.com.au writes:
[...]
> The bogofilter
> implementation of the bayes algorithm suffers (as almost all
> implementations do) from quantization noise. That quantization
> noise is worse when the token counts are low.

In my opinion, this is the key reason that train-on-error is a bad
idea. It's leaves the token counts low, and thus maximizes the
quantization error for a given result. I think. :)

Michael.




More information about the Bogofilter mailing list