[PATCH] Better tagging.

Boris 'pi' Piwinger 3.14 at logic.univie.ac.at
Mon Sep 15 12:51:29 CEST 2003


Matthias Andree wrote:

>>> I'd support removing ALL case folding and case blindness from tokens, as
>>> has been pointed out, the naive Bayes stuff needs as much information as
>>> possible, and we've been proven more than once that case matters even if
>>> we then have to plan for 16 variants of "date" and "dAtE" in the data
>>> base...
>>
>> BTW: How about the limitation abour short tokens? Matthias
>> also gave good arguments about that.
> 
> Do you have a Message-ID at hand?

No, but ...

> IIRC, the token size limitation was primarily a measure against base64
> and "rulers" and stuff.

... it was about short tokens like single letters.

pi





More information about the bogofilter-dev mailing list