[PATCH] Better tagging.

Matthias Andree matthias.andree at gmx.de
Mon Sep 15 10:27:16 CEST 2003


On Mon, 15 Sep 2003, michael at optusnet.com.au wrote:

>                        spam    good  Gra prob  Rob/Fis
> h:DATE                  175      21  0.899074  0.627728
> h:Date                24652   26488  0.498721  0.498303
> 
> I guess my point is that all these items are hints that bogofilter
> currently throws away.  I'm not saying they always make a difference,
> but for my data set they definately do.

I'd support removing ALL case folding and case blindness from tokens, as
has been pointed out, the naive Bayes stuff needs as much information as
possible, and we've been proven more than once that case matters even if
we then have to plan for 16 variants of "date" and "dAtE" in the data
base...

-- 
Matthias Andree

Encrypt your mail: my GnuPG key ID is 0x052E7D95




More information about the bogofilter-dev mailing list