[PATCH] Better tagging.
Matthias Andree
matthias.andree at gmx.de
Mon Sep 15 10:27:16 CEST 2003
On Mon, 15 Sep 2003, michael at optusnet.com.au wrote:
> spam good Gra prob Rob/Fis
> h:DATE 175 21 0.899074 0.627728
> h:Date 24652 26488 0.498721 0.498303
>
> I guess my point is that all these items are hints that bogofilter
> currently throws away. I'm not saying they always make a difference,
> but for my data set they definately do.
I'd support removing ALL case folding and case blindness from tokens, as
has been pointed out, the naive Bayes stuff needs as much information as
possible, and we've been proven more than once that case matters even if
we then have to plan for 16 variants of "date" and "dAtE" in the data
base...
--
Matthias Andree
Encrypt your mail: my GnuPG key ID is 0x052E7D95
More information about the bogofilter-dev
mailing list