Effect of 0.17.3 lexer change
Boris 'pi' Piwinger
3.14 at logic.univie.ac.at
Fri Mar 19 13:35:21 CET 2004
David Relson wrote:
>> > Bogofilter-0.17.3, a.k.a. "Code Clean-up Release - Phase 2", has
>> > been released.
>>
>> I did a check on my mail collection about the lexer change.
>> Of 11529 ham messages 1 is now rated incorrectly.
>> Of 14397 spam messages 4 are now rated incorrectly.
>>
>> So to my surprise the effect is very small. YMMV, in
>> particular if you use the block_on_subnets option.
>
> I assume you're referring to the IP Address fix ???
Yes, has there been anything else?
> I'm presently experimenting with another lexer change. A while back I
> noticed that mime header parts ( the Content-Type, Content-Disposition,
> etc lines after a mime boundary) are tagged as head:Content-Type, etc
> and thought it wrong. I'm presently using "mime:" as the tag for these
> lines which distinguishes them from the same lines in the message's
> header. If you're interested in experimenting/testing I've attached a
> patch. I'm thinking of including this change in 0.17.4 :-)
I am not really sure if it is all that wrong as it is now.
It might be useful to tag those lines by their name, like:
content-type: text
content-type: html
content-type: charset
content-type: utf8
etc.
But maybe mime also does the job.
pi
More information about the Bogofilter
mailing list