Effect of 0.17.3 lexer change

Boris 'pi' Piwinger 3.14 at logic.univie.ac.at
Fri Mar 19 13:35:21 CET 2004


David Relson wrote:

>> > Bogofilter-0.17.3, a.k.a. "Code Clean-up Release - Phase 2", has
>> > been released.
>> 
>> I did a check on my mail collection about the lexer change.
>> Of 11529 ham messages 1 is now rated incorrectly.
>> Of 14397 spam messages 4 are now rated incorrectly.
>> 
>> So to my surprise the effect is very small. YMMV, in
>> particular if you use the block_on_subnets option.
> 
> I assume you're referring to the IP Address fix ???

Yes, has there been anything else?

> I'm presently experimenting with another lexer change.  A while back I
> noticed that mime header parts ( the Content-Type, Content-Disposition,
> etc lines after a mime boundary) are tagged as head:Content-Type, etc
> and thought it wrong.   I'm presently using "mime:" as the tag for these
> lines which distinguishes them from the same lines in the message's
> header.  If you're interested in experimenting/testing I've attached a
> patch.  I'm thinking of including this change in 0.17.4 :-)

I am not really sure if it is all that wrong as it is now.
It might be useful to tag those lines by their name, like:
content-type: text
content-type: html
content-type: charset
content-type: utf8
etc.

But maybe mime also does the job.

pi





More information about the Bogofilter mailing list