Ignoring headers [was: SpamAssassin's header lines]

Mark M. Hoffman mhoffman at lightlink.com
Mon Oct 7 22:17:23 CEST 2002


[David]
> > Perhaps the two of you should put your heads together and see what kind of 
> > solution you can design...

[Eric]
> Well, I don't think that the "ignore-list" idea can (or should) be 
> expanded to be aware of message structure (what's a header, what's not) 
> because the ignore-list support is literally only a few lines of code when 
> we look up the spamicity of a token.
> 
> The idea that bogofilter should be aware of message headers, possibly 
> having the ability to add/remove/modify them, has merit.  I think that for 
> now, anyway, our treatment of headers should be a separate problem, 
> because it has to look at the message early on, before we shred it into a 
> "bag of tokens".
> 
> I don't think that the treatment (keep or discard; examine or 
> don't-examine) of headers like spamassasin adds should be hard-wired into 
> the lexer; I think there will be users who will be dead-set against this, 
> and others that are dead-set for it.

I agree with Eric on all counts above.  I only meant that header-ignores be
spec'ed in the option file... not that it should be integrated with Eric's
ignore-tokens stuff.

> I have a gut feeling that one day, the flex lexer.l won't be good enough.  
> Possible reasons: MIME-encoded messages, specialized treatment of header 
> fields, HTML handling, international charset support...

I (grudgingly) have this feeling also.

Regards,

-- 
Mark M. Hoffman
mhoffman at lightlink.com



More information about the bogofilter-dev mailing list