mass processing with mutt and Fcc

Boris 'pi' Piwinger 3.14 at logic.univie.ac.at
Tue Apr 1 15:31:33 CEST 2003


David Relson wrote:

>> > Bogofilter looks at nearly all the tokens of a message.  Some stuff is
>> > ignored - for example message IDs, because they tend to be unique, and
>>
>>All of them? The local part could be interesting.

How about this?

> At the present time, when processing html, bogofilter does discards html 
> comments, valid html tags (and their innards), and invalid html tags (and 
> their innards).  Basically everything between angle brackets is being 
> ignored at this time.
> 
> The rationale is that that many tokens within html tags are not worth 
> scoring as spam indicators.

I see. I thought that the use of html would be useful (I
remember the early versions of bogofilter said so). Also web
addresses as in links or img elements might be useful.

pi





More information about the Bogofilter mailing list