Teching bogofilter by forwarding messages

David Relson relson at osagesoftware.com
Thu Dec 18 22:34:09 CET 2003


On Thu, 18 Dec 2003 22:39:10 +0100
Michal Wieja <mwieja at poczta.onet.pl> wrote:

...[snip]...
 

> 	Yes, but remember that e-mail software also puts 'FW:', 'Fwd:"
> 	into the 
> subject line, to mark forwarded messages, some of them takes all
> subject into brackets '[' ']'. 
> 
> 	Actually, as far as I understand bogofilter is statistical
> 	analysis tool, so 
> in theory FW, Fwd words should occur in both spams and hams, so weight
> of these words shouldn't have much input into final score.
> 
> --
> Mike

Mike,

Your comments about statistical analysis are correct.  Bogofilter gets a
lot of its information from a message's headers.  Forwarding a message
puts a new set of headers in place and it's important to remove those
headers before passing the message to bogofilter for training.

David




More information about the Bogofilter mailing list