Teching bogofilter by forwarding messages
David Relson
relson at osagesoftware.com
Thu Dec 18 22:34:09 CET 2003
On Thu, 18 Dec 2003 22:39:10 +0100
Michal Wieja <mwieja at poczta.onet.pl> wrote:
...[snip]...
> Yes, but remember that e-mail software also puts 'FW:', 'Fwd:"
> into the
> subject line, to mark forwarded messages, some of them takes all
> subject into brackets '[' ']'.
>
> Actually, as far as I understand bogofilter is statistical
> analysis tool, so
> in theory FW, Fwd words should occur in both spams and hams, so weight
> of these words shouldn't have much input into final score.
>
> --
> Mike
Mike,
Your comments about statistical analysis are correct. Bogofilter gets a
lot of its information from a message's headers. Forwarding a message
puts a new set of headers in place and it's important to remove those
headers before passing the message to bogofilter for training.
David
More information about the Bogofilter
mailing list