Do we need an exclusion list or something?
Paul Tomblin
ptomblin at xcski.com
Fri Sep 13 23:34:43 CEST 2002
Quoting Jonathan Buzzard (jonathan at buzzard.org.uk):
> eds at reric.net said:
> > In my opinion this will always be a problem. I spotted this when I
> > fed it a bunch of spam messages from the month of May and then found
> > that the word "may" was being treated as a very strong indicator of
> > spamicity.
>
> I hinted on this at the beginning of the week. There are two problems
> the inclusion of common words, which don't mean anything, and stuff
> getting included from the headers.
The problem I pointed out was with words that are in 100% of the spam
messages *and* 100% of the ham messages. Surely those should have been
filtered out already?
--
Paul Tomblin <ptomblin at xcski.com>, not speaking for anybody
"Tower zero one request clearance for takeoff."
"Cleared runway three contact ground point six three when off the runway."
- Michael Crichton destroys whatever technical credibility he had left.
For summay digest subscription: bogofilter-digest-subscribe at aotto.com
More information about the Bogofilter
mailing list