Do we need an exclusion list or something?

Paul Tomblin ptomblin at xcski.com
Fri Sep 13 23:34:43 CEST 2002


Quoting Jonathan Buzzard (jonathan at buzzard.org.uk):
> eds at reric.net said:
> > In my opinion this will always be a problem.  I spotted this when I
> > fed it  a bunch of spam messages from the month of May and then found
> > that the  word "may" was being treated as a very strong indicator of
> > spamicity. 
> 
> I hinted on this at the beginning of the week. There are two problems
> the inclusion of common words, which don't mean anything, and stuff
> getting included from the headers.

The problem I pointed out was with words that are in 100% of the spam
messages *and* 100% of the ham messages.  Surely those should have been
filtered out already?

-- 
Paul Tomblin <ptomblin at xcski.com>, not speaking for anybody
"Tower zero one request clearance for takeoff."
"Cleared runway three contact ground point six three when off the runway."
  - Michael Crichton destroys whatever technical credibility he had left.

For summay digest subscription: bogofilter-digest-subscribe at aotto.com



More information about the Bogofilter mailing list