Bogofilter w/ SpamAssassin
Devin Nate
devin.nate at bridgecomm.net
Sat Aug 30 08:34:52 CEST 2003
Hi Folks;
I'm a recent bogofilter user, inspired to give it a try after reading a
good article comparing bogofilter with other systems. Prior to that, I
had started work integrating another bayesian system (CRM114) with
SpamAssassin. Since I liked bogofilter so much, I added a similar
interface for Bogofilter. Since I see that the FAQ has a section on
training bogofilter from SpamAssassin, I thought this experimental
feature may be of interest to some people here since it adds this
capability right into SA, plus giving points against messages that
bogofilter thinks are spams. The URL with patches and documentation is
available here:
http://bugzilla.spamassassin.org/show_bug.cgi?id=2301
As a basic run down, SA uses a near identical 'autolearning' feature as
already built into it. The autolearning system will pass emails into
bogofilter that it considers to be ham or spam based on an autolearning
set of thresholds. The manual training via sa-learn is not done yet. SA
keeps track of how many hams/spams learned, and individual message id's
seen so that it won't relearn a message twice. SA also will perform
train-on-error automatically by default (but can be turned off), once
Bogofilter has seen enough to start classifying on its own. In addition
to all of this, I've run a number of statistical tests on a corpus of
about 2700 hams and 1400 spams; bogofilter did excellent. Excellent job!
The patch includes CRM114 stuff too- both CRM114 and Bogofilter can be
dealt with separately, so if you only want to try bogofilter that's
fine. It is a first revision of the patch, highly experimental, etc., so
it's probably best that people familiar with bogofilter and SA use it
and not on a mission critical server. Feed back is welcome!
Thanks,
Devin Nate
--
____________________________________________________________
Devin Nate
Chief Consultant & General Manager
BridgeComm Corporation
http://www.bridgecomm.net/
mailto:devin.nate at bridgecomm.net
____________________________________________________________
More information about the bogofilter-dev
mailing list