Bogofilter seems to not be working
Jesse Meyer
meyer at btinet.net
Wed Mar 26 04:21:37 CET 2003
On Tue, Mar 25, 2003 at 11:49:25AM -0800, daniel wrote:
> I have set up bogofilter with the procmail recipies in the man page:
>
> :0fw
> | bogofilter -u -e -p
^^^
Note the -u flag, I'll explain my theory in a bit.
>
> [ Snip rest of procmail configuration ]
>
> [ Snip description of spam filter scores approaching 0 over time ]
Here's my (bogofilter-uneducated) theory. If I recall the man page
correctly, the -u flag seems to allow bogofilter to continue learning,
so if it thinks a message is spam, it tries to figure out what new
spam rules it can learn from that message. Inversely, if it considers
the message as ham, it tries to figure out what new non-spam rules
it can learn from the message.
You recieved scores in the .40's, originally, and they then slowly
approached .00 over time. When I read the documentation, I believe
it mentioned .54 as the dividing point between spam and ham.
Now here's my theory, which hinges on the assumption that you are
_not_ continually training bogofilter (you didn't mention it
doing so). I believe that bogofilter might been poorly trained
in the beginning, so that it classified most spam as ham. Then, as
new messages were filtered through procmail and bogofilter, it then
added more rules to classify the fake-ham as ham, dropping your
scores near 0.
If this is the problem, then the solution is simple - remove the
old score files, train bogofilter properly, and continue training it
when it recieves false positives or negatives.
Of course, please note that I have been using bogofilter for
roughly 48 hours now, so I could be way off. :)
~ Jesse Meyer
--
icq : 34583382 | === ascii ribbon campaign ===
msn : dasunt at hotmail.com | () - against html mail
yim : tsunad | /\ - against proprietary attachments
More information about the Bogofilter
mailing list