Maintenance Best Practices

David Relson relson at osagesoftware.com
Thu Jun 5 21:14:20 CEST 2003


At 02:57 PM 6/5/03, Adam Mercer wrote:
>On Thu, Jun 05, 2003 at 12:05:57PM -0400, Dave Stubbs wrote:
> > I'm wondering what others are doing out there to maintain their database
> > files.  I receive between 100 and 200 emails per day, and send all emails
> > through bogofilter with -f -p -u -l -v -e options.
>
>I'm using the options -p -3 -e, and the following in my .bogofilter.cf
>
>algorithm = fisher
>ham_cutoff = 0.10
>spam_cutoff = 0.95
>spamicity_tags = Spam, Ham, Unsure
>spamicity_formats = %6.2e, %6.2e, 0.6f
>
>I initially train my database with, at the moment, around 4000 known
>spams and 4000 known hams. Then I train only on mistakes. So far this
>seems to be working quite well with only a few false negatives - no
>false positives yet.
>
>Cheers
>
>Adam

Adam,

You're getting excellent results.  Glad to hear that the default values are 
doing so well for you.

Bogofilter _may_ be able to do even better for you.  The new stable 
release, bogofilter-0.13.6.1, has revised directions on how to find, given 
_your_ email, the optimum combination of the spam_cutoff, min_dev, and robs 
parameters.  If you have your messages in readily available .mbx files, you 
may want to experiment with the scripts in bogofilter/tuning.

David






More information about the Bogofilter mailing list