Bogofilter and reclassifying

Stroller Linux.Luser at myrealbox.com
Fri Dec 5 13:45:20 CET 2003


On Dec 5, 2003, at 9:44 am, Nathaniel wrote:
>
> Is this correct?  Most howtos I've seen either recreate a wordlist or 
> just
> mark as spam/ham the entire corpus, but I didn't want to maintain large
> corpuses and wanted something fairly efficient...

Please find attached a shell script I'm currently working on - it scans 
Maildir folders for new messages & calls bogofilter to add their 
contents to the wordlist based on whether they're in a spam 
(~/.Maildir/.Junk.Definite) or ham folder. I hope you might find this 
useful - you could easily add a line to delete old messages once 
they've been read - I intend to tar up the spam as part of the process, 
and to run this as part of a cron job. I'll have mailfilter set to drop 
messages with a high bogosity into a ~/.Maildir/.Junk.Probable folder, 
and so all I'll need to do is peruse them myself & drop them into 
~/.Maildir/.Junk.Definite to confirm their spamicity & train Bogofilter 
further based on their contents.

I think you should find this script self-explanatory, but please feel 
free to ask any questions, or since I'm new to shell programming, any 
suggestions. This script works here, but the usual disclaimers 
(provided as is, no warranty, back-up your data, yadayadayada) apply.

Stroller.


-------------- next part --------------
A non-text attachment was scrubbed...
Name: bogo-update.sh
Type: application/octet-stream
Size: 2634 bytes
Desc: not available
URL: <http://www.bogofilter.org/pipermail/bogofilter/attachments/20031205/5f67dff9/attachment.obj>


More information about the Bogofilter mailing list