Training with mailbox format files

Rob Tanner rtanner+cyrus at linfield.edu
Fri Aug 1 00:58:16 CEST 2003


Thanks!  

--On Thursday, July 31, 2003 06:50:08 PM -0400 David Relson
<relson at osagesoftware.com> wrote:

> At 06:42 PM 7/31/03, Rob Tanner wrote:
>> Hi,
>> 
>> We've been using bogofilter now for several weeks and since we're still
>> fine tuning, we deliver all mail, SPAM or not -- but prepend [SPAM] to
>> the subject header when it's SPAM.  I have a couple mailboxes that
>> folks copy misclassified messages into, and I use those to further
>> train bogofilter.
>> 
>> The easiest way for me to collect the messages is to drag them from the
>> server to a local mailbox on my workstation.  The problem is that the
>> software I use, Mulberry, writes them to a single rfc822 format mailbox
>> file.  Can I train bogofilter by giving it that single file, or must I
>> break the messages up and feed them one at a time?
>> 
>> Thanks,
>> Rob
> 
> Hi Rob,
> 
> Bogofilter has understood mailbox files from the beginning.  Use
> "bogofilter -s < spam.mbx" or "bogofilter -n < ham.mbx" as appropriate.
> In you include the "-v" switch, it will tell you how many words and
> messages were processed.
> 
> David
> 
> 




Rob Tanner
Linfield College
McMinnville, Oregon
rtanner+cyrus at linfield.edu




More information about the Bogofilter mailing list