Training with mailbox format files
Rob Tanner
rtanner+cyrus at linfield.edu
Fri Aug 1 00:58:16 CEST 2003
Thanks!
--On Thursday, July 31, 2003 06:50:08 PM -0400 David Relson
<relson at osagesoftware.com> wrote:
> At 06:42 PM 7/31/03, Rob Tanner wrote:
>> Hi,
>>
>> We've been using bogofilter now for several weeks and since we're still
>> fine tuning, we deliver all mail, SPAM or not -- but prepend [SPAM] to
>> the subject header when it's SPAM. I have a couple mailboxes that
>> folks copy misclassified messages into, and I use those to further
>> train bogofilter.
>>
>> The easiest way for me to collect the messages is to drag them from the
>> server to a local mailbox on my workstation. The problem is that the
>> software I use, Mulberry, writes them to a single rfc822 format mailbox
>> file. Can I train bogofilter by giving it that single file, or must I
>> break the messages up and feed them one at a time?
>>
>> Thanks,
>> Rob
>
> Hi Rob,
>
> Bogofilter has understood mailbox files from the beginning. Use
> "bogofilter -s < spam.mbx" or "bogofilter -n < ham.mbx" as appropriate.
> In you include the "-v" switch, it will tell you how many words and
> messages were processed.
>
> David
>
>
Rob Tanner
Linfield College
McMinnville, Oregon
rtanner+cyrus at linfield.edu
More information about the Bogofilter
mailing list