procmail: Non-zero exitcode (1) from "/usr/bin/bogofilter"
dhottinger at harrisonburg.k12.va.us
dhottinger at harrisonburg.k12.va.us
Thu Sep 13 13:52:49 CEST 2007
Quoting David Relson <relson at osagesoftware.com>:
> On Thu, 13 Sep 2007 06:35:05 -0400
> dhottinger at harrisonburg.k12.va.us wrote:
>
> ..[snip]...
>> Thanks,
>> Isnt there a way to get the number of spam tokens and ham tokens,
>> some kind of ratio from my wordlist.db? Is it possible that my
>> wordlist just got out of whack?
>
> "bogoutil -p ..../wordlist.db .MSG_COUNT" will show the ratio of
> registered spam to ham messages.
>
> Actually counting spam vs ham tokens is tougher as each token has its
> spam and ham counts stored with it (as the "tail" experiment showed).
>
> "Pure spam" tokens would have "good" counts of 0, etc. Most tokens
> have both "good" and "bad" counts, as the following shows
>
> bogoutil -p ... Dwayne "from:Dwayne " "to:Dwayne "
>
> Cheers!
>
> David
>
Im starting to think the problem isnt with my wordlist. I exported
some emails that were flagged as spam before Monday, and used
bogofilter -vv email to get their score and see if bogofilter would
score them as spammy:
X-Bogosity: No, tests=bogofilter, spamicity=0.520000, version=1.1.5
int cnt prob spamicity histogram
0.00 0 0.000000 0.520000
0.10 0 0.000000 0.520000
0.20 0 0.000000 0.520000
0.30 0 0.000000 0.520000
0.40 0 0.000000 0.520000
0.50 0 0.000000 0.520000
0.60 0 0.000000 0.520000
0.70 0 0.000000 0.520000
0.80 0 0.000000 0.520000
0.90 0 0.000000 0.520000
[root at mail ~]# bogofilter -vv < test3
X-Bogosity: No, tests=bogofilter, spamicity=0.520000, version=1.1.5
int cnt prob spamicity histogram
0.00 0 0.000000 0.520000
0.10 0 0.000000 0.520000
0.20 0 0.000000 0.520000
0.30 0 0.000000 0.520000
0.40 0 0.000000 0.520000
0.50 0 0.000000 0.520000
0.60 0 0.000000 0.520000
0.70 0 0.000000 0.520000
0.80 0 0.000000 0.520000
0.90 0 0.000000 0.520000
[root at mail ~]# bogofilter -vv < score
X-Bogosity: No, tests=bogofilter, spamicity=0.000000, version=1.1.5
int cnt prob spamicity histogram
0.00 0 0.000000 0.000000
0.10 0 0.000000 0.000000
0.20 0 0.000000 0.000000
0.30 0 0.000000 0.000000
0.40 0 0.000000 0.000000
0.50 6 0.000236 0.000236 ######
0.60 0 0.000000 0.000236
0.70 0 0.000000 0.000236
0.80 0 0.000000 0.000236
0.90 0 0.000000 0.000236
looks ok. The top two emails are very spammy (viagra, etc), the last
one is a newsletter from Coldwater Creek which Im sure someone signed
up to recieve. So this looks good. I wonder if permissions got hosed
somehow on my wordlist. I'll check some other things and see what is
up.
thanks for the info,
--
Dwayne Hottinger
Network Administrator
Harrisonburg City Public Schools
More information about the Bogofilter
mailing list