My wordlist doesn't detect spam very well anymore

Jonathan Kamens jik at kamens.us
Sun Feb 16 20:17:49 CET 2020


Here's my current .bogofilter.cf:

db_cachesize=84
robs=0.0100
min_dev=0.394
robx=0.439254
sp_esf=0.154134
ns_esf=0.866025
spam_cutoff=0.997989	# for 0.01% fp (1); expect 5.65% fn (310).
#spam_cutoff=0.000001	# for 0.05% fp (8); expect 4.25% fn (233).
#spam_cutoff=0.000000	# for 0.10% fp (16); expect 4.25% fn (233).
#spam_cutoff=0.000000	# for 0.20% fp (32); expect 4.12% fn (226).
ham_cutoff=0.100	

On 2/15/20 12:13 PM, Teemu Likonen wrote:
> Jonathan Kamens [2020-02-09T07:47:37-05] wrote:
>
>> In my experience, you need to save a big corpus of known, recent spam
>> and ham messages and periodically run bogotune to determine the
>> currently optimal parameters for your .bogofilter.cf file. Personally,
>> I save copies of all spam and ham messages going back for months and
>> run bogotune once per month. My .MSG_COUNT is currently 66,915 ham and
>> 114,479 spam.
> I would like to know what values bogotune gives you with your big
> database. Obviously the values are not useful for anybody else's system
> but I think it would be interesting to see how far the values can travel
> from the default values of robx, robs, min_dev, ham_cutoff, spam_cutoff.
>


More information about the bogofilter mailing list