fisher algorithm

Graham Wilson bob at decoy.wox.org
Mon Nov 25 00:47:58 CET 2002


why is the spam_cutoff so high for the fisher algorithm?

i remember reading that for mails that fisher knows are spam it usually
returns very high or very low spamicity numbers? is that why the cutoff
is so high? that is, to only catch mails it knows are definitely spam?

also, what value for spam_cutoff are other people using with the fisher
algorithm?

i also remember reading [1] that the fisher algorithm has a `middle
ground'. what are the bounds, with regard to spamicity values, and how
should i treat emails in that range? spam? non-spam? so far, most of the
mails that i have received with spamicity greater than 0.0 (using the
fisher method) have been spam, so i am inclined to lean toward spam as
the answer to that question.

[1] in <20021118182239.GA14201 at athame.dynamicro.on.ca>, greg louis'
    email about spambayes and the fisher method.

--
gram




More information about the Bogofilter mailing list