Testing fisher
David Relson
relson at osagesoftware.com
Tue Jan 28 15:30:18 CET 2003
pi,
An interesting table. Your best value of min_dev seems to depend on
whether a non-zero F-P is acceptable to you.
min_dev=0.2 gives 184 F-N and 0 F-P, while min_dev of 0.015, 0.02, and
0.025 give essentially identical results of 89-92 F-N and 1 F-P. (I've
extracted the min_dev lines I mention.)
fisher-2 0.20 0.60 4221 184 15140 0
fisher-2 0.025 0.60 4262 89 15251 1
fisher-2 0.02 0.60 4297 92 15362 1
fisher-2 0.015 0.60 4295 92 15361 1
So, having done your testing, what values are you actually using?
David
P.S. "Total" is the better column name.
At 09:16 AM 1/28/03, Boris 'pi' Piwinger wrote:
>David Relson wrote:
>
>Note: The numbers give the total size of the testbase, not
>the right detections.
>
>algorithm min_def spam_cutoff test.spam test.ham
> total F-N total F-P
>fisher-2 0.10 0.95 4186 364 15140 1
>fisher-2 0.25 0.60 4335 191 15362 0
>fisher-2 0.20 0.60 4221 184 15140 0
>fisher-2 0.15 0.60 4237 170 15251 0
>fisher-2 0.10 0.60 4221 139 15140 1
>fisher-2 0.075 0.60 4237 132 15251 1
>fisher-2 0.05 0.60 4237 116 15251 1
>fisher-2 0.035 0.60 4262 101 15251 1
>fisher-2 0.025 0.60 4262 89 15251 1
>fisher-2 0.02 0.60 4297 92 15362 1
>fisher-2 0.015 0.60 4295 92 15361 1
>fisher-2 0.00 0.60 4221 140 15140 1
>
>pi
More information about the Bogofilter
mailing list