more spamitarium results

Tom Allison tallison at tacocat.net
Fri May 14 18:30:33 CEST 2004


Ran some more tests looking at the average score and not the attribute 
counts of ham/spam/unsure evaluations.

The consistency between all the spamitarium tests that have parms -rad 
seems to be related to an observation that the resulting email has a 
diff of typically one or two words (I only saw one, but I'm easy).

Average - Average		Test		
Corpus	Parms	0		1		2
ham	none	0.0049788	0.0062384	0.0059236
	radw	0.0065136	0.0071255	0.0084594
	readw	0.0065136	0.0071255	0.0084594
	sradw	0.0065136	0.0071255	0.0084594
	sreadw	0.0065136	0.0071255	0.0084594
	sw	0.0065162	0.0071285	0.0084625
spam	none	0.9598745	0.9732174	0.9700802
	radw	0.9556415	0.9707304	0.9707012
	readw	0.9556415	0.9707304	0.9707012
	sradw	0.9556416	0.9707307	0.9707013
	sreadw	0.9556416	0.9707307	0.9707013
	sw	0.9556426	0.9707308	0.9707037





More information about the Bogofilter mailing list