Re casefolding

michael at optusnet.com.au michael at optusnet.com.au
Wed May 14 09:57:31 CEST 2003


"Peter Bishop" <pgb at adelard.com> writes:
[..]
> False negative performance
> 
> test 	train	spams	fn	fn(with-caps)
> 2.gz	3.gz	3876	19	14
> 3.gx	2.gz	1907	 9	 5
> 
> I am a bit suspicious about the first result as the
> count of spams (as split up by formail) changed
> from the unkludged version (increased by 6 to 3822)..
> 
> The changes look they are just about significant
> might expect a variation of 19+-4 to and 9+-3 from chance variation

Peter, I suspect it's difficult to see how much your
patch it actually affecting things owning to the small
size of your database.

Could you possibly post the patch for this? I'll run it
again a more sizable corpus I have.

Michael.




More information about the Bogofilter mailing list