Re casefolding
michael at optusnet.com.au
michael at optusnet.com.au
Wed May 14 09:57:31 CEST 2003
"Peter Bishop" <pgb at adelard.com> writes:
[..]
> False negative performance
>
> test train spams fn fn(with-caps)
> 2.gz 3.gz 3876 19 14
> 3.gx 2.gz 1907 9 5
>
> I am a bit suspicious about the first result as the
> count of spams (as split up by formail) changed
> from the unkludged version (increased by 6 to 3822)..
>
> The changes look they are just about significant
> might expect a variation of 19+-4 to and 9+-3 from chance variation
Peter, I suspect it's difficult to see how much your
patch it actually affecting things owning to the small
size of your database.
Could you possibly post the patch for this? I'll run it
again a more sizable corpus I have.
Michael.
More information about the Bogofilter
mailing list