Fw: Mixed case token handling
Jef Poskanzer
jef at acme.com
Fri May 30 21:25:05 CEST 2003
>This would help migration from a casefolded database as classification
>algorithn would degenerate to the existing lower case method and
>performance would be no worse than before.
I'm not 100% sure I'm following the discussion correctly, but
couldn't you also handle the migration issue with a little script
that dumps the database, duplicates all-lowercase tokens with
capitalized and all-uppercase versions, and makes a new db?
---
Jef
Jef Poskanzer jef at acme.com http://www.acme.com/jef/
More information about the Bogofilter
mailing list