Fw: Mixed case token handling

Jef Poskanzer jef at acme.com
Fri May 30 21:25:05 CEST 2003


>This would help migration from a casefolded database as classification 
>algorithn would degenerate to the existing lower case method and 
>performance would be no worse than before. 

I'm not 100% sure I'm following the discussion correctly, but
couldn't you also handle the migration issue with a little script
that dumps the database, duplicates all-lowercase tokens with
capitalized and all-uppercase versions, and makes a new db?
---
Jef

         Jef Poskanzer  jef at acme.com  http://www.acme.com/jef/





More information about the Bogofilter mailing list