suggestions/requests
    Eric S. Raymond 
    esr at thyrsus.com
       
    Thu Jan 23 00:07:24 CET 2003
    
    
  
Dew-Jones, Malcolm MSER:EX <Malcolm.DewJones at gems5.gov.bc.ca>:
> For each message you generate all appropriate pseudo words, and then add one
> 
> to erach pseudo word in the word list.  For example the pseudo word
> "document 
> length in bytes 101-500" would count the number of documents that fell in
> that 
> length range.
This is a good idea, but IMO it doesn't belong in bogofilter itself.
Bogofiter should stick to doing one thing -- Bayesian analysis of 
presented features -- and doing it well.
Your statistics should be gathered by a separate feature extractor which
feeds bogofilter.  I'm working on a framework for such tests now; it's
called `bogometer'.
Let's keep this project small and clean and lightweight, people.
-- 
		<a href="http://www.tuxedo.org/~esr/">Eric S. Raymond</a>
    
    
More information about the bogofilter-dev
mailing list