testing partial wordlists

David Relson relson at osagesoftware.com
Sun Feb 6 01:22:38 CET 2005


On 05 Feb 2005 18:56:31 -0500
Tom Anderson wrote:

> On Sat, 2005-02-05 at 17:49, David Relson wrote:
> > Indeed!!  It'd be interesting to know if there're holiday effects.  I've
> > got no info one way or t'other. 
> > 
> > Now I've got some actual numbers and don't have to imagine to decide
> > what to keep or what to pitch.  Given the numbers I got, I've removed
> > hapaxes and tokens older than a year.
> > 
> > I just need to remain vigilant for a while -- in case I've removed too
> > much!
> 
> What are the commands you would use to trim a wordlist by X months? And
> hapaxes?  It'd be nice to have a FAQ entry for this.  Or better yet, a
> maintenance script which can be run via cron like once a month which
> does this trimming, compacts the database, and makes a backup.
> 
> Tom

Hi Tom,

The options are in the man pages and the help messages.  Hint:  look for
age, date, and count related options.

Given people's varying wishes for maintenance, a useful script seems
unlikely.

David



More information about the Bogofilter mailing list