testing partial wordlists
David Relson
relson at osagesoftware.com
Sun Feb 6 01:22:38 CET 2005
On 05 Feb 2005 18:56:31 -0500
Tom Anderson wrote:
> On Sat, 2005-02-05 at 17:49, David Relson wrote:
> > Indeed!! It'd be interesting to know if there're holiday effects. I've
> > got no info one way or t'other.
> >
> > Now I've got some actual numbers and don't have to imagine to decide
> > what to keep or what to pitch. Given the numbers I got, I've removed
> > hapaxes and tokens older than a year.
> >
> > I just need to remain vigilant for a while -- in case I've removed too
> > much!
>
> What are the commands you would use to trim a wordlist by X months? And
> hapaxes? It'd be nice to have a FAQ entry for this. Or better yet, a
> maintenance script which can be run via cron like once a month which
> does this trimming, compacts the database, and makes a backup.
>
> Tom
Hi Tom,
The options are in the man pages and the help messages. Hint: look for
age, date, and count related options.
Given people's varying wishes for maintenance, a useful script seems
unlikely.
David
More information about the Bogofilter
mailing list