Wordlist Histogram [was: What did I do wrong? ]

Boris 'pi' Piwinger 3.14 at logic.univie.ac.at
Thu Feb 19 14:51:21 CET 2004


David Relson wrote:

[bogoutil -H]
> hapaxes:  ham  375505 (29.72%), spam  443797 (35.12%)
>    pure:  ham  562881 (44.55%), spam  616022 (48.75%)

What is the meaning of pure? Tokens which have been seen
only once for one category, but possibly many times in the
other?

BTW: The option is not in the man page.

pi




More information about the Bogofilter mailing list