multi-user [was: New Release: Bogofilter 1.0.0]
David Relson
relson at osagesoftware.com
Wed Dec 28 14:19:24 CET 2005
On Wed, 28 Dec 2005 07:09:03 -0500
Greg Louis wrote:
...[snip]...
> A large ISP in Australia is using a modified version of
> Bogofilter with a single wordlist to watch 150,000 mailboxes. Over 1
> million messages are processed per day. Bogofilter is believed to be
> around 95% effective in this environment, with no false positives
> reported in 6 months of operation. The wordlist management is
> completely centralized, with no user input whatsoever. Administrators
> keep Bogofilter's training current by manually scanning and training on
> random samplings of 100-300 "unsure" emails per week.
Interesting idea -- setting aside copies of unsure's for review!
...[snip]...
> I met with the York U team on Sept. 8, 2004. They were still happy and
> enthusiastic, and looking forward to their first major statistical
> summaries. Unfortunately, we didn't follow up on that, so I never saw
> the actual performance figures.
>
> Individual databases for that scale of user community isn't a practical
> idea; the unexpected and delightful observation was -- and we've all
> seen it who've tried it -- that one database works extremely well even
> for a widely disparate user population, and even with limited (though
> careful) training.
Greg,
Sounds like time to ping the folks at York and get the "year later"
report!
Regards,
David
More information about the Bogofilter
mailing list