multi-user [was: New Release: Bogofilter 1.0.0]

David Relson relson at osagesoftware.com
Wed Dec 28 14:19:24 CET 2005


On Wed, 28 Dec 2005 07:09:03 -0500
Greg Louis wrote:


...[snip]...

>         A large ISP in Australia is using a modified version of
> Bogofilter with a single wordlist to watch 150,000 mailboxes.  Over 1
> million messages are processed per day.  Bogofilter is believed to be
> around 95% effective in this environment, with no false positives
> reported in 6 months of operation.  The wordlist management is
> completely centralized, with no user input whatsoever.  Administrators
> keep Bogofilter's training current by manually scanning and training on
> random samplings of 100-300 "unsure" emails per week.

Interesting idea -- setting aside copies of unsure's for review!

...[snip]...

> I met with the York U team on Sept. 8, 2004.  They were still happy and
> enthusiastic, and looking forward to their first major statistical
> summaries.  Unfortunately, we didn't follow up on that, so I never saw
> the actual performance figures.
> 
> Individual databases for that scale of user community isn't a practical
> idea; the unexpected and delightful observation was -- and we've all
> seen it who've tried it -- that one database works extremely well even
> for a widely disparate user population, and even with limited (though
> careful) training.

Greg,

Sounds like time to ping the folks at York and get the "year later"
report!

Regards,

David




More information about the Bogofilter mailing list