markovian matching & dobly noise reduction

Greg Louis glouis at dynamicro.on.ca
Thu Feb 26 12:06:57 CET 2004


On 20040226 (Thu) at 0208:35 -0500, Tom Anderson wrote:
> http://yro.slashdot.org/article.pl?sid=04/02/24/0025219&mode=nested
> 
> These methods sound useful... has anyone looked into integrating them
> into bogofilter?
> 
Haven't looked at Markov but I've been doing some experiments with
Dobly.  Not overly encouraging so far: in the best test yet, I got
about a 15% reduction in error rate from a version of bogofilter that
was (a) hacked to use token pairs, and (b) running a slightly
simplified Dobly.  Considering the humungous overhead penalty that
these changes impose, I'd need to find a way to turn that 15% into
something like 90% before it could be worth putting in mainline, and
even then, major performance (speed) enhancement would be essential.

-- 
| G r e g  L o u i s         | gpg public key: 0x400B1AA86D9E3E64 |
|  http://www.bgl.nu/~glouis |   (on my website or any keyserver) |
|  http://wecanstopspam.org in signatures helps fight junk email. |




More information about the Bogofilter mailing list