Deb package

Tom Allison tallison at tacocat.net
Tue Jan 28 12:19:37 CET 2003


David Relson wrote:
> At 10:08 AM 1/27/03, tallison at tacocat.net wrote:
> 
>> What version is the latest & greatest deb package?
>>
>> I'm specifically wondering if the .deb package will support the ternary
>> scoring system (yes, no, maybe)
> 
> 
> Tom,
> 
> I'm not sure what the current debian build is.  Clint Adams has been 
> handling that end of things.
> 
> The yes, no, unsure scoring to which you refer is a capability of the 
> Robinson-Fisher algorithm and is in the current stable version of 
> bogofilter, i.e. 0.9.1.2, and in the current version, 0.10.1.1.
> 
> David


bogofilter version 0.10.1

OK!  A few changes in there to the docs....

And now for a few questions/feedback:

 From the manpage:

  Since then, Robinson and others have realized that  the  S
        calculation  can  be  further  optimized:  if  a vector of
        length k contains random, uniformly-distributed probabili?
        ties p, then -2 * sum(ln(p)) is distributed as chi-squared
        with 2n degrees of freedom. This is  believed  to  be  the
        most  sensitive  test of the hypothesis that the vector of
        probabilities is, in fact, uniformly distributed. Bogofil?
        ter  now offers the option of applying this test (known as
        Fisher's method) to yield P(spam)  and  P(not  spam),  and
        using the difference as the "spamicity" score.

Is this the Robinson-Fischer method that you reference later on in 
the options?  It's not identified here and there's not 
explaination as to why/what -f would do differently from -r.



The  -3 option tells bogofilter to use three-state classi?
        fication for the message, i.e.  classify  the  message  as
        ham,  spam,  or  unsure.  This option is effective only if
        ham_cutoff is non-zereo.

Besides a default in the /etc/bogofilter.rc it might be nice to 
have a suggested number here:

"...ham_cutoff is non-zero.  (try 0.10)"


I thought that MIME was going to be decoded.  What killed that 
idea?  Performance?  What if I'm stubborn and want to do MIME 
anyways...  I know that there have been some various posts about 
tools used and methods.  Did anything decisive come from this?

(Sorry, I've been wading around on other lists lately and missed a 
lot here.)

-- 
If your hands are clean and your cause is just and your demands are
reasonable, at least it's a start.





More information about the Bogofilter mailing list