Statistics? Graphics?

David Relson relson at osagesoftware.com
Sun May 22 04:00:27 CEST 2005


On Sat, 21 May 2005 11:46:40 -0700
David Carmean wrote:

> 
> Here's where I am after a few hours of learning to use Ploticus 
> and writing a couple of perl scripts to parse the log I've been 
> keeping of message spamicity scores:
> 
>     http://www.halibut.com/~dlc/tmp/bogodata.png
> 
> You can see where I upgraded from 0.17.2 to 0.94.11 on May 14.  
> You can also see by the large number of green points the few weeks 
> before that how I was having more and more trouble with unflagged 
> spam. (I had been kind of lazy with keeping up on the training).
> 
> On that day I also purged my wordlist of all tokens more than 
> a year old.
> 
> The points above 1.000 and below 0.000 are Ploticus's point "clustering" 
> feature for scatterplots which makes multiple coincident points more visible 
> by ofsetting them a little.
> 
> Once I get this all dialed in I'll share my perl and Ploticus scripts 
> with all who wish.

Hi David,

That'll be pretty cool.  I've got a variety of statistics gathered
(mostly stats from procmail logs, i.e. number of messages put in
various folders, e.g. spam, relson, ...).  I also have a record of
corrections (FP, FN, etc) fed to bogofilter.

A while back I did a quick/dirty plot of the procmail spam counts.
It's rude and crude and needs cleaning.  For those who're interested,
it's at http://www.osagesoftware.com/bogofilter/counts.png

Regards,

David



More information about the Bogofilter mailing list