bogofilter resistant email

Boris 'pi' Piwinger 3.14 at logic.univie.ac.at
Thu Feb 12 10:30:25 CET 2004


Tom Anderson wrote:

> The attached email came in as unsure with a spamicity of 0.493192.  I
> registered it as spam, and it increased to 0.500000.  I've registered it
> five more times, and the spamicity remains at 0.500000.  I think perhaps
> the sheer number of common hammish words are the culprit.  Has anyone
> else gotten impossible to filter emails like this?  It differs from the
> "random word" emails because most of the words in this email are common
> whereas the random words are usually unique.  I fear that registering
> this one too much will distort my database overall.  I wonder if giving
> more weight to the header tokens would be a good idea.

I don't see a problem with your mail:
> bogofilter -vv<boss.eml
> X-Bogosity: Spam, spamicity=0.825, version=0.17.1
>    int  cnt   prob  spamicity histogram
>   0.00   10 0.042248 0.015940 ##########
>   0.10   16 0.180391 0.080471 ################
>   0.20    0 0.000000 0.080471
>   0.30    0 0.000000 0.080471
>   0.40    0 0.000000 0.080471
>   0.50    0 0.000000 0.080471
>   0.60    0 0.000000 0.080471
>   0.70    0 0.000000 0.080471
>   0.80    5 0.827035 0.218653 #####
>   0.90   36 0.971457 0.592296 ####################################

Looks all nice. Also -vvv does not show anything particular
interesting. I am really surprised if the message does not
change the value with every time you register. What is you
-vv output? What is bogoutil -w ~/.bogofilter .MSG_COUNT
(replace directory as needed)?

pi




More information about the Bogofilter mailing list