bogofilter resistant email
Boris 'pi' Piwinger
3.14 at logic.univie.ac.at
Thu Feb 12 10:30:25 CET 2004
Tom Anderson wrote:
> The attached email came in as unsure with a spamicity of 0.493192. I
> registered it as spam, and it increased to 0.500000. I've registered it
> five more times, and the spamicity remains at 0.500000. I think perhaps
> the sheer number of common hammish words are the culprit. Has anyone
> else gotten impossible to filter emails like this? It differs from the
> "random word" emails because most of the words in this email are common
> whereas the random words are usually unique. I fear that registering
> this one too much will distort my database overall. I wonder if giving
> more weight to the header tokens would be a good idea.
I don't see a problem with your mail:
> bogofilter -vv<boss.eml
> X-Bogosity: Spam, spamicity=0.825, version=0.17.1
> int cnt prob spamicity histogram
> 0.00 10 0.042248 0.015940 ##########
> 0.10 16 0.180391 0.080471 ################
> 0.20 0 0.000000 0.080471
> 0.30 0 0.000000 0.080471
> 0.40 0 0.000000 0.080471
> 0.50 0 0.000000 0.080471
> 0.60 0 0.000000 0.080471
> 0.70 0 0.000000 0.080471
> 0.80 5 0.827035 0.218653 #####
> 0.90 36 0.971457 0.592296 ####################################
Looks all nice. Also -vvv does not show anything particular
interesting. I am really surprised if the message does not
change the value with every time you register. What is you
-vv output? What is bogoutil -w ~/.bogofilter .MSG_COUNT
(replace directory as needed)?
pi
More information about the Bogofilter
mailing list