[bogofilter] using block_on_subnets
tallison at tacocat.net
tallison at tacocat.net
Fri Apr 30 18:12:59 CEST 2004
> On Fri, 2004-04-30 at 06:33, Tom Allison wrote:
>> Just for grins, I rebuilt my wordlist using this subnets option
>> enabled.
>> My current database is about 50% larger than it was previously.
>> It will be interesting to see how it grows in the coming week.
>
> Using http://www.orderamidchaos.com/bogofilter/spamitarium, I've been
> inserting Autonomous System Numbers (ASNs) in all of my received lines,
> as this achieves the same basic goal as block_on_subnets, without the
> huge wordlist bloat.
>
What is an ASN?
>> I'm pretty certain that these are all invalid URL's.
>> I just surprised at how many of them are also "good"
>
> It's also possible that 0, 0.0, etc., were used in a different context
> than a URL, but bogofilter parsed it wrong. It's also possible that
> some MTAs might print 0.0.0.0 when they fail to obtain a lookup for the
> correct IP. My program throws away received lines without a valid IP,
> particularly IPs in reserved ranges, impossible IPs, and local IPs.
>
>> If it seems reasonable enough to start looking into more feasable
>> studies, then it might make sense to collect bogofilter wordlist
>> information from other peoples ^url: listings to see if there is
>> sufficient and consistent overlap to provide a reliable means of
>> detection.
>
> As I said before, I'm not using the block_on_subnets option due to the
> disk space considerations, but ASNs can achieve the same goal. Here are
> some of my top scorers (>25 seen):
>
I read through some of the description/code of spamitarium and have one
comment. The removal of some X-Headers could be "bad". I love getting
X-List headers so I can filter on my mailing lists. It also is useful to
keep X-Loop headers around to avoid other problems.
More information about the Bogofilter
mailing list