support for multiple wordlists
David Relson
relson at osagesoftware.com
Tue May 18 01:12:13 CEST 2004
On 17 May 2004 18:56:45 -0400
Tom Anderson wrote:
> David,
>
> The value is precisely the same as for using a "system" list, except
> that some people don't have a "system". This way, individual users
> can still have the benefit of failover to a less tailored, more
> general wordlist when a token is not found in their own list. Imagine
> the space savings possible when 50+% of tokens are sufficiently scored
> in a remote list. Thousands of people can use the same 20M list
> without having to laboriously train their own. Corrections can go in
> their own individual list for increased accuracy, but with
> significantly less resources than maintaining the whole thing locally.
>
> Moreover, as I described above, lists tailored for specific kinds of
> people such as programmers, doctors, lawyers, housewives, sports
> enthusiasts, etc., can get new users off to a great start very
> quickly. Similarly ignore lists for doctors might come prepacked with
> terms like"viagra", while ignore lists for chefs might include
> "breast", etc.
>
> Plus, a remote list can be maintained somewhat more "professionally"
> than general users might be able to achieve on their own. And they
> can be trained on spamtrap spams that have just been released into the
> wild before the users of the list have ever received the new spams.
>
> Clearly, these are very valuable reasons for including the capability
> of looking up tokens in wordlists over the internet. Wget and rsync
> are not viable options. You need to think like an end-user... the
> entire process needs to be as transparent as possible. You should be
> able to set a config option once at setup, and everything else should
> happen automatically.
>
> Tom
Tom,
Introducing a client/server architecture is non-trivial. Making it
secure is an additional problem. It may well be of value. I don't
know.
David
More information about the Bogofilter
mailing list