OT: Chunking the cruft - random lettered words
Tom Allison
tallison at tacocat.net
Wed Mar 17 00:08:47 CET 2004
Tig wrote:
> On Mon, 15 Mar 2004 09:07:02 -0500
> "Eric Wood" <eric at interplas.com> wrote:
>
> <snip>
>
>>Solved. Okay, my only other nuscience email comes with lots of
>
> random
>
>>words in it:
>>
>>wogwo gwoehg gjjdjgdd ......
>>
>>I've trained till I'm blue in the face. The procmail list didn't
>>yeild a magic rule to help me with this. Does anyone have a trick for
>>this kind of email?
>>
>>Thanks,
>>-Eric Wood
>>
>
>
> Could you do some kind of test against percentage of known to unknown
> words from a dictionary file (most *nix installs have one, mine is
> /usr/share/dict/words)?
>
> I'm guessing with some time a little bit shell/perl scripting you could
> possibly come up with something.
>
I would try the perl script first and see how it pans out.
pipe it after bogofilter since bogofilter is already 99.9% effective and
the theory we're testing here is that the remaining 0.1% can't spell
worth a d at rn.
More information about the Bogofilter
mailing list