OT: Chunking the cruft - random lettered words

Tom Allison tallison at tacocat.net
Wed Mar 17 00:08:47 CET 2004


Tig wrote:
> On Mon, 15 Mar 2004 09:07:02 -0500
> "Eric Wood" <eric at interplas.com> wrote:
> 
> <snip>
> 
>>Solved.  Okay, my only other nuscience email comes with lots of
> 
> random
> 
>>words in it:
>>
>>wogwo gwoehg gjjdjgdd ......
>>
>>I've trained till I'm blue in the face.  The procmail list didn't
>>yeild a magic rule to help me with this.  Does anyone have a trick for
>>this kind of email?
>>
>>Thanks,
>>-Eric Wood
>>
> 
> 
> Could you do some kind of test against percentage of known to unknown
> words from a dictionary file (most *nix installs have one, mine is
> /usr/share/dict/words)?
> 
> I'm guessing with some time a little bit shell/perl scripting you could
> possibly come up with something.
> 

I would try the perl script first and see how it pans out.
pipe it after bogofilter since bogofilter is already 99.9% effective and 
the theory we're testing here is that the remaining 0.1% can't spell 
worth a d at rn.





More information about the Bogofilter mailing list