html_tokenizer

Barry Gould BarryGould at PennySaverUSA.net
Fri Feb 21 01:45:22 CET 2003


At 03:58 PM 2/20/2003, David Relson wrote:
>Perhaps not relevant, recently I've noticed a lot of garbage strings 
>inside of spam.  It often looks like character sequences straight from the 
>keyboard, i.e. "qwertyuiop", "asdf", etc.  Remebering that I had noticed 
>"asdf" in one spam, I ran "grep -c asdf spam.Feb.2003/*" and found that 85 
>of 3563 spam I've received this month contain that particular string.

I saw this:
wlo jraabeor myw fecz usi doycs
with subject of
fnk gwid

in a spam today.
It was the ONLY text in eyespace; the rest was images with html links.

Another had the subject
durabilisy ffrm
with body
wrecyy tdmv bmx rojog itdlzsq mifk vsgrbgnfgynhedrszqurkahih kokspl twu cqd 
ico

I've received several of these in the last week or two.

I'm concerned that these "tokens" will start filling up the databases soon.
Looks like we're going to have to start doing routine DB maintenance.

Look what else I found in another spam (looks like their "RANDOMIZE" script 
broke :) )
<x-html>
<!-- saved from url=(0022)http://internet.e-mail -->
<HTML>
<BODY BGCOLOR=white><center><font color=#FFFFFF>[RANDOMIZE]-[RANDOMIZE]

<A HREF=http://www.onlinedns.org/x2/><font
color=#FFFFFF>[RANDOMIZE]-[RANDOMIZE]</font></a><Br>
<a href="http://www.onlinedns.org/x2/">
<img src="http://onlinedns.org/3.gif" border=0>
</a><font color=#FFFFFF>
[RANDOMIZE][RANDOMIZE][RANDOMIZE][RANDOMIZE]
</body></HTML>
</x-html>

The format of this message is almost identical to the randomized messages 
above.

Barry





More information about the Bogofilter mailing list