html_tokenizer
Barry Gould
BarryGould at PennySaverUSA.net
Fri Feb 21 01:45:22 CET 2003
At 03:58 PM 2/20/2003, David Relson wrote:
>Perhaps not relevant, recently I've noticed a lot of garbage strings
>inside of spam. It often looks like character sequences straight from the
>keyboard, i.e. "qwertyuiop", "asdf", etc. Remebering that I had noticed
>"asdf" in one spam, I ran "grep -c asdf spam.Feb.2003/*" and found that 85
>of 3563 spam I've received this month contain that particular string.
I saw this:
wlo jraabeor myw fecz usi doycs
with subject of
fnk gwid
in a spam today.
It was the ONLY text in eyespace; the rest was images with html links.
Another had the subject
durabilisy ffrm
with body
wrecyy tdmv bmx rojog itdlzsq mifk vsgrbgnfgynhedrszqurkahih kokspl twu cqd
ico
I've received several of these in the last week or two.
I'm concerned that these "tokens" will start filling up the databases soon.
Looks like we're going to have to start doing routine DB maintenance.
Look what else I found in another spam (looks like their "RANDOMIZE" script
broke :) )
<x-html>
<!-- saved from url=(0022)http://internet.e-mail -->
<HTML>
<BODY BGCOLOR=white><center><font color=#FFFFFF>[RANDOMIZE]-[RANDOMIZE]
<A HREF=http://www.onlinedns.org/x2/><font
color=#FFFFFF>[RANDOMIZE]-[RANDOMIZE]</font></a><Br>
<a href="http://www.onlinedns.org/x2/">
<img src="http://onlinedns.org/3.gif" border=0>
</a><font color=#FFFFFF>
[RANDOMIZE][RANDOMIZE][RANDOMIZE][RANDOMIZE]
</body></HTML>
</x-html>
The format of this message is almost identical to the randomized messages
above.
Barry
More information about the Bogofilter
mailing list