obscured URL not being tokenized
Tom Anderson
tanderso at oac-design.com
Sun Dec 21 02:14:29 CET 2003
On Sat, 2003-12-20 at 16:43, David Relson wrote:
> based on them. Unfortunately, this isn't adequate. Directives are
> nested, for example a table contains table data which can include font
> directives. Proper processing of all this requires a stack for saving
> the previous state and popping the stack as end tags are encountered.
> It all gets more complicated since the html may be improperly formed, as
> in <table><tr><td><font>...</table>, where the end directive pops
> several stack levels.
I don't think bogofilter needs to be interpreting html. Simply
recognizing the tokens such as 'size=-5' and 'color="#fffffe"' ought to
be enough. If there were to be such functionality as html
interpretation and image recognition attempted, it should be in a
preprocessor seperate from bogofilter which perhaps sets its own
"x-bogosity"-type line. This way it could be turned on or off easily.
Let's not bloat the core of bogofilter with such stuff.
Tom
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://www.bogofilter.org/pipermail/bogofilter/attachments/20031220/3699694f/attachment.sig>
More information about the Bogofilter
mailing list