Parsing stuff like %2E in URLs

Matt Garretson mattg at assembly.state.ny.us
Mon Jan 10 20:44:03 CET 2005


Hello, i've noticed that bogofilter (including 0.93.4) is parsing
% escaped hex values in a way that is unexpected to me.  I'm not
saying it's wrong; just that i'd been expecting different results. :)

A very simplified message is attached as an example, along with the
bogolexer output i'm getting.  Basically "%2Estring" is being
tokenized as "Estring".  However, "%40string" becomes just "string",
as i'd expect.  (The difference appears to be whether the second
hex digit is alpha or numeric.)

Is all this as expected?

Thanks in advance,
-Matt
-------------- next part --------------
A non-text attachment was scrubbed...
Name: mog.tar.gz
Type: application/x-gzip
Size: 496 bytes
Desc: not available
URL: <http://www.bogofilter.org/pipermail/bogofilter/attachments/20050110/4376b6f1/attachment.bin>


More information about the Bogofilter mailing list