Parsing stuff like %2E in URLs
Matt Garretson
mattg at assembly.state.ny.us
Mon Jan 10 20:44:03 CET 2005
Hello, i've noticed that bogofilter (including 0.93.4) is parsing
% escaped hex values in a way that is unexpected to me. I'm not
saying it's wrong; just that i'd been expecting different results. :)
A very simplified message is attached as an example, along with the
bogolexer output i'm getting. Basically "%2Estring" is being
tokenized as "Estring". However, "%40string" becomes just "string",
as i'd expect. (The difference appears to be whether the second
hex digit is alpha or numeric.)
Is all this as expected?
Thanks in advance,
-Matt
-------------- next part --------------
A non-text attachment was scrubbed...
Name: mog.tar.gz
Type: application/x-gzip
Size: 496 bytes
Desc: not available
URL: <http://www.bogofilter.org/pipermail/bogofilter/attachments/20050110/4376b6f1/attachment.bin>
More information about the Bogofilter
mailing list