HTML entities

Boris 'pi' Piwinger 3.14 at logic.univie.ac.at
Wed Apr 2 09:34:38 CEST 2003


David Relson <relson at osagesoftware.com> wrote:

>The code to scan a line for "&[0-9]*;" and convert to characters isn't 
>difficult.  

Not difficult, but we need a large table. This brings us
back to Unicode support (there indeed it would be trivial).
And there are the named entities which make another table.
In any case (w/o Unicode) something has to be done with
those characters not in the used charset (ISO-8859-1?).

pi




More information about the Bogofilter mailing list