What to do for HTML comment processing ???

David Relson relson at osagesoftware.com
Fri Mar 7 04:31:27 CET 2003


At 10:13 PM 3/6/03, Suzanne Skinner wrote:

>On Thu, Mar 06, 2003 at 06:58:01PM -0500, David Relson wrote:
>
> > Mostly, I an interested in what people want bogofilter to do with tokens
> > inside of html tags and comments.  Obvious choices include the following:
> >
> > 1a - discard all tokens inside html tags
> > 1b - discard all tokens inside html comments
> > 2a - score all tokens inside html tags
> > 2b - score all tokens inside html comments
>
>[snip]
>
>My thought would be to do 2b, but add context by prefixing all such tokens
>with "comment:".
>
>Suzanne

Suzanne,

Just to make sure I understand, you want 2b, but not 2a?  2a is where 
href's and urls would live.  2b is the home for javascript, style sheets 
and, also, totally random stuff.

David






More information about the Bogofilter mailing list