[bogofilter - Open Discussion] RE: Sneaky 'invisible' obscuration text

Dan Singletary dvsing at sonicspike.net
Sat Nov 15 00:41:07 CET 2003


Well, when I meant translating the colors, I meant translating "white"
or "red" to their hex equivalents.  The color names have defined color
values that they are associated with-- I think this is defined in the
HTML standards documentation.

a "font: color" type of tag would somewhat affect the classification,
but it still doesn't correct the fact that the majority of the tokens
from this email are contained within the white font area and therefor
not seen by the reader... just put in to throw off the spam filter.
Would bogofilter (at some point) be able to process the html in some way
as to recognize wether certain blocks of text would be visible or not
(either displayed with a micro small font, or with a forground color
equal to that of the background color).

-Dan

SourceForge.net wrote:

> Read and respond to this message at: 
> https://sourceforge.net/forum/message.php?msg_id=2287454
> By: relson
> 
> Dan,
> 
> Bogofilter  is parsing <font ...> and using the resultant tokens in its scoring.
> That helps, a bit.
> 
> A "font:" prefix could be added to such tokens to make them stand out.  
> 
> Identifying colors is much harder.  Saying #FFFFFF is white is easy, but what
> about #FFFFFE , #FFFFFD, and #FFFFF0 ???  How does one define equal?
> 
> If you want to discuss this more, please post to bogofilter at aotto.com
> 
> David
> 
> 
> ______________________________________________________________________
> You are receiving this email because you elected to monitor this forum.
> To stop monitoring this forum, login to SourceForge.net and visit: 
> https://sourceforge.net/forum/unmonitor.php?forum_id=209925







More information about the Bogofilter mailing list