attachments and binary data

David Relson relson at osagesoftware.com
Fri Nov 26 16:10:46 CET 2004


On Fri, 26 Nov 2004 17:42:06 +0300
Evgeny Kotsuba wrote:

...[snip]...

> Seems that now  there is no such problems.
> 
> >>I am looking in bogofilter's src  now after almost 1 year,  with
> >about >of 1Gb in size, 90K  messages.   I   upgraded my port to 
> >0.17.5  >version  and  do some experiments.  I have many  non-spam
> >messages in my >collection  with  attachments  - mainly large  .doc
> >and .pdf files. With >those  attachments word_db  grows  and slows at
> >training.
> >
> >bogofilter doesn't look at non-text attachments, only at the MIME
> >headers of those attachments.
> >  
> >
> Seems that still there are some problems with attachments made by 
> Microsoft Outlook Express
> ======  clinical case 1 =========
> Message-ID: <006a01c401f8$1d7d88e0$6b02a8c0 at blabla.ru>
> From: "AAA" <aaa at blabla.ru>
> To: <bbb at blabla.msk.su>
> Subject: some text
> Date: Thu, 4 Mar 2004 17:51:02 +0300
> X-Priority: 3
> X-MSMail-Priority: Normal
> X-Mailer: Microsoft Outlook Express 6.00.2600.0000
> X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
> 
> some text in koi8-r coding
> 
> 
> begin 666 LK2540-7R.pdf
> M)5!$1BTQ+C0-)>+CS],-"C$@,"!O8FH-/#P@#2]4>7!E("]086=E( TO4&%R
> M96YT(#$U(# @4B -+U)E<V]U<F-E<R R(# @4B -+T-O;G1E;G1S(#,@,"!2
> M( TO365D:6%";W@@6R P(# @-#<S(#8V.2!=( TO0W)O<$)O>"!;(# @," T
> ================= endof clinical case 1 =========================

"begin 666" is the start of a uuencode file.  Bogofilter understands
mime attachments identified as "X-UUENCODE", but not inlined uuencoded
attachments like this.

> ======  clinical case 2 =========
> Message-ID: <009701c3e144$0ec14a00$0f02000a at blabla.ru>
> From: "AK" <ak at blabla.ru>
> To: "VS" <vs at bla.bla.ru>
> Subject: =?koi8-r?B?89TB1NjRIMT==?=
> MIME-Version: 1.0
> Date: Fri, 23 Jan 2004 02:44:55 +0300
> Content-Type: message/partial;
>     total=5;
>     id="01C3E141.BD83EF50 at AOWINXPMBL2";
>     number=3
> X-Priority: 3
> X-MSMail-Priority: Normal
> X-Mailer: Microsoft Outlook Express 6.00.2800.1106
> X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1106
> 
> OloO7nXAjYZe1FVeE4saqjGe9SoVWgrqHi7qwIpNB268PaO7ILk9oamzdYkuNwoUS9Asf
> VFoNBvg
> Eq2Uq8g53WyKoqgbVtBJhguLg8UlM0kqmtJyrLQfkIKqSahKW9eAGJU0L5KpVq4R+H/+q
> Z+8SFh4
> NOPG2fCN5MY3YJJX1Od5RZ716TkrXvFZP8yKhz969PC1Xapnw5vl1zvv+5NmamTJ15/60
> tlv/eWn================= endof clinical case 2
> =========================

This one doesn't even have "begin 666" to identify it, which makes it
impossible to deal with.

Regards,

David



More information about the bogofilter-dev mailing list