attachments and binary data
David Relson
relson at osagesoftware.com
Fri Nov 26 16:10:46 CET 2004
On Fri, 26 Nov 2004 17:42:06 +0300
Evgeny Kotsuba wrote:
...[snip]...
> Seems that now there is no such problems.
>
> >>I am looking in bogofilter's src now after almost 1 year, with
> >about >of 1Gb in size, 90K messages. I upgraded my port to
> >0.17.5 >version and do some experiments. I have many non-spam
> >messages in my >collection with attachments - mainly large .doc
> >and .pdf files. With >those attachments word_db grows and slows at
> >training.
> >
> >bogofilter doesn't look at non-text attachments, only at the MIME
> >headers of those attachments.
> >
> >
> Seems that still there are some problems with attachments made by
> Microsoft Outlook Express
> ====== clinical case 1 =========
> Message-ID: <006a01c401f8$1d7d88e0$6b02a8c0 at blabla.ru>
> From: "AAA" <aaa at blabla.ru>
> To: <bbb at blabla.msk.su>
> Subject: some text
> Date: Thu, 4 Mar 2004 17:51:02 +0300
> X-Priority: 3
> X-MSMail-Priority: Normal
> X-Mailer: Microsoft Outlook Express 6.00.2600.0000
> X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
>
> some text in koi8-r coding
>
>
> begin 666 LK2540-7R.pdf
> M)5!$1BTQ+C0-)>+CS],-"C$@,"!O8FH-/#P@#2]4>7!E("]086=E( TO4&%R
> M96YT(#$U(# @4B -+U)E<V]U<F-E<R R(# @4B -+T-O;G1E;G1S(#,@,"!2
> M( TO365D:6%";W@@6R P(# @-#<S(#8V.2!=( TO0W)O<$)O>"!;(# @," T
> ================= endof clinical case 1 =========================
"begin 666" is the start of a uuencode file. Bogofilter understands
mime attachments identified as "X-UUENCODE", but not inlined uuencoded
attachments like this.
> ====== clinical case 2 =========
> Message-ID: <009701c3e144$0ec14a00$0f02000a at blabla.ru>
> From: "AK" <ak at blabla.ru>
> To: "VS" <vs at bla.bla.ru>
> Subject: =?koi8-r?B?89TB1NjRIMT==?=
> MIME-Version: 1.0
> Date: Fri, 23 Jan 2004 02:44:55 +0300
> Content-Type: message/partial;
> total=5;
> id="01C3E141.BD83EF50 at AOWINXPMBL2";
> number=3
> X-Priority: 3
> X-MSMail-Priority: Normal
> X-Mailer: Microsoft Outlook Express 6.00.2800.1106
> X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1106
>
> OloO7nXAjYZe1FVeE4saqjGe9SoVWgrqHi7qwIpNB268PaO7ILk9oamzdYkuNwoUS9Asf
> VFoNBvg
> Eq2Uq8g53WyKoqgbVtBJhguLg8UlM0kqmtJyrLQfkIKqSahKW9eAGJU0L5KpVq4R+H/+q
> Z+8SFh4
> NOPG2fCN5MY3YJJX1Od5RZ716TkrXvFZP8yKhz969PC1Xapnw5vl1zvv+5NmamTJ15/60
> tlv/eWn================= endof clinical case 2
> =========================
This one doesn't even have "begin 666" to identify it, which makes it
impossible to deal with.
Regards,
David
More information about the bogofilter-dev
mailing list