ignore text/plain part of multipart/alternative messages?
David Flanagan
david at davidflanagan.com
Wed Aug 13 01:43:33 CEST 2003
> Can you post the output of "bogofilter -vvv <spam.txt", too?
Matthias,
I'm not really interested in analyzing why this particular spam got
through bogofilter. I'm pretty certain that the reason is that it
included is that the random book excerpt happened to be about George
Bush the first, and I get a lot of legitimate mail about the current
George Bush.
I'm more interested in discussing a general technique for defeating this
type of spam: by ignoring the text/plain part of multipart/alternative
messages. I haven't heard and argument yet that convinces me that this
isn't the right thing to do. I could probably tweak my bogofilter
confuration to catch this variety of spam as is, but I just think that
the right solution is to discard text/plain in this case: it is either
the same as teh text/html part, or it is intended to confuse the filter,
so in either case we can and should discard it.
Here's bogofilter output:
bogofilter -v < /tmp/spam.txt
X-Bogosity: No, tests=bogofilter, spamicity=0.501542, version=0.13.7
If I cut out the text/plain version and just run bogofilter on the
text/html, I get this:
bogofilter -v < /tmp/spam2.txt
X-Bogosity: Yes, tests=bogofilter, spamicity=0.999950, version=0.13.7
Remarkable difference, isn't it?
The -vvv output on the full message, including the text/plain part is
below.
David
X-Bogosity: No, tests=bogofilter, spamicity=0.501542, version=0.13.7
n pgood pbad fw U
"Mail-from" 636 0.496865 0.000180 0.000369 +
"X-Coding-System" 636 0.496865 0.000180 0.000369 +
"undecided-unix" 510 0.396552 0.000360 0.000916 +
"credentials" 4 0.003135 0.000000 0.001035 +
"fate" 4 0.003135 0.000000 0.001035 +
"will!" 4 0.003135 0.000000 0.001035 +
"Baker" 1 0.000784 0.000000 0.004109 +
"Ronald" 1 0.000784 0.000000 0.004109 +
"aired" 1 0.000784 0.000000 0.004109 +
"evenings" 1 0.000784 0.000000 0.004109 +
"hollow" 1 0.000784 0.000000 0.004109 +
"straw" 1 0.000784 0.000000 0.004109 +
"resist" 13 0.009404 0.000090 0.009804 +
"to:Flanagan" 398 0.266458 0.005227 0.019250 +
"Flanagan" 328 0.211599 0.005227 0.024119 +
"Goodman" 5 0.003135 0.000090 0.028718 +
"Bellingham" 91 0.055643 0.001802 0.031419 +
"supporters" 8 0.004702 0.000180 0.037389 +
"reporter" 4 0.002351 0.000090 0.037860 +
"Bush" 93 0.052508 0.002343 0.042759 +
"Aug" 32 0.018025 0.000811 0.043177 +
"nobody" 74 0.041536 0.001893 0.043629 +
"Ted" 7 0.003918 0.000180 0.044505 +
"hoped" 10 0.005486 0.000270 0.047337 +
"inflation" 3 0.001567 0.000090 0.055570 +
"Bush's" 26 0.013323 0.000811 0.057524 +
"precisely" 14 0.007053 0.000451 0.060304 +
"attempted" 11 0.005486 0.000360 0.061981 +
"Together" 8 0.003918 0.000270 0.064982 +
"voters" 5 0.002351 0.000180 0.071892 +
"George" 59 0.027429 0.002163 0.073149 +
"showing" 34 0.015674 0.001262 0.074601 +
"From" 1113 0.499216 0.042898 0.079135 +
"to:David" 653 0.285266 0.026045 0.083669 +
"question" 106 0.046238 0.004236 0.083951 +
"group" 137 0.055643 0.005948 0.096598 +
"president" 33 0.013323 0.001442 0.097758 +
"examples" 57 0.022727 0.002523 0.099990 +
"admitted" 8 0.003135 0.000360 0.103525 +
"attempting" 8 0.003135 0.000360 0.103525 +
"emerged" 6 0.002351 0.000270 0.103655 +
"intellectual" 6 0.002351 0.000270 0.103655 +
"crowds" 4 0.001567 0.000180 0.103914 +
"imperative" 4 0.001567 0.000180 0.103914 +
"pioneers" 4 0.001567 0.000180 0.103914 +
"$1.3" 2 0.000784 0.000090 0.104688 +
"analogy" 2 0.000784 0.000090 0.104688 +
"spent" 45 0.017241 0.002073 0.107389 +
"political" 52 0.019592 0.002433 0.110534 +
"advance" 36 0.013323 0.001712 0.113972 +
"moving" 53 0.019592 0.002523 0.114157 +
"himself" 30 0.010972 0.001442 0.116258 +
"presidential" 11 0.003918 0.000541 0.121529 +
"David" 919 0.322100 0.045782 0.124451 +
"since" 220 0.076803 0.010995 0.125244 +
"Thanks" 369 0.128527 0.018475 0.125687 +
"Thing" 9 0.003135 0.000451 0.126001 +
"idea" 140 0.045455 0.007390 0.139865 +
"scene" 5 0.001567 0.000270 0.147652 +
"early" 73 0.022727 0.003965 0.148594 +
"President" 134 0.041536 0.007300 0.149498 +
"him" 123 0.036834 0.006849 0.156816 +
"James" 43 0.012539 0.002433 0.162577 +
"front" 165 0.047022 0.009463 0.167544 +
"unable" 25 0.007053 0.001442 0.169835 +
"views" 25 0.007053 0.001442 0.169835 +
"rather" 117 0.032915 0.006759 0.170387 +
"hitting" 14 0.003918 0.000811 0.171669 +
"expert" 31 0.008621 0.001802 0.173006 +
"were" 379 0.105016 0.022080 0.173734 +
"creating" 40 0.010972 0.002343 0.176041 +
"story" 78 0.020376 0.004686 0.187016 +
"ocean" 6 0.001567 0.000360 0.187366 +
"climbed" 3 0.000784 0.000180 0.187745 +
"asked" 98 0.025078 0.005948 0.191733 +
"workers" 28 0.007053 0.001712 0.195424 +
"indicated" 25 0.006270 0.001532 0.196466 +
"Reagan" 16 0.003918 0.000991 0.202043 +
"his" 261 0.063480 0.016222 0.203543 +
"strategy" 30 0.007053 0.001893 0.211626 +
"sessions" 10 0.002351 0.000631 0.211762 +
"media" 155 0.036050 0.009823 0.214153 +
"didn't" 146 0.033699 0.009283 0.215981 +
"cases" 48 0.010972 0.003064 0.218349 +
"operation" 31 0.007053 0.001983 0.219485 +
"fantastic" 25 0.005486 0.001622 0.228294 +
"respond" 80 0.017241 0.005227 0.232665 +
"indication" 11 0.002351 0.000721 0.234852 +
"lunch" 11 0.002351 0.000721 0.234852 +
"content-classes" 67 0.014107 0.004416 0.238438 +
"there" 689 0.144201 0.045512 0.239902 +
"knew" 45 0.009404 0.002974 0.240299 +
"prepared" 64 0.013323 0.004236 0.241262 +
"biggest" 34 0.007053 0.002253 0.242150 +
"had" 451 0.090909 0.030191 0.249310 +
"various" 105 0.021160 0.007030 0.249384 +
"policy" 103 0.020376 0.006939 0.254062 +
"starting" 80 0.015674 0.005407 0.256519 +
"invited" 24 0.004702 0.001622 0.256566 +
"factory" 8 0.001567 0.000541 0.256697 +
"inevitable" 8 0.001567 0.000541 0.256697 +
"But" 470 0.090909 0.031903 0.259776 +
"urn" 73 0.014107 0.004957 0.260036 +
"behind" 65 0.012539 0.004416 0.260475 +
"We're" 119 0.022727 0.008111 0.263031 +
"could" 619 0.115987 0.042448 0.267921 +
"campaign" 134 0.025078 0.009193 0.268242 +
"sound" 63 0.011755 0.004326 0.269023 +
"dependent" 17 0.003135 0.001172 0.272143 +
"Thread-Index" 64 0.011755 0.004416 0.273096 +
"city" 96 0.017241 0.006669 0.278933 +
"television" 31 0.005486 0.002163 0.282823 +
"another" 336 0.058777 0.023522 0.285814 +
"Robert" 64 0.010972 0.004506 0.291152 +
"series" 55 0.009404 0.003875 0.291842 +
"met" 42 0.007053 0.002974 0.296622 +
"anything" 285 0.047806 0.020187 0.296909 +
"American" 207 0.034483 0.014690 0.298748 +
"plain" 4025 0.665361 0.286229 0.300791 +
"showed" 43 0.007053 0.003064 0.302885 +
"going" 438 0.071317 0.031273 0.304835 +
"widely" 58 0.009404 0.004146 0.305969 +
"X-MimeOLE" 1185 0.188871 0.085076 0.310556 +
"either" 249 0.039185 0.017934 0.313985 +
"government" 170 0.026646 0.012257 0.315068 +
"produced" 30 0.004702 0.002163 0.315095 +
"Carter" 10 0.001567 0.000721 0.315162 +
"what" 955 0.148119 0.069034 0.317905 +
"unknown" 434 0.066614 0.031453 0.320729 +
"Produced" 1299 0.199060 0.094178 0.321167 +
"also" 946 0.144201 0.068673 0.322602 +
"closing" 26 0.003918 0.001893 0.325719 +
"three" 203 0.030564 0.014780 0.325957 +
"MimeOLE" 1259 0.188871 0.091745 0.326941 +
"why" 333 0.048589 0.024423 0.334509 +
"What" 385 0.055643 0.028298 0.337125 +
"called" 214 0.030564 0.015771 0.340377 +
"before" 632 0.087774 0.046864 0.348073 +
"upon" 130 0.018025 0.009643 0.348532 +
"They" 300 0.041536 0.022260 0.348929 +
"during" 334 0.046238 0.024784 0.348960 +
"tell" 340 0.047022 0.025234 0.349236 +
"won't" 320 0.043103 0.023882 0.356532 +
"small" 257 0.034483 0.019196 0.357612 +
"described" 112 0.014890 0.008381 0.360159 +
"charset" 4546 0.603448 0.340303 0.360585 +
"answer" 154 0.020376 0.011536 0.361489 +
"national" 131 0.017241 0.009823 0.362962 +
"they" 956 0.125392 0.071738 0.363912 +
"fact" 222 0.028997 0.016673 0.365074 +
"Point" 6 0.000784 0.000451 0.365155 +
"candidate" 6 0.000784 0.000451 0.365155 +
"areas" 130 0.016458 0.009823 0.373784 +
"getting" 322 0.040752 0.024333 0.373865 +
"way" 865 0.108934 0.065429 0.375246 +
"used" 431 0.054075 0.032624 0.376293 +
"plane" 25 0.003135 0.001893 0.376469 +
"York" 132 0.016458 0.010004 0.378050 +
"little" 359 0.044671 0.027217 0.378605 +
"designed" 134 0.016458 0.010184 0.382257 +
"variety" 65 0.007837 0.004957 0.387439 +
"week" 344 0.040752 0.026316 0.392375 +
"that" 3514 0.398119 0.270908 0.404929 -
"come" 374 0.042320 0.028839 0.405279 -
"qualifier" 7 0.000784 0.000541 0.408286 -
"people" 832 0.092476 0.064348 0.410317 -
"ideas" 129 0.014107 0.010004 0.414912 -
"AcNZ0N83BfXqElOvTaSKyw" 0 0.000000 0.000000 0.415000 -
"Bush-Baker" 0 0.000000 0.000000 0.415000 -
"Carter's" 0 0.000000 0.000000 0.415000 -
"Chichi" 0 0.000000 0.000000 0.415000 -
"Damia" 0 0.000000 0.000000 0.415000 -
"Jima" 0 0.000000 0.000000 0.415000 -
"KG-!!Pe" 0 0.000000 0.000000 0.415000 -
"Kennebunkport" 0 0.000000 0.000000 0.415000 -
"Lilah" 0 0.000000 0.000000 0.415000 -
"Mosbacher's" 0 0.000000 0.000000 0.415000 -
"Reagan's" 0 0.000000 0.000000 0.415000 -
"UMM!!W" 0 0.000000 0.000000 0.415000 -
"Walker's" 0 0.000000 0.000000 0.415000 -
"accompaniment" 0 0.000000 0.000000 0.415000 -
"announcer" 0 0.000000 0.000000 0.415000 -
"applause" 0 0.000000 0.000000 0.415000 -
"aridity" 0 0.000000 0.000000 0.415000 -
"barbecues" 0 0.000000 0.000000 0.415000 -
"blurted" 0 0.000000 0.000000 0.415000 -
"caucuses" 0 0.000000 0.000000 0.415000 -
"cultivation" 0 0.000000 0.000000 0.415000 -
"cushion" 0 0.000000 0.000000 0.415000 -
"demagogic" 0 0.000000 0.000000 0.415000 -
"discouraging" 0 0.000000 0.000000 0.415000 -
"dyanimc" 0 0.000000 0.000000 0.415000 -
"elitist" 0 0.000000 0.000000 0.415000 -
"eneregtic" 0 0.000000 0.000000 0.415000 -
"fast-moving" 0 0.000000 0.000000 0.415000 -
"fished" 0 0.000000 0.000000 0.415000 -
"from:Damia" 0 0.000000 0.000000 0.415000 -
"from:Lilah" 0 0.000000 0.000000 0.415000 -
"from:stevegkswl" 0 0.000000 0.000000 0.415000 -
"grappled" 0 0.000000 0.000000 0.415000 -
"greasing" 0 0.000000 0.000000 0.415000 -
"inexperience" 0 0.000000 0.000000 0.415000 -
"ingratiate" 0 0.000000 0.000000 0.415000 -
"intoned" 0 0.000000 0.000000 0.415000 -
"kMhA" 0 0.000000 0.000000 0.415000 -
"newsfilmlike" 0 0.000000 0.000000 0.415000 -
"overheard" 0 0.000000 0.000000 0.415000 -
"placards" 0 0.000000 0.000000 0.415000 -
"portraits" 0 0.000000 0.000000 0.415000 -
"ridiculed" 0 0.000000 0.000000 0.415000 -
"shootdown" 0 0.000000 0.000000 0.415000 -
"skids" 0 0.000000 0.000000 0.415000 -
"slogan" 0 0.000000 0.000000 0.415000 -
"television-based" 0 0.000000 0.000000 0.415000 -
"train.'" 0 0.000000 0.000000 0.415000 -
"wretched" 0 0.000000 0.000000 0.415000 -
"yelled" 0 0.000000 0.000000 0.415000 -
"about" 1943 0.211599 0.150775 0.416076 -
"talking" 116 0.012539 0.009012 0.418174 -
"Payment" 51 0.005486 0.003965 0.419560 -
"Hampshire" 22 0.002351 0.001712 0.421397 -
"who" 1087 0.115204 0.084715 0.423748 -
"was" 1642 0.173981 0.127974 0.423818 -
"call" 549 0.057994 0.042808 0.424676 -
"when" 1021 0.107367 0.079668 0.425954 -
"weeks" 261 0.027429 0.020368 0.426127 -
"Kennedy" 15 0.001567 0.001172 0.427738 -
"just" 1658 0.171630 0.129686 0.430399 -
"Group" 129 0.013323 0.010094 0.431049 -
"reason" 373 0.038401 0.029200 0.431942 -
"much" 889 0.090909 0.069665 0.433848 -
"index" 100 0.010188 0.007841 0.434896 -
"these" 1212 0.123041 0.095079 0.435903 -
"V6.00.2800.1106" 641 0.065047 0.050288 0.436018 -
"The" 3601 0.364420 0.282624 0.436793 -
"days" 613 0.061129 0.048216 0.440952 -
"oreilly.com" 4108 0.408307 0.323270 0.441881 -
"american" 8 0.000784 0.000631 0.445937 -
"virtually" 65 0.006270 0.005137 0.450348 -
"not" 3464 0.333072 0.273882 0.451240 -
"been" 1567 0.150470 0.123919 0.451616 -
"Each" 98 0.009404 0.007751 0.451793 -
"property" 139 0.013323 0.010995 0.452133 -
"have" 3286 0.307210 0.260815 0.459161 -
"music" 42 0.003918 0.003335 0.459733 -
"has" 1916 0.176332 0.152397 0.463594 -
"summer" 43 0.003918 0.003425 0.466362 -
"would" 2019 0.181034 0.161139 0.470928 -
"factors" 35 0.003135 0.002794 0.471225 -
"can" 2905 0.257053 0.232246 0.474650 -
"give" 642 0.056426 0.051370 0.476545 -
"Jimmy" 9 0.000784 0.000721 0.479088 -
"house" 162 0.014107 0.012978 0.479155 -
"Content-Class" 55 0.004702 0.004416 0.484294 -
"Thank" 708 0.060345 0.056867 0.485165 -
"their" 1264 0.107367 0.101568 0.486123 -
"time" 1714 0.144984 0.137797 0.487292 -
"almost" 301 0.025078 0.024243 0.491529 -
"with" 12026 0.995298 0.969358 0.493398 -
"from" 11997 0.991379 0.967195 0.493826 -
"state" 304 0.025078 0.024513 0.494301 -
"for" 12097 0.992163 0.976118 0.495924 -
"history" 182 0.014890 0.014690 0.496610 -
"like" 2695 0.220219 0.217556 0.496958 -
"avoid" 125 0.010188 0.010094 0.497667 -
"sent" 1137 0.091693 0.091925 0.500632 -
"all" 2612 0.210031 0.211247 0.501443 -
"david" 12352 0.984326 1.000000 0.503949 -
"Sun" 1800 0.143417 0.145728 0.503996 -
"out" 2076 0.164577 0.168169 0.505397 -
"Maximum" 20 0.001567 0.001622 0.508545 -
"Windows-1252" 220 0.017241 0.017844 0.508587 -
"always" 427 0.032915 0.034697 0.513174 -
"Who" 72 0.005486 0.005858 0.516386 -
"iso-8859-1" 2392 0.181034 0.194755 0.518255 -
"one" 2561 0.193574 0.208544 0.518614 -
"this" 5478 0.405172 0.447098 0.524596 -
"any" 2129 0.154389 0.174117 0.530026 -
"combo" 11 0.000784 0.000901 0.534767 -
"private" 210 0.014890 0.017213 0.536176 -
"are" 4049 0.286050 0.332012 0.537182 -
"most" 1176 0.082288 0.096521 0.539798 -
"valued" 45 0.003135 0.003695 0.540985 -
"want" 1755 0.121473 0.144196 0.542764 -
"brought" 57 0.003918 0.004686 0.544596 -
"and" 7003 0.459248 0.578317 0.557379 -
"Content-Type" 10736 0.702194 0.886806 0.558090 -
"experts" 60 0.003918 0.004957 0.558467 -
"while" 712 0.046238 0.058850 0.560004 -
"approval" 99 0.006270 0.008201 0.566725 -
"the" 7492 0.470219 0.621125 0.569137 -
"extraordinary" 25 0.001567 0.002073 0.569360 -
"First" 450 0.028213 0.037311 0.569418 -
"wanted" 390 0.024295 0.032354 0.571130 -
"through" 1281 0.078370 0.106435 0.575930 -
"Many" 398 0.024295 0.033075 0.576520 -
"man" 257 0.015674 0.021359 0.576750 -
"to:david" 8391 0.500784 0.698630 0.582476 -
"business" 613 0.036050 0.051099 0.586339 -
"more" 2813 0.165361 0.234499 0.586453 -
"Box" 202 0.011755 0.016853 0.589081 -
"One" 642 0.036834 0.053623 0.592799 -
"benefit" 96 0.005486 0.008021 0.593824 -
"midnight" 55 0.003135 0.004596 0.594486 -
"family" 317 0.018025 0.026496 0.595128 -
"text" 11820 0.670063 0.988194 0.595923 -
"costs" 180 0.010188 0.015050 0.596318 -
"Content-type" 1564 0.087774 0.130858 0.598529 -
"experience" 517 0.028997 0.043259 0.598687 -
"residence" 14 0.000784 0.001172 0.599059 -
"name" 1038 0.057994 0.086878 0.599688 -
"you" 7189 0.401254 0.601748 0.599947 -
"list" 2234 0.124608 0.187004 0.600117 +
"subject" 859 0.047806 0.071918 0.600697 +
"primary" 99 0.005486 0.008291 0.601794 +
"period" 143 0.007837 0.011986 0.604644 +
"over" 1542 0.083072 0.129416 0.609049 +
"never" 742 0.039969 0.062275 0.609080 +
"above" 452 0.024295 0.037942 0.609634 +
"your" 6234 0.333072 0.523522 0.611167 +
"true" 356 0.018809 0.029921 0.614011 +
"off" 970 0.050940 0.081561 0.615546 +
"During" 76 0.003918 0.006399 0.620171 +
"telling" 168 0.008621 0.014149 0.621388 +
"Quick" 124 0.006270 0.010454 0.625093 +
"$2" 125 0.006270 0.010544 0.627102 +
"million" 308 0.014890 0.026045 0.636245 +
"Microsoft" 4616 0.222571 0.390411 0.636904 +
"pocket" 49 0.002351 0.004146 0.638065 +
"alone" 214 0.010188 0.018115 0.640021 +
"For" 1626 0.076019 0.137797 0.644465 +
"email" 3470 0.162226 0.294070 0.644472 +
"developed" 221 0.010188 0.018745 0.647870 +
"six" 105 0.004702 0.008922 0.654845 +
"delete" 337 0.014107 0.028749 0.670828 +
"success" 211 0.008621 0.018025 0.676451 +
"You" 3625 0.142633 0.310292 0.685084 +
"to:oreilly.com" 7946 0.311129 0.680335 0.686192 +
"address" 1417 0.054075 0.121485 0.691983 +
"item" 124 0.004702 0.010634 0.693379 +
"Times" 502 0.018809 0.043079 0.696075 +
"This" 7075 0.264890 0.607156 0.696242 +
"Content-Transfer-Encoding" 8196 0.304859 0.703587 0.697694 +
"become" 382 0.014107 0.032805 0.699284 +
"http" 10945 0.397335 0.940699 0.703045 +
"please" 2968 0.107367 0.255137 0.703818 +
"MIME-Version" 8639 0.308777 0.743061 0.706440 +
"our" 4321 0.154389 0.371665 0.706515 +
"format" 5464 0.193574 0.470169 0.708360 +
"payment" 246 0.008621 0.021179 0.710698 +
"track" 489 0.016458 0.042177 0.719314 +
"Now" 1217 0.040752 0.104993 0.720384 +
"monthly" 223 0.007053 0.019286 0.732202 +
"excitement" 25 0.000784 0.002163 0.733909 +
"$100" 225 0.007053 0.019466 0.734022 +
"message" 6202 0.194357 0.536590 0.734102 +
"host" 404 0.012539 0.034968 0.736047 +
"performance" 450 0.013323 0.039023 0.745477 +
"multipart" 6770 0.196708 0.587509 0.749165 +
"Amount" 54 0.001567 0.004686 0.749306 +
"index.html" 600 0.017241 0.052091 0.751317 +
"content" 578 0.016458 0.050198 0.753089 +
"New" 1795 0.050940 0.155912 0.753734 +
"subj:Now" 114 0.003135 0.009913 0.759724 +
"style" 961 0.025862 0.083634 0.763804 +
"subj:David" 355 0.009404 0.030912 0.766726 +
"none" 446 0.011755 0.038843 0.767662 +
"Promotion" 30 0.000784 0.002614 0.769196 +
"Fixed" 33 0.000784 0.002884 0.786207 +
"alternative" 6372 0.151254 0.556867 0.786401 +
"emails" 640 0.014890 0.055966 0.789847 +
"here" 5305 0.123041 0.463951 0.790387 +
"III" 34 0.000784 0.002974 0.791334 +
"$1" 207 0.004702 0.018115 0.793897 +
"$500" 139 0.003135 0.012167 0.795102 +
"money" 1185 0.026646 0.103731 0.795622 +
"quoted-printable" 5453 0.117555 0.477920 0.802586 +
"Here" 2411 0.051724 0.211337 0.803375 +
"$200" 78 0.001567 0.006849 0.813724 +
"capitalize" 41 0.000784 0.003605 0.821325 +
"offer" 973 0.018025 0.085616 0.826078 +
"proven" 224 0.003918 0.019737 0.834332 +
"Illinois" 45 0.000784 0.003965 0.834886 +
"Easy" 232 0.003918 0.020458 0.839232 +
"Quote" 100 0.001567 0.008832 0.849237 +
"click" 3848 0.055643 0.340393 0.859500 +
"nbsp" 4081 0.057994 0.361121 0.861627 +
"rates" 357 0.004702 0.031633 0.870576 +
"receive" 3163 0.041536 0.280281 0.870931 +
"MIME" 5330 0.068182 0.472513 0.873899 +
"decline" 66 0.000784 0.005858 0.881932 +
"blank" 1196 0.013323 0.106255 0.888580 +
"target" 1056 0.011755 0.093818 0.888646 +
"html" 9772 0.107367 0.868331 0.889958 +
"face" 6184 0.061912 0.550198 0.898854 +
"size" 7042 0.069749 0.626622 0.899838 +
"Interest" 321 0.003135 0.028569 0.901107 +
"Arial" 4848 0.047022 0.431507 0.901735 +
"multi-part" 5128 0.047022 0.456741 0.906658 +
"approved" 430 0.003918 0.038302 0.907178 +
"Verdana" 2157 0.018809 0.192231 0.910874 +
"Monthly" 93 0.000784 0.008291 0.913588 +
"index.php" 637 0.003918 0.056957 0.935623 +
"href" 9050 0.054075 0.809391 0.937374 +
"Click" 3420 0.020376 0.305876 0.937543 +
"sans-serif" 3094 0.017241 0.276857 0.941374 +
"subscriber" 285 0.001567 0.025505 0.942084 +
"Helvetica" 3420 0.015674 0.306417 0.951335 +
"mortgage" 355 0.000784 0.031903 0.976008 +
"Baltimore" 1 0.000000 0.000090 0.994208 +
"academia" 1 0.000000 0.000090 0.994208 +
"ape" 1 0.000000 0.000090 0.994208 +
"approving" 1 0.000000 0.000090 0.994208 +
"assiduous" 1 0.000000 0.000090 0.994208 +
"atrophy" 1 0.000000 0.000090 0.994208 +
"blab" 1 0.000000 0.000090 0.994208 +
"conceded" 1 0.000000 0.000090 0.994208 +
"dominant" 1 0.000000 0.000090 0.994208 +
"footage" 1 0.000000 0.000090 0.994208 +
"on-the-job" 1 0.000000 0.000090 0.994208 +
"subj:Up....Refinance" 1 0.000000 0.000090 0.994208 +
"surrounded" 1 0.000000 0.000090 0.994208 +
"X-UIDL" 8201 0.003918 0.738645 0.994722 +
"buys" 2 0.000000 0.000180 0.997090 +
"cocktails" 2 0.000000 0.000180 0.997090 +
"identifiable" 2 0.000000 0.000180 0.997090 +
"shorts" 2 0.000000 0.000180 0.997090 +
"$544.79" 3 0.000000 0.000270 0.998056 +
"studied" 3 0.000000 0.000270 0.998056 +
"upbeat" 3 0.000000 0.000270 0.998056 +
"american_newsletter.html" 5 0.000000 0.000451 0.998832 +
"readily" 5 0.000000 0.000451 0.998832 +
"subj:Going" 5 0.000000 0.000451 0.998832 +
"Pre-Qualified" 6 0.000000 0.000541 0.999027 +
"american.certrewards.com" 6 0.000000 0.000541 0.999027 +
"musical" 6 0.000000 0.000541 0.999027 +
"seminars" 6 0.000000 0.000541 0.999027 +
"cashout" 7 0.000000 0.000631 0.999165 +
"hats" 7 0.000000 0.000631 0.999165 +
"airport" 10 0.000000 0.000901 0.999416 +
"Slow" 11 0.000000 0.000991 0.999469 +
"drink" 13 0.000000 0.001172 0.999550 +
"Iowa" 14 0.000000 0.001262 0.999582 +
"Vision" 23 0.000000 0.002073 0.999746 +
"rndm" 34 0.000000 0.003064 0.999828 +
"advertisements" 36 0.000000 0.003244 0.999838 +
"www.certrewards.com" 38 0.000000 0.003425 0.999846 +
"site_id" 41 0.000000 0.003695 0.999857 +
"skip_project_changes" 44 0.000000 0.003965 0.999867 +
"projectID" 47 0.000000 0.004236 0.999876 +
"tracking_id" 49 0.000000 0.004416 0.999881 +
"fname" 53 0.000000 0.004776 0.999890 +
"lname" 53 0.000000 0.004776 0.999890 +
"mime_used" 53 0.000000 0.004776 0.999890 +
"mortgageopt" 53 0.000000 0.004776 0.999890 +
"zip_code" 53 0.000000 0.004776 0.999890 +
"Minute" 54 0.000000 0.004867 0.999892 +
"list_id" 62 0.000000 0.005588 0.999906 +
"lender" 77 0.000000 0.006939 0.999924 +
"subj:Are" 111 0.000000 0.010004 0.999947 +
"Refinance" 150 0.000000 0.013518 0.999961 +
"subj:Rates" 154 0.000000 0.013879 0.999962 +
N_P_Q_S_s_x_md 312 0.00e+00 3.08e-03 5.02e-01
1.00e-02 4.15e-01 0.100
More information about the Bogofilter
mailing list