Bogofilter migration & tuneup
Robin Bowes
robin-lists at robinbowes.com
Mon Dec 5 18:40:23 CET 2005
Matthias Andree said the following on 05/12/2005 16:54:
> Robin Bowes <robin-lists at robinbowes.com> writes:
>>wordlist.db is 2.1GB and growing (from a 105MB wordlist.db).
>>
>>Does that sound like it's corrupted?
>
>
> Yes. .db and .txt files are roughly the same order of magnitude in
> size. A factor of 10 looks corrupt. You can use db_verify to see what it
> thinks of your wordlist.db (don't run any registration at the same time,
> else you'll see false positives). As db-recover hasn't helped you, and
> db-recover-harder is no longer workable since the log files have been
> removed, you'll probably have to use db_dump with "-r" (or perhaps "-R"
> if necessary, but that may print junk) option to see what you'll get and
> if it's reasonably complete for db_load and a subsequent "bogoutil -d"
> operation.
>
> The detailed salvaging instructions were not shipped with 0.93.5 but
> joined in with some 0.94 versions, please see sections 3.2 with all
> subsections of the newer README.db:
>
> <http://cvs.sourceforge.net/viewcvs.py/bogofilter/bogofilter/doc/README.db?rev=1.36>
>
> In section 3.2.4, use the new bogofilter 1.0.0 version for "bogoutil
> [whatever] -l".
OK, I installed db4-utils and ran the following commands:
bogoutil.0.93 -v --db-verify wordlist.db
bogoutil: database wordlist.db does not verify: DB_VERIFY_BAD: Database
verification failed
cd .bogofilter # empty dir
db_dump -r ../.bogofilter.bak/wordlist.db > wordlist.saved
# wordlist.saved is 98M
db_load wordlist.db < wordlist.saved
db_load: line 1: unexpected format
worldlist.save looks like this:
[root at batmobile .bogofilter]# head wordlist.saved
2430
2f00000004010000b7f43101
24302e30
0c0000000000000078ca3101
24302e3030
180000001c00000050f43101
24302e303033
0000000001000000c1f03101
24302e303035
000000000100000039f43101
Am I doing something wrong, or is my DB hosed?
Cheers,
R.
More information about the Bogofilter
mailing list