Bogofilter migration & tuneup

Robin Bowes robin-lists at robinbowes.com
Mon Dec 5 18:40:23 CET 2005


Matthias Andree said the following on 05/12/2005 16:54:
> Robin Bowes <robin-lists at robinbowes.com> writes:
>>wordlist.db is 2.1GB and growing (from a 105MB wordlist.db).
>>
>>Does that sound like it's corrupted?
> 
> 
> Yes. .db and .txt files are roughly the same order of magnitude in
> size. A factor of 10 looks corrupt. You can use db_verify to see what it
> thinks of your wordlist.db (don't run any registration at the same time,
> else you'll see false positives). As db-recover hasn't helped you, and
> db-recover-harder is no longer workable since the log files have been
> removed, you'll probably have to use db_dump with "-r" (or perhaps "-R"
> if necessary, but that may print junk) option to see what you'll get and
> if it's reasonably complete for db_load and a subsequent "bogoutil -d"
> operation.
> 
> The detailed salvaging instructions were not shipped with 0.93.5 but
> joined in with some 0.94 versions, please see sections 3.2 with all
> subsections of the newer README.db:
> 
> <http://cvs.sourceforge.net/viewcvs.py/bogofilter/bogofilter/doc/README.db?rev=1.36>
> 
> In section 3.2.4, use the new bogofilter 1.0.0 version for "bogoutil
> [whatever] -l".

OK, I installed db4-utils and ran the following commands:

bogoutil.0.93 -v --db-verify wordlist.db
bogoutil: database wordlist.db does not verify: DB_VERIFY_BAD: Database
verification failed


cd .bogofilter # empty dir
db_dump -r ../.bogofilter.bak/wordlist.db > wordlist.saved
# wordlist.saved is 98M
db_load wordlist.db < wordlist.saved
db_load: line 1: unexpected format

worldlist.save looks like this:

[root at batmobile .bogofilter]# head wordlist.saved
 2430
 2f00000004010000b7f43101
 24302e30
 0c0000000000000078ca3101
 24302e3030
 180000001c00000050f43101
 24302e303033
 0000000001000000c1f03101
 24302e303035
 000000000100000039f43101

Am I doing something wrong, or is my DB hosed?

Cheers,

R.




More information about the Bogofilter mailing list