I think it would also be a good to mention that one way to build up the spam database is to setup the MDA (procmail,maildrop) to automatically divert all email to one of those system accounts that noone really uses (like gopher) to bogofilter -s