matthias.andree at gmx.de
Mon Jan 17 19:09:12 EST 2005
David Relson <relson at osagesoftware.com> writes:
> Inlining is fine. Reflecting on your earlier messages, I realize you're
> right that a "compute" function shouldn't be responsible for calling
> "lookup". Likely there _is_ an appropriate higher level place for the
> lookup call. I'll take a look when I have time.
Most of what bogofilter is doing is transform data sets.
Top-down, we have:
1. transform mail storage to list of messages
2. transform message to a list of tokens
3. transform list of tokens into list of probabilities (unless it is
message-count format input)
4. transform list of probabilities into a single spamicity
How a particular transformation looks in detail depends on what we get
as input, but the output is (ideally) always the same.
Bogotune introduced some switches for different data structures, and I
am not at all happy with the fBogotune switch that is buried deep in the
code - it clouds what the functions are doing and why.
More information about the Bogofilter-dev