OpenBayes design

[Kevin Murphy, 30 July 2001.]

Below is a sketch of the proposed overall design of the library. Some possible application areas are on the right hand side. Bold means not yet implemented in BNT, italic bold means only partly implemented in BNT, and regular (Roman) font means more or less fully implemented in BNT. Dotted lines means optional components. Click on the figure for an enlargement.

The main statistical component of the project comes in defining the CPD (Conditional Probability Distribution) classes. These can be developed independently, and not all methods need be implemented at once. In rough order of priority, I would include: tabular (multinomial), conditional linear Gaussian, decision/regression trees, generalized linear models (GLIMs - includes softmax), noisy-or, others. (BNT supports multinomials, Gaussians, softmax etc. It badly needs more Bayesian inference routines (other than just multinomials), and tree-CPDs.)

Explanation of nodes (right to left, top to bottom)