Learn the best hypothesis given data
 $ some domain knowledge
Learn the most probable H given data
 $ domain knowledge
Pr(h|D)
Pr(h|D) = Pr(D|h)*Pr(h) / Pr(D) … Bayes’ rule
Pr(a,b) = Pr(a|b)P(b)
Pr(b,a) = Pr(b|a)P(a)
Bayesian Learning
For each h e H
 calculate Pr(h|D) = P(D|h)P(h)/P(D)
Output:
 h = argmax Pr(h|D)
 h = argmax Pr(D|h)