Bayesian network

From Wikipedia, the free encyclopedia
Jump to: navigation, search

A Bayesian network is a kind of graph which is used to model events that cannot be observed. This can then be used for inference. The graph that is used is directed, and does not contain any cycles. The nodes of the graph represent random variables. If two nodes are connected by an edge, it has an associated probability that it will transmit from one node to the other.

Bayesian networks are mainly used in the field of (unassisted) machine learning. They have been used where information needs to be classified. Examples are image, document, or speech recognition, and information retrieval.

It is based on Reverend Thomas Bayes' discovery in the 1740's called Bayes' theorem.[1]

History[change | change source]

The term "Bayesian networks" was made by Judea Pearl in 1985 to emphasize three aspects:[2]

  1. The often subjective nature of the input information.
  2. The reliance on Bayes's conditioning as the basis for updating information.
  3. The distinction between causal and evidential modes of reasoning, which underscores Thomas Bayes' posthumously published paper of 1763.[3]

In the late 1980's the seminal texts Probabilistic Reasoning in Intelligent Systems[4] and Probabilistic Reasoning in Expert Systems[5] summarized the properties of Bayesian networks and helped to establish Bayesian networks as a field of study.

Informal variants of such networks were first used by legal scholar John Henry Wigmore, in the form of Wigmore charts, to analyse trial evidence in 1913.[6]:66-76 Another variant, called path diagrams, was developed by the geneticist Sewall Wright[7] and used in social and behavioral sciences (mostly with linear parametric models).

References[change | change source]

  1. McGrayne, Sharon Bertsch. (2011). The Theory That Would Not Die, p. 10. at Google Books
  2. Pearl, J. (1985). "Bayesian Networks: A Model of Self-Activated Memory for Evidential Reasoning" (UCLA Technical Report CSD-850017) in Proceedings of the 7th Conference of the Cognitive Science Society, University of California, Irvine, CA. : 329–334. Retrieved on 2009-05-01. 
  3. Bayes, T.; Price, Mr. (1763). "An Essay towards solving a Problem in the Doctrine of Chances.". Philosophical Transactions of the Royal Society of London 53: 370–418. doi:10.1098/rstl.1763.0053.
  4. Pearl, J., Probabilistic Reasoning in Intelligent Systems, Morgan Kaufmann, San Francisco, CA, 1988
  5. Neapolitan, R.E., Probabilistic Reasoning in Expert Systems, Wiley, NY, 1989
  6. J. B. Kadane and D. A. Schum (1996). A Probabilistic Analysis of the Sacco and Vanzetti Evidence. New York: Wiley. ISBN 0-471-14182-8.
  7. Wright, S. (1921). "Correlation and Causation". Journal of Agricultural Research 20 (7): 557–585. http://www.ssc.wisc.edu/soc/class/soc952/Wright/Wright_Correlation%20and%20Causation.pdf.