Markov decision process

From Simple English Wikipedia, the free encyclopedia

A Markov decision process is a method for optimizing decision making over time in a step-by-step manner in situations where the outcomes of the decisions are partially random and partially determined by the decisions.