Markov Decision Process - Alternative Notations

Alternative Notations

The terminology and notation for MDPs are not entirely settled. There are two main streams — one focuses on maximization problems from contexts like economics, using the terms action, reward, value, and calling the discount factor or, while the other focuses on minimization problems from engineering and navigation, using the terms control, cost, cost-to-go, and calling the discount factor . In addition, the notation for the transition probability varies.

in this article alternative comment
action control
reward cost is the negative of
value cost-to-go is the negative of
policy policy
discounting factor discounting factor
transition probability transition probability

In addition, transition probability is sometimes written, or, rarely,

Read more about this topic:  Markov Decision Process

Famous quotes containing the word alternative:

    A mental disease has swept the planet: banalization.... Presented with the alternative of love or a garbage disposal unit, young people of all countries have chosen the garbage disposal unit.
    Ivan Chtcheglov (b. 1934)