Alternative Notations
The terminology and notation for MDPs are not entirely settled. There are two main streams — one focuses on maximization problems from contexts like economics, using the terms action, reward, value, and calling the discount factor or, while the other focuses on minimization problems from engineering and navigation, using the terms control, cost, cost-to-go, and calling the discount factor . In addition, the notation for the transition probability varies.
in this article | alternative | comment |
---|---|---|
action | control | |
reward | cost | is the negative of |
value | cost-to-go | is the negative of |
policy | policy | |
discounting factor | discounting factor | |
transition probability | transition probability |
In addition, transition probability is sometimes written, or, rarely,
Read more about this topic: Markov Decision Process
Famous quotes containing the word alternative:
“A mental disease has swept the planet: banalization.... Presented with the alternative of love or a garbage disposal unit, young people of all countries have chosen the garbage disposal unit.”
—Ivan Chtcheglov (b. 1934)