Alternative Notations
The terminology and notation for MDPs are not entirely settled. There are two main streams — one focuses on maximization problems from contexts like economics, using the terms action, reward, value, and calling the discount factor or, while the other focuses on minimization problems from engineering and navigation, using the terms control, cost, cost-to-go, and calling the discount factor . In addition, the notation for the transition probability varies.
| in this article | alternative | comment |
|---|---|---|
| action | control | |
| reward | cost | is the negative of |
| value | cost-to-go | is the negative of |
| policy | policy | |
| discounting factor | discounting factor | |
| transition probability | transition probability |
In addition, transition probability is sometimes written, or, rarely,
Read more about this topic: Markov Decision Process
Famous quotes containing the word alternative:
“Our mother gives us our earliest lessons in loveand its partner, hate. Our fatherour second otherMelaborates on them. Offering us an alternative to the mother-baby relationship . . . presenting a masculine model which can supplement and contrast with the feminine. And providing us with further and perhaps quite different meanings of lovable and loving and being loved.”
—Judith Viorst (20th century)