Alternative Notations
The terminology and notation for MDPs are not entirely settled. There are two main streams — one focuses on maximization problems from contexts like economics, using the terms action, reward, value, and calling the discount factor or, while the other focuses on minimization problems from engineering and navigation, using the terms control, cost, cost-to-go, and calling the discount factor . In addition, the notation for the transition probability varies.
in this article | alternative | comment |
---|---|---|
action | control | |
reward | cost | is the negative of |
value | cost-to-go | is the negative of |
policy | policy | |
discounting factor | discounting factor | |
transition probability | transition probability |
In addition, transition probability is sometimes written, or, rarely,
Read more about this topic: Markov Decision Process
Famous quotes containing the word alternative:
“If English is spoken in heaven ... God undoubtedly employs Cranmer as his speechwriter. The angels of the lesser ministries probably use the language of the New English Bible and the Alternative Service Book for internal memos.”
—Charles, Prince Of Wales (b. 1948)