Belief Update
An agent needs to update its belief upon taking the action and observing . Since the state is Markovian, maintaining a belief over the states solely requires knowledge of the previous belief state, the action taken, and the current observation. The operation is denoted . Below we describe how this belief update is computed.
In, the agent observes with probability . Let be a probability distribution over the state space : denotes the probability that the environment is in state . Given, then after taking action and observing ,
where is a normalizing constant with .
Read more about this topic: Partially Observable Markov Decision Process
Famous quotes containing the word belief:
“The belief that established science and scholarshipwhich have so relentlessly excluded women from their makingare objective and value-free and that feminist studies are unscholarly, biased, and ideological dies hard. Yet the fact is that all science, and all scholarship, and all art are ideological; there is no neutrality in culture!”
—Adrienne Rich (b. 1929)
