Belief Update
An agent needs to update its belief upon taking the action and observing . Since the state is Markovian, maintaining a belief over the states solely requires knowledge of the previous belief state, the action taken, and the current observation. The operation is denoted . Below we describe how this belief update is computed.
In, the agent observes with probability . Let be a probability distribution over the state space : denotes the probability that the environment is in state . Given, then after taking action and observing ,
where is a normalizing constant with .
Read more about this topic: Partially Observable Markov Decision Process
Famous quotes containing the word belief:
“My belief is that science is to wreck us, and that we are like monkeys monkeying with a loaded shell; we dont in the least know or care where our practically infinite energies come from or will bring us to.”
—Henry Brooks Adams (18381918)