The idea behind state estimation is straightforward. Say you're someone living in the US and you don't get to directly observe events happening in London. So you take "sensor measurements" of London by reading some US-based newspapers. Based on past experience, The* New York Times* is more likely to have an accurate report than a supermarket tabloid, so you trust the different news sources differently. Let's say you're trying to estimate the weather in London. The *Times* tells you that it's raining in London, and you know that it could very well be raining there. The *Times* could be wrong, but it generally doesn't report "rain" when the true state is "no rain." Since a trustworthy source is reporting something that that you already believe is possible, you'd probably be fairly confident that it is in fact raining in London.

Now say that you're trying to figure out if aliens have landed in London. If you read in the supermarket tabloid that an alien spaceship had landed in London you probably wouldn't believe it, because your experience tells you the tabloid is likely to report "aliens" when the actual state is "no aliens," and your previous belief is strongly in favor of "no aliens." However, the *New York Times* is unlikely to report "aliens" when the true state is "no aliens," so if the *Times* reported "aliens" you'd be skeptical still, but it will change your belief more than the tabloid. And if you keep seeing reports of an alien spaceship in London from trustworthy sources, your belief might go from near certainty in "no aliens" to being unsure about "aliens" versus "no aliens." or even certainty in "aliens."

The example above captures the two things we need to quantify in order to update our state estimate using a sensor measurement. We need a way of capturing the fact that we can be unsure about the true state, and we need to specify how much we trust each sensor. We do this by using the mathematics of probability. Instead of having a single value for each state variable, the robot maintains a probability distribution over possible state values. These probabilities represent how strongly we believe each possible value is the true value. This is our belief over the state, and it gives us a way of quantifying uncertainty.

To handle the trust problem, we make a probabilistic model of the sensor. Instead of explicitly stating how much to trust a sensor, the model tells us how likely we are to see a particular sensor reading for a given value of the true state. In other words, for each possible value of the state, how much does our sensor reading support the belief that this state is the true state? Now all we need to know is how to use a sensor measurement to update our belief. The tool that allows us to do this is an equation called Bayes' Rule.

### Bayes' Rule

By itself, Bayes' Rule is a simple equation relating the probabilities of two variables, X and Z:

The equation has three terms that matter. The term on the left side of the equation, P(X = x | Y = y), is called the posterior, and it tells us how likely it is that the variable X takes on the value x given that we observed the variable Z had the value z. If X is a state variable of interest and Z is a sensor measurement that returned z, the posterior probability is our belief that the true value of X is x. In our newspaper example it represents what we believe about the London weather after reading the paper.

The term P(Z = z | X = x) is the probability of seeing Z = z if X = x. This is our model of the sensor: how likely were we to have seen this measurement for a possible value of the state. Like in the newspaper example, it represents our level of trust in each paper: how likely each paper was to report the correct state in London. The third term, P(X = x) is called the prior, and it represents our original belief that X = x before taking the measurement. The term n is a normalizing constant computed to make the sum of the probabilities of all the possible values of X add up to one.

The end conclusion of Bayes' Rule is this: how much we believe a certain state value is the true value is simply how strongly we believed in it before times how much the sensor reading supports that particular value. There are subtleties in how to represent the probabilities of various states, but that's the basic principle. For more details on this I highly recommend reading Sebastian Thrun's textbook, *Probabilistic Robotics*.

The result of using Bayes' Rule is very powerful: it tells us exactly how likely each possible state is given our sensor readings. As long as we have our model for the sensor (matching the probability of the readings we get to the true state), we know exactly how to update our posterior probability for our state for any sensor. Moreover, we can use the posterior from one measurement as the prior for another, performing sequential updates as time passes or combining the measurements from multiple different sensors. Finally and most importantly, Bayes' Rule isn't limited to sensors that make direct measurements. Our sensor probability simply captures the relationship between two variables. This means that we can use pretty much any source of information to update our estimate, and Bayes' Rule tells us how to extract the maximum amount of information from what the source tells us. The specific implementations of how robots interact with the world differ greatly, but Bayesian probability is how nearly all of them "see" the world. This single-line equation underpins much of modern robotics, and is used to answer our big three questions: Where am I, what's around me, and what do I do?

### Navigation

Knowing your own location and orientation is pretty critical to many robots. One way to find out where you are is to use GPS and inertial sensors like accelerometers. If GPS is unavailable, for example in indoor environments, a robot might find its way using cameras and machine vision. In each case, probability and Bayes' Rule are critical to accurate navigation.

#### Fusing GPS and Inertial Sensors

Consider the needs of an autonomous car (or indeed any outdoor robot). We need an estimate of our vehicle's position, so that we can figure out where to drive, and orientation, so that we know where our cameras or other sensors are pointing. GPS is great at giving an accurate absolute position, but it doesn't give us orientation unless we're moving and it's noisy. It's also unavailable if our robotic car drives under an overpass or through dense downtown areas with no clear view of the satellites.

Inertial sensors (gyros and accelerometers) are another way to get position and orientation: if we know where we started we can integrate acceleration to get position and angular rates to get orientation. This will give us smooth positions that don't jump around and don't depend on a satellite signal. However, after a while, small errors add up and eventually our estimated position and orientation will drift farther and farther from the truth. What we would like is some way to utilize both sensors to balance each other's weaknesses.

Instead of figuring out some relative weighting between the positions we get from the inertial sensing and the GPS, we can simply use Bayes' Rule. A probabilistic sensor model allows us to capture the errors inherent in each sensor, and Bayes' Rule tells us how to use each measurement to update our state estimate. We can then perform sequential, independent Bayesian updates. First we use the inertial sensors to get a new position and orientation estimate, then use that as the prior to perform another update with the GPS.

The resultant position estimate we get combines the best of both sensors: the inertial information smoothes out the GPS's noise, and the GPS keeps the final position estimate from drifting off. If we don't get GPS for a while we can just use inertial sensing until the satellites come back. The picture below shows an example of these sequential updates (since it's hard to draw probabilities over all possible locations, we just show the most likely location after each update).

This form of Bayesian sensor fusion for navigation is used very widely: not only on all of the DARPA Grand Challenge vehicles, but also on most modern aircraft, ships, submarines, and pretty much anything else that needs to know where it is. In fact, this kind of Bayesian navigation was invented for the Apollo space program in the '60s, where it was known as a Kalman filter and used to combine the inputs of the Apollo space capsule's gyros with measurements from a star tracker.

You must login or create an account to comment.