What is the significance of a matrix times its transpose?

Question

I am trying to understand how a Kalman Filter works. (This is an algorithm used in robotics for fusing multiple, noisy sensor readings with a linear model of the robot's motion to try to produce an improved state estimation.)

Part of the algorithm involves this equation:

$S = HPH^T + R$

Where $P$ is the covariance matrix representing the noise in the state estimate, and $H$ is a matrix which maps the robot's state onto the sensor measurements which would imply that state. $R$ is the covariance matrix of the noise in the sensor data.

What is the significance of multiplying by both $H$ and $H^T$?

I don't know the specifics about the algorithm you are talking about, but you may want to look at http://math.stackexchange.com/questions/158219/is-a-matrix-multiplied-with-its-transpose-something-special — KSab, May 10 '14 at 22:09
Did you find an answer to this? Similarly, the KF involves computing P'=FPF^T+Q. I don't understand the purpose of multiplying by the transpose of F. — William Grand, May 19 '21 at 19:22

score 2 · Answer 1 · answered Jan 06 '23 at 20:48

The goal of multiplying a covariance matrix by another matrix and its transpose is to apply a linear transformation to the original covariance matrix. (A covariance matrix is a multivariate Gaussian distribution. These slides might be a helpful further read.)

You define the computation of the innovation covariance of a Kalman Filter ($S = H P H^T + R$) very well in your question. In a simple robotics example:

$P$ is typically the uncertainty of the robot pose expressed in the world reference frame.
$R$ is the the modeled sensor noise expressed in the sensor reference frame.

So what this equation does is (i) project the uncertainty on the robot pose to put it in the reference frame of the sensor and (ii) add the modeled sensor noise. Note that $S$ is also referred with respect to the sensor frame.

The prediction step of the Kalman Filter does something very similar: $P_k = F P_{k-1} F^T + Q$. Here, $F$ is the system model (typically a motion model), so this equation just projects the pose uncertainty in the last time stamp $k-1$ to the current $k$.

If you provide an example, we will be able to better visualize what these equations do to the uncertainties.

What is the significance of a matrix times its transpose?

1 Answers1