I am trying to understand how a Kalman Filter works. (This is an algorithm used in robotics for fusing multiple, noisy sensor readings with a linear model of the robot's motion to try to produce an improved state estimation.)
Part of the algorithm involves this equation:
$S = HPH^T + R$
Where $P$ is the covariance matrix representing the noise in the state estimate, and $H$ is a matrix which maps the robot's state onto the sensor measurements which would imply that state. $R$ is the covariance matrix of the noise in the sensor data.
What is the significance of multiplying by both $H$ and $H^T$?