Understanding quaternions

Question

I'm trying to understand quaternions a bit better and get some more intuition, mostly in the context of using them as a way to think about rotations in 3D. My approach to how one might want to think about them in this context:

We consider the problem of describing 3D rotations by "inject a structure into a larger structure and describe it there". Now instead of looking at 3D rotations, we start by looking at rotations in 4D, and specifically we start with those that are induced by choosing a pair of coordinates, rotating it a certain angle, and then rotating the other two remaining coordinates.

Defining a corresponding multiplication operation to this and extending it distributively gives the quaternion structure. What I do wonder about is this:

With $1$ and $i$ it's clear that these two elements in some sense only effect $\langle 1,i \rangle$ - it rotates this plane. Is there a way to clearly see that other pairs like $1, i+j$ also define some sort of plane that $i+j$ rotates via multiplication? Does extending the multiplication operation distributively still preserve the quality of "rotating two separate coordinate pairs" and if yes, how do I see that?

I'm not sure if that answers your question, but $i$ has nothing special. You can replace $i$ by any pure quaternion, and pretty much everything works the same (except that its square is some negative number which may not be $-1$, but this does not matter in general). — Captain Lama, May 27 '20 at 12:06
we start by looking at rotations in 4D, and specifically we start with those that are induced by choosing a pair of coordinates, rotating it a certain angle, and then rotating the other two remaining coordinates. That would almost never produce a 3-d rotation. The eigenvalues of such a transformation will usually be all complex, and a 3-d rotation always has $1$ as an eigenvalue in the direction of its axis. — rschwieb, May 27 '20 at 13:15
How many questions about understanding quaternions have you read on the site? This is something that people are constantly asking about, so there is plenty of material. If you're mainly trying to 'rubber-duck' until you understand it, I would recommend mathematics chat, not posts. — rschwieb, May 27 '20 at 13:17
@rschwieb I didn't mean that that would produce a 3-d rotation - I meant that that's an "interesting" and natural type of 4D rotations to consider. You play around and maybe you end up realizing "hey if I switch the orientation in the other pair, and then conjugate an element by this rotation, it's actually a 3d rotations." - is that true or do I get that wrong?
I haven't read a ton, I was hoping to get some high level understanding before trying to tackle this topic. — John P, May 27 '20 at 20:08
@JohnP I don't see what connection it has to 3-d rotations. I thought the whole point of your question is to understand 3-d rotations with quaternions. — rschwieb, May 27 '20 at 20:14
@rschwieb I want to get some ideas about how one might have discovered quaternions in the first place. The approach of "let's study some more complicated object" seems imaginable to me. You start studying the more complicated object itself, maybe think about its rotations, and consider the basic rotations I've described etc. — John P, May 27 '20 at 20:31
@CaptainLama What I don't get in this approach is why things will still work even for other "rotations" of this type - how would you formalize that in some sense left multiplication by $(i+j)\sqrt{2}/2$ will "rotate" $\langle 1,(i+j)\sqrt{2}/2 \rangle$ and also the "orthogonal complement" of $\langle 1, (i+j)\sqrt{2}/2 \rangle$ ? Perhaps these sort of things will become clear when I study the relationship with SU(2). — John P, May 27 '20 at 20:41
@JohnP Transformations created by left multiplication of the quaternions on themselves have nothing to do with the 3-d rotations (for which the relevant action is conjugation.). So I think studying them in relation with each other is a dead end. — rschwieb, May 27 '20 at 20:41
@JohnP Re-inventing the quaternions is also probably not too rewarding. It would probably be more rewarding to seek out their connection with the exponential map. — rschwieb, May 27 '20 at 21:04
+1. @rschwieb I disagree vehemently that studying left & right quaternion multiplication is a dead-end with nothing to do with 3D rotations. The conjugation $pxp^{-1}$ has two $p$s in it, and it is a rotation by $2\theta$ which has two $\theta$s in it, and each $p$ has a $\theta$ in it, so it's natural to ask if each $p$ in the conjugation somehow contributes a $\theta$ to this rotation, and investigating left/right multiplication as isoclinic rotations is exactly the way to go for that insight, as I say in this answer and many other answers on this site. — anon, Jun 26 '20 at 02:16
@runway44 Frankly, you are disagreeing with something I never said. I said "left multiplication of the quaternions on themselves have nothing to do with 3-d rotations." and "Studying [left multiplications] in relation with 3-d rotations is a dead end." Of course combining left and right multiplications has something to do with 3-d rotations, because that's what conjugation is doing. I'm just saying left and right actions separately isn't the natural setting for 3-d rotations. — rschwieb, Jun 26 '20 at 12:53
@runway44 My argument could be perhaps seen as saying that studying the conjugation action is best done first, and that starting from the viewpoint of 4-d rotations is putting the cart before the horse. — rschwieb, Jun 26 '20 at 12:56

score 1 · Answer 1 · answered Jun 26 '20 at 02:14

Quaternions have real and imaginary parts, or one may call them a scalar and vector part. That is, we can interpret $\mathbb{H}$ (named after Hamilton) as $\mathbb{R}\oplus\mathbb{R}^3$. We already know how to multiply a scalar by a scalar, and a vector by a scalar, so it remains to describe how to multiply two 3D vectors. The scalar and vector parts of the product $\mathbf{uv}$ are the (opposite) dot product $-\mathbf{u}\cdot\mathbf{v}$ and cross product $\mathbf{u}\times\mathbf{v}$ respectively, so

$$ \mathbf{uv}=-\mathbf{u}\cdot\mathbf{v}+\mathbf{u}\times\mathbf{v}. $$

From this we may conclude, for instance:

The square roots of $1$ are $\pm1$, and the square roots of $-1$ are precisely the unit vectors.
Euler's formula $\exp(\theta\mathbf{u})=\cos(\theta)+\sin(\theta)\mathbf{u}$ for unit vectors $\mathbf{u}$.
All quaternions have a polar form $p=r\exp(\theta\mathbf{u})$ with $r=\|p\|$.
Two quaternions commute if and only if their vector parts are parallel.
Two quaternions anticommute iff they are perpendicular vectors.

We consider the problem of describing 3D rotations by "inject a structure into a larger structure and describe it there". Now instead of looking at 3D rotations, we start by looking at rotations in 4D [...]

Exactly!

Given any unit vector $\mathbf{u}$, we may extend it to an oriented orthonormal basis $\{\mathbf{u},\mathbf{v},\mathbf{w}\}$ of $\mathbb{R}^3$, and if we adjoin the scalar $1$ we get an oriented orthonormal basis for $\mathbb{H}$. Define $L_p(x)=px$ and $R_p(x)=xp$. Then $L_{\mathbf{u}}$ has two invariant planes, the spans of $\{1,\mathbf{u}\}$ and $\{\mathbf{v},\mathbf{w}\}$. More to the point, $L_{\mathbf{u}}$ is a right-angle rotation in the $1\mathbf{u}$-plane and the $\mathbf{vw}$-plane. Moreover, the same applies to $R_{\mathbf{u}}$, except it turns the opposite direction in the $\mathbf{vw}$-plane. Just as $\exp(i\theta)$ turns the complex plane by $\theta$, we can show $L_p$ and $R_p$ (where $p=\exp(\theta\mathbf{u})$ turn the $1\mathbf{u}$ and $\mathbf{vw}$-planes by $\theta$, but with opposite directions in the $\mathbf{vw}$-plane.

If you want, you can write the matrices for $L_p$ and $R_p$ WRT the basis $\{1,\mathbf{u},\mathbf{v},\mathbf{w}\}$.

Inverting $L_p$ or $R_p$ alters the direction of rotation in both planes. Consequently, the conjugation $L_p\circ R_p^{-1}$ (i.e. $x\mapsto pxp^{-1}$) rotates by $2\theta$ in the $\mathbf{vw}$-plane and acts trivially in the $1\mathbf{u}$-plane. Restricting to $\mathbb{R}^3$, we can simply say it rotates around the oriented $\mathbf{u}$-axis by $2\theta$. So the answer to this is yes:

[...] we start with those that are induced by choosing a pair of coordinates, rotating it a certain angle, and then rotating the other two remaining coordinates. [...] You play around and maybe you end up realizing "hey if I switch the orientation in the other pair, and then conjugate an element by this rotation, it's actually a 3d rotations." - is that true or do I get that wrong?

On the other hand,

Is there a way to clearly see that other pairs like 1,i+j also define some sort of plane that i+j rotates via multiplication? [...] What I don't get in this approach is why things will still work even for other "rotations" of this type - how would you formalize that in some sense left multiplication by (i+j)2–√/2 will "rotate" ⟨1,(i+j)2–√/2⟩ and also the "orthogonal complement" of ⟨1,(i+j)2–√/2⟩ ?

This follows, I think reasonably directly, from the quaternion product of two vectors formula I mentioned above: with the dot and cross product here, multiplying two orthogonal vectors yields a third orthogonal vector. You can use this to show the $1\mathbf{u}$ and $\mathbf{vw}$-planes are indeed invariant planes WRT $L_p$ and $R_p$, and check the matrix representations of $L_p$ and $R_p$ in the appropriate basis.

It suffices to know what $L_p$ and $R_p$ do on these invariant planes because they are complementary and span the whole of $\mathbb{H}$; you can figure out what $L_p$ and $R_p$ do to any quaternion by splitting that quaternion into components with respect to the invariant planes.

Does extending the multiplication operation distributively still preserve the quality of "rotating two separate coordinate pairs" and if yes, how do I see that?

Adding two unit quaternions generally does not yield a unit quaternion, so the answer is technically no as written, but the answer is yes if you say "rotating two separate planes by the same angle and rescales."

Of course adding two quaternions gives a quaternion, so algebraically this is clear. I don't really think it's clear geometrically, however, and with good reason: this is a very exceptional accident that occurs in precisely four dimensions, and no other dimensions. (I have a related answer on Are left isoclinic rotations a group?.)

I want to get some ideas about how one might have discovered quaternions in the first place.

Finding a number system to describe 3D rotations just as complex numbers describe 2D rotations was indeed the way Hamilton discovered the quaternions. He needed a number system with an inner product corresponding to a multiplicative norm, and some square roots of $-1$ to act as "generators" for the rotations. He first assumed it would a 3D number system with $x=a+b\mathbf{i}+c\mathbf{j}$ and agonized for years over how to get it to work right, in particular what $\mathbf{ij}$ ought to be. Eventually he realized $|x^2|=|x|^2$ forced $\mathbf{i}$ and $\mathbf{j}$ to anticommute, and then he had an infamous flash of bridge-adjacent insight that $\mathbf{ij}$ ought to be independent of $\mathbf{i}$ and $\mathbf{j}$; from there everything else - the full multiplication table - flowed smoothly from the 4D insight and the requirement $|xy|=|x||y|$.

Once you have the number system in place, you can begin investigating it.

This is my best recollection anyway.

rschwieb · Answer 2 · 2020-06-26T20:14:18.870

As mentioned in the comments, multiplication on just one side lacks the appropriate eigenspace behavior to study 3-d rotations. But if you combine both right and left action, you can get the following. I think perhaps what you're looking for is this:

If $\mathbb H_1$ denotes the unit length quaternions, then there is a surjective homomorphism from $\mathbb H_1\times \mathbb H_1\to SO(4)$, where $\mathbb H$ itself is being viewed as a model of $\mathbb R^4$, and the action is $(a,b)(q)=aq\bar{b}$. (This is a good resource for that.)

Of course, you can get 3-d rotations out of this if you study the set of such transformations that fixes one of the coordinates. If the first coordinate in $\mathbb R^4$ represents the real coordinate of the quaternion, then this is asking for $ax\bar{b}=x$ for all real $x$, and in particular for $x=1$ you get $\bar{b}=a^{-1}$, and you've recovered the conjugation action.

I haven't studied 4-d rotations much, since the 3-d case is so practical. I'd say play around with the 3-d case for a while before doing 4-d, but that's just my two cents.

Understanding quaternions

2 Answers2

Linked