Is there some consensus on the dimensions of a Jacobian matrix and of a gradient?

Question

According to Wikipedia, given a differentiable mapping $F: \mathbb{R}^n \to \mathbb{R}^m$, its Jacobian matrix is a $m \times n$ matrix defined as: $$ J_F=\begin{bmatrix} \dfrac{\partial y_1}{\partial x_1} & \cdots & \dfrac{\partial y_1}{\partial x_n} \\ \vdots & \ddots & \vdots \\ \dfrac{\partial y_m}{\partial x_1} & \cdots & \dfrac{\partial y_m}{\partial x_n} \end{bmatrix}. $$ Specially when $m=1$, the Jacobian matrix is also called the gradient $\nabla F$. So when trying to compute a differential, it is $J_F \Delta x$ or $\nabla F \Delta x$.

In real analysis, optimization, ..., some texts agree with Wikipedia's definitions. However, in some others, a Jacobian matrix or a gradient of a differentiable mapping is defined to be the transpose of the Wikipedia definitions.

Moreover, in baby Rudin, $J_F$ is of $m \times n$ dimension, while when $m=1$, $\nabla F$ is of $n \times 1$ dimension.

When it comes to writing my own formulas, I wonder which way is mostly adopted?

Thanks and regards!

Dylan Moreland · Accepted Answer · 2011-11-13T18:48:44.820

4

$\newcommand{\R}{\mathbf{R}}$Rudin's conventions are certainly the ones I use. He isn't contradicting himself, but it takes some work to see that. You've asked about bilinear forms and duality before, I think, so I hope the following makes sense.

If $F\colon \R^n \to \R$ is a smooth function and $x \in \R^n$, then $J_F$ is something which takes in a vector in the tangent space $T_x\R^n$, which is canonically identified with $\R^n$, to something in $T_{F(x)}\R = \R$. With Rudin's conventions, this should correspond to a $1 \times n$ matrix. We can view $J_F$ as an element of the cotangent space $T_x^\vee\R^n$ at $x$.

$\R^n$ comes with an inner product $\langle\phantom{x}, \phantom{x}\rangle$, the "dot product". What this allows us to do is identify $T_x\R^n$ with $T_x^\vee\R^n$. See the page on nondegenerate forms for more details. The upshot is that there has to be a tangent vector $\nabla F \in T_x\R^n$ such that $\langle \nabla F, v\rangle = J_F \cdot v$ for all $v \in T_x\R^n$. Since $\nabla F$ is a tangent vector, we express it as an $n \times 1$ matrix. You can check that the entries work out to be what Rudin says they are.

This causes more confusion later on in life, because a manifold may not come with a metric and then these identifications that one uses so freely in calculus cannot be made. Manifolds that do have an analogue of the dot product are called Riemannian.

edited Nov 13 '11 at 18:48

answered Nov 13 '11 at 18:32

Dylan Moreland

19,819

Somebody is grappling with a specific instance of this observation here. – yasmar Nov 13 '11 at 19:33
I should be more clear: the inner product is giving one on the tangent space. Will rewrite later. – Dylan Moreland Nov 13 '11 at 19:48
@DylanMoreland: Thanks! What is the theorem for the reason of "there has to be a tangent vector ∇F∈TxRn such that ⟨∇F,v⟩=JF⋅v for all v∈TxRn"? Is it Riesz representation theorem? – Tim Nov 16 '11 at 08:58
@Tim That would work, but our space is finite-dimensional, so the full might of Riesz isn't needed here. – Dylan Moreland Nov 16 '11 at 17:50
1

@Tim To elaborate: let's say I have a finite dimensional real vector space $V$ with an inner product, which in particular satisfies $\langle x, x\rangle = 0$ iff $x = 0$. I get a map $V \to V^\vee$ by sending $x \in V$ to the functional $f_x(y) = \langle x, y\rangle$. This is a homomorphism and it's injective, because $f_x = 0$ means that $x = 0$. By counting dimensions, it's an isomorphism. – Dylan Moreland Nov 17 '11 at 17:12

Is there some consensus on the dimensions of a Jacobian matrix and of a gradient?

1 Answers1