For a sufficiently smooth function $f$, Taylor's theorem in one dimension may be written $$f(x+u)=\sum_{k=0}^l\frac1{k!}f^{(k)}(x)u^k +\varepsilon(x,u,l),$$where the “error” term $\varepsilon(x,u,l)$ is treated, for any given $x\in\Bbb R$, $l\in\Bbb N$, and for any sufficiently small $u$, as negligible for the current purpose—and we ignore it henceforth. Now taking $f$ to be a $\Bbb R^m$-valued function on $\Bbb R^n$, Taylor's theorem becomes $$\pmb f(\pmb x+\pmb u)=\sum_{k=0}^l\frac1{k!}\pmb f^{(k)}(\pmb x)\pmb u^k +\pmb\varepsilon(\pmb x,\pmb u,l),$$under suitable interpretation of the “powers” $\pmb f^{(k)}$ and $\pmb u^k$ and their multiplication. Since we are working in $\Bbb R^n$ and $\Bbb R^m$ with their natural column-vector representation, we can think of $\pmb f^{(1)}$ as having values in $\Bbb R^{m\times n}$, the space of $m\times n$ matrices, with $\pmb u^{(1)}:=\pmb u$. Generally, for $k=2,3,.. .$ , we may interpret $\pmb f^{(k)}$ as living in a space of $m\times(n\times(\cdots\times n)\cdots)$ tensors, which can be thought of as highly row-structured $m\times n^k$ matrices, where there are $k$ $n$s and $k$ applications of $\times$, while $\pmb u^{(k)}$ lives in a tensor space with (columnar) dimensional structure $(\cdots(n\otimes n)\otimes \cdots \otimes n)\times 1$, where there are $k$ $n$s and $k-1$ applications of the column-building operation $\otimes$. The multiplication between these scary beasts is relatively simple, as most of the structure collapses through addition, leaving simply an $m\times1$ matrix—namely an $m$-vector. The components of this vector are sums of terms of the form $$a_{\pmb i}(\pmb x) \pmb u^{\pmb i},$$where $a_{\pmb i}(\pmb x)\in\Bbb R$, $\pmb i =(i(1),...,i(m))\in \Bbb N^m$ with $i(1)+\cdots+i(m)=k$, and $\pmb u^{\pmb i}:=u_1^{i(1)}\cdots u_k^{i(k)}$. Here the $a_{\pmb i}(\pmb x)$ are individual differential coefficients of the form$$\frac1{h(1)!\cdots h(n)!}\frac{\partial^k f_j(\pmb x)}{\partial x_1^{h(1)}\cdots\partial x_n^{h(n)}},$$where $h(1),...,h(n)\in \Bbb N$ with $h(1)+\cdots+h(n)=k$. The initial factor accounts for permutations under which the differential coefficient is invariant.
A precise description of the above would require a foray into the generalities of tensor algebra.