A function \(f\colon \R^n\to \R^m\) is differentiable at the point \(\mathbf{x}_0\) if there exists a linear transformation \(L\colon \R^n\to \R^m\) such that
\begin{equation}
\lim_{\mathbf{h}\to 0} \frac{f(\mathbf{x}_0+ \mathbf{h}) - f(\mathbf{x}_0)-
L\mathbf{h}}{\|\mathbf{h}\|} = 0.\tag{4.1}
\end{equation}
If \(L\) exists, it is called the derivative of \(f\) at \(\mathbf{x}_0\) denoted \(Df(\mathbf{x_0})\).
To understand the definition of the derivative, start with the case \(n=m=1\text{.}\) The derivative of \(f\) at \(x_0\) is a number \(f'(x_0)\) such that
\begin{equation*}
f(x_0 + h) -f(x_0) \approx f'(x_0)h
\end{equation*}
for \(h\) near \(0\text{.}\) The meaning of "approximately equals \(\ldots\) for \(h\) near \(0\)" is made precise by using a limit. To generalize to higher dimensions, interpret \(f'(x_0)h\) as the value of a linear transformation that sends \(h\) to \(f'(x_0)h\text{.}\) The derivative \(Df(\mathbf{x}_0)\) of \(f\) at \(\mathbf{x_0}\) is a linear transformation such that
\begin{equation*}
f(\mathbf{x}_0 + \mathbf{h}) -f(\mathbf{x}_0) \approx Df(\mathbf{x}_0)\mathbf{h}
\end{equation*}
for \(\mathbf{h}\) near \(\mathbf{0}\text{.}\) Putting \(\mathbf{h} = t\mathbf{e}_j\text{,}\) this reads
\begin{equation*}
f(\mathbf{x}_0 + t\mathbf{e}_j) -f(\mathbf{x}_0) \approx Df(\mathbf{x}_0)t\mathbf{e}_j
\end{equation*}
for \(t\) near \(0\text{.}\) Dividing both sides by \(t\) and taking a limit, we get an expression for \(Df(\mathbf{x}_0)\mathbf{e}_j\text{.}\)
\begin{equation}
Df(\mathbf{x}_0)\mathbf{e}_j = \lim_{t\to 0}
\frac{f(\mathbf{x}_0 + t\mathbf{e}_j) -f(\mathbf{x}_0)}{t} =
\left(\frac{\partial y_1}{\partial x_j}, \frac{\partial y_2}{\partial
x_j},\ldots, \frac{\partial y_m}{\partial x_j}\right)\tag{4.2}
\end{equation}
where \(\mathbf{y}=(y_1,y_2,\ldots,y_m) = f(x_1,x_2,\ldots,x_n) =
f(\mathbf{x})\text{.}\) From this it follows that \(Df(\mathbf{x}_0)\text{,}\) if it exists, is represented by the matrix \(\left[\frac{\partial
y_i}{\partial x_j}\right]\text{.}\)
\begin{equation}
[Df(\mathbf{x}_0)] = \left[\frac{\partial y_i}{\partial x_j}\right]\tag{4.3}
\end{equation}