You are here

The chain rule

20 January, 2016 - 10:05
media/image8.png
Figure 2.9 Three clowns on seesaws demonstrate the chain rule. 
 

Figure 2.9 shows three clowns on seesaws. If the leftmost clown moves down by a distance dx, the middle one will come up by dy, but this will also cause the one on the right to move down by dz. If we want to predict how much the rightmost clown will move in response to a certain amount of motion by the leftmost one, we have

\frac{dz}{dx}=\frac{dz}{dy}\cdot \frac{dy}{dx}

This is called the chain rule. It says that if a change in x causes y to change, and y then causes z to change, then this chain of changes has a cascading effect. Mathematically, there is no big mystery here. We simply cancel dy on the top and bottom. The only minor subtlety is that we would like to be able to be sloppy by using an expression like dy/dx to mean both the quotient of two infinitesimal numbers and a derivative, which is defined as the standard part of this quotient. This sloppiness turns out to be all right, as proved in Proof of the chain rule.

Example

Jane hikes 3 kilometers in an hour, and hiking burns 70 calories per kilometer. At what rate does she burn calories?

We let x be the number of hours she’s spent hiking so far, y the distance covered, and z the calories spent. Then

\begin{align*} \frac{dz}{dx} &=\left ( \frac{70 \textrm{cal}}{1/\not{\textrm{km}}} \right )\left ( \frac{3\not{\textrm{km}}}{1\textrm{hr}} \right ) \\ &= 210 \textrm{cal/hr} \end{align*}

Example

Figure 2.10 shows a piece of farm equipment containing a train of gears with 13, 21, and 42 teeth. If the smallest gear is driven by a motor, relate the rate of rotation of the biggest gear to the rate of rotation of the motor.
Let x, y, and z be the angular positions of the three gears. Then by the chain rule,
\begin{align*} \frac{dz}{dx} &=\frac{dz}{dy}\cdot \frac{dy}{dx} \\ &=\frac{13}{21}\cdot \frac{21}{42} \\ &= \frac{13}{42} \end{align*}

media/image9.png
Figure 2.10
 
 

The chain rule lets us find the derivative of a function that has been built out of one function stuck inside another.

Example

Find the derivative of the function z(x)=\textrm{sin}(x^2).

Let y(x)=x^2, so that z(x)=\textrm{sin}(y(x)). Then\begin{align*} \frac{dz}{dx} &=\frac{dz}{dy}\cdot \frac{dy}{dx} \\ &= \textrm{cos }(y)\cdot 2x\\ &= 2x \textrm{ cos}(x^2) \end{align*}The way people usually say it is that the chain rule tells you to take the derivative of the outside function, the sine in this case, and then multiply by the derivative of “the inside stuff,” which here is the square. Once you get used to doing it, you don’t need to invent a third, intermediate variable, as we did here with y.

Example

Let’s express the chain rule without the use of the Leibniz notation. Let the function f be defined by f(x)=g(h(x)). Then the derivative of f is given by f'(x)=g'(h(x))\cdot h'(x)

Example

We’ve already proved that the derivative of t^k is kt^{k-1} for k=-1 (Example) and for k=1,2,3,... (Derivatives of polynomials). Use these facts to extend the rule to all integer values of k.

For k<0, the function x=t^k can be written as x=(t^{-1})^{-k} , where -k is positive. Applying the chain rule, we find dx/dt=(-k)(t^{-1})&^{-k-1}(-t^{-2})=kt^{k-1}.