The MATLAB Notebook v1.5.2

CHAPTER 4: MATHEMATICAL MODELING WITH MATLAB

Lecture 4.1: Finite difference approximations for numerical derivatives

Forward, backward, and central differences for derivatives:

Problem: Given a set of data points near the point (x₀,y₀):

…; (x_-2,y_-2); (x_-1,y_-1); (x₀,y₀); (x₁,y₁); (x₂,y₂); …

Suppose the data values represent values of a function y = f(x). Find a numerical approximation for derivatives f'(x₀), f''(x₀), … of the function y = f(x) at the point (x₀,y₀).

Example: Linear electrical circuits consist of resistors, capacitors, inductors, and voltage and current sources. In a simple resistor-inductor (RL) one-port network driven by a current source, the voltage V = V(t) develops across the port terminals when a current I = I(t) is applied to the input port, The voltage output V(t) can be determined as a sum of the voltage drop across the resistor R I(t) and of the voltage drop across the inductor L I'(t). The derivative I'(t) is to be found from the input current I(t) measured at different time instances:

Solution: The first derivative f'(x₀) of the function y = f(x) at the point (x₀,y₀) can be approximated by the slope of the secant line that passes through two points (linear piecewise interpolation). Depending on whether the points are taken to the right of the point (x₀,y₀) (future data), to the left of the point (x₀,y₀) (past data), or to both sides, the slope of the secant line is called the forward, backward or central difference approximations.

Forward difference approximation:

The secant line passes the points (x₀,y₀) and (x₁,y₁).

f'(x₀) D_forward(f;x₀) =

Forward differences are useful in solving initial-value problems for differential equations by single-step predictor-corrector methods (such as Euler methods). Given the values f'(x₀) and f(x₀), the forward difference approximates the value f(x₁).

Backward difference approximation:

The secant line passes the points (x_-1,y_-1) and (x₀,y₀).

f'(x₀) D_backward(f;x₀) =

Backward differences are useful for approximating the derivatives if the data values are available in the past but not in the future (such as secant methods for root finding and control problems). Given the values f(x_-1) and f(x₀), the backward difference approximates the value f(x₁), if it depends on f'(x₀).

Central difference approximation:

The secant line passes the points (x_-1,y_-1) and (x₁,y₁).

f'(x₀) D_central(f;x₀) =

Central differences are useful in solving boundary-value problems for differential equations by finite difference methods. Approximating values of f'(x₀) that occurs in differential equations or boundary conditions, the central difference relates unknown values f(x_-1) and f(x₁) by an linear algebraic equation.

h = 0.1; x0 = 1; x_1 = x0-h; x1 = x0 + h;

% the three data points are located at equal distance h (the step size)

y0 = exp(10*x0); y_1 = exp(10*x_1); y1 = exp(10*x1); yDexact = 10*exp(10*x0);

yDforward = (y1-y0)/h; % simple form of forward difference for equally spaced grid

yDbackward = (y0-y_1)/h; % simple form of backward difference

yDcentral = (y1-y_1)/(2*h); % simple form of central difference

fprintf('Exact = %6.2f\nForward = %6.2f\nBackward = %6.2f\nCentral = %6.2f',yDexact,yDforward,yDbackward,yDcentral);

Exact = 220264.66

Forward = 378476.76

Backward = 139233.82

Central = 258855.29

Errors of numerical differentiation:

Numerical differentiation is inherently ill-conditioned process. Two factors determine errors induced when the derivative f'(x₀) is replaced by a difference approximations: truncation and rounding errors.

Consider the equally spaced data points with constant step size: h = x₁ – x₀= x₀ – x_-1. The theory based on the Taylor expansion method shows the following truncation errors:

· Forward difference approximation:

f'(x₀) – D_forward(f,x₀) = -f''(x), x [x₀,x₁]

The truncation error of the forward difference approximation is proportional to h, i.e. it has the order of O(h). The error is also proportional to the second derivative of the function f(x) at an interior point x of the forward difference interval.

· Backward difference approximation:

f'(x₀) – D_backward(f,x₀) = f''(x), x [x_-1,x₀]

The truncation error of the backward difference approximation is as bad as that of the forward difference approximation. It also has the order of O(h) and is also proportional to the second derivative of the function f(x) at an interior point x of the backward difference interval.

· Central difference approximation:

f'(x₀) – D_central(f,x₀) = -f'''(x), x [x_-1,x₁]

The truncation error of the central difference approximation is proportional to h² rather than h, i.e. it has the order of O(h²). The error is also proportional to the third derivative of the function f(x) at an interior point x of the central difference interval. The central difference approximation is an average of the forward and backward differences. It produces a much more accurate approximation of the derivative at a given small value of h, compared to the forward and backward differences. If the data values are available both to the right and to the left of the point (x₀,y₀), the use of the central difference approximation is the most preferable.

h = logspace(-1,-16,16);

x0 = 1; x_1 = x0-h; x1 = x0+h;

y0 = exp(10*x0); y_1 = exp(10*x_1); y1 = exp(10*x1); yDe = 10*exp(10*x0);

yDf = (y1-y0)./h; yDb = (y0-y_1)./h; yDc = (y1-y_1)./(2*h);

eDf = abs(yDf-yDe); eDb = abs(yDb-yDe); eDc = abs(yDc-yDe);

loglog(h,eDf,'g:',h,eDb,'b--',h,eDc,'r');

If the step size h between two points becomes smaller, the truncation error of the difference approximation decreases. It decreases faster for central difference approximations and slower for forward and backward difference approximations. For example, if h is reduced by a factor of 10, the truncation error of the central difference approximation is reduced by a factor of 100, while the truncation errors of the forward and backward differences are reduced only by a factor of 10.

When h becomes too small, the difference approximations are taken for almost equal values of f(x) at the two points. Any rounding error of computations of f(x) is magnified by a factor of 1/h. As a result, the rounding error grows with h for very small values of h. An optimal step size h = h_opt can be computed from minimization of the sum of truncation and rounding errors:

· Forward difference approximation:

e_forward = | f'(x₀) – D_forward(f,x₀)| < M₂ + 2 ,

where M₂ = max | f''(x)|. The minimum of error occurs for h = h_opt=2, when e_forward= 2.

· Central difference approximation:

e_central = | f'(x₀) – D_central(f,x₀)| < M₃ + 2 ,

where M₃ = max | f'''(x)|. The minimum of error occurs for h = h_opt=, when e_forward= 3.

M2 = 100*exp(10*(x0+0.1));

M3 = 1000*exp(10*(x0+0.1));

hoptForward = 2*(eps/M2)^(1/2)

hoptCentral = (6*eps/M3)^(1/3)

eoptForward = 2*(eps*M2)^(1/2)

eoptCentral = 3*(eps^2*M3/6)^(1/3)

hoptForward = 1.2180e-011

hoptCentral = 2.8127e-008

eoptForward = 7.2924e-005

eoptCentral = 2.3683e-008

Example: Two central difference approximations are applied to compute the derivative of the current I'(t): green pluses are for h = 10 and blue dots are for h = 5. The exact derivative I'(t) is shown by red solid curve.

Newton forward difference interpolation:

Problem: Given a set of (n+1) data points:

(x₁,y₁); (x₂,y₂); …; (x_n,y_n); (x_n+1,y_n+1)

Find a Newton polynomial of degree n, y = P_n(x), that passes through all (n+1) data points. The Newton polynomial is a discrete Taylor polynomial of the form:

y = P_n(x) = c₀ + c₁ (x-x₁) + c₂(x-x₁)(x-x₂) + c_n(x – x₁)(x – x₂)…(x-x_n)

where the coefficients c_j are constructed from diagonal divided differences.

Comparison: Lagrange polynomial interpolation is convenient when the same grid points [x₁,x₂,…,x_n+1]

are repeatevely used in several applications. The data values can be stored in computer memory to reduce a number. The Lagrange interpolation is not useful however when additional data points are added or removed to improve the appearance of the interpolating curve. The data set has to be completely recomputed every time when the data points are added or removed. The Newton polynomial interpolation is more convenient for adding and deleting additional data points. It is also more convenient for an algorithmic use of the Horner's nested multiplication. Of course, the Newton interpolating polynomials coincides with the Vandermond and Lagrange interpolating polynomials for a given data set, since the interpolating polynomial y = P_n(x) of order n connecting (n+1) data points are unique. The difference between Vandermonde, Lagrange, and Newton interpolating polynomials lies only in computational aspects.

Solution: The coefficients c_j are to be found from the conditions: P_n(x_k) = y_k that results in a linear system:

Ac = y. The Gauss elimination algorithm or induction methods can be used to prove that c_j = d_j,j, where d_j,j are diagonal entries in the table of forward divided differences:

*grid* *points*	*data* *values*	*first* *differences*	*second differences*	*third differences*	*fourth differences*
x₁	y₁
x₂	y₂	*f[x₁,x₂]*
x₃	y₃	*f[x₂,x₃]*	*f[x₁,x₂,x₃]*
x₄	y₄	*f[x₃,x₄]*	*f[x₂,x₃,x₄]*	*f[x₁,x₂,x₃,x₄]*
x₅	y₅	*f[x₄,x₅]*	*f[x₃,x₄,x₅]*	*f[x₂,x₃,x₄,x₅]*	*f[x₁,x₂,x₃,x₄,x₅]*

Zero divided differences f[x_k]are simply the values of the function y = f(x) at the point (x_k,y_k). First divided differences f[x_k,x_k+1] are forward difference approximation for derivatives of the function y = f(x) at (x_k,y_k):

f[x_k,x_k+1] =

Second, third, and higher-order forward divided difference are constructed by using the recursive rule:

f[x_k,x_k+1,…,x_k+m] =

With the use of divided differences c_j-1 = f[x₁,x₂,…,x_j], the Newton interpolating polynomial y = P_n(x) has the explicit representation:

P_n(x) = f[x₁,x₂,…,x_j]

% example of explicit computation of coefficients of Newton polynomials

x = [ -1,-0.75,-0.5,-0.25,-0]; n = length(x)-1;

y = [ -14.58,-6.15,-1.82,-0.23,-0.00];

A = ones(n+1,1); % the coefficient matrix for linear system Ac = y

for j = 1 : n

A = [ A, A(:,j).*(x'-x(j))];

end

A, c = A\(y'); c = c'

A = 1.0000 0 0 0 0

1.0000 0.2500 0 0 0

1.0000 0.5000 0.1250 0 0

1.0000 0.7500 0.3750 0.0938 0

1.0000 1.0000 0.7500 0.3750 0.0938

c = -14.5800 33.7200 -32.8000 14.5067 0.2133

function [yi,c] = NewtonInter(x,y,xi)

% Newton interpolation algorithm

% x,y - row-vectors of (n+1) data values (x,y)

% xi - a row-vector of x-values, where interpolation is to be found

% yi - a row-vector of interpolated y-values

% c - coefficients of Newton interpolating polynomial

n = length(x) - 1; % the degree of interpolation polynomial

ni = length(xi); % the number of x-values, where interpolation is to be found

D = zeros(n+1,n+1); % the matrix for Newton divided differences

D(:,1) = y'; % zero-order divided differences

for k = 1 : n

D(k+1:n+1,k+1) = (D(k+1:n+1,k)-D(k:n,k))./(x(k+1:n+1)-x(1:n-k+1))';

end

c = diag(D);

% computation of values of the Newton interpolating polynomial at values of xi

% the algorithm uses the Horner's rule for polynomial evaluation

yi = c(n+1)*ones(1,ni); % initialization of the vector yi as the coefficient of the highest degree

for k = 1 : n

yi = c(n+1-k)+yi.*(xi-x(n+1-k)); % nested multiplication

end

x = [ -1,-0.75,-0.5,-0.25,-0]; y = [ -14.58,-6.15,-1.82,-0.23,-0.00];

xInt = -1 : 0.01 : 0;

[yInt,c] = NewtonInter(x,y,xInt);

c = c', plot(xInt,yInt,'g',x,y,'b*');

c = -14.5800 33.7200 -32.8000 14.5067 0.2133

MATLAB finite differences:

Let the data points x₁,x₂,…,x_n,x_n+1 be equally spaced with constant step size h = x₂ – x₁. Then, the divided differences can be rewritten as:

f[x_k,x_k+1,…,x_k+m] = ,

where is the m-th forward difference of a function y = f(x) at the point (x_k,y_k).

= y₁ – y₀

= y₂– 2 y₁ + y₀

= y₃–3 y₂ + 3 y₁ – y₀

= y₄– 4 y₃ + 6 y₂ -4 y₁ – y₀

= y₅ – 5 y₄ + 10 y₃ – 10 y₂ + 5 y₁ – y₀

Derivatives of the interpolation polynomials y = P_n(x) approximate derivatives of the function y = f(x). Matching the n-th derivative of the polynomial y = P_n(x) with f⁽ⁿ⁾(x₀), we find the forward difference approximation for higher-order derivatives:

f⁽ⁿ⁾(x₀)

· diff(y): computes first-order forward difference for a given vector y

· diff(y,n): computes n-th order forward difference for a given vector y

· gradient(u): computes horizontal and vertical forward differences for a given matrix to approximate the x- and y-derivatives of u(x,y): grad(u) = [u_x,u_y]

· del2(u): computes a discrete Laplacian for a given matrix u: u = u_xx + u_yy

n = 50; x = linspace(0,2*pi,n); h = x(2)-x(1); y = sin(x); % function

y1 = diff(y)/h; y2 = diff(y,2)/h^2; y1ex = cos(x); y2ex = -sin(x);% derivatives

plot(x(1:n-1),y1,'b',x,y1ex,':r',x(1:n-2),y2,'g',x,y2ex,':r');

Hierarchies of higher-order difference approximations:

Newton interpolating polynomials and divided difference tables can be constructed for backward differences, since the order of data points x₁,x₂,…,x_n,x_n+1 is arbitrary. By arranging for data points in descenting order, the Newton polynomial represents the backward differences. It is trickier to construct tables for central differences. Since central differences are the most accurate approximations, special algorithms are designed to automate derivation of coefficients of central difference approximations for higher-order derivatives.

· hierarchy of forward differences:

Let the data points are equally spaced with constant step size h. Fix a point (x₀,y₀) and present a forward difference approximation for the n-th derivative as the inner product multiplication:

f⁽ⁿ⁾(x₀) D_n*y

where y = [ y₀,y₁,y₂, …]

n = 100; A = diag(ones(n-1,1),1)-diag(ones(n,1));

m = 8; B = A;

for k = 1 : m-1

D(k,:) = B(1,1:m);

B = B*A;

end

D = D

D -1 1 0 0 0 0 0 0

1 -2 1 0 0 0 0 0

-1 3 -3 1 0 0 0 0

1 -4 6 -4 1 0 0 0

-1 5 -10 10 -5 1 0 0

1 -6 15 -20 15 -6 1 0

-1 7 -21 35 -35 21 -7 1

· hierarchy of central differences

Let the data points are equally spaced with constant step size h. Fix a point (x₀,y₀) and present a central difference approximation for the n-th derivative as the inner product multiplication:

f^(2m-1)(x₀) D_2m-1*y; f^(2m)(x₀) D_2m*y

where y = [ …,y_-2,y_-1,y₀,y₁,y₂, …]

n = 100; A = diag(ones(n-1,1),1)-diag(ones(n-1,1),-1);

A2 = diag(ones(n-1,1),1)+diag(ones(n-1,1),-1)-2*diag(ones(n,1));

m = 4; B = A; C = A2; k = 1;

while (k < (2*m-2) )

D(k,:) = B(m,1:2*m-1);

D(k+1,:) = C(m,1:2*m-1);

B = B*A2; C = C*A2;

k = k+2;

end

D = D

D = 0 0 -1 0 1 0 0

0 0 1 -2 1 0 0

0 -1 2 0 -2 1 0

0 1 -4 6 -4 1 0

-1 4 -5 0 5 -4 1

1 -6 15 -20 15 -6 1

Richardson extrapolation for higher-order differences:

Recursive difference formulas for derivatives can be obtained by canceling the truncation error at each order of numerical approximation. This method is called the Richardson extrapolation. It can be used only if the data values are equally spaced with constant step size h.

· Recursive forward differences:

The hierarchy of forward differences for first and higher-order derivatives has the truncation error of order O(h). Denote the forward difference approximation for a derivative f⁽ⁿ⁾(x₀) asD₁(h). Compute the approximation with two step sizes h and 2h:

f⁽ⁿ⁾(x₀) = D₁(h) + h; f⁽ⁿ⁾(x₀) = D₁(2h) + 2h

where is unknown coefficient for the truncation error. By cancelling the truncation error of order O(h), we define a new forward difference approximation for the same derivative:

f⁽ⁿ⁾(x₀) = 2D₁(h) – D₁(2h) = D₂(h)

The new forward difference approximation D₂(h) for the same derivative is more accurate since the truncation error is of order O(h²).

First-order derivative f'(x₀):

The first-order approximation D₁(h) is a two-point divided difference, while the second-order approximation D₂(h) is a three-point divided difference:

D₁(h) = , D₂(h) =

Second-order derivative f''(x₀):

The first-order approximation D₁(h) is a three-point divided difference, while the second-order approximation D₂(h) is a four-point divided difference:

D₁(h) = , D₂(h) =

The process can be continued to find a higher-order forward-difference approximation D_m(h) with the truncation error O(h^m). The recursive formula for Richardson forward difference extrapolation:

D_m+1(h) =D_m(h) +

· Recursive central differences:

The hierarchy of central differences for first and higher-order derivatives has the truncation error of order O(h²). Denote the central difference approximation for a derivative f⁽ⁿ⁾(x₀) asD₁(h). Compute the approximation with two step sizes h and 2h:

f⁽ⁿ⁾(x₀) = D₁(h) + h²; f⁽ⁿ⁾(x₀) = D₁(2h) + 4h²

where is unknown coefficient for the truncation error. By cancelling the truncation error of order O(h²), we define a new central difference approximation for the same derivative:

f⁽ⁿ⁾(x₀) =

The new central difference approximation D₂(h) for the same derivative is more accurate since the truncation error is of order O(h⁴).

First-order derivative f'(x₀):

The first-order approximation D₁(h) is a three-point divided difference, while the second-order approximation D₂(h) is a five-point divided difference:

D₁(h) = , D₂(h) =

Second-order derivative f''(x₀):

The first-order approximation D₁(h) is a three-point divided difference, while the second-order approximation D₂(h) is a five-point divided difference:

D₁(h) = , D₂(h) =

The process can be continued to find a higher-order central-difference approximation D_m(h) with the truncation error O(h^2m). The recursive formula for Richardson forward difference extrapolation:

D_m+1(h) =D_m(h) +

· Numerical algorithm

In order to compute the central difference approximation of a derivative f⁽ⁿ⁾(x₀) up to order m, central difference approximations of lower order are to be computed with larger step sizes: h, 2h, 4h, 8h, …, (m-1)h.

These approximations can be arranged in a table of recursive derivatives:

*step size*	D₁	D₂	D₃	D₄	D₅
h	*D₁(h)*
2h	*D₁(2h)*	*D₂(h)*
4h	*D₁(4h)*	*D₂(2h)*	*D₃(h)*
8h	*D₁(8h)*	*D₂(4h)*	*D₃(2h)*	*D₄(h)*
*16h*	*D₁(16h)*	*D₂(8h)*	*D₃(4h)*	*D₄(2h)*	*D₅(h)*

The diagonal entries are values of higher-order central difference approximations for the derivative f⁽ⁿ⁾(x₀)

where x₀ is the central point surrounded by the points x = x₀- 2^k-1h and x = x₀+ 2^k-1h for k = 1,2,…,m.

The higher-order approximation D_k(h) has the truncation error O(h^2k). If h is small, the truncation error rapidly decreases with larger k. However, the rounding error grows with larger value of k. An optimal order m exists at the minimum of the sum of the truncation and rounding errors.

Example: The figure below presents the central difference approximations D₁(h) and D₂(h) for the derivative of the current I(t) with the step size h = 10. Blue circles are found by five-point central differences D₂(h), green pluses are obtained by three-point central differences D₁(h), and the exact derivative I'(t) is shown by a red solid curve. Five-point difference approximation D₂(h) is clearly more accurate than the three-point difference D₁(h).