3.6. The \(LU\) decomposition#
As we have seen, one way to solve a linear system \(A\vect{x}=\vect{b}\) is to row reduce it to echelon form and then use back substitution. See for instance Example 2.1.8.
In this section we will learn how to solve an \(m\times n\) linear system \(A\mathbf{x}=\mathbf{b}\) by decomposing (or factorising) a matrix into a product of two ‘special’ matrices \(L\) and \(U\).
This is called an \(LU\)-decomposition of the matrix \(A\).
Consider the system \(A\mathbf{x} = \mathbf{b}\) where
To start with, the matrix \(A\) can be factorized as
We will see how this helps to solve the system in an efficient way.
With this factorisation we can solve the system in two steps, using both backward and forward substitution. In fact, the matrix \(U\) is an echelon matrix equivalent to \(A\). To solve the equation \(LU\vect{x} = L(U\vect{x}) =\vect{b}\) we can introduce the auxiliary vector \(\vect{y}\) via
We then first solve
and after that we find \(\mathbf{x}\) by solving
for the solution \( \tilde{\vect{y}}\) of system (3.6.1). For this solution \( \tilde{\vect{x}}\) we then have
We illustrate matters with the linear system of Example 3.6.1.
The system \(A\vect{x} = \vect{b}\) of Example 3.6.1 can be written as
By setting
we have
With forward substitution we can find the solution:
Next we solve
with back substitution. This gives the solution
which is the solution to our original system. Since the factorisation was given, we did not have to solve the system from scratch by trying to find an echelon form.
Here is one to try for yourself.
To solve a system \(LU\vect{x} = \vect{b}\).
Show/Hide Content
There are several methods for factorising matrices. The factorisations that we will see in this section use direct methods. Direct methods are methods that produce the exact factorisations in a finite number of steps.
In the next subsection we will address the questions of whether an \(LU\) decomposition always exists and if so, how to construct it. With an extra condition on \(L\) the decomposition, if it exists, will be unique. In the case where an \(LU\) decomposition does not exist we can instead consider the slightly more general \(PLU\) decomposition. In the remainder of the section we will consider the generalization to non-square matrices and we will analyze to which extent the \((P)LU\) decomposition can give an efficiency boost.
3.6.1. \(LU\) decomposition of a square matrix#
Let \(A\) be an \(n\times n\) matrix. An \(LU\) decomposition of \(A\) is a factorization of the type
where \(L\) is an \(n\times n\) lower triangular matrix with \(1\)s on the diagonal, and \(U\) an \(n\times n\) upper triangular matrix. So,
For convenience we have used blanks for zeros.
Note that the condition of \(1\)s on the diagonal implies that the matrix \(L\) is invertible.
The next proposition captures the main interest of the \(LU\) decomposition, as is already illustrated in Example 3.6.2
Suppose that \(A=LU\), so that the linear system of equations \(A\mathbf{x}=\mathbf{b}\) can be written as \(LU\mathbf{x}=\mathbf{b}\). Then, by setting \(\mathbf{y} = U\mathbf{x}\), we can solve the linear system in two steps.
Solve the system \(L\mathbf{y}=\mathbf{b}\) and find \(\mathbf{y}\).
Note that this system has a unique solution.Solve the system \(U\mathbf{x}=\mathbf{y}\) to find \(\mathbf{x}\).
The solution of the second system is then a solution of the system \(A\mathbf{x}=\mathbf{b}\).
The next proposition tells us which matrices do have an \(LU\) decomposition.
A matrix \(A\) can be written as \(A = LU\), with \(L\) and \(U\) as described in Definition 3.6.1 if and only if can be row reduced to an echelon matrix with only additions of multiples of rows to rows below it. We will call this a top-down row reduction.
Instead of a giving a formal proof, we will illustrate matters first with an example. In this example you will see how an \(LU\) decomposition of a matrix is found via a top-down row reduction that keeps track of the row operations involved.
We consider the matrix
To find an \(LU\) decomposition we will work towards an echelon form in the top-down direction. In addition, we will keep track of the actions we perform. For the first step,
The numbers \(2/3\) and \(1/3\) are called multipliers.
A second row reduction step, involving the multiplier \(1/2\), leads to an echelon/upper triangular matrix.
We can effectuate these row operations by matrix multiplications.
In the same way the second row reduction step can be described by
The echelon matrix \(A_3\) can act as our upper triangular matrix \(U\), and the above computations can be summarized as
where \(F = F_2F_1\).
We conclude that
which is indeed a product of a lower triangular matrix \(L\) (with 1’s on its diagonal) and an upper triangular matrix \(U\).
The above procedure shows how in principle an \(LU\) decomposition can be computed. However, writing down the explicit elementary like matrices \(F_i\), computing their product \(F\), and finding the inverse of \(F\) is unnecessary. Look at the matrix \(L = F^{-1}\) we found! The entries below the diagonal are exactly the numbers \(2/3\), \(1/3\) and \(1/2\) we called the multipliers. The following algorithm describes this ‘shortcut’ to find an \(LU\) decomposition.
Suppose the \(n\times n\) matrix \(A\) can be row reduced top-down to the echelon matrix \(U\). If the numbers \(m_{jk}\) denote the multiples of the \(k\)th row that are subtracted from the rows below it in the \(k\)th step (so \(1 \leq k < j \leq n\)), then
Another look at Example 3.6.3 explains why the detour via the matrices \(F_i\) and the inverse of their product can be skipped, as is expressed in the above algorithm.
For the matrix \(A= \begin{bmatrix} 3 & 1 & -2 \\ 2 & 4 & 1 \\ 1 & 2 & 1 \end{bmatrix}\), the same as in Example 3.6.3, the ‘trick’ is not to work with the matrices \(F_1\) and \(F_2\), but with their inverses.
The first row reduction step
can be rewritten as
Likewise the second row reduction step
can be can be represented as
Combining the two equations gives
We see the multipliers nicely fall into place!
Here is an example where we apply the algorithm without further ado to a \(4 \times 4\) matrix.
We use the \(LU\)-algorithm for the matrix \(A = \begin{bmatrix} 2 & 1 & -2 & 3 \\ 2 & -3 & -4 & 7 \\ -4 & 0 & -2 & -5 \\ -6 & -1 & 8 & -8 \end{bmatrix}.\)
We row reduce the matrix \(A\) top-down, diligently registering the multipliers. Recall that the multiplier \(m_{jk}\) is defined as the multiple of row \(k\) that is subtracted from row \(j\) in the \(k\)-th step of the reduction process.
Here we go:
Putting every multiplier in its right place gives
For the matrix \(A= \begin{bmatrix} 1 & 2 & 1 \\ 4 & 8 & 6 \\ 2 & 5 & 7 \end{bmatrix}\) the procedure breaks down at the second step.
The next logical step towards an echelon matrix would be a row swap. But then the lower triangular structure of \(L\) will be broken. In Subsection 3.6.2 we will study what we can do in such a situation.
Here’s one to try for yourself.
To compute the \(LU\) decomposition of a 3x3 matrix \(A\).
Show/Hide Content
We collect a few properties of triangular matrices that have their value of their own right, and moreover will be used in the proofs of two properties of the \(LU\) decomposition.
Suppose \(A\) and \(B\) are lower triangular \(n \times n\) matrices with \(1\)s on their diagonals. Then the following properties hold.
\(AB\) is also a lower triangular matrix with \(1\)s on its diagonal.
\(A^{-1}\) is also a lower triangular matrix with \(1\)s on its diagonal.
The same properties hold when each instance of ‘lower’ in the above is be replaced by ‘upper’.
There are several ways to prove Proposition 3.6.2. The best would be to think of a proof yourself, but you can also have a peek at the exposition below.
Proof of Proposition 3.6.2
Suppose \(A\) and \(B\) are lower triangular matrices with \(1\)s on their diagonals.
One way to prove that \(AB\) also has these properties is to use the column-row expansion of the product. (Cf. Exercise 3.2.8.)
Let \(A_{(k)}\) be the \(k\)th column of \(A\), and \(B^{(k)}\) the \(k\)th row of \(B\). Then\[\begin{split} A_{(k)} = \begin{bmatrix}0 \\ \vdots \\ 0 \\ a_{kk} \\ \vdots \\ a_{nk} \end{bmatrix} \quad \text{and} \quad B^{(k)} = \begin{bmatrix}b_{k1} & \cdots & b_{kk} & 0 & \cdots & 0 \end{bmatrix}, \end{split}\]where moreover \(a_{kk} = b_{kk} = 1\). So the \(k\)th term in the column-row expansion of \(AB\) becomes
\[\begin{split} \begin{array}{rcl} A_{(k)}B^{(k)} &=& \begin{bmatrix}b_{k1}A_{(k)} & b_{k2}A_{(k)} & \cdots & b_{kk}A_{(k)} & \vect{0} & \cdots & \vect{0}\end{bmatrix} \\ &=& \begin{bmatrix} 0 & 0 & \cdots & 0 & 0 & \cdots & 0 \\ \vdots & \vdots & & \vdots & \vdots & & \vdots \\ 0 & 0 & \cdots & 0 & 0 & \cdots & 0 \\ a_{kk}b_{k1} & a_{kk}b_{k2} & \cdots & a_{kk}b_{kk} & 0 & \cdots & 0 \\ \vdots & \vdots & & \vdots & \vdots & & \vdots \\ a_{nk}b_{k1} & a_{nk}b_{k2} & \cdots & a_{nk}b_{kk} & 0 & \cdots & 0 \end{bmatrix} \end{array}. \end{split}\]This is a lower triangular matrix with a \(1\) on position \((k,k)\) and for the rest \(0\)s on the diagonal. Adding these \(n\) column-row products gives a matrix of the required form.
Next suppose that \(A\) is an lower triangular matrix with \(1\)s on the diagonal. If we apply the algorithm of Proposition 3.4.7 to find the inverse using the augmented matrix
\[\begin{split} [\,A | I \,] = \left[\begin{array}{ccccc|ccccc} 1 & 0 & 0 & \cdots & 0 & 1 & 0 & 0 & \cdots & 0 \\ a_{21} & 1 & 0 & \cdots & 0 & 0 & 1 & 0 & \cdots & 0 \\ a_{31} & a_{32} & 1 & \cdots & 0 & 0 & 0 & 1 & \cdots & 0 \\ \vdots &\vdots & & \ddots & \vdots & \vdots & \vdots & & \ddots & \vdots \\[1ex] a_{n1} &a_{n2} & a_{n3} & \cdots & 1 & 0 & 0 & 0 & \cdots & 1 \end{array} \right], \end{split}\]to row reduce \(A\) to \(I\) we only need to subtract multiples of rows from rows below it. Thus in the matrix to the right of the bar, where the identity matrix is transformed into the inverse matrix of \(A\), the triangular structure with \(1\)s on the diagonal remains.
People that are not satisfied with two examples to show that Algorithm 3.6.2 works can have a look at the following (informal) proof of Proposition 3.6.1.
Proof of Proposition 3.6.1
Suppose \(A\) is an \(n\times n\) matrix that can be row reduced top down to an echelon matrix \(U\) (which will then be an upper triangular). We can row reduce \(A\) from top to bottom, where we use the same form as in Example 3.6.4. For instance the first two steps are
where \(A_2\) will be of the form
The next step will lead to
where \(A_3\) will be of the form
We thus end up with a product
where each matrix \(L_k\) is a matrix containing the multipliers of the \(k\)th step in the proces.
The crucial thing is that in the product \(L_{n-1}L_{n-2}\cdots L_2L_1\) in this order the multipliers do not ‘interact’. For instance, for \(n = 4\), we would get
The pattern is clear, we skip the technical details to prove it for \(n\times n\) matrices.
To prove the converse, assume that \(A\) has an \(LU\) decomposition, i.e.,
We then have to show that \(A\) can be top-down row reduced to an echelon matrix.
By Proposition 3.6.2 the inverse of \(L\) has the same structure as \(L\), i.e.,
As in the proof of the first half of this proof (cf., (3.6.2)), \(L^{-1}\) can be factorized as
Pre-multiplication of a matrix \(M\) with one of the matrices \(L_k\) amounts to adding multiples of the \(k\)th row of \(M\) to the lower rows. So the product \(L_1L_2L_3\cdots L_{n-1}\) is a series of top-down operations, and
amounts to a top-down row reduction of \(A\) to the upper triangular matrix \(U\).
The uniqueness issue is not of major importance, but for completeness’ sake we phrase it in a proposition, followed by a possible argument to prove it.
For an invertible matrix \(A\), if it has an \(LU\) decomposition, it will be unique.
Proof of Proposition 3.6.3.
Suppose \(A\) is an invertible matrix with two \(LU\) decompositions
We then have to show that
Since \(A\) is invertible, the matrices \(U_1, U_2\) are also invertible. (The matrices \(L_1\) and \(L_2\) are always invertible, c.f., Definition 3.6.1.)
From
it follows that
From Proposition 3.6.2 we know that \(L_2^{-1}\) is a lower triangular matrix with \(1\)s on the diagonal, and that the product \(L_2^{-1}L_1\) is also of this form. At the same time the product \(U_2U_1^{-1}\) must be an upper triangular matrix with \(1\)s on its diagonal. Then (3.6.3) implies that
from which the identities
immediately follow.
Lastly we give an example to show that the uniqueness may fail for a singular matrix.
For the matrix \(A = \begin{bmatrix} 1 & 1 & 1 \\ 2 & 2 & 2 \\ 3 & 3 & 3 \end{bmatrix}\) it holds that
where the parameter \(a\) is free to choose.
3.6.2. Generalization to non-square matrices and \(PLU\) decomposition#
In this section we describe what can be said regarding \(LU\) decompositions for non-square matrices, and present a ‘workaround’ for matrices for which there is no top-down row reduction to echelon form.
We start with an example to show that not much changes if we apply Algorithm 3.6.2 to non-square matrices.
The matrix \(A\) is given by
We can row reduce \(A\) top-down to an echelon matrix:
We then have
The two-step procedure as in Algorithm 3.6.1 to solve a linear system \(LU\vect{x} = \vect{b}\) is also feasible for non-square matrices, as the next example shows
We will solve the equation
where
First we solve the system
Using forward substitution we find the solution
and then the system
gives the solution(s)
The generalization of Proposition 3.6.1 to non-square matrices is captured in the next proposition.
An \(m\times n\) matrix \(A\), with \(m \leq n\) can be written as \(A = LU\), with
-
\(L = \begin{bmatrix} 1 & \\ \ell_{21} & 1 & \\ \ell_{31} & \ell_{32} & 1 \\ \vdots & \vdots & \vdots & \ddots & \\ \ell_{m1} & \ell_{m2} & \ell_{m3} & \cdots & 1 \end{bmatrix}\),
an \(m\times m\) upper triangular matrix with \(1\)s on its diagonal,
and
-
\(U =\begin{bmatrix} u_{11} & u_{12} & u_{13} & \cdots &u_{1m} & \cdots & u_{1n} \\ & u_{22} & u_{23} & \cdots & u_{2m} & \cdots & u_{2n} \\ & & u_{33} & \cdots & u_{3m} & \cdots & u_{3n} \\ & & & \ddots & \vdots \\ & & & & u_{mm} & \cdots & u_{mn} \end{bmatrix}\) an echelon matrix
if and only if \(A\) can be row reduced top-down to the echelon matrix \(U\).
Moreover, if the rows of \(A\) are linearly independent the matrices \(L\) and \(U\) are unique.
A similar proposition holds for \(m \times n\) matrices with \(m > n\). However, in this case the systems \(A\vect{x} = \vect{b}\) are inconsistent for most vectors \(\vect{b}\), and then other techniques come into play (e.g., see Section 7.4.2).
In the remainder of this subsection we address the next best thing we can do in case a matrix does not have an \(LU\) decomposition.
Let us return to Example 3.6.6 where the matrix \(A\) cannot be written as \(LU\). We copy from there
We need a row swap to bring the last matrix to echelon form. The key idea is to perform all the row exchanges first and then add multiples of rows (top-down) to other rows to obtain an echelon form.
Here the one row exchange, swapping row 2 and row 3, is captured by the matrix
We get
This is what we will call a \(PLU\) decomposition (a permuted \(LU\) decomposition).
A series of row swaps may lead to any reordering of the rows of a matrix. In mathematics the word for reordering is permutation, and the action of reordering can be accomplished via a multiplication with a permutation matrix \(P\). Before we continue with the \(PLU\) decomposition we define this concept and derive some of its properties.
A permutation matrix is an \(n \times n\) matrix \(P\) with only entries \(0\) and \(1\) in such a way that each row and each column contain exactly one \(1\).
Two \(4\times 4\) permutation matrices are
Note that for an arbitrary \(4 \times 4\) matrix \(A\) we have
and
In general (pre-)multiplication of a matrix \(A\) with any permutation matrix reorders the rows of \(A\). Try to convince yourself why this is so.
The following properties of permutation matrices come in handy.
The product of two \(n\times n\) permutation matrices is again a permutation matrix.
The inverse of a permutation matrix is its transpose.
The following example provides some illustrations. And the interested reader may open the proof below it.
For the matrices \(P_1\) and \(P_2\) from (3.6.4) we have
and
Note that from the second identity it follows that \(P_2^{-1} = P_2^T\).
Proof of Proposition 3.6.5
The product of two \(n\times n\) permutation matrices is again a permutation matrix.
Suppose \(P_1\) and \(P_2\) are permutation matrices. With the product \(P_1P_2\) the rows of \(P_2\) are reordered, and that leaves the properties a \(1\) in each row, a \(1\) in each column, all other entries equal to \(0\), intact.
The inverse of a permutation matrix is its transpose. Thus, \(P^{-1} = P^T\).
In a product \(A^TA\), the entry on position \((i,j)\) is the inner product of the \(i\)th column of \(A\) with the \(j\)th column of \(A\). (Cf., Exercise 3.2.4.) Since the \(n\) columns of \(A\) are the \(n\) = columns \(\vect{e}_j\) of the identity matrix (in some order), and
\[\begin{split} (\vect{e}_i)^T\vect{e}_j =\vect{e}_i\ip\vect{e}_j = \left\{\begin{array}{l} 1, \,\,\text{if}\,\, i = j\\ 0, \,\,\text{if}\,\, i \neq j, \end{array} \right. \end{split}\]the result immediately follows.
\(PLU\) Decomposition)
(Existence of aSuppose that \(A\) is an \(m\times n\) matrix with real coefficients, and let \(m \leq n\). Then there exist a permutation matrix \(P\), a lower triangular matrix \(L\) and an echelon matrix \(U\) such that
As was mentioned before, the key idea is to perform the row exchanges first. These can be put together in the permutation matrix \(P\). The algorithm to actually find the \(LU\) decomposition of \(PA\) without doing the whole row reduction process for \(PA\) all over again is rather intricate, and in our view belongs to a course of numerical linear algebra. There it will be explained that it may also be preferable to work towards a \(PLU\) decomposition instead of an \(LU\) decomposition in cases where theoretically it is not absolutely necessary. For numerical reasons, having to do with finite accuracy when representing real numbers in computers, it may be better to choose the pivots in another order than just top-down.
In the following example the matrix does have a proper \(LU\) decomposition, but we will row reduce it in another order than top-down to echelon form. It is a tiny example to illustrate that it is possible to deduce a \(PLU\) decomposition from it when we keep record of both the multipliers and the positions of the pivots.
We will row reduce the matrix \( A= \begin{bmatrix}2&4&3 \\ 1&2&3\\1&3&2 \end{bmatrix}\) in an alternative order than top-down and extricate a \(PLU\) decomposition from it.
where we have put boxes at the pivot positions.
If we put together the matrices that describe the row operations we get
If we would have started from
using the same pivots would have arrived at
so
The matrix \(P\) is the inverse of (so also the transpose of) the matrix that describes the positions of the pivots as in Equation (3.6.5). It asks for some careful analysis how to reconstruct \(L\) from \(\tilde{L}\).
3.6.3. Grasple Exercises#
To identify triangular matrices.
Show/Hide Content
To compute the \(LU\) decomposition of a 2x2 matrix \(A\).
Show/Hide Content
To compute the \(LU\) decomposition of a 3x3 matrix \(A\).
Show/Hide Content
To compute the \(LU\) decomposition of a 3x3 matrix \(A\) and use it to solve \(A\vect{x} = \vect{b}\).
Show/Hide Content
To compute the \(LU\) decomposition of a 3x3 matrix \(A\) and use it to solve \(A\vect{x} = \vect{b}\).
Show/Hide Content
To decide solving \(A\vect{x} = \vect{b}\) via (given) \(A=LU\) or (given) \(A^{-1}\).
Show/Hide Content
To compute \(A^{-1}\) using \(A = LU\).
Show/Hide Content
Explorative exercise about the \(LDU\) decomposition.
Show/Hide Content
To compute the \(LU\) decomposition of 3x4 matrix \(A\) and use it to solve \(A\vect{x} = \vect{b}\).
Show/Hide Content
To compute a \(PLU\) decomposition of a 3x3 matrix.
Show/Hide Content
To solve a system \(A\vect{x} = \vect{b}\) using \( PA = LU\).
Show/Hide Content
3.6.4. Efficiency Issues#
One way to measure the performance of an algorithm is counting the number of arithmetic operations that are necessary for solving a problem. By arithmetic operations we will take into account additions, multiplications and divisions.
Let us first compute this number when we solve the (square) linear system \(A\mathbf{x}=\mathbf{b}\) by taking the augmented matrix \([ A | \vect{b}]\), find an echelon form and then use backward substitution. Let us suppose that the matrix \(A\) is invertible and possesses an \(LU\) decomposition.
In the worst-case scenario, for a \(3\times 3\) matrix \(A\), (so a \(3\times 4\) augmented matrix), we need the following number of arithmetic operations:
-
To convert the components \(a_{21}\) and \(a_{31}\) to a zero value, we need to
compute two multipliers \(m_{21}\) and \(m_{31}\) (2 divisions), then we need to
multiply each component of the first row, starting at \(a_{12}\), by each \(m_{i1}\) (this happens twice, so \(2\times 3=6\) products) and then we need to
subtract the resulting values to the corresponding components in each row (\(2\times 3=6\) subtractions).
Therefore, we need a total of \(14\) arithmetic operations (8 products/divisions and 6 additions/subtractions). -
To convert the component \(a_{32}\) to a zero value, we need to compute the multiplier \(m_{32}\) (one division), then we need to multiply each component of the second row starting at \(a_{23}\) (this asks for 2 products), and then we need to subtract the resulting values to the corresponding components in the third row (2 subtractions). This gives 5 arithmetic operations.
So just to bring the augmented matrix to an echelon form requires 19 arithmetic operations (11 multiplications/divisions and 8 additions/subtractions).
Next, to solve the system we use backward substitution.
-
to find \(x_2\) requires one multiplication and one subtraction,
-
to find \(x_1\) requires two multiplications and two subtractions.
So in total, we needed \(19+6\) arithmetic operations.
Supposing that \(A=LU\) and that \(L\) and \(U\) are given, then, we solve first \(L\mathbf{y}=\mathbf{b}\) and then \(U\mathbf{x}=\mathbf{y}\).
-
For \(L\mathbf{y}=\mathbf{b}\) we use forward substitution. Since the elements in the main diagonal are ones, then we have that we need no operations to determine \(y_1\), we need one subtraction and one division for \(y_2\), and two subtractions and one division for \(y_3\). This totals 6 arithmetic operations.
-
To solve \(U\mathbf{x}=\mathbf{y}\) we use backward substitution, and we have just seen that it requires 9 arithmetic operations.
So when the matrix \(A\) is already \(LU\) factorised, the number of operations required to solve the system is significantly lower.
Similar computations for a non-singular \(n\times n\) matrix \(A\) leads to the following results.
Suppose \(A\) is an invertible \(n\times n\) matrix and \(\vect{b}\) an arbitrary vector in \(\R^n\).
Row reduction of \(A\) to echelon form requires \(\frac23n^3-\frac12n^2-\frac16n\) (arithmetic) operations.
Row reduction of the augmented matrix \([A | \vect{b}]\) to echelon form requires \(\frac23n^3+\frac32n^2-\frac76n\) operations.
Solving a linear system \(L\vect{y} = \vect{b}\) for an \(n \times n\) lower triangular matrix \(L\) with \(1\)s on the diagonal requires \(n(n-1)\) operations.
Solving a linear system \(U\vect{y} = \vect{b}\) for an \(n \times n\) upper triangular matrix \(U\) requires \(n^2\) operations.
From ii and iii it follows that solving the system \(A\vect{x} = \vect{b}\) by row reduction + backward substitution requires
arithmetic operations.
From iii and iv it follows that solving the system \(LU\vect{x} = \vect{b}\) requires
arithmetic operations.
In many applications in engineering, it is required to solve many, say \(m\), linear systems
\(A\vect{x}=\vect{b}_i\), with the same coefficient matrix \(A\), where typically the vectors \(\vect{b}_i\) are not known beforehand.
In this situation is where the \(LU\) decomposition comes in handy. Instead of solving the systems one by one, we can first find an \(LU\)-decomposition, and then solve the systems \(LU\vect{x}=\vect{b}_i\).
If we compare solving \(m\) linear systems with row reduction (\(RR\)) and with \(LU\) decomposition (\(LU\)), we get
versus
In Table 3.6.1 we can see the comparison in the numbers of operations needed to solve several linear systems when using row reduction and \(LU\) decomposition.
\(n\) |
\(m=5\) |
\(m=10\) |
\(m=50\) |
|||
---|---|---|---|---|---|---|
\(RR\) |
\(LU\) |
\(RR\) |
\(LU\) |
\(RR\) |
\(LU\) |
|
\(3\) |
\(140\) |
\(88\) |
\(280\) |
\(163\) |
\(1400\) |
\(763\) |
\(5\) |
\(575\) |
\(295\) |
\(1,150\) |
\(520\) |
\(5,750\) |
\(2,320\) |
\(10\) |
\(4,025\) |
\(1,565\) |
\(8,050\) |
\(2,515\) |
\(40,250\) |
\(10,115\) |
We will mention one other advantage the \(LU\) decomposition may have, namely when the coefficient matrix \(A\) is a band matrix. In that case it is much more efficient to work with the \(LU\) decomposition than with the inverse. Such systems \(A\vect{x} = \vect{b}\) for instance appear a when (partial) differential equations are solved via discretizations. It falls outside the scope of this textbook to go into the details, but we consider the special case of tridiagonal matrices to illustrate once more the usefulness of the \(LU\) decomposition.
A band matrix of width \(d \geq 0\) is a matrix where only the entries within a distance \(d\) from the diagonal are nonzero. So, \(a_{ij} = 0\) if \(|i-j|>d\). A tridiagonal matrix \(A\) is a band matrix of width 1.
Three examples of band matrices are
\(A\) is a tridiagonal matrix, matrix \(B\) is a band matrix of width 2, and the third matrix is both upper triangular and tridiagonal.
Suppose \(A\) is a band matrix of width \(d\). Then the matrices \(L\) and \(U\) of an \(LU\) decomposition of \(A\) (insofar this exists) are band matrices of width \(d\) as well
Now suppose we need to solve many linear systems \(A\vect{x} =\vect{b}_i\) where \(A\) is an \(n\times n\) tridiagonal matrix. We can compute \(A^{-1}\) and solve the systems by computing the consecutive products \(A^{-1}\vect{b}_i\), or we can find an \(LU\) decomposition and use Algorithm 3.6.1 to solve \(A\vect{x} =\vect{b}_i\).
There are two reasons why it is advantageous to work with the \(LU\) decomposition here.
-
To store \(A^{-1}\) we need to store \(n^2\) numbers whereas to store \(L\) and \(U\) only \(4n-2\) memory places are asked for. In fact, if we may assume that \(L\) has ones on its diagonal, only \(3n-2\) places are needed. For large \(n\) this is a significant gain.
-
One product \(A^{-1}\vect{b}\) involves \(n^2\) products of numbers and \(n(n-1)\) additions, so the total number of arithmetic operations equals \(2n^2 - n\).
To solve
\[\begin{split} L\vect{y} = \begin{bmatrix} 1 \\ \ell_{21} & 1 \\ 0 &\ell_{32} & 1 \\ 0 & 0 & \ell_{32} & 1 \\[1ex] \vdots & \vdots & & \ddots & \\[1ex] 0 & 0 & 0 & \cdots & \ell_{n,n-1} & 1 \end{bmatrix} \begin{bmatrix} y_1 \\ y_2 \\ y_3 \\ y_4 \\[1ex] \vdots \\[1ex] y_n \end{bmatrix} = \begin{bmatrix} b_1 \\ b_2 \\ b_3 \\ b_4 \\[1ex] \vdots \\[1ex] b_n \end{bmatrix} \end{split}\]by forward substitution we find one after another
\[\begin{split} \begin{array}{lcl} y_1 &=& b_1 \\ y_2 &=& b_2 - \ell_{21}y_1 \\ y_3 &=& b_3 - \ell_{32}y_2 \\[1ex] \vdots & & \quad \vdots \\[1ex] y_n &=& b_n - \ell_{n,n-1}y_{n-1} \\ \end{array} \end{split}\]For all but the first row we need two operations per row, so in total \(2(n-1)\) operations.
Likewise, for solving \(U\vect{x} = \vect{y}\) we need an extra division for each row, so the number of operations for backward substitution is \(2(n-1)+n\), and the total number for the two phases becomes
\[ 2(n-1) + 2(n-1) + n = 5n-2. \]For large \(n\) this is again much smaller than the \(2n^2-n\) arithmetic operations in the computation of \(A^{-1}\mathbf{b}\).