Sample Page

In linear algebra, an invertible matrix (non-singular, non-degenerate or regular) is a square matrix that has an inverse. In other words, if a matrix is invertible, it can be multiplied by its inverse matrix to yield the identity matrix. Invertible matrices are the same size as their inverse.

The inverse of a matrix represents the inverse operation, meaning if a matrix is applied to a particular vector, followed by applying the matrix’s inverse, the result is the original vector.

Definition

An n-by-n square matrix A is called invertible if there exists an n-by-n square matrix B such thatwhere In denotes the n-by-n identity matrix and the multiplication used is ordinary matrix multiplication.[1] If this is the case, then the matrix B is uniquely determined by A, and is called the inverse of A, denoted by A−1. Matrix inversion is the process of finding the matrix which when multiplied by the original matrix gives the identity matrix.[2]

Basic idea

A matrix can be viewed as a rule for transforming vectors. For example, a real matrix defines a linear transformation

from the set of -tuples of real numbers to itself. The matrix is invertible when this transformation can be undone by another linear transformation. In that case, there is a matrix such that applying and then , or applying and then , returns every vector to where it started.

Geometrically, an invertible matrix does not collapse space into a lower-dimensional set. It sends distinct vectors to distinct vectors and reaches every vector in the target space. For a real matrix, this is reflected by its determinant: an invertible matrix has nonzero determinant, while a matrix with determinant zero collapses volume to zero and is not invertible.

Algebraically, invertibility means that the linear system

has a unique solution for every vector . Equivalently, the columns of form a basis of the vector space. These and several other equivalent characterizations are summarized by the invertible matrix theorem.

Examples

Consider the following 2-by-2 matrix:

The matrix is invertible, as it has inverse which can be confirmed by computing

To check that it is invertible without finding an inverse, can be computed, which is non-zero.

On the other hand, this is a non-invertible matrix:

This matrix is not invertible, because different vectors and have the same value of , for example, and both give . So it is impossible to reverse the transformation , since different inputs produce the same output. The determinant of is 0, which is a necessary and sufficient condition for a matrix to be non-invertible.

Criteria for invertibility

A matrix (over a field such as the real numbers) is invertible if and only if the only solution to the equation is the zero vector. That is, the nullity of is zero: its nullspace consists only of the zero vector. Equivalently, an matrix is invertible if its rank is , by the rank-nullity theorem. Such a matrix is said to be of full rank. Geometrically, this means that the column space of is all of . The rank and nullity characterizations are related in the following way: having full rank means that the matrix maps onto all of (surjectively), and having nullity zero means that the matrix is one-to-one. That is, it is a bijection, and therefore inveritble.

A more computational criterion for invertibility comes from the standard way of determining the rank and nullity of a matrix. For a square matrix, a matrix is invertible if and only if its row-reduced echelon form is the identity matrix. The reason is that this is the only row-reduced echelon matrix having full rank: all ones down the diagonal and zeros everywhere else.

Another criterion is that a matrix is invertible if and only if its determinant is non-zero.

Methods of matrix inversion

There are many methods to compute the inverse matrix, when it exists. A basic method is to use Gaussian elimination, for example. This proceeds by considering the inverse matrix as a product of elementary matrices, where the are the elementary matrices corresponding to the elementary row operations needed to put into is row-reduced echelon form. This method has the advantage that it will produce a row-reduced echelon form regardless of invertibility, and so gives a criterion to decide if a matrix is invertible which is often more efficient than computing the determinant: the matrix is invertible if and only if, at the end of the process, the echelon form is the identity matrix rather than some matrix of lower rank.

To convert this to a practical method for determining using this method, one uses an augmented matrix whose left side is the matrix to invert and whose right side is the identity matrix. Then, Gaussian elimination is used to convert the left side into the identity matrix, which causes the right side to become the inverse of the input matrix.

For example, take the following matrix:

The first step to compute its inverse is to create the augmented matrix

Call the first row of this matrix and the second row . Then, add row 1 to row 2 This yields

Next, subtract row 2, multiplied by 3, from row 1 which yields

Finally, multiply row 1 by −1 and row 2 by 2 This yields the identity matrix on the left side and the inverse matrix on the right:

Thus, It works because the process of Gaussian elimination can be viewed as a sequence of applying left matrix multiplication using elementary row operations using elementary matrices (), such as

Applying right-multiplication using we get And the right side which is the inverse we want.

Properties

Singularity

Over a field, a square matrix that is not invertible is called singular or degenerate. A square matrix with entries in a field is singular if and only if its determinant is zero.

Invertible matrix theorem

Let A be a square n-by-n matrix over a field K (e.g., the field of real numbers). The following statements are equivalent, i.e., they are either all true or all false for any given matrix:[3]

  • A is invertible, i.e. it has an inverse under matrix multiplication, i.e., there exists a B such that AB = In = BA. (In that statement, “invertible” can equivalently be replaced with “left-invertible” or “right-invertible” in which one-sided inverses are considered.)
  • The linear transformation mapping x to Ax is invertible, i.e., it has an inverse under function composition. (There, again, “invertible” can equivalently be replaced with either “left-invertible” or “right-invertible”.)
  • The transpose AT is an invertible matrix.
  • A is row-equivalent to the n-by-n identity matrix In.
  • A is column-equivalent to the n-by-n identity matrix In.
  • A has n pivot positions.
  • A has full rank: rank A = n.
  • A has a trivial kernel: ker(A) = {0}.
  • The linear transformation mapping x to Ax is bijective; that is, the equation Ax = b has exactly one solution for each b in Kn. (There, “bijective” can equivalently be replaced with “injective” or “surjective“.)
  • The columns of A form a basis of Kn. (In this statement, “basis” can equivalently be replaced with either “linearly independent set” or “spanning set”)
  • The rows of A form a basis of Kn. (Similarly, here, “basis” can equivalently be replaced with either “linearly independent set” or “spanning set”)
  • The determinant of A is nonzero: det A ≠ 0. In general, a square matrix over a commutative ring is invertible if and only if its determinant is a unit (i.e. multiplicatively invertible element) of that ring.
  • The number 0 is not an eigenvalue of A. (More generally, a number is an eigenvalue of A if the matrix is singular, where I is the identity matrix.)
  • The matrix A can be expressed as a finite product of elementary matrices.

Other properties

Furthermore, the following properties hold for an invertible matrix A:

  • for nonzero scalar k
  • if A has orthonormal columns, where + denotes the Moore–Penrose inverse and x is a vector
  • For any invertible n-by-n matrices A and B, More generally, if are invertible n-by-n matrices, then
  • Left and right inverses are equal. That is, if and then .

The rows of the inverse matrix V of a matrix U are orthonormal to the columns of U (and vice versa interchanging rows for columns). To see this, suppose that UV = VU = I where the rows of V are denoted as and the columns of U as for Then clearly, the Euclidean inner product of any two This property can also be useful in constructing the inverse of a square matrix in some instances, where a set of orthogonal vectors (but not necessarily orthonormal vectors) to the columns of U are known. In which case, one can apply the iterative Gram–Schmidt process to this initial set to determine the rows of the inverse V.

A matrix that is its own inverse (i.e., a matrix A such that A = A−1 and consequently A2 = I) is called an involutory matrix.

In relation to its adjugate

The adjugate of a matrix A is a matrix which exists independently of the invertibility of A. It satisfies the identity In particular, if A is invertible, then

In relation to the identity matrix

It follows from the associativity of matrix multiplication that if

for finite square matrices A and B, then also

[4]

This identity does not hold for non-square rectangular matrices, and need not be true for linear operators in infinite dimensions.

Density

Over the field of real numbers, the set of singular n-by-n matrices, considered as a subset of is a null set, that is, has Lebesgue measure zero. That is true because singular matrices are the roots of the determinant function. It is a continuous function because it is a polynomial in the entries of the matrix. Thus in the language of measure theory, almost all n-by-n matrices are invertible.

Furthermore, the set of n-by-n invertible matrices is open and dense in the topological space of all n-by-n matrices. Equivalently, the set of singular matrices is closed and nowhere dense in the space of n-by-n matrices.

In practice, however, non-invertible matrices may be encountered. In numerical calculations, matrices that are invertible but close to a non-invertible matrix may still be problematic and are said to be ill-conditioned.

Derivative of the matrix inverse

Suppose that the invertible matrix A depends on a parameter t. Then the derivative of the inverse of A with respect to t is given by[5]

To derive the above expression for the derivative of the inverse of A, one can differentiate the definition of the matrix inverse using the product rule, and then solve for the derivative of the inverse of A:

Subtracting from both ends of this formula, and multiplying on the right by finishes the derivation.

If is a small number then the derivative formula gives:

Given a positive integer ,

In particular,

Generalizations

Non-square matrices

Non-square matrices, i.e. m-by-n matrices for which mn, do not have an inverse. However, in some cases such a matrix may have a left inverse or right inverse. If A is m-by-n and the rank of A is equal to n, (nm), then A has a left inverse, an n-by-m matrix B such that BA = In. If A has rank m (mn), then it has a right inverse, an n-by-m matrix B such that AB = Im.

Some of the properties of inverse matrices are shared by generalized inverses (such as the Moore–Penrose inverse), which can be defined for any m-by-n matrix.[6]

In Abstract algebra

While the most common case is that of matrices over the real or complex numbers, all of those definitions can be given for matrices over any algebraic structure equipped with addition and multiplication (i.e. rings). However, in the case of a ring being commutative, the condition for a square matrix to be invertible is that its determinant is invertible in the ring, which in general is a stricter requirement than it being nonzero. For a noncommutative ring, the usual determinant is not defined. The conditions for existence of left-inverse or right-inverse are more complicated, since a notion of rank does not exist over rings.

The set of n × n invertible matrices together with the operation of matrix multiplication and entries from ring R form a group, the general linear group of degree n, denoted GLn(R).

Applications

For most practical applications, it is not necessary to invert a matrix to solve a system of linear equations; however, for a unique solution, it is necessary for the matrix involved to be invertible.

Decomposition techniques like LU decomposition are much faster than inversion, and various fast algorithms for special classes of linear systems have also been developed.

Regression/least squares

Although an explicit inverse is not necessary to estimate the vector of unknowns, it is the easiest way to estimate their accuracy and is found in the diagonal of a matrix inverse (the posterior covariance matrix of the vector of unknowns). However, faster algorithms to compute only the diagonal entries of a matrix inverse are known in many cases.[7]

Matrix inverses in real-time simulations

Matrix inversion plays a significant role in computer graphics, particularly in 3D graphics rendering and 3D simulations. Examples include screen-to-world ray casting, world-to-subspace-to-world object transformations, and physical simulations.

Matrix inverses in MIMO wireless communication

Matrix inversion also plays a significant role in the MIMO (Multiple-Input, Multiple-Output) technology in wireless communications. The MIMO system consists of N transmit and M receive antennas. Unique signals, occupying the same frequency band, are sent via N transmit antennas and are received via M receive antennas. The signal arriving at each receive antenna will be a linear combination of the N transmitted signals forming an N × M transmission matrix H. It is crucial for the matrix H to be invertible so that the receiver can figure out the transmitted information.[8]

See also

References

  1. ^ Axler, Sheldon (18 December 2014). Linear Algebra Done Right. Undergraduate Texts in Mathematics (3rd ed.). Springer Publishing (published 2015). p. 296. ISBN 978-3-319-11079-0.
  2. ^ J.-S. Roger Jang (March 2001). “Matrix Inverse in Block Form”.
  3. ^ Weisstein, Eric W. “Invertible Matrix Theorem”. mathworld.wolfram.com. Retrieved 2020-09-08.
  4. ^ Horn, Roger A.; Johnson, Charles R. (1985). Matrix Analysis. Cambridge University Press. p. 14. ISBN 978-0-521-38632-6..
  5. ^ Magnus, Jan R.; Neudecker, Heinz (1999). Matrix Differential Calculus : with Applications in Statistics and Econometrics (Revised ed.). New York: John Wiley & Sons. pp. 151–152. ISBN 0-471-98633-X.
  6. ^ Roman, Stephen (2008), Advanced Linear Algebra, Graduate Texts in Mathematics (Third ed.), Springer, p. 446, ISBN 978-0-387-72828-5.
  7. ^ Lin, Lin; Lu, Jianfeng; Ying, Lexing; Car, Roberto; E, Weinan (2009). “Fast algorithm for extracting the diagonal of the inverse matrix with application to the electronic structure analysis of metallic systems”. Communications in Mathematical Sciences. 7 (3): 755–777. doi:10.4310/CMS.2009.v7.n3.a12.
  8. ^ Albreem, M.; Juntti, M.; Shahabuddin, S. (January 2020). “Efficient initialisation of iterative linear massive MIMO detectors using a stair matrix”. Electronics Letters. 56 (1): 50–52. Bibcode:2020ElL….56…50A. doi:10.1049/el.2019.2938.

Further reading