If n and m are positive integers, an
matrix is a function
from the set of integer ordered pairs (i,j) with
and
to a given set. If this set is a subset of the complex
numbers, the matrix is called a matrix over the complex numbers or a complex
matrix. If this set is a subset of the real numbers, the matrix is called a
matrix over the real numbers or a real matrix. We can have polynomial
matrices, etc. If the matrix is called A, then A(i,j) is called the (i,j)
entry of A. An
array is typically pictured as an array with
m rows and n columns, and A(i,j) is interpreted as the entry in the i'th row
and j'th column.
A
matrix is called em square if m = n. The transpose of
the
matrix A, denoted by At, is the
matrix
with At(i,j) = A(j,i), i = 1,...,n and j = 1,...,m. A real valued matrix A
is called symmetric if A = At. A square matrix A is called diagonal if
implies A(i,j) = 0. A square matrix is called upper triangular if
i < j implies A(i,j) = 0, and is called lower triangular if its transpose is
upper triangular.
If A and B are both
matrices and for each pair (i,j) the quantity
A(i,j) + B(i,j) makes sense (for example if the matrices are polynomial
matrices) we can define the sum of the matrices, A+B, by
(A+B)(i,j) = A(i,j) + B(i,j). This is just the usual definition of the sum of
two functions with a common domain. Similarly, if t is any quantity for
which tA(i,j) makes sense for all ordered pairs (i,j) we may define a matrix
tA by (tA)(i,j) = tA(i,j). This is just the usual definition of the product of
a function by a scalar.
In what follows we will assume that our matrices are complex matrices, and all multipliers t are complex numbers. We may occasionally restrict to real matrices and real multipliers.
If A is an
matrix and B is an
matrix we will define
the matrix product AB to be the matrix with

It is tedious to show that if A, B and C are matrices with appropriate dimensions and t is a complex number then
It is easy enough to construct examples of
matrices A and B where
.
Let I be a
matrix with I(i,i) = 1 and I(i,j) = 0 for
.I is called the identity matrix. It has the property that if A is
then IA = A, and if B is
then BI = B.
A matrix A with A(i,j) = 0 for all i and j is called a zero matrix.
A
matrix is called a column vector and a
matrix is
called a row vector.
If A is a square matrix, t is a complex number, and X is a non-zero column matrix such that AX = tX, then X is called a left eigenvector for A with eigenvalue t. If Y is a non-zero row vector such that YA = tY, then Y is called a right eigenvector for A with eigen value t. If X and Y are real column matrices of the same dimension then X and Y are said to be orthogonal if XtY = 0. X is said to be a unit vector if XtX = 1. A set of column vectors is said to be orthonormal if each vector in the set is a unit vector and any pair of distinct vectors is orthogonal.
To each
matrix A we may associate d column vectors
by the rule A(j)(i,1) = A(i,j). These column
vectors are called the columns of A. The row vectors and rows are
defined in an analogous manner. A real valued square matrix
is called
orthogonal if its columns form an orthonormal set. For any orthogonal
matrix
we have
.
The determinant function of order d,
, is the unique complex
valued function acting on the set of all
matrices with the
properties
If A is a square matrix then A is said to be invertible if there is another square matrix B with AB = BA = I. A can have a most one inverse, and it is denoted by A-1. Three important facts about a square matrix A are
If A is a symmetric matrix then there is an orthogonal matrix
whose
columns are right eigenvectors of A. In this case the matrix
is a diagonal matrix whose with
equal to the
eigenvalue corresponding to the i'th column of
.A symmetric matrix is called positive definite if all of its eigenvalues are
positive. All of the eigenvalues of a symmetric matrix are real numbers.