Introduction to eigenvectors

By Martin McBride, 2024-04-15
Tags: matrix matrix algebra determinant eigenvector eigenvalue unit matrix
Categories: matrices


The product of a square matrix A and a column vector v is a new column vector. The new vector will normally have a different direction from the original, with the matrix representing a linear transformation. However, certain vectors will keep their original direction. We say that such a vector is an eigenvector of the matrix A.

In this article, we will look at eigenvectors, eigenvalues, and the characteristic equation of a matrix. We will also see how to calculate the eigenvectors and values of 2- and 3-dimensional square matrices.

2D example

Consider this matrix, T:

2D matrix

If we multiply this matrix by the vector (2, 0) we get a new vector (2, 4):

2D matrix multiplication

This is illustrated below. The left-hand plot shows the original vector (2, 0) in cyan. It shows several other vectors in different colours. The right-hand graph shows the same set of vectors transformed by the matrix T above:

2D matrix multiplication

Generally, each transformed vector on the right has a different size and direction compared to its untransformed counterpart on the left.

There are two special vectors that do not change direction when transformed by T. Those vectors are (1, 1) and (-3, 2):

2D matrix multiplication

These vectors are called the eigenvectors of T. The cyan vector (1, 1) is transformed into the vector (4, 4). The transformed vector points in the same direction as the original, but it is 4 times longer. We say that the vector (1, 1) is an eigenvector of T with an eigenvalue of 4.

The orange vector (-3, 2) is transformed into the vector (3, -2). It appears to be pointing in the exact opposite direction to the original, but one way to describe that is to say that it has the same direction but with a negative length. Vector (3, -2) is equal to (-3, 2) multiplied by -1, so we say this vector is an eigenvector of T with an eigenvalue of -1.

Eigenvector definition

We can define an eigenvector using this equation:

2D matrix multiplication

Here A is a square matrix of order n (the above example was a square matrix of order 2), v is a vector also of order n, and λ is a scalar constant value.

We say that v is an eigenvector of A and λ is an eigenvalue of A corresponding to eigenvector v.

Usually, the number of eigenvalues will be equal to the order of the matrix (so in the previous example, there were two eigenvalues because it was a two-by-two matrix). Each eigenvalue will be associated with an eigenvector, but bear in mind that if v is an eigenvector then any scaler multiple of v will also be an eigenvector. It is only the direction of the vector that matters.

Also, there are sometimes degenerate cases. For example, a two-by-two matrix might only have one eigenvalue that corresponds to two different eigenvectors that are not collinear.

The characteristic equation

If we take the previous equation for the eigenvector, we can use it to find the eigenvalues. Here is the equation from earlier:

2D matrix multiplication

We will make use of the identity matrix. This is a square matrix where every element of the leading diagonal is 1, and all the other elements are 0. If we multiply any vector v by an identity matrix of the same order, it leaves the vector unchanged:

Characteristic equation

So we can replace v on the RHS of the original equation with Ix and the equation will still be valid:

Characteristic equation

We can rearrange this by moving both terms to the RHS and taking out a common factor of v. Note that, in the equation below, 0 represents a zero vector rather than the scalar value 0. For example, it would be (0, 0) if v has order 2:

Characteristic equation

This says that the matrix (A - λI) always takes v to 0, which implies that the determinant must be 0. So:

Characteristic equation

This is called the characteristic equation of A. We won't prove it here, but the solutions to this equation give the eigenvalues of A, and from the eigenvalues we can find the eigenvectors.

Finding the eigenvalues of a two by two matrix

Let's use this to find the eigenvectors of our previous example matrix:

Solving 2D matrix

Here is the characteristic equation using this matrix and the identity matrix of order 2:

Solving 2D matrix

We multiply the identity matrix by λ:

Solving 2D matrix

And subtract the two matrices:

Solving 2D matrix

Here is the equation for the determinant of a two by two matrix:

Solving 2D matrix

Applying this to our matrix gives:

Solving 2D matrix

We will skip the simplification steps, but we end up with:

Solving 2D matrix

This is a quadratic equation. We can use the quadratic equation to find the following two solutions:

Solving 2D matrix

These are our eigenvalues.

Finding the eigenvectors from the eigenvalues

To find the eigenvectors, we return to the earlier equation:

Characteristic equation

We previously found an expression for A - λI:

Solving 2D matrix

Substituting this into the previous equations, and representing v as a column vector (x, y), gives:

Solving 2D matrix

We already know that the two eigenvalues are -1 and 4. If we substitute -1 for λ we get:

Solving 2D matrix

Multiplying out the matrix gives two simultaneous equations:

Solving 2D matrix

Now these two equations are collinear (in fact, they are identical), so they do not have a unique solution. They are solved by any v that satisfies the relationship:

Solving 2D matrix

This is the equation of a straight line, passing through the origin, with a slope of -2/3. Our eigenvector is any vector on that line.

Right at the start, we demonstrated graphically that the vector (-3, 2) was an eigenvector, and this equation validates that. But we also saw that any vector with the same slope is also an eigenvector. So for example, (-6, 4) is an eigenvector (and it also satisfies the same relationship). There are infinitely many vectors with different lengths but the same slope. We could choose any vector, but it is common to choose the smallest vector that has integer components (if that vector exists).

We can do the same with λ equals 4:

Solving 2D matrix

This gives the following two simultaneous equations:

Solving 2D matrix

These are solved by any pair of values that satisfy:

Solving 2D matrix

This again is a straight line, passing through the origin, with a slope of 1. So (1, 1) is an eigenvector, so is (2, 2) etc.

Solving a three-by-three matrix

Now let's try a three-by-three matrix. The steps can be quite lengthy, so we will skip the detailed arithmetic in some places. We will use this matrix:

Solving 3D matrix

The determinant of the characteristic equation is:

Solving 3D matrix

Adding the two matrices yields:

Solving 3D matrix

We will use the standard equation for a three-by-three determinant:

Solving 3D matrix

Substituting the values from the matrix, after some tedious gathering of terms, gives us:

Solving 3D matrix

This has three solutions (that can be verified by substituting λ in the equation above), but two of the solutions are equal to 2:

Solving 3D matrix

Next, we will find the eigenvectors. As before, we substitute the known λ values into the matrix, multiply by v and solve for zero. We will start with λ equal to 5. This gives the following matrix:

Solving 3D matrix

Multiplying this by a 3-vector (x, y, z) gives:

Solving 3D matrix

This is equal to zero when:

Solving 3D matrix

This gives us a set of simultaneous equations:

Solving 3D matrix

Subtracting the second equation from the first gives:

Solving 3D matrix

Subtracting the third equation from the first gives:

Solving 3D matrix

For any given x we can find a value for y and z, so these linear equations specify a straight line. The eigenvector is any vector collinear with this line. If we arbitrarily pick 1 for x, then y and z are also 1, so the eigenvector associated with eigenvalue 5 is (1, 1, 1).

Now let's try again with the other λ value, 2:

Solving 3D matrix

Multiplying by v:

Solving 3D matrix

This, again, is equal to zero when:

Solving 3D matrix

All the lines of the matrix are identical, so rather than a set of simultaneous equations, we get the same equation repeated three times:

Solving 3D matrix

We can solve for z:

Solving 3D matrix

You might recognise this as the equation of a plane. We won't prove it here, but the reason we get a plane in this case is that the λ value of 2 appears twice as an eigenvalue.

To fully specify a plane, we need to know two vectors on the plane. When we had a line, we knew the direction of the vector but we had the freedom to choose its length. But in a plane, we get to choose the direction and the length of our two vectors (both vectors must be on the plane, and they can't be parallel to each other).

Let's arbitrarily select x as 1 and y as 0. According to the formula, this makes z equal to 1. So one of the eigenvectors is (1, 0, -1).

For the second vector, it would be nice if it were orthogonal to the first vector. One way to do this is to keep x as 1 and choose y as -1. This gives z a value of (1, -1, 0).

We could have set y equal to 1 instead, but then z would have been -2, which is fine but not quite as nice. In fact, we could have used any linear combination of the two vectors (1, 0, -1) and (1, -1, 0), they all exist on the same plane.

See also



Join the GraphicMaths Newletter

Sign up using this form to receive an email when new content is added:

Popular tags

adder adjacency matrix alu and gate angle area argand diagram binary maths cartesian equation chain rule chord circle cofactor combinations complex modulus complex polygon complex power complex root cosh cosine cosine rule cpu cube decagon demorgans law derivative determinant diagonal directrix dodecagon eigenvalue eigenvector ellipse equilateral triangle euler eulers formula exponent exponential exterior angle first principles flip-flop focus gabriels horn gradient graph hendecagon heptagon hexagon horizontal hyperbola hyperbolic function hyperbolic functions infinity integration by parts integration by substitution interior angle inverse hyperbolic function inverse matrix irrational irregular polygon isosceles trapezium isosceles triangle kite koch curve l system line integral locus maclaurin series major axis matrix matrix algebra mean minor axis nand gate newton raphson method nonagon nor gate normal normal distribution not gate octagon or gate parabola parallelogram parametric equation pentagon perimeter permutations polar coordinates polynomial power probability probability distribution product rule proof pythagoras proof quadrilateral radians radius rectangle regular polygon rhombus root sech set set-reset flip-flop sine sine rule sinh sloping lines solving equations solving triangles square standard curves standard deviation star polygon statistics straight line graphs surface of revolution symmetry tangent tanh transformation transformations trapezium triangle turtle graphics variance vertical volume volume of revolution xnor gate xor gate