The equations in the Cartesian planeR2: define, respectively, an ellipse and a hyperbola. In each case, the x and y axes are the principal axes. This is easily seen, given that there are no cross-terms involving products xy in either expression. However, the situation is more complicated for equations like Here some method is required to determine whether this is an ellipse or a hyperbola. The basic observation is that if, by completing the square, the quadratic expression can be reduced to a sum of two squares then the equation defines an ellipse, whereas if it reduces to a difference of two squares then the equation represents a hyperbola: Thus, in our example expression, the problem is how to absorb the coefficient of the cross-term 8xy into the functions u and v. Formally, this problem is similar to the problem of matrix diagonalization, where one tries to find a suitable coordinate system in which the matrix of a linear transformation is diagonal. The first step is to find a matrix in which the technique of diagonalization can be applied. The trick is to write the quadratic form as where the cross-term has been split into two equal parts. The matrix A in the above decomposition is a symmetric matrix. In particular, by the spectral theorem, it has realeigenvalues and is diagonalizable by an orthogonal matrix. To orthogonally diagonalize A, one must first find its eigenvalues, and then find an orthonormaleigenbasis. Calculation reveals that the eigenvalues of A are with corresponding eigenvectors Dividing these by their respective lengths yields an orthonormal eigenbasis: Now the matrix S = is an orthogonal matrix, since it has orthonormal columns, and A is diagonalized by: This applies to the present problem of "diagonalizing" the quadratic form through the observation that Thus, the equation is that of an ellipse, since the left side can be written as the sum of two squares. It is tempting to simplify this expression by pulling outfactors of 2. However, it is important not to do this. The quantities have a geometrical meaning. They determine an orthonormal coordinate system on R2. In other words, they are obtained from the original coordinates by the application of a rotation. Consequently, one may use the c1 and c2 coordinates to make statements about length and angles, which would otherwise be more difficult in a different choice of coordinates. For example, the maximum distance from the origin on the ellipsec12 + 9c22 = 1 occurs when c2=0, so at the points c1=±1. Similarly, the minimum distance is where c2=±1/3. It is possible now to read off the major and minor axes of this ellipse. These are precisely the individual eigenspaces of the matrix A, since these are where c2 = 0 or c1=0. Symbolically, the principal axes are To summarize:
The equation is for an ellipse, since both eigenvalues are positive.
The principal axes are the lines spanned by the eigenvectors.
The minimum and maximum distances to the origin can be read off the equation in diagonal form.
Using this information, it is possible to attain a clear geometrical picture of the ellipse: to graph it, for instance.
Formal statement
The principal axis theorem concerns quadratic forms in Rn, which are homogeneous polynomials of degree 2. Any quadratic form may be represented as where A is a symmetric matrix. The first part of the theorem is contained in the following statements guaranteed by the spectral theorem:
The eigenvalues of A are real.
A is diagonalizable, and the eigenspaces of A are mutually orthogonal.
In particular, A is orthogonally diagonalizable, since one may take a basis of each eigenspace and apply the Gram-Schmidt process separately within the eigenspace to obtain an orthonormal eigenbasis. For the second part, suppose that the eigenvalues of A are λ1,..., λn and the corresponding orthonormal eigenbasis is u1,...,un. Then
where the ci are the coordinates with respect to the given eigenbasis. Furthermore,
The i-th principal axis is the line determined by the n-1 equations cj = 0, j ≠ i. This axis is the span of the vector ui.