Linear Transformations: Section 3

Section 3: Geometric actions of some linear transformations on the plane

[3.0]In this section some geometric actions can be done by linear transformations. To prove this five steps are executed:
   (i) An intuitive geometric description and/or definition of the action is given.
   (ii) The action carries the special points (1,0) and (0,1) onto images, whose coordinates are calculated;
   (iii) From the transposes of the images of the special points, form a matrix which will become the associated matrix of the linear transformation L in the next step.
   (iv) From those images a linear transformation L is created: L(x,y) = x(image of (1,0)) + y(image of (0,1));
   (v) Show that what the linear transformation L does, is the same as what the action in step (1) does.
Often between steps (i) and (ii) some obvious results of the statement in (i) are listed.

The first and second actions [3.1] and [3.2] introduce these ideas.

_________________________________________________

[3.1] Action: Reflection through the origin

For any point not on the origin there is a line through that point and the origin. Push the point through the origin. On the other side of the origin go and stop when the image point will be the same distance from the origin as the original point.

[3.1a] (Reflection through the origin) The origin is the midpoint of the segment joining any point and its reflection through the origin.
Notation: if P' is the image obtained by reflecting a point P through the origin, then O is the mid point of segment PP'.

[3.1b] Some immediate results of [3.1a]:
(i) Vectors OP' = -- OP
(ii) Only the origin is carried back onto itself (left fixed); for all other points, P' and P are distinct

[3.1c] (Images of the special points and triangle)
  (i) the images of (1,0) and (0,1) are (-1,0) and (0,-1) respectively;
  (ii) the special triangle has vertices O, (1,0), (0,1). Its image has vertices O, (-1,0), (0,-1);
  (c) the orientation of the image triangle is positive (counter-clockwise).

It is obvious that O is the mid point of the segment joining (1,0) and (-1,0)
and the segment joining (0,1) and (0,-1).
The following matrix is formed from the transposes of these images:

[3.1d] (The reflection through the origin is a linear transformation) The linear transformation L defined by

L(x,y) = (-x,-y) is the reflection through the origin. Its associated matrix is given in (@) above.

Since the images of (1,0) and (0,1) are (-1,0) and (0,-1) respectively, L(1,0) = (-1,0) and L(0,1) = (0,-1). The equation for L(x,y) is derived from L(x,y) = xL(1,0) + yL(0,1) = x(-1,0) + y(0,-1) = (-x,-y).
To prove that L is the reflection through the origin, see where the mid point of (x,y) and L(x,y) lies. (x,y) + L(x,y) = (x,y) + (-x,y) = (x + (-x), y + (-y)) = (0,0). Therefore (1/2) [(x,y) + (-x,y)] = (0.0). Hence origin O is the mid point of the segment joining (x,y) and L(x,y).

----------

[3.2] Action: Similitude, a uniform expansion outward from or a uniform contraction toward the origin

Consider a magnifying glass aimed at the origin. As the glass is lifted up, everything expands about the origin, but the origin stays fixed. A point P different from the origin is pushed outward on a straight line to a point P' so that O,P,P' are colllinear. A point Q is pushed outward on a straight line to a point Q' so that O,Q,Q' are collinear.

hspace=1000 Suppose the magnification pushes P and Q to new points P' and Q' twice as far from the origin, so that OP'/OP = 2 = OQ'/OQ From these equal ratios vector equations are produced: OP' = 2OP OQ' = 2OQ Here the number 2 might be called the "coefficient of expansion."

More generally, any positive real number λ can be used in place of 2:

(*) OP' = λOP Again, O,P,P' are collinear and λ = OP'/OP.

[3.2a] (Similitude) The action of obtaining a point P' from an arbitrary point P by the vector equation (*) is called a similitude with center at the origin and with coefficient λ, a positive real number..

[3.2b] For various values of λ the geometric effects are:
   (i) if λ > 1, then image P' is further from the origin than P. This similitude is a niform dilation of the plane about the origin;
   (ii) if λ = 1, then P'=P and the image of any point coincides with tht point. The similitude is the identity function on the plane, and no dilation nor contraction occurs;
   (iii) if λ < 1 (but λ > 0) then image P' is closer to the origin than P. This similitude is a uniform contraction of the plane about the origin;

A similitude is completely determined by its coefficient. In fact the collection of all similitudes is in one-to-one correspondence with all the positive real numbers. That collection of similitudes is actually a group.
   (i) if one similitude is followed by another the result is a similitude whose coefficient is the product of the two similitudes of the component similitudes;
   (ii) the identity function is a similitude with coefficient 1;
   (iii) the inverse of a similitude is a similitude whose coefficient is the inverse of the first similitude.

[3.2c] (Images of the special points) A similitude with coefficient λ carries the points (1,0) and (0,1) onto the points (&lambda, 0) and (0,λ) respectively.

If E₁ and E₂ are the special points (1,0) and (0,1) spectively, and E'₁ and E'₂ are the images by action of the similitude, then (position vectors)

OE'₁ = λOE₁ OE'₂ = λOE₂ by definition [3.2a]. This means that OE'₁ = λ(1,0) = (λ,0) OE'₂ = λ(0,1) = (0,λ)

From the transposes of the images of the special points the following matrix is formed:

[3.2d] (A similitude with center at the origin is a linear transformation) The linear transformation L defined by

L(x,y) = (λx,λy) is a similitude with center at the origin and with coefficient λ, a positive real number. The matrix (**) is the matrix associated with the similitude.

Let P(x,y) and P'(λx,λy). Since (λx,λy) = λ(x,y), OP' = λOP. Therefore, L is a similitude with coefficient λ.

[3.2e] (Parallel image) A similitude carries segments onto parallel segments.
Notation: If P' and Q' are images of points P and Q respectively, then segment P'Q' is parallel to segment PQ.

Vectors OP' = λOP and OQ' = λOQ. Converting these vectors to position vectors

p' = λp q' = λq Subtracting the first equation from the second produces q' -- p' = λ(q' -- p') This is equivalent to P'Q' = λPQ which means that segment P'Q' is parallel to segment PQ.
There is a simple geometric proof of [3.2e] using the similarity of triangles P'OQ' and POQ.

[3.2f] (Preserving angles) A similitude carries intersecting lines onto intersecting lines in such a way that the angle of intersection of the second pair of lines is congruent to the angle of intersection of the first pair.
Notation: If points A', B', C' are images of points A, B, C then angle A'B'C' is congruent to angle ABC.

By [[3.2e] segment A'B' and B'C' are parallel to AB and BC respectively. Therefore angle A'B'C' is congruent to angle ABC.

However, similitudes do not preserve lengths of segments. The images of segments are not necessarily congruent to the segments.

_________________________________________________

Some of the trigonometric functions are used extensively below. The reader may wish to review some trigonometry. Click here to go to the review.

[3.3] Action: Orthogonal reflection of the entire plane about a line through the origin

Given a flat sheet of blank paper there are four ways under consideration to fold it. The folds are indicated by red lines.

Figure Fig 1a shows a right triangle ABC drawn on the right half of a piece of paper. Suppose ink for the triangle has not dried. The paper will be folded along the vertical line which divides the paper into two halves. As a result the ink on the right triangle touches the left half of the paper and leaves a copy of a right triangle there. If the paper is unfolded and made flat again then there are two right triangles. (Fig 1b) The triangle A'B'C' on the left is a reflection of the triangle on the right about the vertical line where the paper had been folded.

Upon comparison of the two triangles, two observations can be made.
(a) the copy on the left side of the paper is congruent to the original triangle on the right side;
(b) the copy has an orientation opposite to the orientation of the original triangle on the right side.
The vertices of the original triangle on the right side are labeled A,B,C in a counter-clockwise direction. But the labels on the copy A',B',C' are given in a clockwise direction. The process of reflecting the original triangle about the vertical line makes a congruent figure but reverses orientation.

This time a not-yet-dried parallelogram ABDC is located in the upper half of another paper, and a horizontal line is drawn below it. The paper is folded along the horizontal line to produce a copy of the parallelogram A'B'C'D' in the lower half of the paper. (Fig 2). A reflection about the horizontal line has occurred. The parallelograms are congruent but have opposite orientation.

In both examples it is the action of producing the copies that is the focus of these discussions. First is a geometric description of the action that reflects the triangle about the vertical line (Fig 1b). If a horizontal line segment is drawn joining points A and A' (vertices of the triangles) then the vertical line is the perpendicular bisector of segment AA'. Similarly, it is the perpendicular bisector of horizontal segments BB' and CC'. In general, if P is any point, then its image P' determines a horizontal segment PP' of which the vertical line is a perpendicular bisector.

Now to derive the formula for the linear transformation that does the reflecting of the triangle. The lintr requires the environment of the coordinate plane where each point has a label, an array of length 2. Place the original right triangle in the first quadrant of the coordinate plane and the vertical line on the y-axis. Since the vertices are points, they have attached labels, say

A(α₁,α₂), B(β₁,β₂), C(γ₁,γ₂) But vertices A',B',C' also have attached labels and they must be A'(-α₁,α₂), B'(-β₁,β₂), C'(-γ₁,γ₂) Comparing these corresponding labels, it is possible to see a pattern. Suppose P(x,y) is any point. If P is reflected about the y-axis to produce the image point P', then P'(-x,y).

[3.3a] (Orthogonal reflection L about the y-axis)
geometric definition L(P) = P' where the y-axis is the perpendicular bisector of segment PP' or P=P' if point P is on the y-axis.
algebraic definition L(x,y) = (-x,y).

The algebraic definition makes L a linear transformation, since the expressions inside (-x,y) are linear expressions.

The derivation of the formula for the lintr that reflections the parallelogram about the horizontal line (Fig 2) is obtained in a similar way. But in this case the horizontal line is the perpendicular bisector of vertical segments AA', BB', CC' DD'. Place the original parallelogram in the first quadrant of the coordinate plane and the horizontal line on the x-axis.

[3.3b] (Orthogonal reflection L about the x-axis)
geometric definition L(P) = P' where the x-axis is the perpendicular bisector of segment PP' or P=P' if point P is on the x-axis.
algebraic definition L(x,y) = (x,-y).

The algebraic definition makes L a linear transformation, since the expressions inside (x,-y) are linear expressions.
Of course, this lintr is different from the lintr that is a reflection about the y-axis described in [3.1a]..

The reflections about the xaxis and the y-axis are linear transformations. Therefore, they have associated matrices. These matrices are obtained from what happens to the special points (1,0) and (0,1). The reflection about the y-axis carries these points onto (-1,0) and (0,1) respectively. Then the transposes of these images become the columns in the associated matrix.

The reflection about the x-axis carries the special points (1,0) and (0,1) onto (1,0) and (0,-1) respectively. Then the transposes of these images become the columns in the associated matrix.

[3.3c] (Associated matrices for the orthogonal reflections about the axes:

The geometric description of an orthogonal reflection about any line is the same outside the coordinate plane. In the adjacent figure a slant line_o is used. It is called the axis of the reflection. Points P,Q,R are reflected about line_o onto points P',Q',R'. Again the axis is the perpendicular bisector of segments PP', QQ', RR'. Point W is on the axis, so W' = W. If a lintr is to be involved, then the axis must pass through the origin O. Incidentally, the word "orthogonal" means perpendicular. There are reflections, not discussed in these sections, of "slant reflections" in which the segments PP'.QQ', RR' are not perpendicular to the axis. Their mid points do lie on the axis, and all the segments are parallel. Only orthogonal reflections are discussed here. Sometimes the word "orthogonal" is omitted, but understood.

It is obvious that the position and location of the axis completely determines the reflection. The axis may be placed as a line_o in the coordinate plane so that it passes through the origin O. Let θ be the inclination angle as shown. If θ is between 0° and 90° then line_o slants from lower left to upper right (Fig 4a). If line_o is between 90° and 180° then line_o slants from lower right to upper left (Fig 4b). There is no need to consider angles between 180° and 360° because θ and θ - 180° determine the same line_o.

Any orthogonal reflection about a line through the origin must reflect the special points (1,0) and (0,1) about that line. In [3.3d] and [3.3e] that follow assume that the line passes through the origin and has an inclination angle θ.

[3.3d] (Reflected images of the special points) A reflection about the line with inclination angle θ carries (1,0) and (0,1) onto the points

(cos 2θ, sin 2θ) and (sin 2θ, - cos 2θ)

Click here to see a discussion supporting this statement.

[3.3e] (Reflection is a linear transformation) The linear transformation L defined by

L(x,y) = (x cos 2θ + y sin 2θ, x sin 2θ -- y cos 2θ) and determined by the images of the special points in [3.3d] is the reflection about the line.

Click here to see a discussion supporting this statement.

Statements [3.3d] and [3.3e] combine to support the following:

[3.3f] (Orthogonal reflection about any line through the origin)
geometric definition If P' is the reflected image of any point P then the line is the perpendicular bisector of segment PP', or P=P' if point P is on the line (Fig 4).
algebraic definition The matrix associated with the orthogonal reflection about a line through the origin and making an inclination angle θ with the positive x-axis is

The associated matrix comes directly from [3.3d].

If θ = 90° then line_o is the y-axis. The associated matrix (6) becomes the first matrix in (3) above.
If θ = 0° or θ = 180° then line_o is the x-axis. The associated matrix (6) becomes the second matrix in (3) above.
If θ = 45° then line_o bisects the angle between the positive x-axis and the positive y-axis. The reader can replace θ in the formula for L in [3.1e] by 45° (dont forget to double this angle to 90°) and obtain: L(x,y) = (y,x). Therefore L interchanges the coordinates. What is the matrix associated with L ?

_________________________________________________

[3.4] Action: Rotation of the entire plane about the origin and through a given angle

Throughout this chapter the following are true:
   (i) All rotations are about the origin O.
   (ii) All rotations are actually functions that carry the plane into (or onto) itself.
   (iii) Two rotation functions are equivalent if they carry each point P onto the same point P'.

A precise geometric definition of a rotation is not given here. It would not be in the spirit of an intuitive introduction to rotations. Instead two equivalent intuitive descriptions [3.4a] and [3.4b] are given below.

[3.4a] (Intuitive geometric description 1) Keep in mind that the rotation function on the plane carries points onto points in a definite manner. The plane does not move, so it will be called the fixed plane. A second plane is invented to carry out the motion of rotation. It is called the plane(See adjacent figure.) It lies above the fixed plane. There is a hole just above the origin in the other fixed plane. An axis is created perpendicular to the fixed plane and runs through the hole.

Suppose this plane is empty. Then each point P in the fixed plane is copied vertically into the movable plane above it to make a point P. After all the points have been copied from the fixed plane, the movable plane is rotated through a given angle. If the angle is positive then the rotation is done counter-clockwise. If the angle is negative then the rotation is clockwise. Each point P is carried along with the rotating movement. After the rotation of the movable plane is completed, then P is above some point P' in the fixed plane. (The new position of the movable plane and point P' are not shown in the figure.)

The rotation function on the fixed plane carries P onto P'.
The phrase "rotation function on a plane" is less common that the phrase "rotation of the plane". The reader may mentally attached the word "function" to the word "rotation" whenever it occurs alone in this section.

The above description places the two parallel planes in three dimentional space. The following is an equivalent description of the rotation function in the two dimentional (fixed) plane, and does not involve any rotating (green) plane. But it does involve an arbitrary point P in the (fixed) plane to be carried onto a point P' in that same plane. It is this description that will lead to a more rigorous algebraic definition of a rotation.

[3.4b] (Intuitive geometric description 2) With one end point fixed on the origin O swing segment origin OP around in a circular motion through angle θ and stop. Then the segment origin OP' is formed. The point P' is the rotated image of point P. If θ is positive, the the motion is counter-clockwise (Fig 1a). If θ is negative then the motion is clockwise (Fig 1b).

[3.4c] There are several immediate results from this description:
   (i) The angle completely determines the rotation.
   (ii) Any point and its rotated image are the same distance from the origin.
   (iii) If θ = 0, there is no motion. Then P' = P. The identity function I is a rotation through an angle of zero.
   (iv) The rotation of any point about the origin through an angle of θ is equivalent to a rotation of that same point through an angle of θ - 360°.
   (v) The rotation through any number of revolutions, positive or negative, carries each point back onto itself.
   (vi) A rotation through an angle -180° is equivalent to a rotation through an angle of 180°.

The proof of the following important statement is almost trivial:

[3.4d] (Combining rotations) If a pair of rotations act on a plane, then the angle of the combined rotations (one acting after the other) is equal to the sum of the angles of the individual rotations.
Notation: If θ₁ and θ₂ are the angles of the two rotations, then their combined action (composition) is a rotation with an angle of θ₁ + θ₂.

The first rotation carries point P onto point P', and the second rotation carries point P' onto P".

Angle PO P' = θ₁ and angle P'OP" = θ₂. Therefore the angle of the combined rotations = angle POP" = angle POP' + angle P'OP" = θ₁ + θ₂

The angles involved with rotations are real numbers, which are commutative. This makes the combinedaction of any two rotations commutative - either rotation may act first.
Since θ + -θ = 0, every rotation has an inverse. Simply rotate the same amount but in the opposite direction.

[3.4e] The collection of all rotations about the origin form a commutative group.

It is time now to derive the formula for a rotation. First the images of the special points (1,0) and (0,1) are calculated.

[3.4f] (Rotated images of the special points) A rotation through angle θ carries points (1,0) and (0,1) onto the points

(cos θ, sin θ) and (-sin θ, cos θ) respectively.

Click here to see the discussion supporting this statement.
From the transpose of coordinates of image points form the matrix

[3.4g] (General formula for rotation) The linear ttransformation L defined by

L(x,y) = (x cos θ + y sin θ, -x sin θ + y cos θ) is the rotation about the origin through angle θ. The associated matrix is given in (2)
Notation: x' = x cos θ + y sin θ and y' = -x sin θ + y cos θ.

Click here to see the discussion supporting this statement.

_________________________________________________

[3.5] Action: Orthogonal projection of the entire plane onto a line through the origin

One of the important constructions in plane geometry is to construct a line through a given point and perpendicular to a given line. (Only a compass and straight edge can be used.)

[3.5a] (Geometric description) Where the line through the given point intersects the given line perpendicularly is the point called the orthogonal projection of the given point upon the given line. (If the given point is already on the given line then the point is also the orthogonal projection upon the given line.) Do this action for every point in the plane.

Clearly the line segment joining the given point and its projected image is perpendicular to the given line.

[3.5b] Some obvious results of [3.5a] (i) The location and position of a given line completely determine the orthogonal projection of the entire plane onto it.
(ii) The image of a line or line segment perpendicular to the given line is a point. (iii) Parallel line segments not perpendicular to the given line are projected onto congruent segments inside the given line. (iv) A projection carries subsets of 2 dimensions (have langth and width) onto subsets of lesser dimension (as usbsets of the given line). (v) There is no inverse action to undo the action of a projection.

Two simple orthogonal projects occur in the coordinate plane if the given lines are the axes. In Fig 3 the arbitrary point P(x,y) is orthogonally projected onto the x-axis, and the arbitrary point Q(x,y) is orthogonally projected onto the y-axis. It is not difficult to determine the image points, namely P'(x,0) and Q'(0,y). (The second coordinate of every point on the x-axis must be 0. The first coordinate of every point on the y-axis must be 0. The projections are vertical and horizontal respectively.)

The vertical projection onto the x-axis of the special points (1,0) and (0,1) is (1,0) = (1,0) (point is already on the x-axis) (0,1) ---> (0,0), (point (0,1) is above the origin). Also the horizontal projection onto the y-axis of (1,0) and (0,1) produces images (0,0) and (0,1).

>b>[3.5c] (Images of the special points) The images of (1,0) and (0,1) projected orthogonally onto the x-axis and y-axis are (1,0), (0,0) and (0,0), (0,1) respectively.

The matrices from the transposes of these images are:

[3.5d] (Formula for projection onto the axes) The linear ttransformations L₁ and L₂ defined by

L₁(x,y) = (x,0)    and    L₂(x,y) = (0,y)) are orthogonal projections onto the x-axis and y-axis respectively. The associated matrices are given in (3)
Notation: (onto the x-axis) x' = x and y' = 0;     (onto the y-axis) x' = 0 and y' = y.

[3.5e] (Projection of the special points) The orthogonal projection of points (1,0) and (0,1) onto a line through the origin and with inclination angle θ (made with the positive x-axis) are the points (cos²θ, sin θ cos θ)     and     (sin θ cos θ, sin²θ) where θ is an angle between 0° and 180°. θ may equal 0° but need not equal 180°.

Click here to see an argument supporting this statement.

From the transposes of the images of the special points the following matrix is obtained:

[3.5f] (General formula for projection) The linear ttransformation L defined by L(x,y) = (x cos²θ + y sin θ cos θ, x sin θ cos θ + y sin²θ) is the projection onto a line through the origin with inclination angle θ.(made with the positive x axis). The associated matrix is given in (4) θ is an angle between 0° and 180°. θ may equal 0° but need not equal 180°.
Notation: x' = x cos²θ + y sin θ cos θ and y' = x sin θ cos θ + y sin²θ

Click here to see an argument supporting this statement.

A projection onto a line is singular because it has no inverse. There is no linear transformation that will undo what a projection has done. This is confirmed by the determinant of (4) which is zero. However, the matrix in (4) has rank 1, making projections onto lines having rank 1. See [2.7] above.