Abstract Algebra:Constructions with Straightedge and Compass

We may not appear to have accomplished anything of significance yet concerning the theory of fields, but we have enough machinery to apply to several significant problems. In this section we will consider the problem of constructing geometrical figures using only a compass and a straightedge. For our purposes, a straightedge is a device which can be used only to draw a line between two points and a plane. It has no measurement capabilities. Such constructions were first considered by the Greek geometers and are probably familiar to the reader from a high school course in geometry. The Greeks posed three very interesting questions concerning constructibility which they were unable to solve:

Problem 1: (Trisecting an Angle) Given an arbitrary angle theta

, construct the angle theta

/3, using only straightedge and compass.

Problem 2: (Duplicating a Cube) Given an arbitrary cube C of volume V, construct a cube having volume 2V, using only straightedge and compass.

Problem 3: (Squaring a Circle) Given an arbitrary circle C of area A, construct a square having area A, using only straightedge and compass.

Our plan in this section is to first discuss the general problem of constructibility. From this discussion will emerge a powerful theory which will provide immediate solutions to Problems 1 and 2 and will point the way to a solution of Problem 3. It will turn out that all three problems have no solution. We will turn our attention to the construction of regular polygons.

Let us first lay down the ground rules for our investigation. Let us restrict ourselves to plane constructions. Further, we will identify the Euclidean plane with the complex numbers, by identifying the point with coordinates (a,b) with the complex number a + bi. Finally, let us assume that in addition to a compass and straightedge, we are given the line segment connecting (0,0) and (1,0), somehow marked off on our plane. (This line segment of length 1 is just used as a comparison device. We could just as well begin our constructions by drawing a line segment of arbitrary length and using it for comparison.) Our problem is to decide which geometric figures can be constructed. Henceforth, when we refer to "construction" we will always mean "construction with straightedge and compass, given the above unit segment as initial data."

Let us now reduce our geometric problem to an algebraic problem. It is clear that each geometric figure which we can construct can be defined by a finite number of points, line segments, and circular arcs. Let us describe a given geometrical figure by means of a family of complex numbers as follows: The finite number of points can be viewed as complex numbers as described above. A line segment is described by the complex numbers corresponding to its endpoints. A circular arc is described by four numbers: two corresponding to the endpoints of the arc, on corresponding to the center of the circle, and one equal to the length of the radius of the circle. It is clear that the numbers described are necessary and sufficient to construct the geometrical figure. Thus, let us henceforth think of a geometrical figure in terms of a collection of complex numbers describing points, lines and arcs in the figure. It is clear that in order to construct a given figure, it is both necessary and sufficient to be able to construct the line segments connecting (0,0) to each of the complex numbers describing the figure. This leads us to the following definition.

Definition 1: A complex number alpha

is said to be constructible if the line segment connecting (0,0) to alpha

is constructible.

Theorem 2: A geometrical figure is constructible if and only if each of the complex numbers describing it is constructible.

Thus, our original question concerning constructibility of geometric figures is reduced to one concerning constructibility of complex numbers. Let C denote the set of all constructible complex numbers. We will give a more-or-less complete description of C.

Before proceeding further, let us recall some of the possible elementary constructions of Euclidean geometry. For the actual constructions refer back to high school geometry.

2. Construct a line perpendicular to a given line segment L at a given point P on L.

3. Construct a line through a given point P which is parallel to a given line L.

4. Given line segments of lengths l and l', construct a line segment of length ll'.

Proof: Our assertion amounts to the following: If alpha

and

are constructible, then so are alpha

, -

, 1/

(

0). Let us consider each case separately. Recall that addition of complex numbers can be accomplished geometrically via the parallelogram law. Therefore, let us construct segment from origin to alpha

and

as in Figure 1. Use basic construction 4 above to construct line segment from beta to gamma

parallel to segment from origin to alpha

measure off a line segment of length segment from origin to alpha

, with one endpoint of the line segment at beta

. Then the other end of the measured segment is alpha

. Thus,

is constructible.

Figure 1: Construction of alpha

In order to construct - beta , construct segment from origin to beta as in Figure 2. Continue the line segment through the origin and on the continued portion measure off a segment of length , having O as one endpoint. Then the other endpoint of the measured segment is - beta .

Figure 2: Construction of - beta

Let us construct alpha · beta . Without loss of generality, we may assume that alpha not equal 0, beta 0. Then alpha and beta can both be written in polar form

= r₁(cos

₁ + i sin

₁).

= r₂(cos

₂ + i sin

₂).

By de Moivre's theorem,

= r₁r₂[cos( theta

₁ +

₂) + i sin( theta

₁ +

₂)]

Thus, the line segment O alpha · beta has length r₁r₂ and makes an angle theta ₁ + theta ₂ with the positive X-axis. This is the clue to the construction of alpha · beta . Refer to Figure 3, where alpha and beta have been constructed. Construct the line segment segment from origin to gamma making and angle theta ₁ + theta ₂ with the positive X-axis. Measure off a segment segment from origin to delta of length r₁r₂ on . This is possible by basic construction 4. Then delta = alpha · beta .

Figure 3: Construction of alpha

Finally, assume that beta not equal 0. Then, since sin² theta ₂ + cos² theta ₂ = 1 and cos(- theta ₂) = cos theta ₂, sin(- theta ₂) = -sin theta ₂, we have

= r₂^-1(cos theta

₂ + i sin

₂)^-1

= r₂^-1[(cos theta

₂ - i sin

₂)/(cos²

₂ + sin²

₂)]

= r₂^-1[cos(- theta

₂) + i sin(- theta

₂)].

Thus, the line segment O beta ^-1 has length r₂^-1 and makes an angle - theta with the positive X-axis. The construction of this segment uses basic construction 5.

Corollary 4: All rational numbers are constructible.

Proof: We have bee given 1 as part as our initial data. Thus, 1 element of C. But since C is a field, this implies that Q subset of C.

Proposition 5: Let alpha element of C be constructible and let square root of alpha denote one of the square roots of alpha . Then is constructible.

Proof: Without loss of generality, assume that alpha not equal 0. By de Moivre's theorem,

= ±

[cos(

₁/2) + i sin( theta

₁/2)].

Let us only consider the positive sign. The reasoning for the negative sign is similar. Then the line segment O square root of alpha is a line segment of length square root of r1 which makes an angle of theta ₁/2 with the positive X-axis. Therefore, can be constructed using basic constructions 1 and 6.

Theorem 6: Let alpha ₁,..., alpha _n be complex numbers such that

₁²

_i²

₁,...,

_i-1) (2 < i < n).

Proof: By Corollary 4, every element of Q is constructible and by Proposition 5 and the assumption alpha ₁² Q, we see that alpha ₁ is constructible. Therefore, since the constructible numbers form a field, every element of Q( square root of alpha ) is constructible. Thus, the theorem is true for n = 1. Let us proceed by induction on n. Let n > 1 and assume the theorem for n - 1. Then every element of Q( alpha ₁,..., alpha _n-1) is constructible. By hypothesis, alpha _n² Q( alpha ₁,..., alpha _n-1) so that alpha _n is a square root of a constructible number and hence is constructible by Proposition 5. But since the constructible numbers form a field, every element of Q( alpha ₁,..., alpha _n-1)( alpha _n) = Q( alpha ₁,..., alpha _n) is constructible.

The amazing fact is that the converse is also true. We have

Theorem 7: Let beta element of C be constructible. Then there exists a set of complex numbers { alpha ₁,..., alpha _n} such that

₁²

_i²

₁,...,

_i-1) (2 < i < n).

and such that beta element of Q( alpha ₁,..., alpha _n). Thus, if beta is constructible, deg(Q( beta )/Q) = 2^k for some k.

Before proceeding with the proof of Theorem 7, let us do some preliminary work. Assume that the points beta ₁, beta ₂,..., beta _m-1 have been constructed. What points can we construct using beta ₁, beta ₂,..., beta _m-1? There are two elementary constructions which we can perform: (a) We can draw a line L connecting beta _i to beta _j (i not equal j). (b) We can draw a circle C with center at beta _i which passes through beta _j (ij). Thus, if beta _m is constructible from beta ₁, beta ₂,..., beta _m-1, then beta _m is either the intersection of two lines L₁ and L₂, the intersection of a line L and circle C, or the intersection of two circles C₁ and C₂. Let

(1)

_j =

_j + i

_j (

_j,

R, 1 < j < m),

(2)

F_j = Q(i, gamma

₁,

₁,...,

_j,

_j) (1 < j < m).

We will consider separately three cases:

Case 1: beta _m is the intersection of lines L₁ and L₂.

The lines L₁ and L₂ have equations

L₁: a₁x + b₁y + c₁ = 0,

(3)

L₂: a₂x + b₂y + c₂ = 0,

where a₁,b₁,c₁,a₂,b₂,c₂ element of F_m-1. The system of equations (3) has the solution x = alpha _m, y = delta _m, by assumption. But the solution of the system (3) can be calculated rationally in terms of a₁,b₁,c₁,a₂,b₂,c₂ and therefore lies in F_m-1. Thus we see that

(4)

F_m-1,

F_m-1

F_m = F_m-1

since F_m = F_m-1( alpha _m, delta _m) if m > 2.

Case 2: beta _m is the intersection of the line L and the circle C.

The line L and the circle C have the equations

L: ax + by + c = 0,

(5)

C: dx² + ey² + fx + gy + h = 0,

where a,b,c,d,e,f,g,h element of F_m-1. On of the solutions of the system of equations (5) is x = alpha _m, y = delta _m, by assumption. On the other hand, the system (5) can be solved by substituting the linear relation into the quadratic equation and then solving for, say, x, from which the corresponding value of y can be computed. Therefore, the solutions of system (5) can be computed in terms of the square root of an element eta , where eta is computable rationally in terms of a,b,c,d,e,f,g,h. In particular, eta element of F_m-1 and alpha _m F_m-1( square root of eta ), delta _m F_m-1(). Further,

(6)

F_m

F_m-1(

since F_m = f_m-1( alpha _m, delta _m) if m > 2.

Case 3: beta _m is the intersection of the circles C₁ and C₂.

The equations of C₁ and C₂ are given by

C₁: x² + y² + a₁x + b₁y + c₁ = 0,

C₂: x² + y² + a₂x + b₂y + c₂ = 0,

where a₁,b₁,c₁,a₂,b₂,c₂ element of F_m-1. The points of intersection of C₁ and C₂ are the same as the points of intersection of C₁ with the line whose equation is

(a₁ - a₂)x + (b₁ - b₂)y + (c₁ - c₂) = 0.

Therefore, by case 2, there exists eta in F_m-1 such that

(7)

F_m

F_m-1(

By comparing the results of cases 1-3, we have

Lemma 8: Suppose that the points beta ₁,..., beta _m-1 (m > 2) have been constructed and that beta _m is constructible from beta ₁,..., beta _m-1. Further, suppose that beta ₁ = alpha _j + i delta _j, where alpha _j and delta _j are real, and let

F_j = Q(i, alpha

₁,

₁,...,

_j,

_j) (1 < j < m.).

Then there exists eta element of F_m-1 such that F_m subset of F_m-1( square root of eta ).

Proof: By (6) and (7) we are done in cases 2 and 3. In case 1 we may set eta = 1 by (4).

Let us now prove Theorem 7. Suppose that beta can be constructed by successively constructing beta ₁, beta ₂,..., beta _n = beta , where beta ₁ is constructed from beta ₀ = 1. For m = 1,...,n let F_m be as in the Lemma, and set F₀ = Q(i). Let us apply Lemma 8 to each of the sets of numbers beta ₀, beta ₁,..., beta _m (1 < m < n) to get that there exists eta _m element of F_m-1 such that

(8)

F_m

F_m-1(

). (1 < m < n).

Define

₀ = i,

_j =

(1 < j < n).

We assert that

(9)

F_m

₀,

₁,...,

_m) (1 < m < n).

This follows trivially by induction from (8). It is clearly true for m = 1 by (8). Assume that m > 1 and that (9) is true for m - 1. Then, by the induction hypothesis and (8),

F_m

F_m-1(

_m)

₀,...,

_m-1)(

_m)

= Q(

₀,...,

_m).

This completes the induction and hence (9) is proved. It is clear that alpha ₀² = -1 element of Q. Also, by (9), for 1 < m < n, we have

_m² =

F_m-1

₀,...,

_m-1).

Finally, by (9) for m = n, and the fact that beta _n = beta , we have

_n + i

F_n

₀,...,

_n).

This completes the proof of the first assertion of Theorem 7. In order to show that deg(Q( beta )/Q) = 2^k for some k, it suffices to show that

deg(Q(

₀,...,

_n)/Q) = 2^r

for some r. But, if we set tilde F _i = Q( alpha ₀,..., alpha _i) (0 < i < n), we have deg( tilde F _i+1/ tilde F _i) = 1 or 2 since alpha _i+1² element of tilde F _i and tilde F _i+1 = tilde F _i( alpha _i+1). Moreover,

deg(Q(

₀,...,

_n)/Q)

= deg(

_n/

_n-1) · deg( tilde F

_n-1/

_n-2)···deg( tilde F

₁/

₀)

= 2^r

for some r.

Let us now illustrate Theorem 7 by applying it to the problem of trisecting an angle. Not only will we show that there is no general procedure for trisecting an angle with compass and straightedge, but there are particular angles which cannot be trisected, For example, let us prove:

Theorem 9: A 60^o angle cannot be trisected using only a compass and a straightedge.

Proof: It is well known that a 60^o angle can be constructed using only a compass and a straightedge. Therefore, a 60^o angle can be trisected if and only if it is possible to construct a 20^o angle using only a compass and a straightedge. But let us see what it means for an angle theta to be constructible. If theta is constructible, then we may place one side of theta on the X-axis and construct the point where the other side of theta intersects the circle of radius 1 with center at the origin (see Figure 4). But by elementary trigonometry, this point is cos theta + i sin theta . Therefore, if angle theta is constructible, the number cos theta + i sin theta is constructible. Conversely, if cos theta + i sin theta is constructible, then angle theta is constructible (see Figure 4). Therefore, a 20^o angle is constructible if and only if

cos(20^o) + i sin(20^o) = zeta

is constructible.

Figure 4: Construction of the Angle theta

Let us assume that zeta is constructible. By Theorem 7, deg(Q( zeta )/Q) = 2^r for some r. However, zeta is a primitive eighteenth root of 1, so by Theorem 1 of examples,

deg(Q(

)/Q) =

(18)

= 6.

Thus, a contradiction is reached and zeta is not constructible. Therefore, an angle of 60^o cannot be trisected using only straightedge and compass.

Let us next turn to the problem of duplicating the cube. For simplicity's sake let us consider a cube C of side 1. We wish to construct a cube of volume 2. This is equivalent to constructing a line segment of length cube root of 2 . Let us show that the number is not constructible. Note that

deg(Q(

)/Q) = 3.

Therefore, by Theorem 7, we see that cube root of 2 is not constructible. Thus, we have

Theorem 10: It is impossible to duplicate the cube of side 1 using only a straightedge and compass.

Let us now turn to the problem of squaring the circle. For the sake of simplicity, let us consider the circle of radius 1. It has area equal to . Thus, we are asked to construct a square of area . This is equivalent to constructing a line segment of length square root of pi . Thus, the question of squaring the circle comes down to: Is constructible. We cannot give a complete proof of the fact that the answer is no. However, let us at least give an indication of the idea involved. If is constructible, then deg(Q()/Q) is a power of 2. In particular, is algebraic over Q by Theorem 5 of the section on algebraic and transcendental numbers. Actually, square root of pi is transcendental over Q. However, the proof of this last statement is very deep and will not be included here.

Let us now take up the problem of the construction of a regular polygon of n sides. Constructing such a polygon is equivalent to constructing such a polygon is equivalent to constructing an angle of 360/n. But as we saw in the proof of Theorem 9, the angle theta can be constructed if and only if cos theta + i sin theta is constructible. Therefore, a regular polygon of n sides can be constructed if and only if

cos(360/n) + i sin(360/n) = zeta

is constructible. But zeta _n is a primitive nth root of 1 and deg(Q( zeta _n)/Q) = phi (n). Therefore, by Theorem 7, if zeta _n is constructible, phi = 2^r for some r. Thus, if a regular polygon of n sides is constructible, then phi (n) = 2^r. Actually the converse is also true. Let us investigate the values of n for which phi (n) is a power of two. By consulting Table 1, we see that this is not always true. For example phi (7) = 6, so that it is impossible to construct a regular 7-gon.

Table 1: Values of phi

(n)

n	(n)	n	(n)
2	1	12	4
3	2	13	12
4	2	14	6
5	4	15	8
6	2	16	8
7	6	17	16
8	4	18	6
9	6	19	18
10	4	20	8
11	10	21	12

Let

n = p₁^r₁· p₂^r₂··· p_t^r_t (r_i > 0)

be the decomposition of n into a product of powers of distinct primes p₁,..., p_t. Then

(10)

(n) =

(p₁^r₁) ··· phi

(p_t^r_t),

so that phi (n) is a power of 2 if and only if phi (p_j^r_j) is a power of 2 (1 < j < t). Note that

(p_j^r_j) = p_j^r_j-1(p_j - 1).

Therefore, if p_j = 2, phi (p_j^r_j is automatically a power of two. Thus, we are reduced to the following question: Let p be an odd prime, r a positive integer. When is p^r-1(p - 1) a power of 2? It is clearly necessary and sufficient that r = 1 and p - 1 = 2^k for some k. However, if p - 1 = 2^k, then k must be a power of 2. For if k is divisible by an odd prime q, we must have k = qu for some positive integer u. But then

p = 2^k + 1

= 2^qu + 1

= (2^u + 1)(2^(q-1)u - 2^(q-2)u + .. + 1),

which contradicts the fact that p is prime. Thus, we have shown that if p^r-1(p - 1) is a power of 2, then r = 1 and p = 2^{2^v} + 1. And conversely, if p = 2^{2^v} + 1, then phi (p) is a power of 2. A prime p of the form

2^{2^v} + 1

is called a Fermat prime. The first few Fermat primes after 3 are

2^2¹ + 1 = 5,

2^2² + 1 = 17,

2^2³ + 1 = 257,

2^2⁴ + 1 = 65,537.

It is not true that every integer of the form 2^{2^v} + 1 is prime. For as Euler showed, 2^2⁵ + 1 is divisible by 641. The net result of the above discussion is

Theorem 11: In order for a regular polygon of n sides to be constructible using only straightedge and compass, it is necessary and sufficient for n to be of the form

n = 2^rp₁p₂...p_w,

where p₁p₂...p_w are distinct Fermat primes.