NUMBER[1] (through Fr. nombre, from Lat. numerus; from a root seen in Gr. νέμειν to distribute), a word generally expressive of quantity, the fundamental meaning of which leads on analysis to some of the most difficult problems of higher mathematics.
1. The most elementary process of thought involves a distinction within an identity-the A and the not-A within the sphere throughout which these terms are intelligible. Again A may be a generic quality found in different modes Aa, Ab, Ac, &c.; for instance, colour in the modes, red, green, blue and so on. Thus the notions of “one,” “two,” and the vague “many” are fundamental, and must have impressed themselves on the human mind at a very early, period: evidence of this is found in the grammatical distinction of singular, dual and plural which occurs in ancient languages of widely different races. A more definite idea of number seems to have been gradually acquired by realizing the equivalence, as regards plurality, of different concrete groups, such as the fingers of the right hand and those of the left. This led to the invention of a set of names which in the first instance did not suggest a numerical system, but denoted certain recognized forms of plurality, just as blue, red, green, &c., denote recognized forms of colour. Eventually the conception of the series of natural numbers became sufficiently clear to lead to a systematic terminology, and the science of arithmetic was thus rendered possible. But it is only in quite recent times that the notion of number has been submitted to a searching critical analysis: it is, in fact, one of the most characteristic results of modern mathematical research that the term number has been made at once more precise and more extensive.
2. Aggregates (also called manifolds or sets).—Let us assume the possibility of constructing or contemplating a permanent system of things such that (1) the system includes all objects to which a certain definite quality belongs; (2) no object without this quality belongs to the system; (3) each object of the system is permanently recognizable as the same thing, and as distinct from all other objects of the system. Such a collection is called an aggregate: the separate objects belonging to it are called its elements. An aggregate may consist of a single element.
It is further assumed that we can select, by a definite process, one or more elements of any aggregate at pleasure: these form another aggregate . If any element of remains unselected, is said to be a part of (in symbols, ): if not, is identical with . Every element of is a part of . If and , then .
When a correspondence can be established between two aggregates and in such a way that to every element of corresponds one and only one element of , and conversely, and are said to be equivalent, or to have the same power (or potency); in symbols, . If and , then . It is possible for an aggregate to be equivalent to a part of itself: the aggregate is then said to be infinite. As an example, the aggregates , &c., and , &c., are equivalent, but the first is only a part of the second.
3. Order.—Suppose that when any two elements of an aggregate are taken there can be established, by a definite criterion, one or other of two alternative relations, symbolized by and , subject to the following conditions:-(1) If , then , and if , then ; (2) If and , then . In this case the criterion is said to arrange the aggregate in order. An aggregate which can be arranged in order may be called ordinable. An ordinable aggregate may, in general, by the application of different criteria, be arranged in order in a variety of ways. According as or we shall speak of a as anterior or posterior to . These terms are chosen merely for convenience, and must not be taken to imply any meaning except what is involved in the definitions of the signs and for the particular criterion in question. The consideration of a succession of events in time will help to show that the assumptions made are not self-contradictory. An aggregate arranged in order by a definite criterion will be called an ordered aggregate. Let be any two elements of an ordered aggregate, and suppose . All the elements (if any) such that are said to fall within the interval . If an element , posterior to , can be found so that no element falls within the interval , then is said to be isolated from all subsequent elements, and is said to be the element next after . So if , and no element falls within the interval , then is isolated from all preceding elements, and is the element next before . As will be seen presently, for any assigned element , either, neither, or both of these cases may occur.
An aggregate is said to be well-ordered (or normally ordered) when, in addition to being ordered, it has the following properties: (1) has a first or lowest element a which is anterior to all the rest; (2) if is any part of , then has a first element. It follows from this that every part of a well-ordered aggregate is itself well-ordered. A well-ordered aggregate may or may not have a last element.
Two ordered aggregates are said to be similar () when a one-one correspondence can be set up between their elements in such a way that if are the elements of B which correspond to any two elements of A, then or according as or . For example, , because we can make the even number correspond to the odd number and conversely.
Similar ordered aggregates are said to have the same order-type. Any definite order-type is said to be the ordinal number of every aggregate arranged according to that type. This somewhat vague definition will become clearer as we proceed.
4. The Natural Scale.—Let be any element of a well-ordered aggregate . Then all the elements posterior to form an aggregate , which is a part of and, by definition, has a first element . This element is different from , and immediately succeeds it in the order of . (It may happen, of course, that does not exist; in this case is the last element of .) Thus in a well-ordered aggregate every element except the last (if there be a last element) is succeeded by a definite next element. The ingenuity of man has developed a symbolism by means of which every symbol is associated with a definite next succeeding symbol, and in this way we have a set of visible or audible signs 1, 2, 3, &c. (or their verbal equivalents), representing an aggregate in which (1) there is a definite order, (2) there is a first term, (3) each term has one next following, and consequently there is no last term. Counting a set of objects means associating them in order with the first and subsequent members of this conventional aggregate. The process of counting may lead to three different results: (1) the set of objects may be finite in number, so that they are associated with a part of the conventional aggregate which has a last term; (2) the set of objects may have the same power as the conventional aggregate; (3) the set of objects may have a higher power than the conventional aggregate. Examples of (2) and (3) will be found further on. The order-type of 1, 2, 3, &c., and of similar aggregates will be denoted by ; this is the first and simplest member of a set of transfinite ordinal numbers to be considered later on. Any finite number such as 3 is used ordinally as representing the order-type of 1, 2, 3 or any similar aggregate, and cardinally as representing the power of 1, 2, 3 or any equivalent aggregate. For reasons that will appear, is only used in an ordinal sense. The aggregate 1, 2, 3, &c., in any of its written or spoken forms, may be called the natural scale, and denoted by . It has already been shown that is infinite: this appears in a more elementary way from the fact that , where each element of is made to correspond with the next following. Any aggregate which is equivalent to the natural scale or a part thereof is said to be countable.
5. Arithmetical Operations.—When the natural scale has once been obtained it is comparatively easy, although it requires a long process of induction, to define the arithmetical operations of addition, multiplication and involution, as applied to natural numbers. It can be proved that these operations are free from ambiguity and obey certain formal laws of commutation, &c., which will not be discussed here. Each of the three direct operations leads to an inverse problem which cannot be solved except under certain implied conditions. Let denote any two assigned natural numbers: then it is required to find natural numbers, such that
respectively. The solutions, when they exist, are perfectly definite, and may be denoted by and ; but they are only possible in the first case when , in the second when is a multiple of , and in the third when a is a perfect th power. It is found to be possible, by the construction of certain elements, called respectively negative, fractional and irrational numbers, and zero, to remove all these restrictions.
6. There are certain properties, common to the aggregates with which we have next to deal, analogous to those possessed by the natural scale, and consequently justifying us in applying the term number to any one of their elements. They are stated here, once for all, to avoid repetition; the verification, in each case, will be, for the most part, left to the reader. Each of the aggregates in question (, suppose) is an ordered aggregate. If are any two elements of , they may be combined by two definite operations, represented by and , so as to produce two definite elements of represented by and (or ); these operations obey the formal laws satisfied by those of addition and multiplication. The aggregate contains one (and only one) element , such that if is any element of ( included), then , and . Thus contains the elements , or, as we may write them, such that and ; also We may express this by saying that contains an image of the natural scale. The element denoted by may be called the ground element of .
7. Negative Numbers.—Let any two natural numbers be selected in a definite order (to be distinguished from , in which the order is reversed). In this way we obtain from an aggregate of symbols which we shall call couples, от more precisely, if necessary, polar couples. This new aggregate may be arranged in order by means of the following rules:—
Two couples are said to be equal if . In other words are then taken to be equivalent symbols for the same thing.
If , we write ; and if we write .
The rules for the addition and multiplication of couples are:
The aggregate thus defined will be denoted by ; it may be called the scale of relative integers.
If denotes or any equivalent couple, and . Hence is the ground element of . By definition, : and hence by induction , where is any natural integer. Conversely every couple in which can be expressed by the symbol . In the same way, every couple in which can be expressed in the form , where .
8. It follows as a formal consequence of the definitions that . It is convenient to denote and its equivalent symbols by , because
; |
hence , and we can represent by the scheme—
in which each element is obtained from the next before it by the addition of . With this notation the rules of operation may be written (, denoting natural numbers)—
with the special rules for zero, that if is any element of ,
.
To each element, , of corresponds a definite element such that ; if , then , but in every other case are different and may be denoted by . The natural number is called the absolute value of and .
9. If are any two elements of , the equation is satisfied by putting . Thus the symbol is always interpretable as , and we may say that within subtraction is always possible; it is easily proved to be also free from ambiguity. On the other hand, is intelligible only if the absolute value of is a multiple of the absolute value of .
The aggregate has no first element and no last element. At the same time it is countable, as we see, for instance, by associating the elements with the natural numbers respectively, thus—
It is usual to write (or simply ) for and for ; that this should be possible without leading to confusion or ambiguity is certainly remarkable.
10. Fractional Numbers.—We will now derive from a different aggregate of couples subject to the following rules:
The symbols are equivalent if . According as is greater or less than we regard as being greater or less than . The formulae for addition and multiplication are
. |
All the couples are equivalent to , and if we denote this by we have , so that is the ground element of the new aggregate.
Again , and by induction . Moreover, if is a multiple of , say , we may denote by .
11. The new aggregate of couples will be denoted by . It differs from and in one very important respect, namely, that when its elements are arranged in order of magnitude (that is to say, by the rule above given) they are not isolated from each other. In fact if , and , the element lies between and ; hence it follows that between any two different elements of we can find as many other elements as we please. This property is expressed by saying that is in close order when its elements are arranged in order of magnitude. Strange as it appears at first sight, is a countable aggregate; a theorem first proved by G. Cantor. To see this, observe that every element of R may be represented by a "reduced" couple , in which are prime to each other. If are any two reduced couples, we will agree that is anterior to if either (1) , or (2) but . This gives a new criterion by which all the elements of R can be arranged in the succession
which is similar to the natural scale.
The aggregate , arranged in order of magnitude, agrees with in having no least and no greatest element; for if denotes any element , then .
12. The division of one element of by another is always possible; for by definition
- ,
and consequently is always interpretable as . As a particular case , so that every element of is expressible in one of the forms . It is usual to omit the symbol altogether, and to represent the element by , whether is a multiple of or not. Moreover, is written , which may be done without confusion, because , and , by the rules given above.
13. Within the aggregate subtraction is not always practicable; but this limitation may be removed by constructing an aggregate related to in the same way as to . This may be done in two ways which lead to equivalent results. We may either form symbols of the type , where denote elements of , and apply the rules of § 7 ; or else form symbols of the type , where denote elements of , and apply the rules of § 10. The final result is that contains a zero element, , а ground element , an element such that , and a set of elements representable by the symbols (. In this notation the rules of operation are
;
;
;
.
Here and denote any two elements of . If , then , and if , then . If , then
14. When is constructed by means of couples taken from , we must put , , and , if is any element of except . The symbols and are inadmissible; the first because it satisfies the definition of equality (§ 10) with every symbol , and is therefore indeterminate; the second because, according to the rule of addition,
- , which is inconsistent with
In the same way, if denotes the zero element of , and any other element, the symbol is indeterminate, and inadmissible, because, by the formal rules of operation, , which conflicts with the definition of the ground element . It is usual to write (or simply ) for , and for . Each of these elements is said to have the absolute value . The criterion for arranging the elements of in order of magnitude is that, if are any two elements of it, when is positive; that is to say, when it can be expressed in the form .
15. The aggregate is very important, because it is the simplest type of a field of rationality, or corpus. An algebraic corpus is an aggregate, such that its elements are representable by symbols , &c., which can be combined according to the laws of ordinary algebra; every algebraic expression obtained by combining a finite number of symbols, by means of a finite chain of rational operations, being capable of interpretation as representing a definite element of the aggregate, with the single exception that division by zero is inadmissible. Since, by the laws of algebra, , and , every algebraic field contains , or, more properly, an aggregate which is an image of .
16. Irrational Numbers.—Let denote any element of ; then and all lesser elements form an aggregate, say; the remaining elements form another aggregate , which we shall call complementary to , and we may write . Now the essence of this separation of into the parts and may be expressed without any reference to as follows:—
I. The aggregates are complementary; that is, their elements, taken together, make up the whole of .
II. Every element of is less than every element of .
III. The aggregate has no least element. (This condition is artificial, but saves a distinction of cases in what follows.)
Every separation which satisfies these conditions is called a cut (or section), and will be denoted by . We have seen that every rational number can be associated with a cut. Conversely, every cut in which has a last element is perfectly definite, and specifies without ambiguity. But there are other cuts in which has no last element. For instance, all the elements () of such that either , or , form an aggregate , while those for which and , form the complementary aggregate . This separation is a cut in which has no last element; because if is any positive element of , the element exceeds , and also belongs to . Every cut of this kind is said to define an irrational number. The justification of this is contained in the following propositions:—
(1) A cut is a definite concept, and the assemblage of cuts is an aggregate according to definition; the generic quality of the aggregate being the separation of into two complementary parts, without altering the order of its elements.
(2) The aggregate of cuts may be arranged in order by the rule that if is a part of .
(3) This criterion of arrangement preserves the order of magnitude of all rational numbers.
(4) Cuts may be combined according to the laws of algebra, and, when the cuts so combined are all rational, the results are in agreement with those derived from the rational theory.
As a partial illustration of proposition (4) let be any two cuts ; and let be the aggregate whose elements are obtained by forming all the values of , where is any element of and is any element of . Then if is the complement of , it can be proved that is a cut; this is said to be the sum of and . The difference, product and quotient of two cuts may be defined in a similar way. If denotes the irrational cut chosen above for purposes of illustration, we shall have where comprises all the numbers obtained by multiplying any two elements, which are rational and positive, and such that . Since it follows that is positive and greater than ; it can be proved conversely that every rational number which is greater than can be expressed in the form . Hence so that the cut actually gives a real arithmetical meaning to the positive root of the equation ; in other words we may say that defines the irrational number . The theory of cuts, in fact, provides a logical basis for the treatment of all finite numerical irrationalities, and enables us to justify all arithmetical operations involving the use of such quantities.
17. Since the aggregate of cuts (N say) has an order of magnitude, we may construct cuts in this aggregate. Thus if a is any element of N, and A is the aggregate which consists of a and all anterior elements of ZZ, We may write N= A+A′, and (A, A′) is a cut in which A has a last element a. It is a remarkable fact that no other kind of cut in N is possible; in other words, every conceivable cut in N is defined by one of its own elements. This is expressed by saying that N is a continuous aggregate, and N itself is referred to as the numerical continuum of real numbers. The property of continuity must be carefully distinguished from that of close order (§ 11); a continuous aggregate is necessarily in closing order, but the converse is not always true. The aggregate N is not countable.
18. Another way of treating irrationals is by means of sequences. A sequence is an unlimited succession of rational numbers
a1, a2, a3 . . . am, am+1 . . .
(in order-type ω) the elements of which can be assigned by a dehnite rule, such that when any rational number ε, however small, has been fixed, it is possible to find an integer m, so that for all positive integral values of n the absolute value of (am+n − am) is less than ε. Under these conditions the sequence may be taken to represent a definite number, which is, in fact, the limit of am when m increases without limit. Every rational number a can be expressed as a sequence in the form (a, a, a, . . .), but this is only one of an infinite variety of such representations, for instance—
(.9, .99, .999, . . .) = (12, 34, 78, . . . 2n−12n . . . )
and so on. The essential thing is that we have a mode of representation which can be applied to rational and irrational numbers alike, and provides a very convenient symbolism to express the results of arithmetical operations. Thus the rules for the sum and product of two sequences are given by the formulae
- (a1, a2, a3, . . .)+(b1, b2, b3, . . .) = (a1+b1, a2+b2, a3+b3, . . .)
- (a1, a2, a3, . . .)×(b1, b2, b3, . . .) = (a1b1, a2b2, a3b3 . . .)
from which the rules for subtraction and division may be at once inferred. It has been proved that the method of sequences is ultimately equivalent to that of cuts. The advantage of the former lies in its convenient notation, that of the latter in giving a clear definition of an irrational number without having recourse to the notion of a limit.
19. Complex Numbers.—If α. is an assigned number, rational or irrational, and n a natural number, it can be proved that there is a real number satisfying the equation x"=a, except when n is even and a is negative: in this case the equation is not satisfied by any real number whatever. To remove the difficulty we construct an aggregate of polar couples {x, y}, where x, y are any two real numbers, and define the addition and multiplication of such couples by the rules
- {x, y}+{x′,y′= x+x′, y+y};
- {x, y}×{x′ y′} = xx′−yy′, xy′+x′y}
We also agree that {x, y} <x′, y′}, if x<x′ or if x=x′ and y<y′. It follows that the aggregate has the ground element {1, 0}, which we may denote by σ; and that, if we write τ for the element (0, 1},
τ2={−1, 0} = σ.
Whenever m, n are rational, {m, n} = mσ+nτ, and we are thus justified in writing, if we like, xσ+yτ for {x, y} in all circumstances. A further simplification is gained by writing x instead of xσ, and regarding τ as a symbol which is such that τ2= −1, but in other respects obeys the ordinary laws of operation. It is usual to write i instead of τ; we thus have an aggregate J of complex numbers x+yi. In this aggregate, which includes the real continuum as part of itself, not only the four rational operations (excluding division by {0, 0}, the zero element), but also the extraction of roots, may be effected without any restriction. Moreover (as first proved by Gauss and Cauchy), if a0, a1 . . . an an are any assigned real or complex numbers, the equation
a0zn−1+. . .an−1z+an =0,
is always satisfied by precisely n real or complex values of z, with a proper convention as to multiple roots. Thus any algebraic function of any finite number of elements of J is also contained in J, which is, in this sense, a closed arithmetical field, just as N is when we restrict ourselves to rational operations. The power of J is the same as that of N.
20. Transfinite Numbers.—The theory of these numbers is quite recent, and mainly due to G. Cantor. The simplest of them, w, has been already defined (§ 4) as the order-type of the natural scale. Now there is no logical difficulty in constructing a scheme
u1, u2, u3 . . .
indicating a well-ordered aggregate of type ω immediately followed by a distinct element v1: for example, we may think of all positive odd integers arranged in ascending order of magnitude and then think of the even number 2. A scheme of this kind is said to be of order-type (ω+1); and it will be convenient to speak of (ω+1) as the index of the scheme. Similarly we may form arrangements corresponding to the indices
ω+2, ω+3 . . . ω+n,
where n is any positive integer. The scheme
u1, u2, u3 . . . | v1, v2, v3 . . .
is associated with ω+ω = 2ω;
u11, u12, u13 . . . | u21, u22, u23 . . . | . . . | un1, un2, . . . | . . .
with ω.ω or ω2; and so on. Thus we may construct arrangements of aggregates corresponding to any index of the form
φ(ω) = aωn+bωn−1+ . . . +kω+l,
where n, a, b, . . . l are all positive integers.
We are thus led to the construction of a scheme of symbols—
I. 1,2,3, . . . n .
ω, ω+1, . . . ω+n . . .
2ω, 2¢.>+1, 2w-|-n,
II. o2+1, a2+2. .w2+n,
@i><<»>, ¢<°=>+1 . .¢<<»>+n, -
af", ww--1, ww-I-n,
III. 2, ¢><»»>, w<1><»»> +1, w¢<~»>+», .
The symbols φ(ω) form a countable aggregate: so that we may, if we like (and in various ways), arrange the rows of block (II.) in a scheme of type co: we thus have each element a succeeded in its row by (a+ 1), and the row containing ¢>(w) succeeded by a definite next row. The same process may- be applied to (III.), and we can form additional blocks (IV), (V), &c., with first elements w4=w"“'°" w5=w°"" &c All the symbols in which w occurs are called transfinite ordinal numbers.
21. The index of a finite set is a definite integer however the set may be arranged; we may take this index as also denoting the power of the set, and call it the number of things in the set. But the index of an infinite or din able set depends upon the Way in which its elements are arranged; for instance, ind. (1, 2, 3, )=w, but ind. (1, 3, 5, | 2, 4, 6, )=2w Or, to take another example, the scheme—
1, 3, 5, . . . (2n−1).
2, 6, 10, . 2(2n-1)-2",
2n. 3, 2'" 5, 2" (zn-1)
where each row is supposed to follow the one above it, gives a permutation of (1, 2, 3, ), by which its index is changed from ω to ω2. It has been proved that there is a permutation of the natural scale, of which the index is φ(ω), any assigned element of (II); and that, if the index of any ordered aggregate is φ(ω), the aggregate is countable. Thus the power of all aggregates which can be associated with indices of the class (II) is the same as that of the natural scale; this power may be denoted by a. Since a is associated with all aggregates of a particular power, independently of the arrangement of their elements, it is analogous to the integers, 1, 2, 3, &c, when used to denote powers of finite aggregates; for this reason it is called the least transfinite cardinal number.
22. There are aggregates which have a power greater than az for instance, the arithmetical continuum of positive real numbers, the power of which is denoted by c. Another one is the aggregate of all those order-types which (like those in II. above) are the indices of aggregates of power a. The power of this aggregate is denoted by “L According to Cantor's theory it is the transfinite cardinal number next superior to a, which for the sake of uniformity is also denoted by No. It has been conjectured that *', =c, but this has neither been verified nor disproved The discussion of the aleph-numbers is still in a controversial stage (November 1907) and the points in debate cannot be entered upon here.
23. Transfinite numbers, both ordinal and cardinal, may be combined by operations which are so far analogous to those of ordinary arithmetic that it is convenient to denote them by the same symbols. But the laws of operation are not entirely the same; for instance, 2w and co2 have different meanings: the first has been explained, the second is the index of the scheme (aibi l agbz I asbg | I a, b, ,| )or any similar arrangement. Again if n is any positive integer, na=a"=a It should also be observed that according to Cantor's principles of construction every ordinal number is succeeded by a definite next one; but that there are dehnite ordinal numbers (e.g w, wf) which have no ordinal immediately preceding them.
24. Theory of Numbers.—The theory of numbers is that branch of mathematics which deals with the properties of the natural numbers. As Dirichlet observed long ago, the whole of the subject would be coextensive with mathematical analysis in general; but it is convenient to restrict it to certain fields where the appropriateness of the above definition is fairly obvious. Even so, the domain of the subject is becoming more and more comprehensive, as the methods of analysis become more systematic and more exact.
The first noteworthy classification of the natural numbers is into those which are prime and those which are composite. A prime number is one which is not exactly divisible by any number except itself and I; all others are composite. The number of primes is infinite (Eucl Elem. ix. 20), and consequently, if n is an assigned number, however large, there is an infinite number (a) of primes greater than n.
If m, n are any two numbers, and m> n, we can always find a definite chain of positive integers (gl, ri), (gg, rg), &c, such that m=q1n+r1, n=g2r1-1-rg, r1=g3r2'+r3, &c
with n>r1>r2>r3 .; the process by which the are calculated will be called residuation Since there is only a finite number of positiye integers less than rt, the process must terminate with two equalities of the form
Th-2 = qm» 1 +fh, fb-1 = Qh-s-xfn.
Hence we infer successively that 11. is a divisor of Th.-1» fh-21. .r1, and finally of m and n. Also ri is the greatest common factor of m, nz because any common factor must divide rl, rg, and so on down to nf: and the highest factor of rn is rn itself. It will be convenient to write f1»=dV (rn, 1t). If rr = I, the numbers m, n are said to be prime to each other, or co-primes.
25. The foregoin theorem of residuation is of the greatest importance; with the help of it we can prove three other fundamental propositions, namely:-
(I) If m, n are any two natural numbers, we can always find two other natural numbers x, y such that
dv(m, n) =xm -yn.
(2) If m, n are prime to each other, and p is a prime factor of mn, then p must be a factor of either m or n.
(3) Every number may be uniquely expressed as a product of prime factors.
Hence if n=p“g5r'Y is the representation of any numbern as the product of powers of different primes, the divisors of n are the terms of the product
<I+1>+1>“+ -, +1>“> (1+q+ ~ +49 <1-H+ . . . +fv) -their number 15 (fl-l-I) (B+I) (~/+I) .; and their sum is Il(p“+'-l) + II(p- I). This includes I and n among the divisors of n.
26. Totients.—By the totient of n, which is denoted, after Euler, by ¢(n), we mean the number of integers prime to n, and not exceeding n. If n=p¢, the numbers not exceeding n and not prime to it are p, zp, (pe-p), ps of which the number is p¢'1; hence ¢(P“) = p“-p“'1 If m, n are prime to each other, 4>(rnn) =4>(rn)4>(n); and hence for the general case, if n=p“g3rY ,4>(n)=lIp'1“1(p-1), where the product applies to all the different prime factors of n. If di, dz, &c., are the different divisors of n, ~ ¢(lli)+d>(d2)+ - . . =rl.
For ¢XamPl€» 15 =¢(1s>+¢><5)+q><.s>+<1><;>=8+4+2+1.
27. Residues and congruences.—It will now be convenient to include in the term “ number ” both zero and negative integers. Two numbers a, b are said to be congruent with respect to the modulus rn, when (a-b) is divisible by m. This is expressed by the notation aab (mod rn), which was invented by Gauss. The fundamental theorems relating to congruences are
If a-Eb and csd (mod m), then a=*=cEb=1=d, and abEcd If haEhb(mod rn) then aab (mod rn/d), where d=dv(h, m). Thus the theory of congruences is very nearly, but not quite, similar to that of algebraic equations. With respect to a given modulus in the scale of relative integers may be distributed into rn classes, any two elements of each class being congruent with respect to m. Among these will be d>(m) classes containing numbers prime to m. By taking any one number from each class we obtain a complete system of residues to the modulus m. Supposing (as we shall always do) that m is positive, the numbers 0, I, 2, (m-I) form a system of least positive residues; according as rn is odd or even, 0, =*=I, ='=2, is (m-1), or o, =f=1, =2, . =sé(m-2), § m form a system of absolutely least residues.
28. The Theorems of Fermat and Wilson.—Let ri, rg, rf where t=d>(m), be a complete set of residues prime to the modulus rn. Then if x is any number prime to rn, the residues xrl, xrg, xn also form a complete set prime to m (§ 27). Consequently xryxrz xr, § r, r, r, , and dividing by r1r2 ri, which is prime to the modulus, we infer that
x¢( ') E I (mod rn).
which is the general statement of Fermat's theorem. If rn is a prime p, it becomes xf"'§ r (mod p).
For a prime modulus p there will be among the set x, 2x, x, (p-1)x just one and no more that is congruent to I: let tlliis be xy. Ifysx, wemusthavex'-I = (x-I) (x+1)Eo, and hencexs *IZ consequently the residues 2, 3, 4, (p-2) can be arranged in % (p-3) pairs (x, y) such that xysr. Multiplying them all together, we conclude that 2.3.4. (p-2) EI and hence, since I (p- I)-E - I, (P"1)!E-I(f110d P).
which is Wilson's theorem. It may be generalized, like that of Fermat, but the result is not very interesting. If in is composite (m- 1) !+I cannot be a multiple of m: because rn will have a prime factor p which is less than m, so that (m-I)!Eo (mod p). Hence Wilson's theorem is invertible: but it does not supply any practical test to decide whether a given number is prime.
29. Exponents, Primitive Roots, Indices.—Let p denote an odd prime, and x any number prime to p. Among the powers x, xl, x', xp" there is certainly one, namely x1"1, which EI (mod p); let x” be the lowest power of x such that x”EI. Then e is said to be the exponent to which x appertains (mod p): it is always a factor of (p-I) and can only be I when scsi. The residues x for which e =p- I are said to be primitive roots of p. They always exist, their number is 4>(p-1), and they can be found by a. methodical, though tedious, process of exhaustion. If g is any one of them, the complete set may be represented by g, g", gb, &c where a, b, &c, are the numbers less than (p-I) and prime to it, other than I. Every number x which is prime to p is congruent, mod p, to gi, where i is one of the numbers I, 2, 3, (p-I); this number i is called the index of x to the base g. Indices are analogous to logarithms: thus
ind, (xy)Eind, x+ind, , y. ind, ,(x )Eh indgx (mod p-I). Consequently tables of primitive roots and indices for different primes are of great value for arithmetical purposes. Jacobi's Canon Arithmeticus gives a primitive root, and a table of numbers and indices for all primes less than 1000.
For moduli of the forms 2p, p'", 2p"' there is an analogous theory (and also for 2 and 4); but for a composite modulus of other forms there are no primitive roots, and the nearest analogy is the representation of prime residues in the form a' B” X' ~, where a, B, 7, are selected prime residues, and x, y, z, are indices of restricted range. For instance, all residues prime to 48 can be exhibited in the form 5” 7” I3', where x=o, I, 2, 3; y=o, 1; z=o, 1; the total number of distinct residues being 4.2.2 = 16 =¢(48), as it should be.
30. Linear Congruences.—The congruence a'xEb' (mod m') has no solution unless dv(a', m') is a factor of b'. If this condition is satisfied, we may replace the given congruence by the equivalent one axab (mod rn), where a is prime to b as well' as to rn. By residuation (§§ 24, 25) we can find integers h, k such that ah-mk = 1, and thence obtain xsbh (mod m) as the complete solution of the given congruence. To the modulus rn' there are m'/m in congruent solutions. For example, I2xE30 (mod 21) reduces to 2x55 (mod 7) whence x56 (mod 7) 56, 13, 20 ~(mod 21). There is a theory of simultaneous linear congruences in any number of variables, first developed with precision by Smith. In any particular case, it is best to replace as many as possible of the given congruences by an equivalent set obtained by successively eliminating the variables x, y, z, . in order. An important problem is to find a number which has given residues with respect to a given set of moduli. When possible, the solution is of the form x≡a (mod m), where in is the least common multiple of the moduli. Supposing that p is a prime, and that we have a corresponding table of indices, the solution of ax§ ;1li mod p) can be found by observing that ind x≡ind b−ind a (mod p−1).
31. Quadratic Residues. Law of Reciprocity.-To an odd prime modulus p, the numbers 1, 4, 9, . . . (p−1)2 are congruent to Hp-1) residues only, because (p-x)2=x'2. Thus for p=5, we have 1, 4, 9, 1621, 4, 4, 1 respectively. There are therefore é(p-1) quadratic residues and %(p-1) quadratic non-residues prime to p; and there is a corresponding division of in congruent classes of integers with respect to p. The product of two residues or of two non residues is a residue; that of a residue and a non-residue is a non residue; and taking any primitive root as base the index of any number is even or odd according as the number is a residue or a non residue. Gauss writes aRp, aNp to denote that a, is a residue or non residue of p respectively.
Given a table of indices, the solution of x2Ea(mod p) when possible, is found from zind xEind a (mod pe1), and the result may be written in the form xi =|=r (mod p). But it is important to discuss the congruence xiao without assuming that we have a table of indices. It is sufficient to consider the case x'sq (mod p), where q is a positive prime less than p; and the question arises whether the quadratic character of q with respect to p can be deduced from that of p with respect to q. The answer is contained in the following theorem, which is called the law of quadratic reciprocity (for real positive odd primes): if p, q are each or one of them of the form 4n+1, then p, q are each of them a residue, or each a non-residue of the other; but if p, q are each of the form 4n+3, then according as p is a residue or non-residue of q we have g a non-residue or a residue of p.
Legendre introduced a symbol which denotes + 1 or −1 according as mRq or mNq being a positive odd prime and m any number prime to q); with its help we may express the law of reciprocity in the form
=(,): <»-1><q-n
This theorem was first stated by Legendre, who only partly proved it; the first complete proof, by induction, was published by Gauss, who also discovered five (or six) other more or 'less independent proofs of it. Many others have since been invented.
There are two supplementary theorems relating to −1 and 2 respectively, which, may be expressed in the form 1 = gp-n (2 = l(P2'I)
(z> l l U ' P) l ')
where p is any positive odd prime.
It follows from the definition that
<ssL-° e <@>“ere>~—fl
9 9 fl
and that = if m =m' (mod q). As a simple application of the law of reciprocity, let it be required to find the quadratic character of II with respect to 1907. We have
(ss)=~¢¥3=*(§)n
because 6N11. Hence 11R1907.
Legendre's symbol was extended by Jacobi in the following manner. Let P be any positive odd number, and let p, p', p", &c. be its (equal or unequal) prime factors, so that P=pp'p” .... Then g Q is any number prime to P, we have a generalized symbol defined
(P) - (P) (if) (fr)
This symbol obeys the law that, if Q is odd and positive, B Q HP-)(Q-)
(Q) (P)-< 1>.
with the supplementary laws
IJ. . § <P-> 3 P=(P)'(')
I~ (P) °(UA( -It
is found convenient to add the conventions that (915) = (%)
when Q and P are both odd; and that the value of the symbol is o when P, Q are not co-primes.
In order that the congruence x”-Ea (mod rn) may have a solution it is necessary and sufficient that a be a residue of each distinct prime factor of m If these conditions are all satisfied, and m=2'<p/of1. ., where p, q, &c., are the distinct odd prime factors of rn, being t in all, the number of in congruent solutions of the given congruence is 2', 2'*1 or 2'+2, according as:<<2, r=2, or l¢>2 respectively. The actual solutions are best found by a process of exhaustion. It should be observed that = 1 is a necessary but not a sufficient condition for the possibility of the congruence.
32. Quadratic forms.—It will be observed that the solution of the linear congruence ax≡b (mod m) leads to all the representations of b in the form ax-}~my, where x, y are integers. Many of the earliest researches in the theory of numbers deal with particular cases of the problem: given four numbers m, a, b, c, it is required to find all the integers x, y (if there be any) which satisfy the equation ax2+bxy+ cy” =m. F ermat, for instance, discovered that every positive prime of the form 4n+1 is uniquely expressible as the sum of two squares. There is a corresponding arithmetical theory for forms of any degree and any number of variables; only those of linear forms and binary quadratics are in any sense complete, as the difficulty of the problem increases very rapidly with the increase of the degree of the form considered or of the number of variables contained in it.
The form ax*-}»bxy+cy2 will be denoted by (ri, b, c) (x, y)' or more simply by (a, b, c) when there is no need of specifying the variables. If k is the greatest common factor of a, b, c, we may write (cz, b, c) = k(a', I/, c') where (a', b', c') is a primitive form, that is, one for which dv (a', b', c') =1. The other form is than said to be derived from (a', b', c') and to have a divisor k. For the present we shall concern ourselves only with primitive forms. Writing D =b2-4ac, the invariant D is called the determinant of (a, b, c), and there is a first classification of forms into definite forms for which D is negative, and indefinite forms for which D is positive. The case D =o or a positive square is rejected, because in that case the form breaks u p into the product of two linear factors. It will be observed that DEG, I (mod 4) according as b is even or odd; and that if ki' is, any odd square factor of D there will be forms of determinant D and divisor k.
If we write x′=ax+By, y' =-yx-l»5y, we have identically (a, b, c) (x′. y')2=(@', b'. v') (x.;v)"
where
zz' =aa2-1-ba-y+ 072
b' = 2aa/3 +b(a.5 +By) -I-2675
c' = 1182 -I-b;36 +52
Hence also
D' = b'2-4a'c' = (ao -5~/)2(b'=-4ac) = (416 ~B»y)2 D. Supposing that a, /3, fy, 6 are integers such that a6-/3'y=n, a number different from zero, (il, b, c) is said to be transformed into (a', b', c') by the substitution (3 of the nth order. If n2=I, the two forms are said to be equivalent, and the equivalence is said to be proper or improper according as ri= I or n = - I. In the case of equivalence, not only are x', y' integers wherever x, y are so, but conversely; hence every number representable by (a, b, c) is representable by (a', b', c') and conversely. For the present We shall deal with proper equivalence only and write f-f' to indicate that the forms f, f' are properly equivalent. Equivalent forms have the same divisor. A complete set of equivalent forms is said to form a class; classes of the same divisor are said to form an order, and of these the most important is the principal order, which consists of the primitive classes. It is a fundamental theorem that for a given determinant the number of classes is finite; this is proved by showing that every class must contain one at least of a certain finite number of so-called reduced forms, which can be found by dehnite rules of calculation.
33. Method of Reduction.—This differs according as D is positive or negative, and will require some preliminary lemmas. Suppose that any complex quantity z=x+yi is represented in the usual way by a point (x, y) referred to rectangular axes. Then by plotting off all the points corresponding to (az~{-5) / (yz-f-6), we obtain a complete set of properly equivalent points. These all lie on the same side of the axis of x, and there is precisely one of them and no more which satisfies the conditions: (i.) that it is not outside the area which is bounded by the lines 2x= ='=1; (ii.) that it is not inside the circle x2+y2=1; (iii.) that it is not on the line 2x=I, or on the arcs of the circle x2~i-y2 ='I intercepted by 2x= I and x=o. This point will be called the reduced point equivalent to z. In the positive half-plane (y>o) the aggregate of all reduced points occupies the interior and half the boundary of an area which will be called the fundamental triangle, because the areas equivalent to it and hnite, are all triangles bounded by circular arcs, and having angles -érr, évr, o and the fundamental trian le may be considered as a special case when one vertex goes to infinity. The aggregate of equivalent triangles forms a kind of mosaic which fills up the whole of the positive half-(plane. It will be convenient to denote the fundamental triangle (with its half-boundary, for which x<o) by V; for a reason which will appear later, the set of equivalent triangles will be said to make up the modular dissection of the positive half-plane.
Now let f'=(a', b', c') be any definite form 'with a' positive and determinant - A. The root of a'z2j-b'z-4-c' =0 which is represented by a point in the positive half-plane is
and this is a reduced point if either
w= A1
(i.) b' <a'<c;
(u.) b' =a', a'<c
(iii.) a' =c', 0<b'§ a'.
Cases (ii.) and (iii.) oply occur when the representative point on Cfhe bounda of V. A orm whose representative point is re uce is said to rliie a reduced form. It follows from the geometrical theory that every form is equivalent to a reduced form, and that there are as many distinct classes of positive forms of determinant-A as there are reduced forms. The total number of reduced forms is limited, because in case (i.) we have A =4ac -b'> 3112, so that b </ %A, while 4a2<¢}ac< A+b2<§ »A; in case (ii.) /;=4ac-a'=>3a2, or else a =b =c =/ 55; in case (iii.) A =4a2-b2> 3b2,4a2 =A-|-bz <§ A, or else a=-b =c=/ § A. With the help of these inequalities a complete set of reduced forms can be found by trial, and the number of classes determined. The latter cannot exceed Q; A; it is in general much less. With an indefinite form (a, b, c) we may associate the representative circle
a(x2+y2) +bx+c=o,
which cuts the axis of x in two real points. The form is said to be reduced if this circle cuts V; the condition for this is a(a =1= 'kb-l-c) <0, which can be expressed in the form 3a2+(a=b)2 <D, and it is hence clear that the absolute values of a, b, and therefore of c, are limited. As before, there are a limited number of reduced forms, but they are not all non-equivalent. In fact they arrange themselves, according to a law which is not very difficult to discover, in cycles or periods, each of which is associated with a particular class. The main result is the same as before: that the number of classes is finite, and that for each class we can find a representative form by a finite process of calculation.
34. Problem of Representation.-It is required to find out whether a given number rn' can be represented by the given form (a', b', c'). One condition is clearly that the divisor of the form must be a factor of m'. Suppose this is the case; and let m, (a, b, c) be the quotients of rn' and (a', b', c') be the divisor in question. Then we have now to discover whether m can be represented by the primitive form (a, b, c). First of all we will consider proper representations m= (11, b, 0) (a, W
where a, 'y are co-primes. Determine integers 13,5 such that a6 -/37 = 1, and apply to (a, b, c) the substituticni < ';' ; the new form will be (m, n, I), where
rt'-4rnl=D=b2-4ac.
Consequently rt'=D (mod 4m), and D must be a quadratic residue of m. Unless this condition is satisfied, there is no proper representation of m by any form of determinant D. Suppose, however, that n2=D (mod 4m) is soluble and that nl, rn, &c. are its roots. Taking any one of these, say ut, we can find out whether (rn, ng, li) and (a, b, c) are equivalent; if they are, there is a substitution Q which converts the latter into the former, and then m =aa2 -I-ba'y+c'y2. As to derived representations, if m= (a, b, c) (tx, ty)2, then rn must have the square factor tl, and rn/F=(a, b, c) (x, y)2; hence everything may be made to depend on proper representation by primitive orms.
35. Automorphs. The Pellian Equation.-A primitive form (a, b, c) is, by definition, equivalent to itself; but it may be so in more ways than one. In order that (a, b, c) may be transformed into itself by the substitution <f;' , it is necessary and sufficient that (at, B) (Ht-l-bu), -cu
-y, 6 au, § (t-bu)
where (t, u) is an integral solution of
F - Du' =4.
If D is negative and -D> 4, the only solutions are t= =1=2, u=0; D= -3 gives (=2, 0), (==1, =1); D= -4 gives (=|=2, o), (0, ii). On the other hand, if D> 0 the number of solutions is infinite and if (11, 141) is the solution for which t, u have their least positive ilalues, all the other positive solutions may be found from 1. n D D "
5i§ L= &?/~ (»=2, s,4.—)-The
substitutions by which (a, b, c) is transformed into itself are called its auto morphs. In the case when D=0 (mod 4) we have t==2T, u=2U, D =4N, and (T, U) any solution of
T2 - NU2 = 1.
This is usually called the Pellian equation, though it should properly be associated with Fermat, who first perceived its importance. The minimum solution can be found by converting / N into a periodic continued fraction.
The form (a, b, c) may be improperly equivalent to itself; in this case all its improper auto morphs can be expressed in the form K, (K-l-b})/2a
(fc-bk)/26, -)
where K2—DA” =4ac. In particular, if bac (mod a) the form (a, b, c) is improperly equivalent to itself. A form improperly equivalent to itself is said to be ambiguous.
36. Characters of a form or class. Genera.—Let (a, b, c) be any primitive form; we have seen above (§ 32) that if a., (3, 'y, 6 are any integers
4(f1<1' -l-i>¢w+cv”)(<l/32 +1255-I-c5') = 5” ~ (015 -Bv)'D where b' =2aaB+b(a6-Q-/3'y) -I-2c'y5. Now the expressions in brackets on the left hand may denote any two numbers rn, rt representable by the form (a, b, c); the formula shows that 4mn is a residue of D, and hence mn is a residue of every odd prime factor of D, and if p is any such factor the symbols and will have the same value. Putting (a, b, c) = f, this common value is denoted by and called a quadratic character (or simply character) of f with respect to p. Since a is representable by f (x = I, y=0) the value is the same as For example, if D = -140, the scheme of characters for the six reduced primitive forms, and therefore for the classes they (Q) (§)
(I, 0. 35) + +
@d=2»9)
(51 0| "
(3, 12, 12)-In
certain cases there are supplementary characters of the type represent, is
<%I> and , and the characters are discriminated according as an odd or even power of p is contained in D; but in every .case there are certain combinations of characters (in number one-half of all possible combinations) which form the total characters of actually existing classes. Classes which have the same total character are said to belong to the same genus. Each genus of the same order contains the same number of classes.
For any determinant D we have a principal primitive class for which all the characters are -I-; this is represented by the principal form (I, 0, -n) or (I, I, -n) according as D is of the form 4n or 4rL-l-I. The corresponding genus is called the principal genus. Thus, when D= -140, it appears from the table above that in the primitive order there are two genera, each containing three classes; and the non-existent total characters are + - and - -i-. 37. Composition.—Considering X, Y as given lineo-linear functions of (x, y), (x', y') defined by the equations X = 1>oxx'+i>i»o;' +1>2x'y 'l-Psyy
Y = qow'+q1xy +q2x'y+qayy
we may have identically, in x, y, x', y', (A, B, C) (X. Y)2=(<1.b, C) (vm/)2><(¢1', b',0') (x', y') and, this being so, the form (A, B, C) is said to be compounded of the two forms (a, b. c), (a', b', c'), the order of composition being indifferent. In order that two forms may admit of composition into a third, it is necessary and sufficient that their determinants be in the ratio of two squares. The most important case is that of two primitive forms ¢>, X of the same determinant; these can be compounded into a form denoted by ox or X¢ which is also primitive and of the same determinant as ¢> or X. If A, B, C are the classes to which ¢>, X, ox respectively belong, then any form of A compounded with any form of B gives rise to a form belonging to C. For this reason we write C=AB =BA, and speak of the multiplication or composition of classes. The principal class is usually denoted by I, because when compounded with any other class A it gives this same class A. The total number of primitive classes being finite, h, say, the series A, A”, A3, &c., must be recurring, and there will be a least exponent e such that A” = I. This exponent is a factor of h, so that every class satisfies A"= I. Composition is associative as well as commutative, that is to say, (AB)C=A(BC); hence the symbols Al, A2, . . .Ai for the h different classes detine an Abelian group (see GROUPS) of order h, which is representable by one or more base-classes Bi, BQ. . . . Bt in such a way that each class A is enumerated once and only once by putting
A=BfB2'/ .B, " (x fm, yén, . zip)
with mn. p == h, and B, "f = BQ" = . = Bti' = I. Moreover, the bases may be so chosen that m is a multiple of n, n of the next correspond mg index, and so on. The same thing may be said with regard to the symbols for the classes contained in the princpal genus, because two forms of that genus compound into one o the same kind. If this latter group is cyclical, that is, if all the classes of the principal genus can be represented in the form 1, A, A2,. . .Av−1, the determinant D is said to be regular; if not, the determinant is irregular. It has been proved that certain specified classes of determinants are always irregular; but no complete criterion has been found, other than working out the whole set of primitive classes, and determining the group of the principal genus, for deciding whether a given determinant is irregular or not.
If A, B are any two classes, the total character of AB is found by compounding the characters of A and B. In particular, the class A2, which is called the duplicate of A, always belongs to the principal genus. Gauss proved, conversely, that every class in the principal genus may be expressed as the duplicate of a class. An ambiguous class satisfies A'= 1, that is, its duplicate is the principal class; and the converse of this is true. Hence if Br, B¢, . .Bi are the base classes for the whole composition-group, and A=B1'BZ”. .BH (as above) A2= I, if 2x==o or m, 2y=o or rt, &c.; hence the number of ambiguous classes is 2'. As an example, when D = -1460, there are four ambiguous classes, represented by
(I. 0, 355)» (2, 2. 183). (5. 0, 73)» (10, 101 39>: hence the composition-group must be dibasic, and in fact, if we put Br, B2 for the classes represented by (lx, 6, 34) and (2, 2, 183), we have B110 =B22= I and the 20 primitive classes are given by B1"B21f (xé. 10, yé 2). In this case the determinant is regular and the classes in the principal genus are I, BF; B14, BN, BF. 38. On account of its historical interest, we may briefly consider the form x'--y2, for which D = -4. If is an odd prime of the form 4n+I, the congruence mza—4(mod 4pi> is soluble (§ 3I); let one of its roots be m, and rn' +4 =4lp. Then (p, m, l) is of determinant -4, and, since there is only one primitive class for this determinant, we must have (p, rn. l)~ (I, 0, I). By known rules we can actually find a substitution <3 which converts the first form into the second; this being so, will transform the second into the first, and we shall have p =72 -1-52, a representation of p as the sum of two squares. This is unique, except that we may put p=(=°=-y)2+(=4=6)'. We also have 2 = I2'i-I2 while no prime 4n +3 admits of such a representation.
The theory of composition for this determinant is expressed by the identity (x2+y”) (x”j-y ) = (xx' iyyf)2+(xy'=¢= yx')'; and by repeated application of this, and the previous theorem, we can show that if N=2“p”q°. ., where p, q, . are odd primes of the form 4n-l-1, we can find solutions of N =x'+y“, and indeed distinct solutions. For instance 65 = 12-l-8*=4“+72, and conversely two distinct representations N =x'+y2 = ui-i-1/2 lead to the conclusion that N is composite. This is a simple example of the application of the theory of forms to the difficult problem of deciding whether a given large number is prime or composite; an application first indicated by Gauss, though, in the present simple case, probably known to Fermat.
39. Number of classes. Class-number Relations.-It appears from Gauss's posthumous papers that he solved the very difficult problem of finding a formula for h(D), the number of properly primitive classes for the determinant D. The first published solution, however, was that of P. G. L. Dirichlet; it depends on the consideration of series of the form E(ax2-i-bxy-1-cy2)'1" where s is a positive quantity, ultimately made very small. L. Kronecker has shown the connexion of Dirichlet's results with the theory of elliptic functions, and obtained more comprehensive formulae by taking (a, b, c) as the standard type of a quadratic fcrm, whereas Gauss, Dirichlet, and most of their successors, took (a, 2b, c) as the standard, calling (bi-ae) its determinant. As a sample of the kind of formulae that are obtained, let p be a prime of the form 411--3; then h(-4p) =>:a-ze. htm wg <»+u~/ z>> =1<»g TI('¢aI1%> where in the first formula Za. means the sum of all quadratic residues of p contained in the series I, 2, 3, . .%(p-I) and Z5 is the sum of the remaining non-residues; while in the second formula (I, u) is the least positive solution of £2-pu2= 1, and the product extends to all values of b in the set 1, 3, 5, . (4p-I) of which p is a non residue. The remarkable fact will be noticed that the second formula gives a solution of the Pellian equation in a trigonometrical form.
Kronecker was the first to discover, in connexion with the complex multiplication of elliptic functions, the simplest instances of a very curious group of arithmetical formulae involving sums of class numbers and other arithmetical functions; the theory of these relations has been greatly extended by A. Hurwitz. The simplest of all these theorems may be stated as follows. Let H (A) represent the number of classes for the determinant -A, with the convention that § and not I is to be reckoned for each class containing a reduced form of the type (a, o, a) and if for each class containing a reduced form (a, a, a); then if n is any positive integer,
E H(4n-K2) =Φ(n)+I/(rt) (-2/né xé 2#7l)
where Φ(n) means the sum of the divisors of n, and I/(rt) means the excess of the sum of those divisors of n which are greater than J n over the sum of those divisors which are less than / n. The formula is obtained by calculating in two different ways the number of reduced values of z which satisfy the modular equation ](nz) =](z), where ](z) is the absolute invariant which, for the elliptic function p(u; gg, gs) is g;»3+ (gf-27g32), and z is the ratio of any two primitive periods taken so that the real part of iz is negative (see below, § 68). It should be added that there is a series of scattered papers by J. Liouville, which implicitly contain Kronecker's class-number relations, obtained by a purely arithmetical process without any use of transcendent.
40. Bilinear Forms.—A bilinear form means an expression of the type Eagkxfyk (i=1, 2, . . .m; k=I, 2, . n); the most important case is when m=n, and only this will be considered here. The invariants of a form are its determinant [am] and the elementary factors thereof. Two bilinear forms are equivalent when each can be transformed into the other b linear integral substitutions x'=Eax, y'=2/Sy. Every bilinear form is equivalent to a reduced Y
form 2e, x;y, , and r =n, unless [a, ,, .] =o. In order that two forms may 1
be equivalent it is necessary and sufficient that their invariants should be the same. Moreover, if a-b and e-d, and if the invariants of the forms a+)c, b+>d are the same for all values of >, we shall have a+)c-b~{-Ad, and the transformation of one form to the other may be effected b a. substitution which does not involve). The theory of bilinear forms practically includes that of quadratic forms, if we suppose xi, yr to be cogredient variables. Kronecker has develo ed the case when n=2, and deduced various class-relations for quadratic forms in a manner resembling that of Liouvil'le. So far as the bilinear forms are concerned, the main result is that the number of classes for the positive determinant alla”-a12a21==A is 12Φ(Δ)+I/(A)}+2e, where e is 1 or 0 according as A is or is not a square, and the symbols <1>, ' have the meaning previously assigned to them (§ 39).
41. Higher Quadratic Forms.-The algebraic theory of quadratics is so complete that considerable advance has been made in the much more complicated arithmetical theory. Among the most important results relating to the general case of n variables are the proof that the class-number is finite; the enumeration of the arithmetical invariants of a form; classification according to orders and genera, and proof that genera with specified characters exist; also the determination of all the rational transformations of a given form into itself. In connexion with a definite form there is the important conception of its weight; this is defined as the reciprocal of the number of its proper auto morphs. Equivalent forms are of the same weight; this is defined to be the weight of their class. The weight of a genus or order is the sum of the weights of the classes contained in it; and expressions for the weight of a given genus have actually been obtained. For binary forms the sum of the weights of all the genera coincides with the expression denoted by H(A) in § 39. The complete discussion of a form requires the consideration of (n-2) associated quadratics; one of these is the contra variant of the given form, each of the others contains more than rt variables. For certain quaternary and senary classes there are formulae analogous to the class-relations for binary forms referred to in § 39 (see Smith, Proc. R.S. xvi., or Collected Papers, i. 510).
Among the most interesting special applications of the theory are certain propositions relating to the representation of numbers as the sum of squares. In order that a number may be expressible as the sum of two squares it is necessary and sufficient for it to be of the form PQ”, where P has no square factor and no prime factor of the form 4n-Q-3. A number is expressible as the sum of three s uares if, and only i, it is of the form m2n with rt: I, =r=2 =+=3 (mod til); when rn = I and 1153 (mod 8), all the squares are odd, and hence follows Fermat's theorem that every number can be expressed as the sum of three triangular numbers (one or two of which may be 0). Another famous theorem of Fermat's is that every number can be expressed as the sum of four squares; this was first proved by jacobi, who also proved that the number of solutions of rt =x' +y' +z2+t” is 84>(n), if rt is odd, while if rt is even it is 24 times the sum of the odd factors of rt. Explicit and finite, though more complicated, formulae have been obtained for the number of representations of rt as the sum of five, six, seven and eight squares respectively. As an example of the outstanding difficulties of this part of the subject may be mentioned the problem of finding all the integral (not merely rational) auto morphs of a given form f. When f is ternary, C. Hermite has shown that the solution depends on finding all the integral solutions of F(x, y, z) +P = I, where F is the contra variant of f.
Thanks to the researches of Gauss, Eisenstein, Smith, Hermite and others, the theory of ternary quadratics is much less incomplete than that of quadratics with four or more variables. Thus methods of reduction (iliave been found both for definite and for indefinite forms; so that it would be possible to draw up a table of representative forms, if the result were worth the labour. One specially important theorem is the solution of ax'+by2 -1-cz' ==o; this is always possible if -bc, -ca, -ab are quadratic residues of a, b, c respectively, and a formula can then be obtained which furnishes all the solutions.
42. Complex Numbers.-One of Gauss's most important and far reaching contributions to arithmetic was his introduction of complex integers a+bi, where a, b are ordinary integers, and, as usual, i2 = −1. In this theory there are four units ±1, ±i; the numbers ih(a+bi) are said to be associated; a-bi is the conjugate of a+bi and we write N(a=f=bi) =a2+b2, the norm of a+bi, its conjugate, and associates. The most fundamental proposition in the theory is that the process of residuation (§ 24) is applicable; namely, if m, ri are any two complex integers and N(m) >N(n), we can always find integers q, r such that m=gn+r with N(r)≤12N(n), This may be proved analytically, but is obvious if we mark complex integers by points in a plane. Hence immediately follow propositions about resolutions into prime factors, greatest common measure, &c., analogous to those in the ordinary theory; it will only be necessary to indicate special points of difference.
We have 2 = −i(1+i)2, so that 2 is associated with a square; a real prime of the form 4n+3 is still a prime but one of the form 4n+1 breaks up into two conjugate prime factors, for example 5 = (1 -2i)(1 +2i) An integer is even, semi-even, or odd according as it is divisible by (I +i)', (I +i) or is prime to (1+i). Among four associated odd integers there is one and only one whichré I (mod 2+ 2i); this is said to be primary; the conjugate of a primary number is primary, and the product of any number of primaries is primary. The conditions that a+bi may be primary are bEo (mod 2) a+b-120 (mod 4). Every complex integer can be uniquely expressed in the form i"'(I+i)"a°bBc1' . ., where 0 5m<4, and a, b, c, . are primary primes.
With respect to a complex modulus m, all complex integers may be distributed into N (m) congruous classes. If m=h(a+bi) where a, b are co-primes, we may take as representatives of these classes the residues x-+-yi where x=0, I, 2, .. .{(a'+b')h-Il; y=o, I, 2, (h-I). Thus when b=o we may take x=o, 1, 2, ...(h-I); y=o, I, 2, . . (h-1), giving the h' residues of the real number h; while if a-|-bi is prime, r, 2, 3, . . .(a'+b2-I) form a complete set of residues.
The number of residues of m that are prime to m is given by ¢<m) =N(m>n (1 -NQB)
where the product extends to all prime factors of m. As an analogue to Fermat s theorem we have, for any integer prime to the modulus,
x¢<"'>E I (mod m), xN°1i=- I (mod p) according as m is composite or prime. There are ¢lN(p) -I) primitive roots of the prime p; a primitive root in the real theory for a real prime 4n+I is also a primitive root in the new theory for each prime factor of (4n+I), but if p=4n.+3 be a prime its primitive roots are necessarily complex. 43. If p, q are any two odd primes, we shall define the symbols zand 'by the congruences p{{N(q)-tl; 2, p1lN(q)'llE= '(m0d Q), it being understood that the symbols stand for absolutely least P residues. It follows that 2 = I or - I according as p is a quadratic residue of q or not; and that '=I only if p is a bi quadratic residue of q. If p, q are primary primes, we have two laws of reciprocity, expressed by the equations p/q.-al q/p=N<»»~1=-=f-To these must be added the supplementary formulae 2=<-l)i{N<»>-Il, 2=(-l)a1<»+b>=-=l, 4=ie, <a-1) '=i}{a+b-<l+b>2}, a+bi being a primary odd prime. In words, the law of bi quadratic reciprocity for two primary odd primes mayfbe expressed by saying that the biquadrate characters of each prime with respect to the other are identical, unless p=qE3-|-2i (mod 4), in which case they are opposite. The law of biqkpadratic reciprocity was discovered by Gauss, who does not seem, owever, to have obtained a complete proof of it. The first published proof is that of Eiseustein, which is very beautiful and simple, but involves the theory of lemniscate functions. A proof on the lines indicated in Gauss's posthumous papers has been developed by Busche; this probably admits of simplification. Other demonstrations, for instance Jacobi's, depend on cyclotomy (see below). 44. Algebraic Numbers.—Thefirst extension of Gauss's complex theory was made by E. E. Kummer, who considered complex numbers represented by rational integral functions of any roots of unity, thus including the ordinary theory and Gauss's as special cases. He was soon faced by the difficulty that, in some cases, the law that an integer can be uniquely expressed as the product of prime factors appeared to break down. To see how this happens take the equation fl' +11 +6 =0, the roots of which are expressible as rational integral functions of 23rd roots of unity, and let 17 be either of the roots. If we define an+b to be an integer, when a, b are natural numbers, the product of any number of such integers is uniquely expressible in the form lf;-I-m. Conversely evei}y integer can be expressed as the product of a finite number o in decomposable integers a+bη, that is, integers which cannot be further resolved into factors of the same type. But this resolution is not necessarily unique: for instance 6=2.3= -η(η+1), where 2, 3, η, η+1 are all indecomposable and essentially distinct. To see the way in which Kummer surmounted the difficulty consider the congruence
u2+u+6≡0(mod p)
where p is any prime, except 23. If -23Rp this has two distinct roots u1, u2; and) we say that aη+b is divisible by the ideal prime factor of p corresponding to ul, if aul+bE0 (mod p). For instance, if p=2 we may put ul=o, ul= I and there will be two ideal factors of 2, say pl and pg such that an+b≡0 (mod pl) if bEo (mod 2) and a1;+bEo (mod pl) if a+bEo (mod 2). If both these congruences are satisfied, a==° bio (mod 2) and af;+b is divisible by 2 in the ordinary sense. Moreover (a1;+b)(c17+d)=(bc+ad-ac)»q+(bd-6ac) and if this product is divisible by pl, bdEo (mod 2), whence either an+b or cv;+d is divisible by pl; while if the product is divisible by pg we have bc+ad+bd-7ac==o (mod 2) which is equivalent to (a-HJ) (c +d)§ o (mod 2), so that again either a17~l-b or 617+d is divisible by pl. Hence we may properly speak of pl and pl as [prime divisors. Similarly the congruence u'+u+620 (mod 3) defines two ideal prime factors of 3, and ar;+b is divisible by one or the other of these according as bEo (mod 3) or 2a+bEo (mod 3); we will call these prime factors pl, p¢. With this notation we have (neglecting unit factors)
2 =P1Pzl 3 '-=PsP4l 17 =P1P3l I +7 =172P4
Real primes of which -23 is a non-quadratic residue are also primes in the field (1l); and the prime factors of any number an-|-b, as well as the degree of their multiplicity, may be found by factorizing (6a'-ab+bz), the norm of (an+b). Finally every integer divisible by pl is expressible in the form =b 2m i (I +17)n where m, n are natural numbers (or zero); it is convenient to denote this fact by writing pl=[2, 1+-n], and calling the aggregate 2711-|~(I-i-'l])?l a compound modulus with the base 2, I +1l. This generalized idea of a modulus is very important and far-reaching; an aggregate is a modulus when, if a, B are any two of its elements, a+H and a-B also belong to it. For arithmetical purposes those moduli are most useful which can be put into the form [al, a2, ...a., ,] which means the aggregate of all the quantities xlal+x2al+...+x, .a, . obtained by assigning to (xl, xl, ...x, ,), independently, the values ol=|=I, =|=2, &c. Compound moduli may be multiplied together, or raised to powers, by rules which will be plain from the following example. We have
pl==[4, 2<1+n). (1 'l'1l)2]=1412+2171-5+1ll=[4l 12, -5+n] =l4l-5+1:] =[4l 3+11]
hence
P2'=P2'-P2=l4» 3-1-11] ><l2l I +nl=[S»4+4nl 6-l~211»3+411+112] =l8, 4+41l. 6+2n, -3+3111=(11-I)[11+2,11-6,3]=(11-I)[I.11l-Hence every integer divisible by pl” is divisible by the actual integer (11-I) and conversely; so that in a certain sense we may regard pz as a cube root. Similarly the cube of any other ideal prime is of the form (a1l+b)[I, 1;]. According to a principle which will be explained further on, all primes here considered may be arranged in three classes; one is that of the real primes, the others each contain ideal primes only. As we shall see presently all these results are intimately connected with the fact that for the determinant -23 there are three primitive classes, represented by (1, 1, 6) (2, 1, 3), (2, -1, 5) respectively.
45. Kummer's definition of ideal primes sufficed for his particular purpose, and completely restored the validity of the fundamental theorems about factors and divisibility. His complex integers were more general than any previously considered and suggested a definition of an algebraic integer in general, whic h is as follows: if al, al, ...a, . are ordinary integers (i.e. elements of N, § 7), and 0 satisfies an equation of the form
6"'i'a10" l'l'a20” 2'i" - - - 'l'an»-16'l'an:ol
is said to be an algebraic integer. We may suppose this equation irreducible; is then said to be of the nth order. The n roots 0, 0', 0”, . .0<""1) are all different, and are said to be conjugate. If the equation began with (1/00" instead of 0", 6 would still be an algebraic number; every algebraic number can be put into the form 6/m, where m is a natural number and 0 an algebraic integer. Associated with 6 we have afield (or corpus) S2 = R(0) consisting of all rational functions of 0 with real rational coefficients; and in like manner we have the conjugate fields S2' = R(»9'), &c. The aggregate of integers contained in S2 is denoted by o.
Every element of Ω can be put into the form
where c0, cl, ...c, , l are real and rational. If these coefficients are all integral, w is an integer; but the converse is not necessarily true. It is possible, however, to find a set of integers ω1, ω2 .... ωn, belonging to Ω, such that every integer in Ω can be uniquely expressed in the form
co-ordinates of with respect to the base . Thus is a modulus (§ 44), and we may write . Having found one base, we can construct any number of equivalent bases by means of equations such as , where the rational integral coefficients are such that the determinant .
If we write
is a rational integer called the discriminant of the field. Its value is the same whatever base is chosen.
If is any integer in , the product of and its conjugates is a rational integer called the norm of , and written . By considering the equation satisfied by we see that where is an integer in . It follows from the definition that if are any two integers in , then ; and that for an ordinary real integer , we have .
46. Ideals.—The extension of Kummer’s results to algebraic
numbers in general was independently made by J. W. R. Dedekind
and Kronecker; their methods differ mainly in matters of notation
and machinery, each having special advantages of its own for
particular purposes. Dedekind’s method is based upon the notion
of an ideal, which is defined by the following properties:—
(i.) An ideal is an aggregate of integers in .
(ii.) This aggregate is a modulus; that is to say, if are any two elements of (the same or different) is contained in . Hence also contains a zero element, and is an element of .
(iii.) If is any element of , and any element of , then is an element of . It is this property that makes the notion of an ideal more specific than that of a modulus.
It is clear that ideals exist; for instance, itself is an ideal. Again, all integers in which are divisible by a given integer (in ) form an ideal; this is called a principal ideal, and is denoted by . Every ideal can be represented by a base (§§ 44, 45), so that we may write , meaning that every element of can be uniquely expressed in the form , where is a rational integer. In other words, every ideal has a base (and therefore, of course, an infinite number of bases). If are any two ideals, and if we form the aggregate of all products obtained by multiplying each element of the first ideal by each element of the second, then this aggregate, together with all sums of such products, is an ideal which is called the product of and and written or . In particular . This law of multiplication is associative as well as commutative. It is clear that every element of is contained in : it can be proved that, conversely, if every element of is contained in , there exists an ideal such that . In particular, if is any element of , there is an ideal such that . A prime ideal is one which has no divisors except itself and . It is a fundamental theorem that every ideal can be resolved into the product of a finite number of prime ideals, and that this resolution is unique. It is the decomposition of a principal ideal into the product of prime ideals that takes the place of the resolution of an integer into its prime factors in the ordinary theory. It may happen that all the ideals in are principal ideals; in this case every resolution of an ideal into factors corresponds to the resolution of an integer into actual integral factors, and the introduction of ideals is unnecessary. But in every other case the introduction of ideals or some equivalent notion, is indispensable. When two ideals have been resolved into their prime factors, their greatest common measure and least common multiple are determined by the ordinary rules. Every ideal may be expressed (in an infinite number of ways) as the greatest common measure of two principal ideals.
47. There is a theory of congruences with respect to an ideal modulus. Thus means that is an element of . With respect to , all the integers in may be arranged in a finite number of incongruent classes. The number of these classes is called the norm of , and written . The norm of a prime ideal is some power of a real prime ; if , is said to be a prime ideal of degree . If are any two ideals, then . If , then , and there is an ideal such that . The norm of a principal ideal is equal to the absolute value of as defined in § 45.
The number of incongruent residues prime to is—
,
where the product extends to all prime factors of . If is any element of prime to ,
.
Associated with a prime modulus for which we have primitive roots, where has the meaning given to it in the ordinary theory. Hence follow the usual results about exponents, indices, solutions of linear congruences, and so on. For any modulus we have , where the sum extends to all the divisors of .
48. Every element of which is not contained in any other ideal is an algebraic unit. If the conjugate fields consist of real and imaginary fields, there is a system of units , where , such that every unit in is expressible in the form where is a root of unity contained in and are natural numbers. This theorem is due to Dirichlet.
The norm of a unit is or ; and the determination of all the units contained in a given field is in fact the same as the solution of a Diophantine equation
.
For a quadratic field the equation is of the form , and the theory of this is complete; but except for certain special cubic corpora little has been done towards solving the important problem of assigning a definite process by which, for a given field, a system of fundamental units may be calculated. The researches of Jacobi, Hermite, and Minkowsky seem to show that a proper extension of the method of continued fractions is necessary.
49. Ideal Classes.—If is any ideal, another ideal can always be
found such that is a principal ideal; for instance, one such
multiplier is . Two ideals are said to be equivalent
() or to belong to the same class, if there is an ideal such
that are both principal ideals. It can be proved that two
ideals each equivalent to a third are equivalent to each other and
that all ideals in may be distributed into a finite number, , of
ideal classes. The class which contains all principal ideals is called
the principal class and denoted by .
If are any two ideals belonging to the classes respectively, then belongs to a definite class which depends only upon and may be denoted by or indifferently. Thus the class-symbols form an Abelian group of order , of which is the unit element; and, mutatis mutandis, the theorems of § 37 about composition of classes still hold good.
The principal theorem with regard to the determination of is the following, which is Dedekind’s generalization of the corresponding one for quadratic fields, first obtained by Dirichlet. Let
where the sum extends to all ideals contained in ; this converges so long as the real quantity is positive and greater than . Then being a certain quantity which can be calculated when a fundamental system of units is known, we shall have
.
The expression for is rather complicated, and very peculiar; it may be written in the form
where means the absolute value of the square root of the discriminant of the field, have the same meaning as in § 48, is the number of roots of unity in , and is a determinant of the form , of order , with elements which are, in a certain special sense, “logarithms” of the fundamental units .
50. The discriminant enjoys some very remarkable properties. Its value is always different from ; there can be only a finite number of fields which have a given discriminant; and the rational prime factors of are precisely those rational primes which, in , are divisible by the square (or some higher power) of a prime ideal. Consequently, every rational prime not contained in is resolvable, in , into the product of distinct primes, each of which occurs only once. The presence of multiple prime factors in the discriminant was the principal difficulty in the way of extending Kummer’s method to all fields, and was overcome by the introduction of compound moduli—for this is the common characteristic of Dedekind’s and Kronecker’s procedure.
51. Normal Fields.—The special properties of a particular field
are closely connected with its relations to the conjugate fields
. The most important case is when each of the
conjugate fields is identical with : the field is then said to be
Galoisian or normal. The aggregate of all
rational functions of and its conjugates is a normal field: hence
every arithmetical field of order is either normal, or contained in a
normal field of a higher order. The roots of an equation
which defines a normal field are associated with a group of substitutions:
if this is Abelian, the field is called Abelian; if it is cyclic,
the field is called cyclic. A cyclotomic field is one the elements of
which are all expressible as rational functions of roots of unity; in
particular the complete cyclotomic field , of order , is the
aggregate of all rational functions of a primitive mth root of unity.
To Kronecker is due the very remarkable theorem that all Abelian
(including cyclic) fields are cyclotomic: the first published proof of
this was given by Weber, and another is due to D. Hubert.
Many important theorems concerning a normal field have been established by Hilbert. He shows that if is a given normal field of order , and any of its prime ideals, there is a finite series of associated fields , of orders , such that , and that if , , a prime ideal in . If is the last of this series, it is called the field of inertia (Trdgheitskorper) for p: next after this comes another field of still lower order called the resolving field (Zerlegungskiirper) for p, and in this field there is a. prime of the first degree, pm, such that pi+1=p'°, where k=m/mi. In the field of inertia pq; remains a prime, but becomes of higher degree; in Sli 1, which is called the branch-field (Verzweigungskorper) it becomes a power of a prime, and by going on in this way from the resolving field to Sl, we obtain (H-2) representations for any prime ideal of the resolving field. By means of these theorems, Hilbert finds an expression for the exact power to which a rational prime p occurs in the discriminant of Q, and in other ways the structure of S2 becomes more evident. It may be observed that whem m is prime the whole series reduces to SZ and the rational field, and we conclude that every prime ideal in S2 is of the first or mth degree: this is the case, for instance, when m=2, and is one of the reasons why quadratic fields are comparatively so simple in character. 52. Quadratic Fields.-Let rn be an ordinary integer different from +I, and not divisible by any square: then if x, y assume all ordinary rational values the expressions x+y/m are the elements of a field which may be called Q(~/ rn). It should be observed that y/rn means one definite root of x2-m=o, it does not matter which: it is convenient, however, to agree that ylm is positive when m is positive, and ii/m is negative when rn is negative. The principal results xgating to SZ will now be stated, and will serve as illustrations of 44'5¥-
In the notation previously used
n=[I, %(I+#711)] or [I, y/rn]
according as mal (mod 4) or not. In the first case A=m, in the second A =4m. The field S2 is normal, and every ideal prime in it is of the first degree.
Let q be any odd prime factor of m; then q=q2, where q is the rime ideal [q, %(q+/m)] when mel (mod 4) and in other cases ii, /m]. An odd prime p of which rn is a quadratic residue is the roduct of two prime ideals p, p', which may be written in the form Fp, § (a+/m)], [p, %(a-/m)] or Ip, a+/ml, [p, a-»/m], according as mal (mod 4) or not: here a is a root of x2Em (mod p), taken so as to be odd in the first of the two cases. All other rational odd primes are primes in Q. For the exceptional prime 2 there are four cases to consider: (i.) if msl (mod 8), then 2 = l2, %(I +/m)]X[2, $(I -/m)]. (ii.) If mE5 (mod 8), then 2 is prime: (iii.) if mE2 (mod 4), 2 = [2, / m]2: (iv.) if rn =3 (mod 54), 2 =[2, I +x/m)2. Illustrations will be found in § 44 for the case rn =23.
53. Normal Residues. Genera.-Hilbert has introduced a very convenient definition, and a corresponding symbol, which is a generalization of Legendre's quadratic character. Let n, m be rational integers, rn notasquare, 'wany rational prime; we write = +I if, to the modulus w, n is congruent to the norm of an integer contained in SZ(/ m); in all other cases we put = -1. This new symbol obeys a set of laws, among which may be especially noted = = and = +I, whenever n, m are prime
to p.
Now let ql, gi, . . qi be the different rational prime factors of the discriminant of SDH m); then with any rational integer a we may associate the l symbols
(a, nz) (11, (zz rn)
T, T2, —gt
and call them the total character of a with respect to SZ. This definition may be extended so as to give a total character for every ideal a in Q, as follows. First let S2 be an imaginary field (rn <o); we put r =t, 1'i=N(a), and call
rf-I m> <”- “>
gl ' ° i ' gr
the total character of a. Secondly, let Sl be a real field; we first determine the t separate characters of - 1, and if they are all positive we put 7z= -l-N(a), r=t, and adopt the r characters just written above as those of a. Suppose, however, that one of the characters of - I is negative; without loss of generality we may take it to be that with reference to qt. We then put r=t-I, i= =f=N(a) taken with such a sign that = +I, and takeas theltotal character of a the symbols for i=I, 2, . .(t-1).
With these definitions it can be proved that all ideals of the same class have the same total character, and hence there is a distribution of classes into genera, each genus containing those classes for which the total character is the same (cf. § 36).
Moreover, we have the fundamental theorem that an assigned set of r units =t I corresponds to an actually existing genus if, and only if, their product is +I, so that the number of actually existing genera is 2"'. This is really equivalent to a theorem about quadratic forms first stated and proved by Gauss; the same may be said about the 8 »-
5 /
next proposition, which, in its natural order, is easily proved by the method of ideals, whereas Gauss had to employ the theory of ternary quadratics.
Every class of the principal genus is the square of a class. An ambiguous ideal in S2 is defined as one which is unaltered by the change of / rn to -w/ rn (that is, it is the same as its conj ugate) and not divisible by any rational integer except =*=I. The only ambiguous prime ideals in S2 are those which are factors of its discriminant. PuttingA=q12 of. . .q,2, there are in S2 exactly 2t ambiguous ideals: namely, those factors of A, including n, which are not divisible by any square. It is a fundamental theorem, first proved by Gauss, that the number of ambiguous classes is equal to the number of genera.
54. Class-Number.-The number of ideal classes in the field (ZH m) may be expressed in the following forms:-
(i.) m <0:
h=i2 (n=1, 2.., ., -A);
(ii.) m>o= "
II sin E
h ;1 A
"2 log e Og avr
II sin Z-
In the first of these formulae -r is the number of units contained in Sl; thus r=6 for A= -3, r=4 for A= -4, -r=2 in other cases. In the second formula, e is the fundamental unit, and the products are taken for all the numbers of the set (I, 2, . . .A) for which = +I, =-'I respectively. In the ideal theory the' only way in which these formulae have been obtainediis by a modification of Dirichlet's method; to prove them without the use of transcendental analysis would be a substantial advance in the theory.
55. Suppose that any ideal in S2 is expressed in the form [wh wg]-; then any element of it is expressible as xw1+yw2, where x, y are rational integers, and we shall have N (xw1+;vw2) =ax2+bxy+cy', where a, b, c are rational numbers contained in the ideal. If we put x=a.x'+By', y='yx'--6y', where a, 6, 'y, 5 are rational numbers such that o.5-/3'y= il, we shall have simultaneously (a, b, c) (x, y)2 = (a', b', c') (x', y')2 as in § 52 and also V (af, ff, 6') (x', y')2 = N{x'(v.w1+'yw2) 'i'y'(/3w1+5w2)} =N(x'w'i-I-y'w'2), where [w'1, w'2l is the same ideal as before. Thus all equivalent forms are associated with the same ideal, and the numbers representable by forms of a particular class are precisely those which are norms of numbers belonging to the associated ideal. Hencethe class-number for ideals in SZ is also the class-number for a set of quadratic forms; and it can be shown that all these forms have the same determinant A. Conversely, every class of forms of determinant A can be associated 'with a definite class of ideals in S'l(~/ rn), where m=A or +A as the case may be. Composition of form-classes exactly corresponds to the multiplication of ideals: hence the complete analogy between the two theories, so long as they are really in contact. There is a corresponding theory of forms in Connexion with a field of order rz: the forms are of the order n, but are only very special forms of that order, because they are algebraically resolvable into the product of linear factors.
56. Complex Quadratic Forms.-Dirichlet, Smith and others, have discussed forms (a, b, c) in which the coefficients are complex integers of the form m-l-ni; and Hermite has considered bilinear forms oxx' +bxy'+b'x'y+cyy', where x', y', b' are the conjugates of x, y, b and a, e, are real. Ultimately these theories are connected with fields of the fourth order; and of course in the same way we might consider forms (a, b, c) with integral coefficients belonging to any given field of order n: the theory would then be ultimately connected with a field of order 2n.
57. Kronecker's Method.—In practice it is found convenient to combine the method of Dedekind with that of Kroneckcr, the main principles of which are as follows. Let F(x, y, 2, . . .) be a polynomial in any number of indeterminate (umbrae, as Sylvester calls them) with ordinary integral coefficients; if n is the greatest common measure of the coefficients, we have F =r1E, where E is a primary or unit form. The positive integer n is called the divisor of F; and the divisor of the product of two forms is equal to the product of the divisors of the factors. Next suppose that the coefficients of F are integers in a field S2 of order n. Denoting the conjugate forms by F', F", . . . F<"">, the product FF'F” . . . F<"'U=fE, where f is a real positive integer, and E a unit form with real integral coefficients. The natural numberf is called the norm of F. If F, G are any two forms (in SZ) we have N(FG)=N(F)N(G). Let the coefficients of F be o.1, ag, &c., those of G 61, BQ, &c., and those of FG Yu 72, &c.; and let p be any prime ideal in Q. Then if p"' is the highest power of p Contained in each of the coefficients a¢, and p” the highest power of p contained in each of the coefficients Bi, p'"+" is the highest ower of p contained by the whole set of coefficients vi. Vi/riting dvfinu, a2, . . .) for the highest ideal divisor of ai, az, &c., this is called the content of F; and we have the theorem that the product of the contents of two forms is equal to the content of the product of the forms. Every form is associated with a definite ideal nl, and we have N(F) =N(m) if m is the content of F, and N(m) has the meaning already assigned to it. On the other hand, to a given ideal correspond an indefinite number of forms of which it is the content; for instance (§ 46, end) we can find forms ax-{-By of which any given ideal is the content.
58. Now let w1, w1, w, . be a basis of o; u1, 141, . un a set of indeterminate; and
E=w1ui+wm+ - - - +w»u, .:
5 is called the fundamental form of 9. It satisfies the equation N (x-5) =O, or
F(x) =x"+U1x""+ . . -l-U, .=o
where U1, U2, . Un are rational Eolynomials in 141, ug, .un with rational integral coefficients. T is is called the fundamental equation.
Suppose now that p is a rational prime, and that p=p“q"r°. . where p, q, r, . &c., are the different ideal prime factors of p, then if F (x) is the left-hand side of the fundamental equation there is an ide ntical congruence
F(x) ={P(x)}“{Q(x)}”{R(x)l°- - -(m0d P)
where P(x), Q(x), &c., are prime functions with respect to p. The meaning of this is that if we expand the expression on the right-hand side of the congruence, the coefficient of every term x'u1"'. un will be congruent, mod p, to the corresponding coefficient in'F (x). Iff, g, h, &c., are the degrees of p, q, r, &c. (§ 47), thenf, g, h, are the dimensions in x, u1, u2, . . u.. of the forms of P, Q, R, respectively. For every prime p, which is not a factor of A, a=b=c=. =I and F(x) is congruent to the product of a set of different prime factors, as many in number as there are different ideal prime factors of p. In particular, if p is a prime in SZ, F(x) is a prime function (mod p) and conversely.
It generally happens that rational integral values a1, aa, an can be assigned to u1, ug, . un such that Un, the last term in the fundamental equation, then has a value which is prime to p. Supposing that this condition is satisfied, let a1w1+rL1w1+ .+a, .w,1=a; and let P1(a) be the result of putting x=a, u¢=a; in P(x). Then the ideal p is completely determined as the greatest common divisor of p and P1(a); and similarly for the other prime factors of p. There are, however. exceptional cases when the condition above stated is not satisfied.
59. Cyclotomy.-It follows from de Moivre's theorem that the arithmetical solution of the equation x”'- I =0 corresponds to the division of the circumference of a circle into m equal parts. The case when m is composite is easily made to depend on that where m is a power of a prime; if m is a power of 2, the solution is effected by a chain of quadratic equations, and it only remains to consider the case when m =q'<, a power of an odd prime. It will be convenient to writeu=¢(m)=g'<'I(q-I); if we also put r=e2ffi/M, the primitive roots of x"'= I will be, u in number, and represented by r, r", fb, &c. where I, a, b, &c., form a complete set of prime residues to the modulus m. These will be the roots of an irreducible equation f(x) =o of degreep; the symbol f(x) denoting (x"'-1)-:~(x"'/'1—I). There are primitive roots of the congruence x»'=I (mod m); let g be any one of these. Then if we put r°h=r;, , we obtain all the roots of f(x) =o in a definite cyclical order (rr, rg .r, t); and the change of r into r” produces a cyclical permutation of the roots. It follows from this that every cyclic polynomial in 71, rg, .rn with rational coefficients is equal to a rational number. Thus if we write l+ag +bg'+.+kgu'1 =n, we have, in virtue of r;, =r”'°, r1“r1°...r, .~-1"1, ».'=r", and, if we use S to denote cyclical summation, S(r1“r1". JM) = r"+r""+. . +r”°"", the sum of the nth powers of all the roots of f(x) =o, and this is a rational integer or zero. Since every cyclic polynomial is the sum of parts similar to S(r1“r2b. . .r, i'), the theorem is proved. Now let e, f be any two conjugate factors of p, so that ef = il, and let
lli=7i'i'7i+¢'i"f1Z+2e"l"- . 'i"7i+(/-1)e (i=I, 21- - -6) then the elementary symmetric functions Em, Enmf, &c., are cyclical functions of the roots of f(x) =o and therefore have rational values which can be calculated: consequently 171, 112, . sq., which are called the f-nomial periods, are the roots of an equation F(11) ='fle'l'5171° 1'l"- - . +v.=0
with rational integral coefficients. This is irreducible, and defines a field of order e contained in the field defined by f(x) =o. Moreover, the change of r into r” alters ni into "li+1, and we have the theorem that any cyclical function of -41, 172, . . . n, is rational. Now let h, k be any conjugate factors of f and put
Zi =f¢+f¢+1»+ff+m+~ - ~7d+(f-h)c (i= I, 21 3») then § 1, § '1+, , fm.. .§ '1+(;, 1), will be the roots of an equation GK) = § "'-1n§ ”"'1+f2§ ""°+- » - -Hn =0»
the coefficients of which are expressible as rational polynomials in 171. Dividing h into two conjugate factors, we can deduce from G(§ ') =0 another period equation, the coefficients of which are rational polynomials in 111, § '1, and so on. By choosing for e, h, &c., the successive prime factors of n, ending up with 2, we obtain a set of equations of prime degree, each rational in the roots of the preceding equations, and the last having r1 and 1'1-1 for its roots. Thus to take a very interesting historical case, let m = 17, so that p = 16 = 24, the equations are all quadratics, and if we take 3 as the primitive roct of 17, they are
1l”+17'4=0, s“'-'/it-I =0 s
2>~'-2s“>+(11§ -n+s“-3)=0, P2"'}P'l'I=0-If two quantities (real or complex) a and b are represented in the usual way by points in a plane, the roots of x2-l-ax+b=o will be represented by two points which can be found by a Euclidean construction, that is to say, one requiring only the use of rule and compass. Hence a regular polygon of seventeen sides can be inscribed in a given circle by means of a Euclidean construction; a fact first discovered by Gauss, who also found the general law, which is that a regular polygon of m sides can be inscribed in a circle by Euclidean construction if and only if ¢(m) is a power of 2; in other words m =2' P where P is a product of different odd primes, each of which is of the form 2"+1.
Returning to the case m=g*, we shall call the chain of equations F(1;) =O, &c., when each is of prime degree, a set of Galoisian auxiliaries. We can find different sets, because in forming them we can take the prime factors of pt in any order we like; but their number is always the same, and their degrees always form the same aggregate, namely, the prime factors of u. No other chain of auxiliaries having similar properties can be formecrcontaining fewer equations of a given prime degree p; a fact first stated by Gauss, to whom this theory is mainliy due. Thus if m=q'= we must have at least (fc-I) auxiliaries of or er q, and if q-I =2¢;lpB . . ., we must also have a quadratics, B equations of order p, an so on. For this reason a set of Galoisian auxiliaries may be regarded as providing the simplest solution of the equation f(x) =o
60. When m is an odd prime p, there is another very interesting way of solvin the equation (xp-1) + (x-1) =o. As before let (r1, 12, . . r, , § be its roots arranged in a cycle by means of a primitive root of x1"lEI (mod p); and let e be a primitive root of ef'-1 = I. Also let
01 =71+€f2+62T3+ . . . +EP°27p 1
0k=f1+ék72+£2k73+ . . +€ kfp 1 =2, 3, . . so that 0;, is derived from 01 by changing e into ek. The cyclical permutation (r1, r2, .rp-1) applied to gk converts it into e"'0;, ; hence 0101./0;at1 is unaltered, and may be expressed as a rational, and therefore as an integral function of e. It is found by calculation that we may put
m= I
¢k(€) =@'= §)+ eindm-l-kind(p+1-m) [k=I, Q
0'=+1 m=a
while
0101.4 = -p.
In the exponents of h, (e) the indices are taken to the base g used to establish the cyclical order (11, rg . r, , 1). Multiplying together the (p—2) preceding equalities, the result is
01""= -P1//1(e)1,02(e) .1//p 3(e) =R(e)
where R(e) is a rational integral function of e the degree of which, in its reduced form, is less than ¢>(p- I). Let p be any one definite root of x1'°1=R(e), and put 01 =p: then since
ok .
§ ;:=Pi¢z- ~ -'pkwe
must take 0;¢=p"///MP2 , bk 1 =R;, (e)p", where R1, (¢) is a rational function of e, which we may suppose put into its reduced integral form; and finally, by addition of the equations which define 01, 01, &c.,
(;D- I)f1=n+Rz(f)P”-I-Ra(f)p3+- - ~ +Rp 2(¢)P""-If in this formula we change p into e"'p, and 11 into r;H.1, it still remains true. 4
It will be observed that this second mode of solution employs a Lagrangian resolvent 01; considered merely as a solution it is neither so direct nor so fundamental as that of Gauss. But the form of the solution is very interesting; and the auxiliary numbers Me) have many curious properties, which have been investigated by Jacobi, Cauchy and Kronecker
61. When m =q'<, the discriminant of the corresponding cyclotomic field is =hq', where)=q" '(f<g-x-I). The prime q is equal to q", where p =¢(m) =q"" (q- 1), and q is a prime ideal of the first degree. If p is any rational prime distinct from g, and f the least exponent such that pf§ I (mod. m), f will be a factor of, u, and putting /.¢/f=e, we have p=p1p2 . p, ,, where pi, pg . . pe are different prime ideals each of the fth degree. There are similar theorems for the case when m is divisible by more than one rational prirne. Kummer has stated and proved laws of reciprocity for quadratic and higher residues in what are called regular fields, the definition of which is as follows. Let the field be R(e”"/P), where p is an odd prime; then this field is regular, and p is said to be a regular prime, when h, the number of ideal classes in the field, is not divisible by p. Kummer proved the very curious fact that p is regular if, and only if, it is not a factor of the denominators of the first Hp-3) Bernoullian numbers. He also succeeded in showing that in the field R(e2πi/p>) the equation αp+βp+γp=0 has no integral solutions whenever h is not divisible by p2. What is known as the “last” theorem of Fermat is his assertion that if m is any natural number exceeding 2, the equation xm+ym=zm has no rational solutions, except the obvious ones for which xyz=0. It would be sufficient to prove Fermat's theorem for all prime values of m; and whenever Kummer’s theorem last quoted applies, Fermat's theorem will hold. Fermat's theorem is true for all values of m such that 2<m<101, but no complete proof of it has yet been obtained.
Hilbert has studied in considerable detail what he calls Kummer fields, which are obtained by taking x, a primitive pth root of unity, and y any root of yi'-a=o, where a is any number in the field R(x) which is not a perfect pth power in that field. The Kummer field is then R (x, y), Consisting of) all rational functions of x and y. ()ther fields that have been discussed more or less are general cubic fields, some special bi quadratic and a few Abelian fields not cyclic. Among the applications of cyclotomy may be mentioned the proof which it affords of the theorem, first proved by Dirichlet, that if m, n are any two rational integers prime to each other, the linear form mx +n is capable of representing an infinite number of rimes.
62. Gauss's Sums.-Let m be any positive real integer; then S=m'I
i+i=-»»
QZSQFI/m=1 m
I-l-I V
r=o
This remarkable formula, when m is prime, contains results which were first obtained be' Gauss, and thence known as (§ auss's sums. The easiest method o proof is Kronecker's, which consists in finding the value of f {e21fi2”"'dz/(I~e2"i2)}, taken round an appropriate contour. It will be noticed that one result of the formula is that the square root of any integer can be expressed as a rational function of roots of unity.
The most important application of the formula is the deduction from it of the law of quadratic reciprocity for real primes: this was done by Gauss. .
63. One example may be given of some remarkable formulae giving explicit solutions of representations of numbers by certain quadratic forms. Let p be any odd prime of the form 7n+2; then we shall have p=7n+2=x2+7y2, where x is determined by the congruences (),
n .
2x~ (m0d P), x-3 (mod 7)-This
formula was obtained by Eisenstein, who proved it by investigating properties of integers in the field generated by 'l]a'2I1]"7=0| which is a component of the field generated b seventh roots- of unity. The first formula of this kind was given by gauss, and relates to the case p=4n+I =x' -|-y2; he conceals its connexion with complex numbers. Probably there are many others which have not yet been stated.
64. Higher Congruences. Functional Moduli.-Suppose that p is a real prime, and that f(x), ¢»(x) are polynomials in x with rational integral coefficients. The congruence f(x)E¢(x) (mod p) is identical when each coefficient of f is congruent, mod p, to the corresponding coefficient of ¢. It will be convenient to write, under these circumstances, f~¢(mod p) and to say that f, ¢ are equivalent, mod p. Every polynomial of degree h is equivalent to another of equal or lower degree, which has none of its coefficients negative, and each of them less than p. Such a polynomial, with unity for the coefficient of the highest power of x contained in it, may be called a reduced polynomial with respect to p. There are, in all, pf' reduced polynomials of degree h. A polynomial may or may not be equivalent to the product of two others of lower degree than itself; in the latter case it is said to be prime. In every case, F being any polynomial, there is an equivalence F~cf1f¢ ...fl where c is an integer and f, , ff2, ...f1 are prime functions; this resolution is unique. Moreover, it dollows from Fermat's theorem that {F(x)}P~F(xP), {F(x)}1'2..F(xP2), an soon
As in the case of equations, it may be proved that, when the modulus is prime, a congruence f(x)-E o (mod p) cannot have more in congruent roots than the index of the highest power of x in f(x), and that if xE£is a solution, f(x)-(x-£)f1(x), where f1(x) is another polynomial. The solutions of xPEx are all the residues ofp; hence xP~x-x(x+1) (x-+-2) . . .(x-l-p-I), where the right-hand expression is the product of all the linear functions which are prime to p. A generalization of this is contained in the formula
x(x"”"'-I)~Uf(x) (mod P)
where the product includes every prime function f(x) of which the degree is a factor of m. By a process similar to that employed in finding the equation satisfied by primitive mth roots of unity, we can find an expression for the product of all prime functions of a given degree m, and prove that their number is (m> I) I %(Pm 2pm/¢+EPm/ab )
where a, b, c are the different prime factors of m. Moreover, if F is any given function, we can find a resolution F~cF1F2 . . . F, ,, (mod p)
where c is numerical, F1 is the product. of all prime linear functions which divide F, F 2 is the product of all the prime quadratic factors, and so ont
65. By the functional congruence ¢(x)2»//(vc) (mod p, f(x)) is meant that polynomials U, V can be found such that 4>(x) ===, Z/(x)+pU+ Vf(x) identically. We might also writeq>(x)~/»(x) (mod p, f(x)); but this is not so necessary here as in the preceding case of a simple modulus. Let m be the degree of f(x); without loss of generality we may suppose that the coefficient cf x"' is unity, and it will be further assumed that f(x) is a prime function, mod p. Whatever the dimensions of ¢(x), there will be definite functions X(x), ¢, (x) such that 4>(x) =f(x)X(x) -l-q'>i(x) where ¢1(x) is of lower dimension than f(x); moreover, we may suppose qi-1(x) replaced by the equivalent reduced function 4>2(x) mod p. Finally then, ¢E4>2 (mod p, f(x)) where ¢2 is a reduced function, mod Q, of order not greater than (m-I). If we put pm =n, there will be in all (including zero) n residues to the compound modulus (p, f): let us denote these by Ri, R2, . . . R, ,. Then (cf. § 28) if we reject the one zero residue (Rn, suppose) and take any function¢ of which the residue is not zero, the residues of ¢»R1, 4>R2, . . ¢>R, , 1 will all be different, and we conclude that 4>""E I (mod 1), f). Every function therefore satisfies ¢"~¢> (mgd p, f); by putting ¢ =x we obtain the principal theorem stated in 64.
A still more comprehensive theory of compound moduli is due to Kronecker; it will be sufficiently illustrated by a particular case. Let m be a fixed natural number; X, Y, Z, T assigned polynomials, with rational integral coefficients, in the independent variables x, y, z; and let U be any polynomial of the same nature as X, Y, Z, T. We may write U~o (mod m, X, Y, Z, T) to express the fact that there are integral polynomials M, X', Y', Z', T' such that U =mM +X'X +Y'Y-I-Z'Z +T'T
identically. In this notation U-V means that U-V~o. The number of independent variables and the number of functions in the modulus are unrestricted; there may be no number m in the modulus, and there need not be more than one. This theory of Kronecker's is admirably adapted for the discussion of all algebraic problems of an arithmetical character, and is certain to attain a high degree of development.
t is worth mentioning that one of Gauss's proofs of the law of quadratic reciprocity (Gétt. Naehr. 1818) involves the principle of a compound modulus.
66. Forms of Higher Degree:-Except for the case alluded to at the end of § 55, the theory of orms of the third and higher degree is still guite fragmentary. . jordan has proved that the class number is nite. . Poincaré has discussed the classification of ternary and qliiaternary cubics. With regard to the ternary cubic it is known t at from any rational solution of f =o we can deduce another by a process whici is equivalent to finding the tangential of a point (x1, y1, zr) on the curve, that is, the point where the tangent at (xt, yt, zi) meets the curve again. We thus obtain a series of solutions (xl, yl, zt), (xg, y2, zz), &c., which may or may not be periodic. E. Lucas and J. Sylvester have proved that for certain cubics f =o has no rationa solutions; for instance x3-I-y3-Az3=o has rational solutions only if A=ab(a-|-b)/ca, where a, b, c are rational integers. Waring asserted that every natural number can be expressed as the sum of not more than 9 cubes, and also as the sum of not more than I9 fourth powers; these propositions have been neither proved nor disproved.
67. Results derived from Elliptic and Theta Functions.-For the sake of reference it will be convenient to give the expressions for the four lacobian theta functions. Let w be any complex quantity such that the real part of iw is negative; and let g =e"i~. Then A +ou
000(1J)=2qs2e2°"'i" = I -l-2g cos 21rv -I-2q4 cos 41r°v-I-2gP cos 61rv+ . . . oo
0°
=II(I-q'“)(1 +2Q2"1 COS 2r'v+q'~"“).
I
001('u) = I -2g cos 21rv-I-2g4 cos 41m -zqp cos 6-rrv+ . . . ='€I(I -q”)(I -2q”"' Cos 21rv+q*““).
6w(v) =2ql cos 1|-11+ zqf cos 31rv+2qi" cos 51rv+ . . =2qi cos 1rv°ff(1 -q2”)(1 +2q2' cos 2-:rv-l-q4=°), , 0n(v) =2q} sin -/rv-zqi sin 31r-v -I-2q”'° sin 51r-o-. . r =2q& sin 7l"U§ (I -g2°) (I-2q2-' cos 21rv+q4'). Instead of 0w(o), &c., we write 600, &c. Clearly 011=o; we have the important identities
611, = 7600010001 300' '= 0014 +9104
where 911' means the value of d0n(v)/dv for v =o. If, now, we put ~/x=g-£1 /K' =%:» 'l4=7l'0002U, so that κ'+x" = I, we shall have
@»»<»>= @»<»>= »<. @»<»<=»>=; .d
0—(-my) ~/ic snu, Bmw) Per u,6m< v) VK, nu, and, supposing for simplicity that iw is a real negative quantity,
- 0002 =2K, w119092 =2iK', to ='iK'/K,
the notation being that which is now usual for the elliptic functions. It is found that
25 sn 2Ku=2§ J¥ -Q sin (2.9-I) 1ru,
1r 1 I “Q
K so, 1
-5; cn 2Ku =2Z%é-éqcos (2s - I)1ru,
I
K °° ~
~; -dn 2Ku = ~§ +227-L- cos 2s1ru.
From the last formula, by putting u =0, we obtain
®
I +4§ j% =3§ =000”(I +2q+2q'+2qP+- - -)“, and hence, by expanding both sides in ascending powers of q, and e uating the coefficients of q", we arrive at a formula for the number of ways of expressing n as the sum of two squares. If 5 is any odd divisor of n, including I and n itself if n is odd, we find as the coefficient of g" in the expansion of the left-hand side 42(-I)“(6'1)§ on the right-hand side the coefficient enumerates all the solutions n = (=|=x)2+(=*=y)', taking account of the different signs (except for 02) and of the order in which the terms are written (except when x2 =y”). Thus if n is an odd prime of the form 4k-i-I, Z(-1)5(8 )=2, and the coefficient of q" is 8, which is right, because the one possible composition n = zz'-l-b” may be written n = (ia)2+ (=|=b)2 = (ib)2-1-(¢a)2, giving eight representations.
By methods of a similar character formulae can be found for the number of representations of a number as the sum of, 6, 8 squares respectively. The four-square theorem has been stated in § 41; the eight-square theorem is that the number of representations of a number as the sum of eight squares is sixteen times the sum of the cubes of its factors, if the given number is odd, while for an even number it is sixteen times the excess of the cubes of the even factors above the cubes of the odd factors. The five-square and seven square theorems have not been derived from q-series, but from the general theory of quadratic forms.
68. Still more remarkable results are deducible from the theory of the transformation of the theta functions. The elementary formulae are
0n(u, w-l-1) =e"i/“0n(u, w), 010(1l, w'l'Il=@"i/4910(14, <v)» 9o1(1»¢» w'i'I) =900(14. QF), 000(u1 w'l'1) =9ox(14, w), 5-iriuz/wall (2» "!') = "1;/ iW01l(u1 01), w w
Cdiuz/waio (2, -l> = V "if-v910(14, 02), £0 (D
e-1|-iu2/w9w(E' .l) =, /"' ¢¢, ,'90, (u, w), 0) C0
e'"i“2/"'000 (3, -3) =w/ -'Lw900(u, on), where 4 -iw is to be taken in such a way that its real part is positive. Taking the definition of /c given in § 67, and considering x as a function of w, we find
x(w +I) =10102/0012 =iK(<.o) /l<'(r.o), = 2 z= I
rc((D) 001 /000 K
For convenience let »<2(¢.i)=a: then the substitutions (w, w~|-I) and (w, -w“) convert zr into <r/(a-1) and (I -iv) respectively. Now if a, B, y, éare any real integers such that 115-H-y=1, the substitution o, (aa>+5)/(-yt.>+6)1 can be compounded of (w, w+I) and (w, -nfl); the effect on a will be the same as if we appl a corresponding substitution compounded of [<r, 11/(0-I)»} and £3 I -n]. But these are periodic and of order 3, 2 respectively]; therefore we cannot get more than six values of a, namely
0, I, ,' l J. fl, I,
is-1 I -a 17 a
and any symmetrical function of these will have the same value at any two equivalent places in the modular dissection (§ 33). Their sum is constant, but the sum of their squares may be put into the form
2(<f2-<1-4-1)
a'2(a - I)2 3
hence (af -a+ I)3 +<r'(¢r - I)2 has the same value at equivalent places. F. Klein writes
]=4C'12 ¢7 i'I)3 .
27<r2(¢r-1)2
this is a transcendental function of w. which is a special case of a Fuchsian or automorphic function. It is an analytical function of q2, and may be expanded in the form
J=;%n~2+v44+¢1q2+f2q4+ . .z
where cl, c2, &c., are rational integers.
69. Suppose, now, that a, b, c, d are rational integers, such that dv(a, , b, c, d)=I and ad-bc=n, a positive integer. Let (aw+b)/ (cw-f-d) =w'; then the equation ](w') = ](w) is satisfied if and only if w'—w, that is, if there are integers a, [3, 'y,5suchthata5-]3'y===1, and (uw-i-71) (7w+5) " (€w+<i)(¢1w-HS) =0-If we write, b(n) =nII(I +p'1), where the product extends to all prime factors (p) of n, it is found that the values of w fall into ¢(n) equivalent sets, so that when w is given there are not more than //(n) different values of ](w'). Putting ](w')=]', ](;v)=], we have a modular equation
f1(]'» J) =0
symmetrical in J, ]', with integral coefficients and of degree , b(n). Similarly when dv(a, b, c, d) =r we have an equation f-, (], ]) =o of order 1p(n/-rf); hence the complete modular equation for transformations of the nth order is
the degree of which is (n), the sum of the divisors of n. . Now if in F(J′,J) we put J′,J, the result is a polynomial in J alone, which we may call G(]). To every linear factor of G corresponds a class of quadratic forms of determinant (K2-4n) where »<'<4n and K is an integer or zero: conversely from every such form we can derive a linear factor (J - a) of G. Moreover, if with each form we associate its weight (§ 41) we find that with the notation of § 39 the degree of G is precisely EH(4n-xz)-en, Where 6f, =I when n is a square, and is zero in other cases. But this degree may be found in another way as follows. A complete representative set of transformations of order n is given by w'=(aw+b)/d, with ad=n, 0Eb<d; hence
Gt1>=11 no-
and by substituting for]'(w) and I their values in terms of q, we find that the lowest term in the factor expressed above is either q 2/1728 or q"l° /'1/1728, or a constant, according as a<d, a>d or a=d. I-fence if v is the order of G(]), so that its expansion in q begins with a term in q'2" we must have V =z<1 -d) +2 fi) =zd+>:a
d > V n d> V n a>V n
d> V n
= 22d
extending to all divisors of n which exceed √ n. Comparing this with the other value, we have
H(41Z-K2)=22d+€n =(n) +(n),
as stated in § 39.
70. Each of the singular moduli which are the roots of G(J) =0 corresponds to exactly one primitive class of definite quadratic forms, and conversely.
Corresponding to every given negative determinant -A there is an irreducible equation /(j) =0, where j = 1728 ], the coefficients of which are rational integers, and the degree of which is h(-A). The coefficient of the highest power of j is unity, so that j is an arithmetical integer, and its conjugate values belong one to each primitive class of determinant -A. By adjoining the square roots of the prime factors of A the function , b(j) may be resolved into the product of as many factors as there are genera of primitive classes, and the degree of each factor is equal to the number of classes in each genus. In particular, if {I, I, § (A+x)} is the only reduced form for the determinant -A, the value of j is a real negative rational cube. At the same time its approximate value is exp - zri-ii;-Q +744 = 744-effV A, so that, approximately, e"V 4=m3-I-744 where m is a rational integer. For instance effv 43=8847367 3-9997775 . .= 9503+744 very nearly, and for the class (I, I, 11) fihe exact value of j is-9603. Four and only four other similar determinants are known to exist, namely -11, -19, ~67, ~-163, although thousands have been classified. According to Hermite the decimal part of e1fV 163 begins with twelve nines; in this case Weber has shown that the exact value ofj is -218- 3-53-233-293.
71. The function(n3 is the most fundamental of 2 set of quantities called class-invariants. Let (a, b, 6) be the representative of any class of definite quadratic forms, and let w be the root of ax2-{-bx-I-c=0 which has a positive imaginary part; then F (w) is said to be a class invariant for (a, b, c) if FC'-;%' -§)=F(w) for all real integers a B, »y, 5 such that aB-/3'y=I. This is true for j(w) whatever w may be, and it is for this reason that j is so fundamental. But, as will be seen from the above examples, the value of j soon becomes so large that its calculation is impracticable. Moreover, there is the difficulty of constructing the modular equation f1(], j') =0 (§ 69), which has only been done in the cases when n =2, 3 (the latter by Smith in Proc. Lon-i. Math. Soc. ix. p. 242). For moderate values of A the difficulty can generally be removed by constructing algebraic functions of j. Suppose we have an irreducible equation
x"'-{-¢:1x""'1+ . . . -I-c, ,.=o,
the coefficients of which are rational functions ofj(w). If we apply any modular substitution w'=S(w), this leaves the equation unaltered, and consequently only per mutates the roots among themselves: thus if x, (<») is any definite root we shall have x1(w') = x;(w), where i may or may not be equal to 1. The group of unitary substitutions which leave all the roots unaltered is a factor of the complete modular group. If we put y =x(nw), y will satisfy an equation similar to that which defines x, with j' written forj; hence, since ], ]' are connected by the equation, (j, j') =o, there will be an equation m//(x, y) =o satisfied by x and y. By suitably choosing x we can in many cases find gl/(x, y) without knowing fig, j'); and then the equation 1//(x, x) =0 defines a set of singular mo uli, each one of which belongs to a certain value of w and all the quantities derived from it by the substitutions which leave x(w) unaltered. As one of the simplest examples, let n=2, xi*-j(w) =y3-j (w') =0. Then the equation connecting x, y in its complete form is of the ninth degree in each variable; but it can be proved that it has a rational factor, namely
y”-x”;v2+495xy+x=*-2* - 3” - 5“=0.
and if in this we put x=y=u, the result is 114-2u3-495u2+24.33.5“=o,
the roots of which are 12, 2O, ' 15, - 15. It remains to find the values of w, to which they belong. Writing 'yg(w) = “/j, it is found that we may define 'yz in such a way that 'y2(w+1)=e"2ffi/3'y2(w), 1/2(-w )='y2(w), whence it is found that aw Hg - f?(~/5+1f1+B5-5512)
W2 (~, ¢, ,+5):e 'v2(w).
We shall therefore have 7z(2w) =-y2(w) for all values of such that 2<-> =3%%<15~/37 = I, '15 +ve -l-B5-551250 (mod 3). Putting (a, B, 'y, 5)=(o, -1, 1, 0) the conditions are satisfied, and 2w=i/2. Now j(i)=1725, so that ~/2(z)=12; and since j(w) is positive for a pure imaginary, '~/2(z/ 2) =20. The remaining case is settled by putting
°;=°;+¢',
2 'yw +5
with a, 13, 'y, 6 satisfying the same conditions as before. One solution is (-1, 2, 1, 1) and hence w2+3w-}-4=O, so that 72 =-15. Besides 'Yz, other irrational invariants which have been used with effect are 'v3==~/ (j-1725), the moduli K, »<', their square and fourth roots, the functions f, fi, fz defined by f=2i('<'<')"I”' fi = VK' ~f1 fi = if '<-f» and the function 1;(nw)/q(w) where 1;(w) is defined by +°° $2-l-s I 2 of °°
= nr E (-)S 3 =L9 -, - =q1'2II(I -gh). n(<»> 9 oo I Q, ,311(3 3) I
72. Another powerful method, developed by C. F. Klein and K. E. R. Fricke, proceeds by discussing the deficiency of fi(j, j') =o considered as representing a curve. If this deficiency is zero, j and j may be expressed as rational functions of the same parameter, and this replaces the modular equation in the most convenient manner. For instance, when n =7, we may put
1= f- 2+'3'+“"lf'2+5*+'>' =¢<f>, f' =¢<f'>, -r-r' = 49.
The corresponding singular moduli are found by solving ¢»§ r) = ¢(-r'). For dehciency I we may find in a similar way two auxiliary functions x, y connected by some simple equation ul/(x, y) =0 not exceeding the fourth degree, and such that j, j' are each rational functions of x and y.
Hurwitz has extended this field of research almost indefinitely, not only by generalising the formulae for class-number sums, such as that in § 69, but also by bringing the modular-function theo ry into connexion with that of- algebraic correspondence and Abelian integrals. A comparatively simple example may help to lndicate the nature of these researches. From the formulae given at the beginning of § 67, we can deduce, by actual multiplication of the corresponding series,
+°° - “l'°° = .;
f;e'uo»=e=we°1e..= is lslqe/.><iq»” “iff .., , "ZX(m)qm/4 ['m='I| 5191 ' ' °
where
x<m> =>2 Isl
extended over all the representations m = E” -|-4112. In a' similar way ? 9'l10l.0 = 0009102601 = 22 ('- I)¥('”'1) X gm/9 E 9'1;901 = 0009109012 = Z( I) f(m“I)x(m) gm/4 If, now, we write
Mm):E (- 1)}(Z1)X(m) gm/2, jim) =22(- I)i(1;;1)X(m)gm/4, j3(w) =2E qm/4
we shall have
di1:dj2:dj3'=610:001:600
where610, 001, 000, are connected by the relation (§ 67) 0104'l'9014 '600 4 =0
which represents, in homogeneous co-ordinates, a quartic curve of deficiency 3. For this curve, or any equivalent algebraic figure, j, (¢0), j2(w) and j3(<») supply an independent set of Abelian integrals of the first kind. If we put x=/»<, y=/f<', it is found that d d d
§§ -=a].<w>. § =a2<<»>. $=iJ.<»»>.
so that the integrals which the algebraic theory gives in connexion with x4-l-y4-I=0 are directly
provided that we put x=/ n(w).
Other functions occur in this theory analogous to j1(w), but such that in the q-series which are the expansions of them the coefficients and exponents depend on representations of numbers by quaternary quadratic forms.
73. In the Berliner Sitzungsberzchte for the period 1883-1890, L. Kronecker published a very important series of articles on elliptic functions, which contain many arithmetical results of extreme elegance; some of these Kronecker had announced without proof many years before. A few will be quoted here, without any attempt at demonstration; but in order to understand them, it will be necessary to bear in mind two definitions. The first relates to the identified with jifw), j.(<.,), j, <..,), Legendre-]acobi symbol If a, b have a common factor we put (9 =0; while if a is odd and b=2"c, where c is odd, we put n
(lg) = . The other definition relates to the classification of discriminants of quadratic forms. If D is any number that can be such a discriminant, we must have DE0 or I (mod. 4), and in every case we can write D=D0Q2, where Q2 is a square factor of D, and D0 satisfies one of the following conditions, in which P denotes a product of different odd primes:-
D0=P, with PEI (mod 4)
=4P, PE - 1 (mod 4)
=8P, PE =f= 1 (mod 4)
are called fundamental discriminants; every discriminant is uniquely expressible as the product of a fundamental discriminant and a positive integral square. Now let D1, D2 be any two discriminants, then D1D2 is also a discriminant, and we may put D1D2=D=D0Q2, where Do is -fundamental: this being done, we shall have
lf f 'fi (Bti) (Df)
Do
Do
Numbers such as~ D0
T2 2 - 4-Ffhk)
D) T (Dil (Qi)
=l2 E-Fam* bmnc2
2 a b Cl;< + A m, " in < + + n)
wherewearetotake h, k=1,2, 3, . .+°°; m, n=0, =*=I, *2, . .. *oo except that, if D<0, the case m=n=o is excluded, and that, if D>o, (2am+bn)T§ nU Where (T, U) is the least positive solution of T2-DU2=4. The sum 2 applies to a system of representative a, b, c
primitive forms (a, b, c) for the determinant D, chosen so that a is prime to Q, and b, 5 are each divisible by all the prime factors of Q. A is any number prime to 2D and representable by (a, b, 6); and finally -r=2, 4, 6, I according as D'<-4, D= -4, D= 'Eg of D>0-The function F is quite arbitrary, subject only to the conditions that F(xy) =F(x)F(y), and that the sums on both S1d€3S are convergent. By putting F(x) =x-I-P, where P is a real positive quantity, it can be deduced from the foregoing that, if D2 is not a square, and if Di is different from 1,
2
fH(DiQ')H(D2Q“)=Lt 2: E (Q) <am2+1>m»+¢»2>-=-P p=o a, b, c A "'~" m
where the function H(d) is defined as follows for any discriminant d:- d = -A <0 fH<d> =-f% h< -A)
hon T+U/d
1> ° Hld) “ 2Ti1Og' l"- Uv E h(d) meaning the number of primitive forms for the determinant d. This is a generalisation of a theorem due to Dirichlet.
Then is another formula which, in a certain sense, is the generalisation of Gauss’s sums (§ 62) in cyclotomy. Let ψ(u, v) denote the function θ, ,(u+v)+¢9m(u)60i(v) and let D1, D2 be an two fundamental discriminants such that D1D2 is also fundamental and negative: then
τθ
(%> (Ei) =” f.ie>+ (s) i.#.@#<“m“+»”"r>
where, on the left-hand side, we are to sum for sg = 1, 2, 3 . . [Dali and on the right we are to take a complete set of representative primitive forms (a, b, c) for the determinant D, D2, and give to m, n all positive and negative integral values such that am'+bmn-I-cn' is odd. The quantity -r is 2, if D1Dq< -4, -r= if D, D2=-4, 'r=6 if DiD1= ~3. By putting D2=1, we obtain, after some easy transformations, ¢=a
5; 13) iQ <=4~/A fl-/am2+bmn+cn2
s=l (S sn A ~Fw2Z2g ),
which holds for any fundamental discriminant -A. For instance, taking to =iK'/K, andA=3, we have 0,02 =2»<K/1r, and 2gi('"2+m"'*"2)= &<;'Qsn ifg; a verification is afforded by making 2K approach the value rr, in which case q, K vanish, while the limit of qi/K is }, whence the limiting value of sni;-€ is that of 6g%/KJ 3, which =6/4»/ 3 =/3/2, as it should be.
Several of Kronecker's formulae connect the solution of the Pellian equation with elliptic modular functions: one example may be given here. Let D be a positive discriminant of the form 8n+ 5, let (T, U) be the least solution of T'-DU2=1: then, if h(D) is the number of primitive classes for the determinant D, (T- U~/ D)h(D) =l1(2»<»<')2
where the product on the right extends to a certain sixth part of those values of 2|<»<' which are singular, and correspond to the field S2(/ -D), or in other words are connected with the class invariant j(/ -D). For instance, if D =5, the equation to find (m<')2 is 4.5{(»<»<')2-1}3+(25+13/ 5)“(f<»<')'=o one root of which is given by (2m<')2=9-4/'5=T-U/ 5 which is right, because in this case h(D) = 1.
74. Frequency of Primes.-The distribution of primes in a finite interval (a, a-l-b) is very irregular, if we change a and keep b constant. Thus if we put n!=y, the numbers, .t+2, ;r+3, . (/.t-l-n-I) are all composite, so that we can form a run of consecutive composite numbers as extensive as we please; on the other hand, there is possibly no limit to the number of cases in which p and p-l-2 are both primes. Legendre was the first to find an approximate formula for F(x), the number of primes not exceeding x. He found by induction F(x) =x + (log, , x - 1 -08366)
which answers fairly well when x lies between foo and I,000,000, but becomes more and more inaccurate as x increases. Gauss found, by theoretical considerations (which, however, he does not explain), the approximate formula d
= x
F(x) L(x) 2 Eg x
(where, as in all that follows, log x is taken to the base e). This value is ultimately too large, but when x exceeds a million it is nearer the truth than the value given by Legendre's formula. By a singularly profound and original analysis, Riemann succeeded in finding a ormula, of the same type as Gauss's, but more exact for very large values of x. In its complete form it is very complicated; but, by omitting terms which ultimately vanish (for sufficiently large values of x) in comparison with those retained, the formula reduces to
F(x)=A+E(-I)“iL(x;””) (m=I, 2, 3, 5, 6. 7, II, - . -) m WL
where the summation extends to all positive imegfa,1 values of m which have no square factor, and /r is the number of different prime factors of rn, with the convention that when m=1, (- 1)# = 1. The symbol A denotes a constant, namely
®
(-1)# f dx
A z m Xl* zxixz-1) logxi
and L is used in the sense given above. P. L. Tchébichev obtained some remarkable results on the frequency of primes by an ingenious application of Stirling's theorem. One of these is that there will certainly be (k-I-1) primes between a and b. provided that
a<%~2x/IJ-Q R log 6 (log b)2-${(4k+25) -ggi where R=%; log 2+§ log 3+§ log 5-, Q log 3o=o-921292 .... From this may be inferred the truth of Bertrand's conjecture that there is always at least one prime between a and (za.-2) if;>a>7. l'chébichev's results were generalized and made more precise by Sylvester.
The actual calculation of the number of primes ina given interval may be effected by a formula constructed and used by D. F. E Meissel. The following table gives the values of F(n) for various values of n, according to Meisse1's determinations:- n F(n)
20,000 2,262
100,000 9,592
500,000 41,538
I,0f)0,000 78,498
Riemann's analysis mainly depends upon the properties of the function
s“(S>=§ ln" (n=1, 2, 3, ~ -),
considered as a function of the complex variable s. The above definition is only valid when the real part of s exceeds 1; but it can be generalized by writing
where the integral is taken from x='+oo along the axis of real quantities to x=e, where e is a very small positive quantity, then round a circle of radius f and centre at the origin, and finally from x=e to x=+°0 along the axis of real quantities. This function g'(z) is of great importance, and has been recently studied by von Mangoldt Landau and others.
Reference has alread been made to the fact that if l, m are coprimes the linear form fic--m includes an infinite number of primes. Now let (a, b, c) be any primitive quadratic form with a total generic character C; and let lx-|-m be a primitive linear form chosen so that all its values have the character C. Then it has been proved by Weber and Meyer that (a, b, c) is capable of representing an infinity of primes all of the linear form lx-4-m.
75. Arithmetical Functions. -This term is applied to symbols such as ¢(u), rI>(n), &c., which are associated with n by an intrinsic arithmetical definition. The function (I)(1z) was written fn by Euler, who investigated its properties, and by proving the formula oo -l-oc
II(1-g') = Eq§ <3f'-l-J) deduced the result that I “GO
s2+s
fn =f(n- 1) ~-f(n-2) -f(n-5)-. . . =E(-1)'-If where on the right hand we are to take all positive values of s such that 7%-%(3S2iS) is not negative, and to interpret fo as n, if this term occurs. ]. Liouville makes frequent use of this function in his papers, but denotes it by § '(n).
If the quantity x is positive and not integral, the symbol E(x) or xl is used to denote the integer (including zero) which is obtained by omitting the fractional part of x; thus E(/ 2) =1, E(o-7) =o, and so on. For some purposes it is convenient to extend the definition by putting E(-x) = - E(x), and agreeing that when x is a positive integer, E(x) =x-é; it is then possible to find a Fourier sine-series representing x-E(x) for all real values of x. The function E(x) has many curious and important properties, which have been investigated by Gauss, Hermite, Hacks, Prin sheim, Stern and others. What is perhaps the simplest roof of the iaw of quadratic reciprocity depends upon the fact that if p, q are two odd primes, and we put p=2h+1, q=2k~|-I
r=h s=k
2 is (52) + z E (iii) =hk=i(p- mg- 1)
r=r 9 s=1 Q
the truth of which is obvious, if we rule a rectangle 5” Xq” into 'unit squares, and draw its diagonal. This formula is auss's, but the geometrical proof is due to Eisenstem. Another useful formula is 1==m - I y
23 E (x -I-~) =E(mx) - E(.x), which is due to Hermite. 1=1 M
Various other arithmetical functions have been devised for particular purposes; two that deserve mention (both due to Kronecker) are 5;, ;, , which means 0 or 1 according as h, k are unequal or equal, and sgn x, which means x+
76. Transcendental Numbers.-It has been proved by Cantor that the aggregate of all algebraic numbers is countable. Hence immediately follows the proposition (first proved by Liouville) that there are numbers, both real and complex, which cannot be defined b any combination of a finite number of equations with rationafiintegral coefficients. Such numbers are said to be transcendental. Hermite first completely proved the transcendent character of e; and Lindemann, by a similar method, proved the transcendence of -rr. Thus it is now finally established that the quadrature of the circle is impossible, not only by rule and compass, but even with the help of any number of algebraic curves of any order when the coefficients in their equations are rational (see Hermite, C.R. lxxvii., 1873, and Lindemann, Math. Ann. xx., 1882). Another number which is almost certainly transcendent is Euler's constant C. It may be convenient to give here the following numerical values:
π | =3.14159 | 25535 | 89793 | 23846. . . |
e | =2.71828 | 18284 | 59045 | 23536. . . |
C | =0.57721 | 56649 | 01532 | 8606065. . . (Gauss-Nicolai) |
log10 = (π log10e) | =0.13493 | 41840. . . | (Weber), | |
the last of which is useful in calculating class-invariants. |
77. Miscellaneous Investigations.—The foregoing articles (§§ 24-76) give an outline of what may be called the analytical theory of numbers, which is mainly the work of the 19th century, though many of the researches of Lagrange, Legendre and Gauss, as well as all those of Euler, fall within the 18th. But after all, the germ of this remarkable development is contained in what is only a part of the original Diophantine analysis, of which, beyond question, Fermat was the greatest master. The spirit of this method is still vigorous in Euler; but the appearance of Gauss’s Disquisitiones arithmetical in 1801 transformed the whole subject, and gave it a new tendency which was strengthened by the discoveries of Cauchy, Jacobi, Eisenstein and Dirichlet. In recent times Edouard Lucas revived something of the old doctrine, and it can hardly be denied that the Diophantine method is the one that is really germane to the subject. Even the strange results obtained from elliptic and modular functions must somehow be capable of purely arithmetical proof without the use of infinite series. Besides this, the older arithmeticians have announced various theorems which have not been proved or disproved, and made a beginning of theories which are still in a more or less rudimentary stage. As examples of the latter may be mentioned the partition of numbers (see Numbers, Partition of, below), and the resolution of large numbers into their prime factors.
The general problem of partitions is to find all the integral solutions of a set of linear equations Σcixi=mi with integral coefficients, and fewer equations than there are variables. The solutions may be further restricted by other conditions-for instance, that all the variables are to be positive. This theory was begun by Euler: Sylvester gave lectures on the subject, of which some portions have been preserved; and various results of great generality have been discovered by P. A. MacMahon. The author last named has also considered Diophantine inequalities, a simple problem in which is “to enumerate all the solutions of 7x⪖13y in positive integers.”
The resolution of a given large number into its prime factors is still a problem of great difficulty, and tentative methods have to be applied. But a good deal has been done by Seelhoff, Lucas, Landry, A. J. C. Cunningham and Lawrence to shorten the calculation, especially when the number is given in, or can be reduced to, some particular form.
It is well known that Fermat was led to the erroneous conjecture (he did not affirm it) that 2m+1 is a prime whenever m is a power of 2. The first case of failure is when m=32; in fact 232+1 ≡ 0 (mod 641). Other known cases of failure are m=2n, with n=6, 12, 23, 26 respectively; at the same time, Eisenstein asserted that he had proved that the formula 2m+1 included an infinite number of primes. His proof is not extant; and no other has yet been supplied. Similar difficulties are encountered when We examine Mersenne’s numbers, which are those of the form 2p−1, with p a prime; the known cases for which a Mersenne number is prime correspond to p= 2, 3, 5, 7, 13, 17, 19, 31, 61.
A perfect number is one which, like 6 or 28, is the sum of its aliquot parts. Euclid proved that 2p−1 (2p−1) is perfect when (2p−1) is a prime: and it has been shown that this formula includes all perfect numbers which are even. It is not known whether any odd perfect numbers exist or not.
Friendly numbers (numeri amicabiles) are pairs such as 220, 284, each of which is the sum of the aliquot parts of the other. No general rules for constructing them appear to be known, but several have been found, in a more or less methodical way.
78. In conclusion it may be remarked that the science of arithmetic (q.v.) has now reached a stage when all its definitions, processes and results are demonstrably independent of any theory of variable or measurable quantities such as those postulated in geometry and mathematical physics; even the notion of a limit may be dispensed with, although this idea, as well as that of a variable, is often convenient. For the application of arithmetic to geometry and analysis, see Function.
Authorities.—W. H. and G. E. Young, The Theory of Sets of Points (Cambridge, 1906; contains bibliography of theory of aggregates); P. Bachmann, Zahlentheorie (Leipzig, 1892; the most complete treatise extant); Dirichlet-Dedekind, Vorlesungen über Zahlentheorie (Braunschweig, 3rd and 4th ed., 1879, 18); K. Hensel, Theorie der algebraischen Zahlen (Leipzig, 1908); H. J. S. Smith, Report on the Theory of Numbers (Brit. Ass. Rep., 1859–1863, 1865, or Coll. Math. Papers, vol. i.); D. Hilbert, “Bericht über die Theorie der algebraischen Zahlkörper” (in Jahresber. d. deutschen Math.-Vereinig., vol. iv., Berlin, 1897); Klein-Fricke, Elliptische Modulfunctionen (Leipzig, 1890-1892); H. Weber, Elliptische Functionen u. algebraische Zahlen (Braunschweig, 1891). Extensive bibliographies will be found in the Royal Society’s Subject Index, vol. i. (Cambridge, 1908) and Encycl. d. math. Wissenschaften, vol. i. (Leipzig, 1898). (G. B. M.)