Quaternions as a practical instrument of physical research


 I

Introduction

It is a curious phenomenon in the History of Mathematics that the greatest work of the greatest Mathematician of the century which prides itself upon being the most enlightened the world has yet seen, has suffered the most chilling neglect.

The cause of this is not at first sight obvious. We have here little to do with the benefit provided by Quaternions to Pure Mathematics. The reason for the neglect here may be that Hamilton himself has developed the Science to such an extent as to make successors an impossibility. One cannot however resist a strong suspicion that were the subject even studied we should hear more from Pure Mathematicians, of Hamilton’s valuable results. This reason at any rate cannot be assigned for the neglect of the Physical side of Quaternions. Hamilton has done but little in this field, and yet when we ask what Mathematical Physicists have been tempted by the bait to win easy laurels (to put the incentive on no higher grounds), the answer must be scarcely one. Prof. Tait is the grand exception to this. But well-known Physicist though he be, his fellow-workers for the most part render themselves incapable of appreciating his valuable services by studying the subject if at all only as dilettanti. The number who read a small amount in Quaternions is by no means small, but those who get further than what is recommended by Maxwell as imperatively necessary are but a small percentage of the whole.

I cannot help thinking that this state of affairs is owing chiefly to a prejudice. This prejudice is well seen in Maxwell’s well-known statement—“I am convinced that the introduction of the ideas, as distinguished from the operations and methods of Quaternions, will be of great use to us in all parts of our subject.”2 Now what I hold and what the main object of this essay is to prove is that the “operations and methods” of Quaternions are as much better qualified to deal with Physics than the ordinary ones as are the “ideas”.

But, what has produced this notion, that the subject of Quaternions is only a pretty toy that has nothing to do with the serious work of practical Physics? It must be the fact that it has hitherto produced few results that appeal strongly to Physicists. This I acknowledge, but that the deduction is correct I strongly disbelieve. As well might an instrument of which nobody has attempted to master the principles be blamed for not being of much use. Workers naturally find themselves while still inexperienced in the use of Quaternions incapable of clearly thinking through them and of making them do the work of Cartesian Geometry, and they conclude that Quaternions do not provide suitable treatment for what they have in hand. The fact is that the subject requires a slight development in order readily to apply to the practical consideration of most physical subjects. The first steps of this, which consist chiefly in the invention of new symbols of operation and a slight examination of their chief properties, I have endeavoured to give in the following pages.

I may now state what I hold to be the mission of Quaternions to Physics. I believe that Physics would advance with both more rapid and surer strides were Quaternions introduced to serious study to the almost total exclusion of Cartesian Geometry, except in an insignificant way as a particular case of the former. All the geometrical processes occurring in Physical theories and general Physical problems are much more graceful in their Quaternion than in their Cartesian garb. To illustrate what is here meant by “theory” and “general problem” let us take the case of Elasticity treated below. That by the methods advocated not only are the already well-known results of the general theory of Elasticity better proved, but more general results are obtained, will I think be acknowledged after a perusal of Section 7 to Section 11 below. That Quaternions are superior to Cartesian Geometry in considering the general problems of (1) an infinite isotropic solid, (2) the torsion and bending of prisms and cylinders, (3) the general theory of wires, I have endeavoured to shew in Section 11–Section 13. But for particular problems such as the torsion problem for a cylinder of given shape, we require of course the various theories specially constructed for the solution of particular problems such as Fourier’s theories, complex variables, spherical harmonics, &c. It will thus be seen that I do not propose to banish these theories but merely Cartesian Geometry.

So mistaken are the common notions concerning the pretensions of advocates of Quaternions that I was asked by one well-known Mathematician whether Quaternions furnished methods for the solution of differential equations, as he asserted that this was all that remained for Mathematics in the domain of Physics! Quaternions can no more solve differential equations than Cartesian Geometry, but the solution of such equations can be performed as readily, in fact generally more so, in the Quaternion shape as in the Cartesian. But that the sole work of Physical Mathematics to-day is the solution of differential equations I beg to question. There are many and important Physical questions and extensions of Physical theories that have little or nothing to do with such solutions. As witness I may call attention to the new Physical work which occurs below.

If only on account of the extreme simplicity of Quaternion notation, large advances in the parts of Physics now indicated, are to be expected. Expressions which are far too cumbrous to be of much use in the Cartesian shape become so simple when translated into Quaternions, that they admit of easy interpretation and, what is perhaps of more importance, of easy manipulation. Compare for instance the particular case of equation (15m) Section 9 below when F = 0 with the same thing as considered in Thomson and Tait’s Nat. Phil., App. C. The Quaternion equation is

ρ1S1Ψ D wΔ = 0.

The Cartesian exact equivalent consists of Thomson and Tait’s equations (7), viz.

d dx 2dw dA dα dx + 1 + dw db dα dz + dw dc dα dy + d dy 2dw dB dα dy + dw da dα dz + dw dc dα dx + 1 + d dz 2dw dC dα dz + dw da dα dy + dw db dα dx + 1 = 0,

and two similar equations.

Many of the equations indeed in the part of the essay where this occurs, although quite simple enough to be thoroughly useful in their present form, lead to much more complicated equations than those just given when translated into Cartesian notation.

It will thus be seen that there are two statements to make good:—(1) that Quaternions are in such a stage of development as already to justify the practically complete banishment of Cartesian Geometry from Physical questions of a general nature, and (2) that Quaternions will in Physics produce many new results that cannot be produced by the rival and older theory.

To establish completely the first of these propositions it would be necessary to go over all the ground covered by Mathematical Physical Theories, by means of our present subject, and compare the proofs with the ordinary ones. This of course is impossible in an essay. It would require a treatise of no small dimensions. But the principle can be followed to a small extent. I have therefore taken three typical theories and applied Quaternions to most of the general propositions in them. The three subjects are those of Elastic Solids, with the thermodynamic considerations necessary, Electricity and Magnetism, and Hydrodynamics. It is impossible without greatly exceeding due limits of space to consider in addition, Conduction of Heat, Acoustics, Physical Optics, and the Kinetic Theory of Gases. With the exception of the first of these subjects I do not profess even to have attempted hitherto the desired applications, but one would seem almost justified in arguing that, since Quaternions have been found so applicable to the subjects considered, they are very likely to prove useful to about the same extent in similar theories. Again, only in one of the subjects chosen, viz., Hydrodynamics, have I given the whole of the general theory which usually appears in text-books. For instance, in Electricity and Magnetism I have not considered Electric Conduction in three dimensions which, as Maxwell remarks, lends itself very readily to Quaternion treatment, nor Magnetic Induction, nor the Electro-Magnetic Theory of Light. Again, I have left out the consideration of Poynting’s theories of Electricity which are very beautifully treated by Quaternions, and I felt much tempted to introduce some considerations in connection with the Molecular Current theory of Magnetism. With similar reluctance I have been compelled to omit many applications in the Theory of Elastic Solids, but the already too large size of the essay admitted of no other course. Notwithstanding these omissions, I think that what I have done in this part will go far to bear out the truth of the first proposition I have stated above.

But it is the second that I would especially lay stress upon. In the first it is merely stated that Cartesian Geometry is an antiquated machine that ought to be thrown aside to make room for modern improvements. But the second asserts that the improved machinery will not only do the work of the old better, but will also do much work that the old is quite incapable of doing at all. Should this be satisfactorily established and should Physicists in that case still refuse to have anything to do with Quaternions, they would place themselves in the position of the traditional workmen who so strongly objected to the introduction of machinery to supplant manual labour.

But in a few months and synchronously with the work I have already described, to arrive at a large number of new results is too much to expect even from such a subject as that now under discussion. There are however some few such results to shew. I have endeavoured to advance each of the theories chosen in at least one direction. In the subject of Elastic Solids I have expressed the stress in terms of the strain in the most general case, i.e. where the strain is not small, where the ordinary assumption of no stress-couple is not made and where no assumption is made as to homogeneity, isotropy, &c. I have also obtained the equations of motion when there is given an external force and couple per unit volume of the unstrained solid. These two problems, as will be seen, are by no means identical. In Electrostatics I have considered the most general mechanical results flowing from Maxwell’s theory, and their explanation by stress in the dielectric. These results are not known, as might be inferred from this mode of statement, for to solve the problem we require to know forty-two independent constants to express the properties of the dielectric at a given state of strain at each point. These are the six coefficients of specific inductive capacity and their thirty-six differential coefficients with regard to the six coordinates of pure strain. But, as far as I am aware, only such particular cases of this have already been considered as make the forty-two constants reduce at most to three. In Hydrodynamics I have endeavoured to deduce certain general phenomena which would be exhibited by vortex-atoms acting upon one another. This has been done by examination of an equation which has not, I believe, been hitherto given. The result of this part of the essay is to lead to a presumption against Sir William Thomson’s Vortex-Atom Theory and in favour of Hicks’s.

As one of the objects of this introduction is to give a bird’s-eye view of the merits of Quaternions as opposed to Cartesian Geometry, it will not be out of place to give side by side the Quaternion and the Cartesian forms of most of the new results I have been speaking about. It must be premised, as already hinted, that the usefulness of these results must be judged not by the Cartesian but by the Quaternion form.

Cartesian form of some of the results to follow

Elasticity

Let the point (x,y,z) of an elastic solid be displaced to (x,y,z). The strain at any point that is caused may be supposed due to a pure strain followed by a rotation. In Section III. below, this pure strain is called ψ. Let its coordinates be e, f, g, a2, b2, c2; i.e. if the vector (ξ,η,ζ) becomes (ξ,η,ζ) by means of the pure strain, then

ξ = eξ + 1 2cη + 1 2bζ,  &c., &c.

Thus when the strain is small e, f, g reduce to Thomson and Tait’s 1 + e, 1 + f, 1 + g and a, b, c are the same both in their case and the present one. Now let the coordinates of Ψ, Section 9 below, be E, F, G, A2, B2, C2. Equation (15), Section 9 below, viz.3

Ψω = ψ2ω = χχω = 1Sρ1ρ2Sω2,

gives in our present notation

E = e2 + c24 + b24 = (dxdx)2 + (dydx)2 + (dzdx)2,                           &c., &c. A = a(f + g) + bc2 = 2{(dxdy)(dxdz) + (dydy)(dydz) + (dzdy)(dzdz)},                           &c., &c.

which shew that the present E, F, G, A2, B2, C2 are the A, B, C, a, b, c of Thomson and Tait’s Nat. Phil., App. C.

Let us put

J xyz x y z = J J yz y z = J11, &c., &c., J zx y z = J12,J yz z x = J21, &c., &c., &c., &c.

I have shown in Section 8 below that the stress-couple is quite independent of the strain. Thus we may consider the stress to consist of two parts—an ordinary stress PQRSTU as in Thomson and Tait’s Nat. Phil. and a stress which causes a couple per unit volume LMN. The former only of these will depend on strain. The result of the two will be to cause a force (as indeed can be seen from the expressions in Section 7 below) per unit area on the x-interface P, U + N2, T M2, and so for the other interfaces. If L, M, N be the external couple per unit volume of the unstrained solid we shall have

L = LJ,M = MJ,N = NJ,

for the external couple and the stress-couple are always equal and opposite. Thus the force on the x-interface becomes

P,U N2J,T + M2J

and similarly for the other interfaces.

To express the part of the stress (P &c.) which depends on the strain in terms of that strain, consider w the potential energy per unit volume of the unstrained solid as a function of E &c. In the general thermodynamic case w may be defined by saying that

w × ( the element of volume) = ( the intrinsic energy of the element) ( the entropy of the element ×its absolute temperature ×Joule’s coefficient).

Of course w may be, and indeed is in Section 8, Section 8 below, regarded as a function of e &c.

The equation for stress is (15b) Section 9 below, viz.,

Jϕ¯ω = 2χΨ D wχω = 2ρSρωS 1Ψ D w 2.

The second of the expressions is in terms of the strain and the third in terms of the displacement and its derivatives. In our present notation this last is

JP 2 = dx dx 2 dw dE + dx dy 2 dw dF + dx dz 2 dw dG beginalign1ex] + 2dx dy dx dz dw dA + 2dx dz dx dx dw dB + 2dx dx dx dy dw dC,  &c., &c. JS 2 = dy dx dz dx dw dE + dy dy dz dy dw dF + dy dz dz dz dw dG beginalign1ex] + dy dy dz dz + dy dz dz dy dw dA + dy dz dz dx + dy dx dz dz dw dB beginalign1ex] + dy dx dz dy + dy dy dz dx dw dC,  &c., &c.

In Section 8 I also obtain this part of the stress explicitly in terms of e, f, g, a, b, c, of w as a function of these quantities and of the axis and amount of rotation. But these results are so very complicated in their Cartesian shape that it is quite useless to give them.

To put down the equations of motion let Xx, Y x, Zx be the force due to stress on what before strain was unit area perpendicular to the axis of x. Similarly for Xy, &c. Next suppose that X, Y , Z is the external force per unit volume of the unstrained solid and let D be the original density of the solid. Then the equation of motion (15n) Section 9 below, viz. 

Dρ̈ = F + τΔ,

gives in our present notation

X + dXxdx + dXydy + dXzdz = ẍD, &c., &c.

It remains to express Xx &c. in terms of the displacement and LMN. This is done in equation (15l) Section 9 below, viz4

τω = 2ρS1Ψ D wω + 3V MV ρρSω 122S123Sρρρ.

In our present notation this consists of the following nine equations:

Xx = 2 dw dE dx dx + dw dC dx dy + dw dB dx dz + J12N J13M 2J , Y x = 2 dw dE dy dx + dw dC dy dy + dw dB dy dz + J13L J11N 2J , Zx = 2 dw dE dz dx + dw dC dz dy + dw dB dz dz + J11M J12L 2J ,

and six similar equations.

We thus see that in the case where LMN are zero, our present Xx, Xy, Xz are the PQR of Thomson and Tait’s Nat. Phil. App. C (d), and therefore equations (7) of that article agree with our equations of motion when we put both the external force and the acceleration zero.

These are some of the new results in Elasticity, but, as I have hinted, there are others in Section 8, Section 8 which it would be waste of time to give in their Cartesian form.

Electricity

In Section IV. below I have considered, as already stated, the most general mechanical results flowing from Maxwell’s theory of Electrostatics. I have shewn that here, as in the particular cases considered by others, the forces, whether per unit volume or per unit surface, can be explained by a stress in the dielectric. It is easiest to describe these forces by means of the stress.

Let the coordinates of the stress be PQRSTU. Then F1F2F3 the mechanical force, due to the field per unit volume, exerted upon the dielectric where there is no discontinuity in the stress, is given by

F1 = dPdx + dUdy + dTdz, &c., &c.

and (l,m,n) being the direction cosines of the normal to any surface, pointing away from the region considered

F = [lP + mU + nT]a []b, &c., &c.,

where a, b indicate the two sides of the surface and F1, F2, F3 is the force due to the field per unit surface.

It remains to find P &c. Let X, Y , Z be the electro-motive force, α, β, γ the displacement, w the potential energy per unit volume and Kxx, Kyy, Kzz, Kyz, Kzx, Kxy the coefficients of specific inductive capacity. Let 1 + e, 1 + f, 1 + g, a2, b2, c2 denote the pure part of the strain of the medium. The K’s will then be functions of e &c. and we must suppose these functions known, or at any rate we must assume the knowledge of both the values of the K’s and their differential coefficients at the particular state of strain in which the medium is when under consideration. The relations between the above quantities are

4πα = KxxX + KxyY + KzxZ, &c., &c. w = (Xα + Y β + Zγ)2 = (KxxX2 + K yyY 2 + K zzZ2 + 2K yzY Z + 2KzxZX + 2KxyXY )8π.

It is the second of these expressions for w which is assumed below, and the differentiations of course refer only to the K’s. The equation expressing P &c. in terms of the field is (21) Section IV below, viz. 

ϕω = 1 2V DωE Ψ D wω,

which in our present notation gives the following six equations

P = 1 2(αX + βY + γZ) dwde, &c., &c., S = 1 2(βZ + γY ) dwda, &c., &c.

I have shown in Section 14–Section 14 below that these results agree with particular results obtained by others.

Hydrodynamics

The new work in this subject is given in Section VI.—“The Vortex-Atom Theory.” It is quite unnecessary to translate the various expressions there used into the Cartesian form. I give here only the principal equation in its two chief forms, equation (9) Section 25 and equation (11) Section 25, viz. 

P + v σ22 + (4π)1(Sστu + umt)d𝔰 = H, P + v σ22 + (4π)1{d𝔰Su(V στ mσ) + ud(md𝔰)dt} = H.

In Cartesian notation these are

dpρ + V + q22 (4π)1{2[(x x)(wη vζ) + + ]r3 + (ct)r}dxdydz = H. dpρ + V + q22 (4π)1{(x x)[2(wη vζ) cu] + + }r3.dxdydz (4π)1{d(cdxdydz)dt}r = H.

The fluid here considered is one whose motion is continuous from point to point and which extends to infinity. The volume integral extends throughout space. The notation is as usual. It is only necessary to say that H is a function of the time only, r is the distance between the points x,y,z and x,y,z;

c = dudx + dvdy + dwdz;

ddt is put for differentiation which follows a particle of the fluid, and t for that which refers to a fixed point.

The explanation of the unusual length of this essay, which I feel is called for, is contained in the foregoing description of its objects. If the objects be justifiable, so must also be the length which is a necessary outcome of those objects.