Next Contents

1.1. ORBITS

Even a gas-free galaxy is a formidably complex dynamical system: ∼ 1011 stars and inconceivably more WIMPS are strongly coupled by their mutual gravitational field. As in any branch of theoretical physics we make progress by simplifying. The key simplification is that the motion of all these particles would be very similar if calculated in the gravitational field of an imaginary smoothed-out version of the galaxy. That is, we smear the mass of each particle over something like an interstellar distance and compute the gravitational field from this smooth mass distribution. Given the usefulness of this approximate gravitational field, our first task becomes to understand how particles move in smooth gravitational fields of this kind.

1.1.1. Quasiperiodicity and isolating integrals

The equations of motion of a particle in a typical gravitational potential Φ are readily integrated numerically. If the potential is axisymmetric, the component of angular momentum about the galaxy's symmetry axis, Lz, is conserved, and we can eliminate the azimuthal angle φ and its time derivative from the equations of motion, leaving us with two coupled ordinary differential equations to integrate:

Equation 1

(1.1)

where

Equation 2

(1.2)

Fig. 1.1 shows a couple of typical orbits obtained in this way. They have a nicely regular pattern of paths running to and fro. Given sufficient time such an orbit will carry the particle to any given point inside the bounding envelope, and when it reaches that point the particle will be moving with one of four velocities: dot{R} = ± vR and dot{z} = ± vz, where vR and vz are numbers that depend on the orbit and the location.

Figure 1

Figure 1.1. The motion in the (R, z) plane of a particle that orbits in an axisymmetric gravitational potential. The orbits have the same values of E and Lz but are nevertheless very different because they have different values of the mysterious “third integral”.

If the time series R(t) or z(t) obtained in this way is Fourier transformed, one finds that the spectrum consists of discrete lines, and moreover, any frequency appearing in the spectrum can be expressed as an integer linear combination of two fundamental frequencies ΩR and Ωz. That is,

Equation 3

(1.3)

where

Equation 4

(1.4)

with nR and nz being (possibly negative) integers. Thus the orbit is characterised by the two fundamental frequencies and countably many amplitudes Rn, zn and phases ψR nz n. An orbit with these properties is said to be quasiperiodic.

As the particle moves in the (R, z) plane, it rotates about the symmetry axis: its azimuthal coordinate φ can be found by simple quadrature:

Equation 5

(1.5)

The time series of φ will consist of sinusoidal contributions at discrete frequencies superimposed on a secularly increasing value, and the rate of the secular increase defines a third fundamental frequency Ωφ. Thus three fundamental frequencies will be required to express the frequency of any line in the spectrum of say x(t) as an integer linear combination of fundamental frequencies.

The result of integrating the equations of motion in potentials that are not axisymmetric cannot be displayed as nicely as in Fig. 1.1. But when you Fourier transform the time series x(t) of the coordinates of such orbits, you usually find that the spectra consist of lines that can be expressed as integer linear combinations of three fundamental frequencies. So a large fraction of orbits in galaxy-like potentials are quasiperiodic.

An integral of motion is a function I(x, v) on phase space that returns the same number no matter at what point along an orbit you evaluate it:

Equation 6

(1.6)

We say that I is an isolating integral if the set of phase-space points that satisfy equation (1.6) for some given value on the right side defines a smooth five-dimensional subset of phase space. For example, the energy E(x, v) = 1/2 v2 + Φ(x) is an isolating integral.

If an orbit is quasiperiodic, one may show that it admits at least three functionally independent isolating integrals. We can take E to be one of these, and in the case of an axisymmetric potential, Lz = xvyyvx can be taken to be another of them. Only in exceptional potentials do we have an analytic expression for a third isolating integral, but its existence is assured by the quasiperiodic nature of orbits – for a proof see Arnold (1978).

1.1.2. Angles and actions

It often happens that the key to solving a physics problem is to identify the coordinate system that's best suited to that problem. Hamiltonian mechanics allows us to use an extremely wide range of coordinates for phase space with ease – all “canonical coordinates” are intrinsically equal. In the following we shall denote by (x, p) a canonical system made up of coordinate xi that gives the star's position and its canonically conjugate momentum pi. In the simplest case xi is a Cartesian coordinate and pi = ẋi is its rate of change, but in some instances pi ≠ ẋi. Given that three isolating integrals Ii exist, a shrewd question to ask is “is there a canonical coordinate system in which the Ii, or some functions of them, act as momenta?” Since any function J(I1, I2, I3) of the Ii will itself be an integral of motion, we usefully increase our chances of our enquiry having the answer “yes” by opening the enquiry up to functions of our original integrals.

It turns out that only a highly restricted group of integrals are capable of playing the role of momenta. We call such integrals actions and reserve for them the letter J, so Jj(x, p) is the jth action of an orbit.

To understand why only special integrals can act as momenta, we should consider the subset M of points in phase space that satisfy

Equation 7

(1.7)

These equations impose three restrictions on the six phase-space coordinates (x, p). So the set M is a three-dimensional subset of phase space to which the particle is confined for all time. M is a subset of the energy hypersurface H(x, p) = E, where H is the Hamiltonian function and E is the particle's energy. If the orbit is bound, as we shall assume, the energy hypersurface is compact, so M is compact also. In practice M will also be a connected set, and it can then be shown that it is diffeomorphic to a 3-torus (see §§49, 50 of Arnold, 1978, for a proof).

What is a 3-torus? Think of a room with each point on the floor identified with the point on the ceiling that is vertically above it, each point on the left wall identified with the corresponding point on the right wall, and with points on the front and back walls similarly identified. Position within this room (M) is specified by the values of the coordinates θ1, θ2 and θ3 that are canonically conjugate to J1, J2 and J3, whose values specify which room (torus) we are in. They do so in a way that can be understood geometrically.

In Hamiltonian mechanics a fundamental role is played by the Poincaré invariant P. This is a number that we assign to any two-surface S in phase space through

Equation 8

(1.8)

so P(S) is the sum of the areas of the projections of S onto all the planes formed by each coordinate xi and its conjugate momentum pi. It's called an invariant because you get the same number no matter what canonical coordinates you use. Consequently, given any two-surface SM we can replace (xi, pi) by (θi, Ji) and write

Equation 9

(1.9)

and this vanishes because every point of S has the same value of Ji. In view of this fact we say M is null. It follows from the nullness of M that the value of any line integral ∫ dx · p through M between two given endpoints is the same for any path between those points that can be continuously distorted into some standard path without leaving M. To see why, take the difference between the integrals along two different paths Γ1 and Γ2 between a given pair of points. This difference of integrals is identical with the line integral along the closed path Γ1 − Γ2 in which we go out on Γ1 and back on Γ2. By Green's theorem this closed line integral is equal to the Poincaré invariant of the 2-surface that Γ2 sweeps out as it is deformed into Γ1. But this Poincaré invariant vanishes, so the original line integrals were equal. Consider now the line integral ∫ dx · p along the path from the front wall to the corresponding point on the back wall. We choose to evaluate this integral using the (θ, J) coordinates, and to take the path on which θ23=constant. Then we have

Equation 10

(1.10)

so J1 is equal to ∫ dx · p divided by the amount by which θ1 increments as we cross the room. We choose to scale the actions so θi increments by 2π on crossing the room, so

Equation 11

(1.11)

where Γi is any path that takes us once across the room.

If we want to find the values of the ordinary phase-space coordinates (x, p) at the point θ we need to do the following: (i) Choose a point (x0, p0) in the room and declare it to be θ = 0. Now integrate from the initial conditions x = x0, p = p0 when θ1 = 0 the coupled o.d.e.s

Equation 12

(1.12)

where [..] denotes a Poisson bracket, to discover the (x, p) coordinates of the points θ = (θ1, 0, 0). (ii) Starting from any of these points we can integrate the o.d.e.s

Equation 13

(1.13)

to discover the (x, p) coordinates of the points θ = (θ1, θ2, 0). (iii) Starting from any of the last-mentioned points we can integrate the third set of o.d.e.s to find the (x, p) coordinates of a general point θ. The key property of actions is that if we integrate say the first set of o.d.e.s from a point θa on one wall to the point θb at which the trajectory hits the opposite wall, we find that θb = θa + (2π, 0, 0) (Fig. 1.2 left). That is the trajectories generated by actions carry you from a point on one wall to the point on the opposite wall that we have identified with our starting point. If we form some other integrals Ii(J) by adopting three non-trivial functions of the actions, the trajectories generated by the Ii will not carry us from a point on a wall to the point on the opposite wall that has been identified with it (Fig. 1.2 right). Consequently, the variables that are canonically conjugate to the Ii cannot form a global coordinate system (although they can be used as coordinates locally).

Figure 2

Figure 1.2. Trajectories generated by an action (left) and by some other integral (right). In the left panel each horizontal curve joins points on the left and right boundaries that have been identified, whereas in the right panel the leftmost and rightmost points of these curves are at logically distinct points. Consequently, in the right panel the periodic continuation of each horizontal curve would not overlie the portion plotted. The vertical curves exhibit the same phenomenon. Consequently these curves define a global coordinate system only in the left panel.

In remembrance of the fact that θa and θb = θa + (2π, 0, 0) correspond to the same point in phase space, the canonically conjugate coordinates of actions are called angle variables. To see what flexibility we have in the choice of actions and angles, we observe that if we have new variables θ′ that are related to our original angle variables by linear equations

Equation 14

(1.14)

where n is a matrix with integer entries, then if any of the θ′i is incremented by 2π, all the θj will change by 2mπ, so we will return to the same place in phase space. Hence the θ′ provide a global coordinate system for a torus as effectively as the θ. To discover what actions correspond to the θ′, we write down the generating function of the canonical transformation (θ, J) ↔ (θ′, J′)

Equation 15

(1.15)

It is straightforward to check that S generates equation (1.14) and we have also

Equation 16

(1.16)

Thus the new actions are integer linear combinations of the old actions.

Any function on phase space can be expressed as a function of the angles and actions (θ, J). In particular the Hamiltonian can be expressed in this form. But while the angle variables vary as a particle orbits, its energy does not. So the Hamiltonian cannot depend on θ. Hence the Hamiltonian is a function H(J) of the actions only. This fact makes the equations of motion of the angle variables trivial:

Equation 17

(1.17)

Since H depends on J only, so must its partial derivatives Ωi. Since J is constant along the orbit, Ωi is too, so we can immediately integrate equation (1.17):

Equation 18

(1.18)

Thus the particle moves through its room at constant speed along straight lines whose slope is given by Ω = ∂ H / ∂ J.

Usually the frequencies are incommensurable and then the line will eventually come arbitrarily close to every point in the room (Fig. 1.3 left).

Figure 3

Figure 1.3. A particle moves through its room (torus) on a straight line that repeatedly hits a wall and reappears at the corresponding point on the opposite wall. If the components of the slope vector are rationally related, the line eventually closes on itself (right). In general it does not close and comes arbitrarily close to every point in the room (left). Black dots mark the start and end of the plotted portion of the trajectory.

Since the ordinary phase-space coordinates are periodic functions of the angle variables with period 2π they can be expanded in Fourier series

Equation 19

(1.19)

where the sum is over vectors with integer components and the Xin and ψin are constant amplitudes and phases. When we substitute into this expression our solution (1.18) of the equations of motion, we obtain

Equation 20

(1.20)

Equation (1.3) from which we started is an instance of this equation. Thus we have come full circle from the empirical fact that numerically integrated orbits are quasiperiodic to the fact that such spectra are a necessary consequence of these orbits having three isolating integrals.

In the generic case of incommensurable frequencies, we have a useful result, the time-averages theorem: when the frequencies are incommensurable, the fraction of its time a particle spends in a subset V of the torus is V d3 θ / (2π)3. From this the strong Jeans theorem follows: in a steady-state galaxy, the density of stars is uniform within incommensurable tori, so the density of stars in phase space is a function f(J) of the actions only. We call f the distribution function, often abbreviated DF.

It is useful to think of orbits as points in the three-dimensional space, action space, that has the three actions as is Cartesian coordinates. The strong Jeans theorem tells us that galactic equilibria are simply distributions of stars in this easily imagined space.

1.1.3. Epicycle approximation

It's instructive to examine a very useful model at this point. A star in an axisymmetric potential moves in the effective potential (1.2). It is physically obvious that the minimum of Φeff occurs at (Rg, 0), where the guiding-centre radius Rg is the radius of the circular orbit with the given angular momentum Lz. Since this is the potential's minimum, a Taylor expansion of Φeff around it will contain no linear terms and be of the form

Equation 21

(1.21)

Stars on sufficiently circular orbits will be confined to the region in which we need retain only the first three terms in this series. Consequently, their radial and vertical motions will be harmonic. The frequencies of these oscillations are Ωr = κ, the epicycle frequency and Ωz = ν, the vertical epicycle frequency. The solution to the equations of motion is

Equation 22

(1.22)

where X, ψr, Z and ψz are all constants. Clearly we can set

Equation 23

(1.23)

Then calculating pR = dot{R} and evaluating ∮d R pR we can show that X = √2Jr / κ, and similarly that Z = √2Jz / ν, so in the epicycle approximation the connection between ordinary coordinates and angle-action variables is

Equation 24

(1.24)

Next Contents