Boltzmann distributions

Next: Paramagnetism Up: Applications of statistical thermodynamics Previous: Introduction

Boltzmann distributions

We have gained some understanding of the macroscopic properties of the air around us. For instance, we know something about its internal energy and specific heat capacity. How can we obtain some information about the statistical properties of the molecules which make up air? Consider a specific molecule: it constantly collides with its immediate neighbour molecules, and occasionally bounces off the walls of the room. These interactions ``inform'' it about the macroscopic state of the air, such as its temperature, pressure, and volume. The statistical distribution of the molecule over its own particular microstates must be consistent with this macrostate. In other words, if we have a large group of such molecules with similar statistical distributions, then they must be equivalent to air with the appropriate macroscopic properties. So, it ought to be possible to calculate the probability distribution of the molecule over its microstates from a knowledge of these macroscopic properties.

We can think of the interaction of a molecule with the air in a classroom as analogous to the interaction of a small system in thermal contact with a heat reservoir . The air acts like a heat reservoir because its energy fluctuations due to any interactions with the molecule are far too small to affect any of its macroscopic parameters. Let us determine the probability of finding system in one particular microstate of energy when it is thermal equilibrium with the heat reservoir .

As usual, we assume fairly weak interaction between and , so that the energies of these two systems are additive. The energy of is not known at this stage. In fact, only the total energy of the combined system $A^{(0)} = A + A'$ is known. Suppose that the total energy lies in the range $E^{(0)}$ to $E^{(0)} + \delta E$ . The overall energy is constant in time, since $A^{(0)}$ is assumed to be an isolated system, so

$\begin{displaymath} E_r + E' = E^{(0)}, \end{displaymath}$

(369)

where

denotes the energy of the reservoir

. Let ${\mit\Omega}'(E')$ be the number of microstates accessible to the reservoir when its energy lies in the range

to $E'+\delta E$ . Clearly, if system

has an energy

then the reservoir

must have an energy close to $E'=E^{(0)} - E_r$ . Hence, since

is in one definite state (i.e., state

), and the total number of states accessible to

is ${\mit\Omega}'(E^{(0)} - E_r)$ , it follows that the total number of states accessible to the combined system is simply ${\mit\Omega}'(E^{(0)} - E_r)$ . The principle of equal a priori probabilities tells us the the probability of occurrence of a particular situation is proportional to the number of accessible microstates. Thus,

$\begin{displaymath} P_r = C' \,{\mit\Omega}'(E^{(0)} - E_r), \end{displaymath}$

(370)

where

is a constant of proportionality which is independent of

. This constant can be determined by the normalization condition

$\begin{displaymath} \sum_r P_r = 1, \end{displaymath}$

(371)

where the sum is over all possible states of system

, irrespective of their energy.

Let us now make use of the fact that system is far smaller than system . It follows that $E_r\ll E^{(0)}$ , so the slowly varying logarithm of can be Taylor expanded about $E' = E^{(0)}$ . Thus,

$\begin{displaymath} \ln P_r = \ln C' +\ln {\mit\Omega}'(E^{(0)}) -\left[\frac{\partial \ln {\mit\Omega}'} {\partial E'} \right]_0 E_r +\cdots. \end{displaymath}$

(372)

Note that we must expand $\ln P_r$ , rather than

itself, because the latter function varies so rapidly with energy that the radius of convergence of its Taylor series is far too small for the series to be of any practical use. The higher order terms in Eq. (372) can be safely neglected, because $E_r\ll E^{(0)}$ . Now the derivative

$\begin{displaymath} \left[\frac{\partial \ln {\mit\Omega}'}{\partial E'} \right]_0 \equiv \beta \end{displaymath}$

(373)

is evaluated at the fixed energy $E' = E^{(0)}$ , and is, thus, a constant independent of the energy

. In fact, we know, from Sect. 5, that this derivative is just the temperature parameter $\beta = (k\,T)^{-1}$ characterizing the heat reservoir

. Hence, Eq. (372) becomes

$\begin{displaymath} \ln P_r = \ln C' + \ln {\mit\Omega}'(E^{(0)}) - \beta\, E_r, \end{displaymath}$

(374)

giving

$\begin{displaymath} P_r = C \exp(-\beta \,E_r), \end{displaymath}$

(375)

where

is a constant independent of

. The parameter

is determined by the normalization condition, which gives

$\begin{displaymath} C^{-1} = \sum_r \exp(-\beta \,E_r), \end{displaymath}$

(376)

so that the distribution becomes

$\begin{displaymath} P_r = \frac{\exp(-\beta\, E_r)}{\sum_r \exp(-\beta \,E_r)}. \end{displaymath}$

(377)

This is known as the Boltzmann probability distribution, and is undoubtably the most famous result in statistical physics.

The Boltzmann distribution often causes confusion. People who are used to the principle of equal a priori probabilities, which says that all microstates are equally probable, are understandably surprised when they come across the Boltzmann distribution which says that high energy microstates are markedly less probable then low energy states. However, there is no need for any confusion. The principle of equal a priori probabilities applies to the whole system, whereas the Boltzmann distribution only applies to a small part of the system. The two results are perfectly consistent. If the small system is in a microstate with a comparatively high energy then the rest of the system (i.e., the reservoir) has a slightly lower energy than usual (since the overall energy is fixed). The number of accessible microstates of the reservoir is a very strongly increasing function of its energy. It follows that when the small system has a high energy then significantly less states than usual are accessible to the reservoir, and so the number of microstates accessible to the overall system is reduced, and, hence, the configuration is comparatively unlikely. The strong increase in the number of accessible microstates of the reservoir with increasing gives rise to the strong (i.e., exponential) decrease in the likelihood of a state of the small system with increasing . The exponential factor $\exp(-\beta \,E_r)$ is called the Boltzmann factor.

The Boltzmann distribution gives the probability of finding the small system in one particular state of energy . The probability that has an energy in the small range between and $E+\delta E$ is just the sum of all the probabilities of the states which lie in this range. However, since each of these states has approximately the same Boltzmann factor this sum can be written

$\begin{displaymath} P(E) = C \,{\mit\Omega}(E)\,\exp(-\beta E), \end{displaymath}$

(378)

where ${\mit\Omega}(E)$ is the number of microstates of

whose energies lie in the appropriate range. Suppose that system

is itself a large system, but still very much smaller than system

. For a large system, we expect ${\mit\Omega}(E)$ to be a very rapidly increasing function of energy, so the probability

is the product of a rapidly increasing function of

and another rapidly decreasing function (i.e., the Boltzmann factor). This gives a sharp maximum of

at some particular value of the energy. The larger system

, the sharper this maximum becomes. Eventually, the maximum becomes so sharp that the energy of system

is almost bound to lie at the most probable energy. As usual, the most probable energy is evaluated by looking for the maximum of $\ln P$ , so

$\begin{displaymath} \frac{\partial \ln P}{\partial E} = \frac{\partial \ln {\mit\Omega}}{\partial E} -\beta =0, \end{displaymath}$

(379)

giving

$\begin{displaymath} \frac{\partial \ln {\mit\Omega}}{\partial E} = \beta. \end{displaymath}$

(380)

Of course, this corresponds to the situation in which the temperature of

is the same as that of the reservoir. This is a result which we have seen before (see Sect. 5). Note, however, that the Boltzmann distribution is applicable no matter how small system

is, so it is a far more general result than any we have previously obtained.

Next: Paramagnetism Up: Applications of statistical thermodynamics Previous: Introduction

Richard Fitzpatrick 2006-02-02