1. Introduction
The laws of thermodynamics are fundamental to the present understanding of nature [1, 2]. It is not surprising then to find they have a very wide range of applications beyond their original scope, such as to gravitation. The analogy between properties of black holes and thermodynamics could be extended to a complete correspondence, since a black hole in free space had been shown to radiate thermally with a temperature T=κ/2π, where κ is the surface gravity. One should be able to assign an entropy to a black hole given by SH=AH/4 where AH is the surface area of the black hole [3]. In the nineteenth century, the problem of reconciling time asymmetric behavior with time symmetric microscopic dynamics became a central issue in this area of physics [4]. Lord Kelvin wrote about the subjection of physical phenomenon to microscopic dynamical law. If then the motion of every particle of matter in the universe were precisely reversed at any instant, the course of nature would be simply reversed for ever after [5]. Physical processes, on the other hand, are irreversible, such as conduction of heat and diffusion processes [6, 7]. It subsequently became apparent that not only is there no conflict between reversible microscopic laws and irreversible microscopic behavior, but there are extremely strong reasons to expect the latter from the former. There are many reasons; for example, there exists a great disparity between microscopic and macroscopic scales and the fact that the events we observe in the macroworld are determined not only by the microscopic dynamics but also by the initial conditions or state of the system.
In the twentieth century, it became clear that the microworld was described by a different kind of physics along with mathematical ideas that need not be taken into account in describing the macroworld. This is the subject of quantum mechanics. Even though the new quantum equations have similar symmetry properties as their classical counterparts, it also reveals numerous phenomena that can contribute at this level to the problems mentioned above. These physical phenomena which play various roles include the phenomenon of quantum entanglement, the effect of decoherence in general, and the theory of measurements as well.
The purpose of this is to study the subject of entropy as it applies to quantum mechanics [8, 9]. Its definition is to be relevant to very small systems at the atomic and molecular level. Its relationship to entropies known at other scales can be examined. It is also important to relate this information from this new area of physics to the older and more established theories of thermodynamics and statistical physics [10, 11, 12, 13, 14, 15]. To summarize, many good reasons dictate that the arrow of time is specified by the direction of increase of the Boltzmann entropy, the von Neumann macroscopic entropy. To relate the quantum Boltzmann approach to irreversibility to measurement theory, the measuring apparatus must be included as a part of the closed quantum mechanical system.
Advertisement
2. Entropy and quantum mechanics
Boltzmann’s great insight was to connect the second law of thermodynamics with phase space volume. This he did by making the observation that for a dilute gas, log∣ΓM∣ is proportional up to terms negligible compared to the system size, to the thermodynamic entropy of Clausius. He then extended his insight about the relation between thermodynamic entropy and log∣ΓM∣ to all macroscopic systems, no matter what their composition. This gave a macroscopic definition of the observationally measureable entropy of equilibrium macroscopic systems. With this connection established, he generalized it to define an entropy for systems not in equilibrium.
Clearly, the macrostate Mx is determined by x, a point in phase space, and there are many such points, in fact a continuum, which correspond to the same M. Let ΓM then be the region in Γ consisting of all microstates x corresponding to a given macrostate M. Boltzmann associated with each microstate x of a macroscopic system M a number SB, which depends only on Mx, such that up to multiplicative and additive constants is given by
Sx=SBMx=kBlog∣ΓM∣.E1
This S is called the Boltzmann entropy of a classical system. The constant kB=1.38⋅10−16 erg/K is called Boltzmann’s constant, and if temperature is measured in ergs instead of Kelvin, it may be set to one. Boltzmann argued that due to large differences in the sizes of ΓM, SBxt will typically increase in a way which explains and describes the evolution of physical systems towards equilibrium.
The approach of Gibbs, which concentrates primarily on probability distributions or ensembles, is conceptually different from Boltzmann’s. The entropy of Gibbs for a microstate x of a macroscopic system is defined for an ensemble density ρx to be
SGρ=−kB∫Γρxlogρxdx.E2
In (2), ρx is the probability for the microscopic state of the system to be found in the phase space volume element dx. Suppose ρx is taken to be the generalized microcanonical ensemble associated with a macrostate M
ρMx=ΓM−1,x∈Γ;0,otherwise.E3
Then clearly
SGρM=kBlog∣ΓM∣=SBM.E4
The probability density for the system in the equilibrium macrostate ρMeq is the same as that for the microcanonical and equivalent to the canonical or grandcanonical ensemble when the system is of macroscopic size. The time development of SB and SG subsequent to some initial time when ρ=ρM is very different unless M=Meq when there is no further systematic change in M or ρ. In fact, SGρ never changes in time as long as x evolves according to Hamiltonian evolution, so ρ evolves according to the Liouville equation. Then SG does not give any indication that the system is evolving towards equilibrium. Thus the relevant entropy for understanding the time evolution of macroscopic systems is SB and not SG.
From the standpoint of mathematics, these expressions for classical entropies can be unified under the heading of the Boltzmann-Shannon-Gibbs entropy [16]. A very general form of entropy which includes those mentioned can be defined in a mathematically rigorous way. To do so, let ΩAμ be a finite measure space, ν a probability measure that is absolutely continuous with respect to μ, and its Radon-Nikodym derivative dν/dμ exists. The generalized BSG entropy is defined to be
SBSG=∫dνdμ⋅logdνdμdν,E5
when the integrand is integrable.
This includes the classical Boltzmann-Gibbs entropy when dμ and dν are given by
dμ=d3Npd3Nqℏ3N,dν=ρcldμ.E6
It also includes the Shannon entropy appearing in information theory in which
Ω=12…,μ1=μ2=…=1,νi=1.E7
In this case, (5) gives the entropy to be
S=−∑iρilogρi.E8
In attempting to translate these considerations to the quantum domain, it is immediately clear that a perfect analogy does not exist.
Although the situation is in many ways similar in quantum mechanics, it is not identical. The irreversible incompressible flow in phase space is replaced by the unitary evolution of wave functions in Hilbert space and velocity reversal of x by complex conjugation of the wave function. The analogue of the Gibbs entropy (2) of an ensemble is the von Neumann entropy of a density matrix ρ:
SvNρ=−kBTrρlogρ.E9
This formula was given by von Neumann. It generalizes the classical expression of Boltzmann and Gibbs to the realm of quantum mechanics. The density matrix with maximal entropy is the Gibbs state. The range of SvN is the whole of the extended real line 0∞, so to every number ζ with 0<ζ≤∞, there is a density matrix ρ such that SvNρ=ζ. Like the classical SGρ, this does not alter in time for an isolated system evolving under Schrödinger evolution. It has value zero whenever ρ represents a pure stare. Similar to SGρ, it is not most appropriate for describing the time symmetric behavior of isolated macroscopic systems. The Szilard engine composed of an atom is an example in which the entropy of a quantum object is made use of. von Neumann discusses the macroscopic entropy of a system, so a macrostate is described by specifying values of a set of commuting macroscopic observable operators Â, such as particle number, energy, and so forth, to each of the cells that make up the system corresponding to the eigenvalues aα, an orthogonal decomposition of the system’s Hilbert space H into linear subspaces Γ̂α in which the observables  take the values aα. Let Πα the projection into Γ̂α. von Neumann then defines the macroscopic entropy of a system with density matrix ρ˜ as
S˜macρ˜=kB∑α=1Npαρ˜log∣Γ̂α∣−kB∑α=1Npαρ˜logpαρ˜.E10
Here, pαρ˜ is the probability of finding the system with density matrix ρ˜ in the microstate Mα
pαρ˜=TrΠαρ˜,E11
and ∣Γ̂α∣ is the dimension of Γ̂α. An analogous definition is made for a system which is represented by a wave function Ψ; simply replace pαρ by pαΨ=ΨΠαΨ. In fact, ∣Ψ〉〈Ψ∣ just corresponds to a particular pure density matrix.
von Neumann justifies (10) by noting that
S˜macρ=−kBTrρ˜logρ˜=SvNρ˜,E12
for
ρ˜=∑αpα∣Γ̂α∣Πα,E13
and ρ˜ is macroscopically indistinguishable from ρ.
A correspondence can be made between the partitioning of classical phase space Γ and the decomposition of Hilbert space H and to define the natural quantum analogues to Boltzmann’s definition of SBM in (1) as
ŜBMα=kBlog∣Γ̂Mα∣E14
where ∣Γ̂Mα∣ is the dimension of Γ̂Mα. With definition (14) the first term on the right of (10) is just what would be stated for the expected value of the entropy of a classical system of whose macrostate we were unsure. The second part of (10) will be negligible compared to the first term for a macroscopic system, classical or quantum, and going to zero when divided by the number of particles.
Note the difference that in the classical case, the state of the system is described by x∈Γα for some α, so the system is always in one of the macrostates Mα. For a quantum system described by ρ or Ψ, this is not the case. There is no analogue of (1) for general ρ or Ψ. Even when the system is in a macrostate corresponding to a definite microstate at t0, only the classical system will be in a unique macrostate at time t. The quantum system will in general evolve into a superposition of different macrostates, as is the case in the Schrödinger Cat paradox. In this wave function, Ψ corresponding to a particular macrostate evolves into a linear combination of wave functions associated with very different macrostates. The classical limit is obtained by a prescription in which the density matrix is identified with a probability distribution in phase space and the trace is replaced by integration over phase space. The superposition principle excludes partitions of the Hilbert space: an orthogonal decomposition is all that is relevant.
2.1 Properties of entropy functions
Entropy functions have a number of characteristic properties which should be briefly described in the quantum case. The set of observables will be the bounded, self-adjoint operators with discrete spectra in a Hilbert space. The set of normal states can be taken to be the density operators or positive operators of trace one.
The entropy functional satisfies the following inequalities. Let λi>0 and ∑iλi=1. Then S has the concavity property:
S∑iλiρi≥∑iλiSρi,E15
with equality if all λi are equal.
Subadditivity holds with equality if and only if ρiρj=0, i≠j
S∑iλiρi≤∑iλiSρi−∑iλilogλi.E16
and
S∑iλiρi≤STBρ≤Sρ−∑kpklogpkE17
where the first equality holds iff TBρ=ρ and the second iff Sρk=Sρ for all k. The conditional entropy is defined to be
Sρ1ρ2=Trρ1logρ1−ρ1logρ2.E18
The formal expression will be interpreted as follows. If A,B are positive traceless operators with complete orthonormal sets of eigenstates ∣ai〉 and ∣bi〉, using a resolution of identity, ∑i〈ai∣AlogA∣ai〉=∑i,jaiAbjbjlogAai=∑i,jaiaibjlogaibjai so that
∑jbjAlogA−AlogB+B−Abj=∑jaiAlogA−AlogB+B−Aai=∑i,jaibi2ailogai−ailogbj+bj−ai=SAB.E19
Concavity of the function xlogx ensures the terms of the final sum are nonnegative. In order that Sρ1ρ2<∞, it is necessary that πρ1≤πρ2 where πW=suppW is the support projection of W, so ρ1<ρ2. From the definition, Sρ1ρ2≥0 with equality if ρ1=ρ2. If λρ1≤ρ2, for some λ∈01, Sρ1ρ2≤−logλ from operator monotony of logz. If ρ=∑iλiρi, then
Sρ=∑iλiSρi+∑iλiSρiρ,E20
which gives (15) and (16). If T is a trace-preserving operator, then ρ<Tρ, and
STρ=Sρ+SρTρ.E21
This is to say that T is entropy-increasing.
The concept of irreversibility is clearly going to be relevant to the subject at hand, so some thoughts related to it will be given periodically in what follows. A possible way to account for irreversibility in a closed system in nature is by the various types of course-graining. There are also strong reasons to suggest the arrow of time is provided by the direction of increase of the quantum form of the Boltzmann entropy. The measuring apparatus should be included as part of the closed quantum mechanical system in order to relate the quantum Boltzmann approach to irreversibility to the concept of a measurement. Let Sc be a composite system consisting of a macroscopic system S coupled to a measuring instrument I, so Sc=S+I, where I is a large but finite N-particle system. A set of course-grained mutually commuting extrinsic variables are provided whose eigenspaces correspond to the pointer positions of I. von Neumann’s picture of the measurement process is basic to the approach, but according to which, the coupling of S to I leads to the following effects. A pure state of S described by a linear combination ∑αcαψα of its orthonormal energy eigenstates is converted into a statistical mixture of these states for which cα2 is the probability of finding the system in state ψα. It also sends a certain set of classical or intercommuting, macroscopic variables M of I to values indicated by pointer readings that indicate which of the states is realized.
There is an amplification process of the S−I coupling where different microstates of S give rise to macroscopically different states of I. If I is designed to have readings which are in one-to-one correspondence with the eigenstates of S, it may be assumed index α of its microstates goes from 1 to n. Denote the projection operator for subspace K by Πα, then
ΠαΠβ=Παδαβ,∑αΠα=1Kα,E22
and each element of the abelian subalgebra of ℬ takes the form with Mα scalars
M=∑αMαΠα.E23
Define the projection operators:
πα=1⊗Πα,α=1,…,n.E24
Suppose A is measured on system S, initially in a state of the composite system described by a density matrix ρ. The value pα is obtained with probability τα=Trρπα. After the measurement, the state of the composite system is accounted for by the density matrix:
ρα=1ταπαρπα.E25
This is a mixture of states in each of which A has a definite value.
The transformation ρ→ρ˜=∑απαρπα may be viewed as a loss of information contained in non-diagonal terms ψαρπα′ with α≠β in ∑αα′παρπα′. When a sequence of measurements is carried out and a time evolution is permitted to occur between measurements leads one to assign to a sequence of events πα1t1πα2t2⋯παntn the probability distribution:
Pα=Trπα1tn⋯πα1t1ρπα1t1⋯παntn,E26
where ρ=ρ0, over the set of histories, where the πk satisfy (22) with Π replaced by the π. Let us define
Dα′α=Trπα1t1⋯παntnρπαntn⋯πα1t1.E27
The following definition can now be stated. A history is said to decohere if and only if
Dαα′=δα,α'ρα.E28
A state is called decoherent with respect to the set of πα if and only if
παρ0πβ=0,α≠β.E29
This implies that Trπα′ρπαA=0 for all α≠α′, which is equivalent to παρ=0 for all α. In contrast to infinite systems where there is no need to refer to a choice of projections, decoherent mixed states over the macroscopic observables can be described by relations between the density matrix and the projectors. They would be of the form ρm=∣Ψ〉〈Ψ∣ with ∣Ψ〉=∑αλαπαΦα such that ∑αλα2=1 and Φα∈H and satisfy
∑α≠α′πα′ρmπα+παρmπα′≠0.E30
The relative or conditional entropy between two states Sρ1ρ2 was defined in (18), and it plays a crucial role. It is worth stating a few of its properties, as some are necessary for the theorem:
Sρ1ρ2≥0.E31
Sρ1ρ2=0,ρ1=ρ2.E32
Sλρ1+1−λρ2λσ1+1−λσ2≤λSρ1σ1+1−λSρ2σ2.E33
When γ is a completely positive map, or embedding
Sρ1⋅γρ2⋅γ≤Sρ1ρ2.E34
The last two inequalities are known as joint concavity and monotonicity of the relative entropy. The following result may be thought of as a quantum version of the second law.
Theorem: Suppose the initial density matrix is decoherent at zero time (29) with respect to πα and have finite entropy
ρ0=∑απαρ0πα,Sρ0=−kBTrρ0logρ0<∞,E35
and it is not an equilibrium state of the system. Let ρtf, for tf>0, be any subsequent state of the system, possibly an equilibrium state. Then for an automorphic, unitary time evolution of the system between 0≤t≤tf
S0≤Stf,E36
where S0=Stf if and only if (e)∑α<βπαρtfπβ+πβρtfπα=0.
Proof: Set ρ′tf=∑απαρtfπα=ρtf⋅γ, so ρ′ is obtained from ρ by means of a completely positive map. It follows that
Sρ′tfρ′0=−Sρ′tf−kB∑αTrρtfπαlogρ0πα=−Stf−kBTr(ρtflogρ0≤Sρtfρ0=−Sρ0−Trρtflogρ0.E37
The first equality uses the cyclic property of the trace and the definition of ρ′. The second equality uses decoherence of ρ0, and the next inequality is a consequence of (34). The evolution is unitary and hence preserves entropy which is the last equality. This implies that St≥S0 and the equality condition (e) follows from (32). □
Of course, entropy growth as in the theorem is not necessarily monotonic in the time variable. For this reason, it is usual to refer to fixed initial and final states. For thermal systems, a natural choice of the final state is the equilibrium state of the system. It is the case in thermodynamics that irreversibility is manifested as a monotonic increase in the entropy. Thermodynamic entropy, it is thought, is related to the entropy of the states defined in both classical and quantum theory. Under an automorphic time evolution, the entropy is conserved. One application of an environment is to account for an increase. A type of course-graining becomes necessary together with the right conditions on the initial state to account for the arrow of time. In quantum mechanics, the course-graining seems to be necessary and may be thought of as a restriction of the algebra and can also be interpreted as leaving out unobservable quantum correlations. This may, for example, correspond to decoherence effects important in quantum measurements. Competing effects arise such as the fact that correlations becoming unobservable may lead to entropy increase. There is also the effect that a decrease in entropy might be due to nonautomorphic processes. Although both effects lead to irreversibility, they are not cooperative but rather contrary to one another. The observation that the second law does hold implies these nonautomorphic events must be rare in comparison with time scales relevant to thermodynamics.
Advertisement
3. Quantum mechanics and nonequilibrium thermodynamics
Some aspects of equilibrium thermodynamics are examined by considering an isothermal process. Since it is a quasistatic process, it may be decomposed into a sequence of infinitesimal processes. Assume initially the system has a Hamiltonian Hγ in thermal equilibrium at a temperature T. Boltzmann’s constant is set to one. The state is given by the Gibbs density operator ρ. This expression can also be written in terms of the energy eigenvalues εn and eigenvectors ∣n〉 of H. The probability of finding the system in state ∣n〉 is
pn=nρn=e−βεnZ.E38
The average external energy U of the system is given as
U=U=TrHρ=∑nεnpn.E39
When the parameter γ is changed to γ+dγ, both εn and pn as well as U change to
dU=∑ndεnpn+εndpn.E40
Each instantaneous infinitesimal process can be broken down into a part which is the work performed; the second is the heat transformed as the system relaxes to equilibrium. This breakup motivates us to define
δW=∑ndεnpn,δQ=∑nεndpn,E41
so dU=δQ+δW, and δ is used to indicate that heat and work are not exact differentials. The free energy of the system is defined to be F=−TlogZ, so dF=∑ndεnpn which means
δW=dF.E42
By integrating over the infinitesimal segments, we find W is
W=ΔF=ΔU−Q.E43
Inverting Eq. (38) for pn, we can solve for
εn=−TlogZpn.E44
Substituting into the relation for δQ, we get two terms, one proportional to logZ and the other to logpn. The term with logZ when the pk satisfy ∑kpk=1 is
−T∑nlogZdpn=−TlogZd∑npn=0,E45
It remains to study
δQ=−T∑ndpnlogpn.E46
By the chain rule
d∑npnlogpn=∑ndpnlogpn+∑ndpn=∑ndpnlogpn.E47
So δQ is not a function of the state but is related to the variation of something that is. Define the entropy S as usual from (9), S=−∑npnlogpn, and arrive at
δQ=TdS.E48
This relation only holds for infinitesimal processes. For finite and irreversible processes, there may be additional terms to the entropy change. This has been quite successful at describing many different types of physical system [17, 18, 19].
A deep insight has come recently into the properties of nonequilibrium thermodynamics which could be achieved by regarding work as a random variable. For example, consider a process in which a piston is used to compress a gas in a cylinder. Due to the nature of the gas and its chaotic motion, each time the piston is pressed, the gas molecules exert a back reaction with a different force. This means the work needed to achieve a given compression changes each time something is carried out.
Usually a knowledge of nonequilibrium processes is restricted to inequalities such as the Jarzynski inequality. He was able to show by interpreting work W as a random variable that an inequality can be obtained, even for a process performed arbitrarily far from equilibrium.
Suppose the system is always prepared in the same state initially. A process is carried out and the total work W performed is measured. Repeating this many times, a probability distribution for the work PW can be constructed. An average for W can be computed using PW as
W=∫PWdW.E49
Jarzynski showed that the statistical average of e−βW satisfies
e−βW=e−βΔF,E50
where ΔF=FTγf−FTγi. It holds for a process performed arbitrarily far from equilibrium. Now the inequality W≥ΔF is contained in (50) and can be realized by applying Jensen’s inequality, which states that e−βW≥e−βW.
In macroscopic systems, individual measurements are usually very close to the average by the law of large numbers. For mictoscopic systems, this is usually not true. In fact, the individual realizations of W may be smaller than ΔF. These cases would be local violations of the second law but for large systems become extremely rare. If the function PW is known, the probability of a local violation of the second law is
PW<ΔF=∫−∞ΔFPWdW.E51
To get (50) requires detailed knowledge of the system’s dynamics, be it classical, quantum, unitary, or whatever.
Consider nonunitary quantum dynamics. Initially, the system has Hamiltonian Hi=Hγi. The system was in thermal equilibrium with a bath at temperature T. The initial state of the system is the Gibbs thermal density matrix (38). Let εni and ∣n〉 denote the initial eigenvalues and eigenvectors of Hi as εni is obtained with probability pn=e−βεni/Z.
Immediately after this measurement, γ changes from γ0=γi to γτ=γf according to the rule γt. If it is assumed the contact with the bath is very weak during this process, the state of the system evolves according to
∣ψt〉=Ut∣n〉,E52
where U is the unitary evolution operator which satisfies Schrödinger’s equation, i∂tU=HtU, U0=1.
The Hamiltonian is Hγf at the end and has energy levels εmf, eigenvectors ∣m〉, so the probability εnf measured is ∣〈mψτ〉2=∣〈m∣Uτn〉2. This may be interpreted as the conditional probability a system in ∣n〉 will be in ∣m〉 after time τ.
No heat has been exchanged with the environment, so any change in the environment has to be attributed to the work performed by the external agent and is
W=εmf−εni,E53
where both εni and εmf are fluctuating and change during each realization of the experiment. The first εni is random due to thermal fluctuations and εmf is random due to quantum fluctuations in W as a random variable by (53).
To get an expression for PW obtained by repeating the process several times, this is a two-step measurement process. From probability theory, if A,B are two events, the total probability pAB that both events have occurred is
pAB=pABpB,E54
where pB is the probability B which occurs and pAB is the conditional probability B that has occurred. The probability of both events that have occurred is ∣〈m∣Uτn〉2pn. Since we are interested in the work performed, we write
PW=∑n,m∣〈m∣Uτn〉2pnδW−εmf−εni.E55
And some over all allowed events, weighted by their probabilities, and arrange the terms according to the values εmf−εni. In most systems, there are present a rather large number of allowed levels, and even more allowed differences εmf−εni. It is more efficient to use the Fourier transform
Gy=eiyW=∫−∞∞PWeiyWdW.E56
This has the inverse Fourier transform
PW=12π∫−∞∞dyGye−iyW.E57
Using (55), we obtain that
Gy=∑n,m∣〈m∣Un〉2pneiyεmf−εni=∑n,mnU†eiyεmfmmUe−iyεnipnn=∑n,mnU†eiyHfmmUe−iyHiρn=TrU†τeiyHfUτe−iyHiρ.E58
Hence, it may be concluded that
G=TrU†τeiyHfUτe−iyHiρ.E59
This turns out to be somewhat easier to work with than PW, and (59) plays a similar role as Z in equilibrium statistical mechanics. From Gy, the statistical moments of W can be found by expanding
Gy=eiyW=1+iyW−y22W2−y36W3+⋯.E60
A formula for the quantum mechanical formula for the moments can be found as well. The average work is W=Hf−Hi, where for any operator A, we have At=TrU†tAUtρ as the expectation value of A at time t. This follows from the fact that the state of the system at t is ρt=UtρUt†. From the definition of G, it ought to be the case that Gy=iβ=e−βW. However, ρ in (38) and (59) yields
Giβ=1ZiTrU†e−βHfU=1ZiTreβHf=ZfZi.E61
Using Z=e−βF, (61) yields (50)
Giy=e−βW=e−βΔF.E62
Nothing has been assumed about the speed of this process. Thus inequality (50) must hold for a process arbitrarily far from equilibrium.
Advertisement
4. Heat flow from environment approach
There is another somewhat different way in which the Jarzynski inequality can be generalized to quantum dynamics. In a classical system, the energy of the system can be continuously measured as well as the flow of heat and work. Continuous measurement is not possible in quantum mechanics without disrupting the dynamics of the system [20].
A more satisfactory approach is to realize that although work cannot be continuously measured, the heat flow from the environment can be measured. To this end, the system of interest is divided into a system of interest and a thermal bath. The ambient environment is large, and it rapidly decoheres and remains at thermal equilibrium, uncorrelated and unentangled with the system. Consequently, we can measure the change in energy of the bath −Q without disturbing the dynamics of the system. The open-system Jarzynski identity is expressed as
e−βW=e−βEfeβQeβEi=e−βΔF.E63
For a system that has equilibrated with Hamiltonian H interacting with a thermal bath at temperature T, the equilibrium density matrix is ρ=eβH/Z=e−βF−βH, where β=1/KBT. The dynamics of an open quantum system is described by a quantum operator ρ˜=Sρ, a linear trace-preserving, complete positive map of operators. Any such complete positive superoperator has an operator-sum representation
Sρ=∑αAαρAα†.E64
Conversely, any operator-sum represents a complete positive superoperator. The set of operators Aα is often called Krauss operators. The superoperator is trace-preserving and conserves probability if ∑αAα†Aα=I. In the simplest case, the dynamics of an isolated quantum system is described by a single unitary operator U†=U−1.
The interest here is in the dynamics of a quantum system governed by a time-dependent Hamiltonian weakly coupled to an extended, thermal environment. Let the total Hamiltonian be
H=HSt⊗IB+IS⊗HB+εHint,E65
where IS and IB are system and bath identity operators, HSt the system Hamiltonian, HB the bath Hamiltonian, and Hint the bath-system interaction with ε a small parameter. Assume initially the system and environment are uncorrelated such that the initial combined state is ρS⊗ρeqB, where ρeqB is the thermal density equilibrium matrix of the bath.
By following the unitary dynamics of the combined total system for a finite time and measuring the final state of the environment, a quantum operator description of the system dynamics can also be obtained:
SstρS=TrBUρS⊗ρeqBU†=∑i,f〈bf∣U(ρS⊗(∑ie−βεiBZB∣bi〉bi))U†bf=1ZB∑i,fe−βεiBbfUbiρSbiU†bf.E66
Here U is the unitary evolution operator of the total system
U=expiℏ∫stHτdτ,E67
and TrB is the partial trace over the bath degrees of freedom, εiB are the energy eigenvalues, b〉 is the orthonormal energy eigenvectors of the bath, and ZB is the bath partition function. Assume the bath energy states are nondegenerate. Then (66) implies the Krauss operators for this dynamics are
Ai,f=1ZBe−βεiB/2bfUbi.E68
Suppose the environment is large, with a characteristic relaxation time short compared with the bath-system interactions, and the system-bath coupling ε is small. The environment remains near thermal equilibrium, unentangled and uncorrelated with the system. The system dynamics of each consecutive time interval can be described by a superoperator derived as in (66) which can then be chained together to form a quantum Markov chain:
ρt=St−1t⋯Ss+1s+2Sss+1ρ.E69
The Hermitian operator of a von Neumann-type measurement can be broken up into a set of eigenvalues λσ and orthonormal projection operators πσ such that H=∑σλσπσ. In a more general sense, the measured operator of a positive operator-valued measurement need not be projectors or orthonormal. The probability of observing the a-th outcome is
pa=TrAaρAa†.E70
The state of the system after this interaction is
ρ˜a=AaρAa†TrAaρAa†.E71
The result of the measurement can be represented by using a Hermitian map superoperator A:
A=∑αaαAαρAα†.E72
An operator-value sum maps Hermitian operators into Hermitian operators:
AH†=aαAαHAᆆ=∑αaαA††H†Aα†=AH.E73
In the other direction, any Hermitian map has an operator-value-mean representation. Hermitian maps provide a particularly concise and convenient representation of sequential measurements and correlation functions. For example, suppose Hermitian map A represents a measurement at time 0, C is a different measurement at time t, and the quantum operation St represents the system evolution between the measurements. The expectation value of a single measurement is
a=TrAρ=∑αaαTrAαρAα†=∑αpαaα.E74
The correlation function bta0 can be expressed as
bta0=TrBStAρ0=∑α,βaαbβTrBαStAαρ0Aα†Bβ†.E75
It may be shown that just as every Hermitian operator represents some measurement on the Hilbert space of pure states, every Hermitian map can be associated with some measurement on the Liouville space of mixed states.
A Hermitian map representation of heat flow can now be constructed under assumptions that the bath and system Hamiltonian are constant during the measurement and the bath-system coupling is very small. A measurement on the total system is constructed, and thus the bath degrees of freedom are projected out. This leaves a Hermitian map superoperator that acts on the system density matrix alone. Let us describe the measurement process and mathematical formulation together.
Begin with a composite system which consists of the bath, initially in thermal equilibrium weakly coupled to the system:
ρS⊗ρeqB.E76
Measure the initial energy eigenstate of the bath so based on (76):
(IS⊗∣bi〉〈bi∣)ρS⊗ρeqB(IS⊗∣bj〉〈bj∣).E77
Now allow the system to evolve together with the bath for some time:
U(IS⊗∣bi〉〈bi∣)ρS⊗ρeqB(IS⊗∣bj〉〈bj∣)U†.E78
Finally, measure the final energy eigenstate of the bath:
(IS⊗∣bi〉〈bf∣)U(IS⊗∣bi〉〈bi∣)ρS⊗ρeqB(IS⊗∣bj〉〈bj∣)U†(IS⊗∣bf〉〈bf∣).E79
Taking the trace over the bath degrees of freedom produces the final normalized system density matrix where trace over S gives the probability of observing the given initial and final bath eigenstates. Multiply by the Boltzmann weighted heat, and sum over the initial and final bath states to obtain the desired average Boltzmann weighted heat flow:
eβQ=∑i,fe−βεfB−εiBTrSTrB(IS⊗∣bf〉〈bf∣)U(IS⊗∣bi〉〈bi∣)ρS⊗ρeqB(IS⊗∣bj〉〈bj∣)U†(IS⊗∣bj〉〈bj∣).E80
Replace the heat bath Hamiltonian by IS⊗HB=H−HSt⊗IB−εHint. The total Hamiltonian commutes with the unitary dynamics and cancels. The interaction Hamiltonian can be omitted in the small coupling limit giving
eβQ=TrSTrBeβHS/2⊗ISUe−β/2HS⊗IBρS⊗ρeqBe−βHS/2⊗IBU†eβHS/2⊗IBE81
Collecting the terms acting on the bath and system separately and replacing the Krauss operators describing the reduced dynamics of the system, the result is
eβQ=TrSeβHS/2TrBUe−βHS/2ρSe−βHS/2⊗ρeqBU†eβHS/2=TrS∑αeβHS/2Aαe−βHS/2ρSeβHS/2Aα†e−βHS/2.E82
To summarize, it has been found that the average Boltzmann weighted heat flow is represented by
eβQ=TrR−1SRρS.E83
where S represents the reduced dynamics of the system. The Hermitian map superoperator Rt is given by
Rtρ=e−βHt/2ρeβHt/2.E84
The paired Hermitian map superoperators act at the start and end of a time interval. They give a measure of the change in the energy of the system over that interval. This procedure does not disturb the system beyond that already incurred by coupling the system to the environment. The Jarzynski inequality now follows by applying this Hermitian map and quantum formalism. Discretize the experimental time into a series of discrete intervals labeled by an integer t.
The system Hamiltonian is fixed within each interval. It changes only in discrete jumps at the boundaries. The heat flow can be measured by wrapping the superoperator time evolution of each time interval St along with the corresponding Hermitian map measurements Rt−1SRt. In a similar fashion, the measurement of the Boltzmann weighted energy change of the system can be measured with e−βΔE=TrRτSRτ−1. The average Boltzmann weighted work of a driven, dissipative quantum system can be expressed as
e−βW=TrRτ∏tRt−1StRtRτ−1ρ0eq,E85
In (85), ρeqt is the system equilibrium density matrix when the system Hamiltonian is HtS.
This product actually telescopes due to the structure of the energy change Hermitian map (84) and the equilibrium density matrix (65). This leaves only the free energy difference between the initial and final equilibrium ensembles, as can be seen by writing out the first few terms
e−βW=TrRτRτ−1SτRτ⋯R2−1S2R2R1−1S1R1R0−1ρeq0=TrτRτ−1SτRτ⋯R2−1S2R2R1−1S1R1IZ0=Tr[RτRτ−1SτRτ⋯R2−1S2R2(R1−1S1ρeq1Z1Z0]⋯=ZτZ0=e−βΔF=e−βΔF.E86
In the limit in which the time intervals are reduced to zero, the inequality can be expressed in the continuous Lindblad form:
e−βW=TrRtexp∫0tRξ−1SξRξdξR0−1ρ0eq=e−βΔF.E87
Advertisement
5. A model quantum spin system
A magnetic resonance experiment can be used to illustrate how these ideas can be applied in practice. A sample of noninteracting spin-1/2 particles are placed in a strong magnetic field B0 which is directed along the z direction. Denote by σj,j=x,y,z the usual Pauli matrices and 1 the 2×2 identity matrix. It is assumed the motion of the system is unitary. Then the spin is governed by the Hamiltonian:
H0=−12B0σz.E88
In units where ℏ is one, B0 represents the characteristic precession frequency of the spin. Since Ho is diagonal in the ∣±〉 basis that diagonalizes σz, the matrix exponential and partition function are given by
e−H/T=eB0/2T00e−B0/2T,Z=Tre−H/T=2coshB02T,E89
If we set σ˜ to be the equilibrium magnetization of the system, σ˜=σxth, the thermal density matrix is
ρ=ρth=121+σ˜001−σ˜,σ˜=tanhB0T.E90
and σ˜ corresponds to the parametric response of a spin-1/2 particle.
The work segment is implemented by introducing a very small field of amplitude B rotating in the xy plane with frequency ω. The work parameter is governed by the field
B=Bsinωtcosωt0.E91
Typically, B0≈ωT and B≈0.01T, so we may approximate B<<B0. The total Hamiltonian is the combination
Ht=−B02σz−B2σzsinωt+σycosωt.E92
The oscillating field plays the role of a perturbation which although weak may initiate transitions between the up and down spin states and will be most frequent at the resonance condition ω=B0, so the driving frequency matches the natural oscillation frequency.
The time evolution operator Ut is calculated now. To do this, define a new operator Vt by means of the equation
Ut=eiωtσz/2Vt.E93
Substituting (43) into the evolution equation for Ut, i∂tU=HtU, U0=1. It is found that Vt obeys the Schrödinger equation:
i∂V∂t=H˜tV,V0=1,E94
It is found that Vt satisfies
i∂V∂t=12ωσz−B0σz−Be−iωtσz/2σxsinωt+σycosωteiωσz/2Vt.E95
Using the commutation relations of the Pauli matrices and the fact that
e−iωσz=1cosωt2−iσzsinωt2,E96
it is found that the terms in the evolution equation can be simplified
e−iασzσxeiασz=1cosα−iσxsinασx1cosα+iσzsinα=σx+2sinαcosασy−2iσzσysin2α=σx+2sinαcosασysin2α,E97
e−iασzσyeiασz=(σycosα−iσxσysinα1cosα+iσzsinα=σy−2sinαcosασx+2iσzσsin2α.E98
By means of these results, it remains to simplify
e−iωtσz/2σzsinωt+σycosωteiωtσz/2=σzsinωt−sinωt+cosωtsinωt−cosωtsinωt+σysin2ωt+cosωt−cosωt+cos2ωt=σy.E98a
Taking these results to (95), we arrive at
i∂V∂t=H1V,H1=−12B0−ωσz−12Bσy.E99
This means Vt evolves according to a time-dependent Hamiltonian, so the solution can be written as
Vt=e−iH1t,E100
and the full-time evolution operator is given by
Ut=eiωtσz/2e−iH1t.E101
Since the operators σy and σz do not commute, the exponentials in (101) cannot be using the usual addition rule.
To express (100) otherwise, suppose M is an arbitrary matrix such that M2=1. When α is an arbitrary parameter, power series expansion of e−iαM yields
e−iαM=1cosα−iMsinα.E102
Now H1 can be put in equivalent form
H1=Ω2σzcosϑ+σysinϑ,Ω=B0−ω2+B2,tanϑ=BB0−ω,E103
Since σi2=1, it follows that
σzcosϑ+σysinϑ2=1.E104
Consequently, (100) can be used to prove that Vt is given by
e−iH1t=1cosΩ2t+iσzcosϑ+σysinϑsinΩ2t=cosΩ2t+icosϑsinΩ2tsinϑsinΩ2t−sinϑsinΩ2tcosΩ2t−icosϑsinΩ2tE105
Since
eiωσzt/2=eiωt/200e−iωt/2E106
the evolution operator is then given by
Ut=utvt−v∗tu∗t.E107
The functions ut and vt in (107) are given as
ut=eiωt/2cosΩ2t+isinϑsinΩ2t,vt=eiωt/2⋅sinϑ⋅sinΩ2t.E108
Apart from a phase factor, the final result depends only on Ω and ϑ, and these in turn depend on B0, B, and ω through (108). To understand the physics of Ut a bit better, suppose the system is initially in the pure state ∣+〉. The probability will be found in state ∣−〉 after time t is
∣〈−∣Ut+〉2=v2.E109
This expression represents the transition probability per unit time a transition will occur. Since the unitarity condition U†U=1 implies that u2+v2=1, we conclude u2 is the probability when no transition occurs. Note v is proportional to sinϑ, which gives a physical meaning to ϑ. It represents the transition probability and reaches a maximum when ω=B0 at resonance where Ω=B, so u and v simplify to
ut=eiωt/2cosB2t,vt=eiωt/2sinB2t.E110
Now that Ut is known, the evolution of any observable A can be calculated
At=TrU†tAUtρ.E111
If A is replaced by σz in (111), we obtain
σzt=Tru∗t−vtv∗tutσzutvt−v∗tu∗t121+σ˜001−σ˜=σ˜u2−v2=σ˜1−2v2.E112
Substituting v2, this takes the form
σzt=σ˜cos2ϑ+sin2ϑcosΩt=tanhB02Tcos2ϑ+sin2ϑcosΩt.E113
Consider the average work. Suppose B<<B0, so the unperturbed Hamiltonian H0 can be used instead of the full Hamiltonian Ht when expectation values of quantities are calculated which are related to the energy. Let us determine the energy of the system at any t by taking operator A to be H0:
H0t=−B02σzt=−12B0TrU†tσzUtρ)=−12B0Tru∗−vv∗u100−1uv−v∗u∗121+σ˜001−σ˜=−B04Tru∗−vv∗uuvv∗−u∗1+σ˜001−σ˜=−14B0σ˜1−2v2.E114
The average work at time t is simply the difference between the energy st time t1 and t=0. Since v0=0, this difference is
Wt=−B02σ˜1−2v2+B02σ˜=σ˜B0v2=σ˜B0sin2ϑsin2Ω2t=σ˜B0BΩ2sin2Ω2t,E115
since sin2ϑ=1−cos2ϑ=B2/Ω2. The average work oscillates indefinitely with frequency Ω/2. This is a consequence of the fact the time evolution is unitary. The amplitude multiplying the average work is proportional to the initial magnetization σ˜ and B2/Ω2, so the ratio is a Lorentzian function.
The equilibrium free energy is F=−TlogZ where Z=2coshB0/2T. The free energy of the initial state at t=0 and final state at any arbitrary time is the same yielding
ΔF=0.E116
This is a consequence of the fact that B<<B0. According to W≥F, it should be expected that
Wt≥ΔF=0.E117
Given the matrices for Ut and ρ that have been determined so far, the function G can be computed:
Gy=TrU†yeixHfUye−iyHiρ=Tru∗−vv∗ue−iyB0/200eiyB0/2uv−v∗u∗eiyB0/200e−iyB0/2121+σ˜00121−σ˜=u2+12(1+σ˜eiyB0+1−σ˜e−iyB0v2.E118
Set x=iβ and recall use definition (42) for σ˜ in the second term of (118) to give
1+tanhβ2B0e−βB0+1−tanhβ2B0eβB0=2coshβB0−2tanhβ2B0sinhβB0=1.E119
Substituting (119) into (118), we can conclude
e−βW=Giβ=u2+v2=1.E120
This is the Jarzynski inequality, since it is the case that ΔF=0 here. The statistical moments of the work can be obtained by writing an expression for G into a power series
Gy=u2+v21+iσ˜B0y−12B02y2+⋯.E121
From (121), the first and second moments can be obtained; for example
W=σ˜B0v2,W2=B02v2.E122
As a consequence, the variance of the work can be determined
varW=W2−W2=B02v2−σ˜2B02v4−B02v21−σ˜2v2.E123
A final calculation that may be considered is the full distribution of work PW. Now PW is the inverse Fourier transform of Gy:
PW=12π∫−∞∞dyGye−iyW.E124
Using the Fourier integral form of the delta function, (124) can be written as
PW=12π∫−∞∞dyGye−iyW=12π∫−∞∞dyut2+vt2121+σ˜eiB0y+121−σ˜e−iB0ye−iyW.=u2δW+12vt2δW−B0+12vt21+σ˜δW+B0.E125
Work taken as a random variable can take three values W=0,+B0,−B0 where B0 is the energy spacing between the up and down states. The event W=B0 corresponds to the case where the spin was originally up and then reversed, so an up-down transition. The energy change is B0/2−−B0/2=B0. Similarly, W=−B0 is the opposite flip from this one, and W=0 is the case with no spin flip.
The second law would have us think that W>0, but a down-up flip should have W=−B0, so PW=−B0 is the probability of observing a local violation of the second law. However, since PW=±B0 is proportional to 1±σ˜, up-down flips are always more likely than down-up. This ensures that W≥0, so violations of the second law are always exceptions to the rule and never dominate.
The work performed by an external magnetic field on a single spin-1/2 particle has been studied so far. The energy differences mentioned correspond to the work. For noninteracting particles, energy is additive. Hence the total work W which is performed during a certain process is the sum of works performed on each individual particle W=W1+⋯+WN. Since the spins are all independent and energy is an extensive variable, it follows that W=NW. where W is the average work from (115).
Advertisement
6. Conclusions
We have tried to give an introduction to this frontier area that lies in between that of thermodynamics and quantum mechanics in such a way as to be comprehensible. There are many other areas of investigation presently which have had interesting repercussions for this area as well. There is a growing awareness that entanglement facilitates reaching equilibrium [21, 22, 23]. It is then worth mentioning that the ideas of einselection and entanglement with the environment can lead to a time-independent equilibrium in an individual quantum system and statistical mechanics can be done without ensembles. However, there is really a lot of work yet to be done in these blossoming areas and will be left for possible future expositions.