Open access peer-reviewed chapter - ONLINE FIRST

Nonlinear Generalized Schrödinger’s Equations by Lifting Hamilton-Jacobi’s Formulation of Classical Mechanics

By Gérard Gouesbet

Submitted: June 28th 2021Reviewed: August 23rd 2021Published: September 28th 2021

DOI: 10.5772/intechopen.100068

Downloaded: 26


It is well known that, by taking a limit of Schrödinger’s equation, we may recover Hamilton-Jacobi’s equation which governs one of the possible formulations of classical mechanics. Conversely, we may start from the Hamilton-Jacobi’s equation and, by using a lifting principle, we may reach a set of nonlinear generalized Schrödinger’s equations. The classical Schrödinger’s equation then occurs as the simplest equation among the set.


  • Schrödinger’s equation
  • Hamilton-Jacobi’s equation
  • correspondence principle
  • lifting principle

1. Introduction

Schrödinger’s equation is the fundamental equation of quantum mechanics. Using a correspondence principle, we may recover the classical limit of mechanics under the form of the Hamilton-Jacobi’s equation. This is a up-down process, from a general theory to a limit restricted theory, i.e. from quantum mechanics to classical mechanics. We may use another principle, that I call a lifting principle, which, starting from Hamilton-Jacobi’s equation allows one, through a bottom-up process, to reach a set of generalized Schrödinger’s equations, encompassing nonlinear terms. From this generalized set, we may turn back to a up-bottom process. In a first step, we recover the classical Schrödinger’s equation as, in some sense, the simplest equation in the set and, in a second step, we recover again classical mechanics from quantum mechanics, using again a correspondence principle.

The chapter is organized as follows. Section 2 recalls the Hamilton-Jacobi’s equation of classical mechanics which, in the present chapter, may be viewed as a turning equation, both the end of a up-bottom process and the beginning of a bottom-up process. Section 3 exemplifies a way to obtain Schrödinger’s equation by using an analogy relying on Hamilton-Jacobi’s equation. Section 4 expounds the bottom-up process from Hamilton-Jacobi’s equation to a set of generalized Schrödinger’s equations. Section 5 provides a complementary discussion while Section 6 is a conclusion.


2. Hamilton-Jacobi’s formulation of classical mechanics

We know that classical mechanics can be declined under four different formulations, which are mathematically and empirically equivalent. These are the Newton’s, Lagrange’s, Hamilton’s and Hamilton-Jacobi’s formulations. In the present chapter, we rely on the Hamilton-Jacobi’s formulation, see for instance Louis de Broglie [1], Blotkhintsev [2], Landau and Lifchitz [3], and Holland [4]. This formulation of nonrelativistic classical mechanics of a matter point relies on an equation, that I shall call Hamilton-Jacobi’s equation, reading as:


This equation allows one to study the motions of a particle of mass min a potential V=Vxjt. The xj’s denote Cartesian coordinates and tis the time. The field S=Sxjtis a real field that I shall call the Jacobi’s field. Eq. (1) has to be complemented by two other equations reading as:


in which Wis the energy and pjis the momentum. From Eq. (2), we see that Sis an action (energy multiplied by time) and, from now on, we may call it the action. Also, inserting Eqs. (2) and (3) in Eq. (1), we see that we obtain W=T+V, which should be enough to convince us of the equivalence between Newton’s and Hamilton-Jacobi’s formulations. For a conservative motion, the energy (that we denote Ein that case) is constant along each particular motion, and Eq. (2) implies:


Inserting Eq. (4) into Eq. (1), we obtain:


We now consider the locus of the points for which S0possesses a given value C0:


Eq. (6) shows that the locus is a time-independent surface. There is one surface, and only one, containing a point Pof space, according to C0=S0xjP. The whole space is therefore filled by a set of motionless surfaces forming what I call the Jacobi’s static field. From Eqs. (3) and (4), we have:


Therefore, pjis the gradient of S(and of S0). This means that trajectories are orthogonal to the surfaces S(and to the surfaces S0). Next, we consider the locus of the points for which the action Spossesses a given value C:


Eq. (8) shows that the locus is still a surface but which now depends on time. When times goes on, the surface moves and, in general, experiences a deformation. For a given time t, the moving surface Sxjt=Ccoincides with a motionless surface S0xj=C0, according to, from Eq. (4): C=C0Et. Therefore, when time goes on, the moving surface S=Csweeps over all motionless surfaces S0=C0.

We now consider a fictitious point P, pertaining to the surface S=C, and therefore moving with it, with the constraint that its displacement remains orthogonal to the swept surfaces S0=C0. The velocity of the moving surface may then be defined as:


in which dxjis an infinitesimal displacement of the point P. But we have:


that is to say:


leading to:


But wj(modulus: w) is colinear to pj(modulus: p). Hence, with Epositive, we obtain:


We are therefore facing two different velocities (i) the velocity v=p/mof the material point and (ii) the velocity w=E/pof the fictitious point P. Finally, inserting Eq. (13) into Eq. (5), we obtain:


We then remark that Newton’s formulation relies on the existence of trajectories while Hamilton-Jacobi’s formulation relies both on trajectories and on a field filling the space. Hamilton-Jacobi’s formulation is the first one in which the motion of a localized object has been associated with a space filling field. In other words, Hamilton-Jacobi’s formulation is nonlocal. This nonlocality actually anticipates the nonlocality of quantum mechanics and the space filling field Sis an anticipation as well of a space filling field of quantum mechanics. It has furthermore been argued that Newton’s and Hamilton-Jacobi’s formulation, although empirically equivalent, are ontological contradictory, representing an example of the Duhem-Quine ontological underdetermination of theory by experience [5, 6].


3. Guessing Schrödinger’s derivation

Strictly speaking, there is no derivation of Schrödinger’s equations but a variety of guessing approaches, with different flavors depending on the preferences of the authors. Basically, however, Schrödinger’s equation has been introduced in [7, 8] under its stationary form and in [9] under its time-dependent form. English translation is available from [10] and French translation from [11]. The derivation relies on an analogy between Hamilton-Jacobi’s formulation of classical mechanics and geometrical optics. As rather usual when something new is exposed for the first time, Schrödinger’s argument is more complicated than necessary. For instance, it relies on the use of non-Cartesian coordinates and on a non-Euclidean interpretation of the configuration space, requiring the use of covariant and contravariant components of vectors (more generally, of tensors), which may be unfamiliar to some readers. Feynman even commented that some arguments invoked by Schrödinger are erroneous [12]. Without showing any disrespect to Schrôdinger’s work, I prefer to present a more recent exposition extracted from Winogradski [13] who defended her thesis under the supervision of Louis de Broglie.

We begin with scalar wave optics and with the corresponding wave equation reading as:


in which u=uxjtis the velocity of the wave Ψxjt. We may also introduce the refractive index nof the medium according to n=c/uin which cis the speed of light. We now consider a steady medium (n/t=0) which may support monochromatic waves of angular frequency ω, reading as:


Because Ψand Ψ0are, in general, complex fields, we set:


leading to:




In these expressions, Ψ0is a complex amplitude, Aa real amplitude, ϕxjtand ϕ0xjare phases. We may then introduce the wave-number vector reading as:


The wave-number kis defined as kj2and the wave-length λis defined by λ=2π/k. Also, we have:


Inserting Eq. (16) into Eq. (15), we obtain:


Next, inserting Eq. (17) into Eq. (22), we obtain two equations relating the real amplitude Aand the phase ϕ0:


If the medium, besides being steady, is homogeneous (n/xj=0), the wave equation admits plane wave solutions reading as:


in which A,kj,ωare constant quantities, and λbecomes the spatial period of the wave along the direction of propagation.

We are now equipped enough to turn to a discussion of geometrical optics which is an approximation to wave optics. This approximation is valid whenever the optical wave approximately behaves as a plane wave over a distance of the order of the wave-length λ, that is to say when Axjand kj=ϕ0/xjare approximately constant over λ. Equivalently, we may take the limit λ0. There is a rigorous but tedious way to take this limit by examining the relative variations of ΔA/Aand Δkj/kover λ, in the direction xk, relying on Taylor expansions. I shall rather use heuristic and convincing enough arguments which furthermore lead to the correct results. Because Ais approximately a constant, Eq. (23) reduces to:


Furthermore, because kj=ϕ0/xjis approximately a constant too, Eq. (24) reduces to an identity 00.Therefore, Eq. (26) is the geometrical optics version of the wave optics. Eqs. (23) and (24), i.e. two equations, have collapsed into a single one. We observe that Eq. (26) contains the phase ϕ0, but does not contain any more the amplitude A. This means that the concept of amplitude has no meaning, in a strict sense defined by the above derivation, in geometrical optics (this does not prevent to build geometrical optics models using the concept of amplitude).

Also, from Eqs. (20) and (26), we have:


Now, similarly as for S0and S, ϕ0and ϕare equiphase surfaces satisfying the following obvious analogous results. The locus of the points for which ϕ0possesses a given value C0, i.e. ϕ0xj=C0, is a time-independent equiphase surface. There is one surface, and only one, containing a point Pof space, given by C0=ϕ0xjP. The whole space is therefore filled by a set of motionless surfaces forming the static phase field. The trajectories orthogonal to these surfaces are called rays. The locus of the points for which ϕpossesses a given value C, i.e. ϕxjt=C, is a time-dependent equiphase surface. For a given time t, the moving equiphase surface ϕ=Ccoincides with a motionless equiphase surface ϕ0=C0. When time goes on, the moving surface ϕ=Csweeps over all motionless surfaces ϕ0=C0.

Assembling the results obtained for the conservative Hamilton-Jacobi’s classical mechanics and for geometrical optics, we obtain a remarkable analogy exhibited in Table 1.

Classical mechanicsGeometrical optics

Table 1.

Analogy between Hamilton-Jacobi’s classical mechanics and geometrical optics.

This analogy has been discovered by Hamilton, about one century (!) before its use to the discovery of Schrödinger’s equations, see Refs. [14, 15], references therein and prior references from Hamilton. Formally, we may express the same structure by using a mechanical language or an optical language. Both languages may be translated, from one to the other, by using a dictionary D exhibited in Table 2, where the newly introduced constant Ghas the dimension of an action.

trajectory ray

Table 2.

The dictionary.

An analogy is not necessarily significant but any analogy should be, at least tentatively, taken seriously. If the analogy is fully meaningless, then the value of the constant Gdoes not matter, and any value for Gwould do. A contrario, if the analogy is somehow meaningful, that is to say if the motion of a material point can be somehow associated with the propagation of a certain scalar field (the point of view taken very seriouslyby Louis de Broglie in his double solution), then the constant Gshould be a new fundamental constant of nature. We now know that the analogy under study may be taken seriously enough, and that it eventually leads to G=. Lines (c) and (d) of Table 2 then lead to:


which we call de Broglie, or Einstein-de Broglie relations. Eq. (28) expresses an equivalence between momentum (mechanical language) and wave-number (optical language), while Eq. (29) expresses an equivalence between energy (mechanical language) and angular frequency (optical language).

The situation we are facing is now sketched in the Figure 1 below. First, we possess an analogy between Hamilton-Jacobi’s classical mechanics and geometrical optics, expressed by a dictionary D. Second, geometrical optics is an approximation to scalar wave optics. The Figure 1 then exhibits three filled rectangles, and we may feel intuitively but clearly that something is lacking, corresponding to the fourth empty rectangle. To fill this rectangle, we apply the dictionary D to wave optics. From the dictionary of Table 2, with G=, we have:

Figure 1.

Guessing Schrödinger’s equation.


We may then translate Eq. (22) to:


which is exactly the time-independent (stationary) Schrödinger’s equation. Therefore, Eq. (16) is translated to:


and we readily establish that Ψalso satisfies Eq. (31) that we better rewrite as:


Next, we can eliminate Efrom Eq. (33) by using Eq. (32). The “simplest” way to do it is to write:


leading to:


which is the general time-dependent Schrödinger’s equation. Invoking the “simplest” way to obtain Eq. (34) rules out awkward expressions such as the one obtained by deriving Eq. (32) twice with respect to time, i.e.:


4. Deriving a set of generalized Schrödinger’s equations

There are good reasons to believe that classical mechanics is suspicious. One of them is the existence of singularities in classical mechanics such as exhibited in the mechanical rainbow [16, 17]. If we trust a non-singularity principle stating that “local infinity in physics is not admissible” [18], we arrive to the conclusion that we must build a wave mechanics (nowadays better known as “quantum mechanics”). For this, we decide to start from what we know (actually what we are supposed to know), namely classical mechanics. We are looking for a wave mechanics based on a wave Ψxjtwhich should have the virtue of washing out the singularities exhibited by classical mechanics. The most general form for a wave reads as:


in which T=Txjtis a complex dimensionless phase. At this stage, our amount of knowledge is supposed to be very weak. We only possess one field Sxjtfor classical mechanics and two fields Ψxjtand Txjtfor wave mechanics. These fields are the only quantities involved in the problem. Therefore, we have to search for a relationship between Ψand S(first option), or between Tand S(second option). Because Tand Spossess the same nature (they are fields without being waves), I preferably choose the second option. Of course, the first option is likely to be valid as well, but it would certainly lead to more complicated derivations and equations.

For the relationship between Tand S, we could search for TSor for ST. Because wave mechanics (T) is assumed to be more general than classical mechanics (S), it is apparent that we better have to try to determine TSrather than the inverse version ST. We therefore have to explicitly consider Txjt=TSxjt. However, this is to be slightly corrected. Indeed, Tis dimensionless while Sis an action (the action). This will require us to introduce a new constant, that will be denoted g.

Now, I invoke a principle that I call the lifting principle (later to be commented a bit more when the demonstration is completed). This principle tells us something very simple, even looking a bit like tautological, as follows: classical mechanics is an approximation to wave mechanics. Rather than simply using the argument Sin TS, we then have to look for a function TS¯in which the functional argument S¯=S¯xjtreads as:


in which gis a constant having the dimension of an action, S1is a correcting function, and εis a small parameter. To recover classical mechanics from wave mechanics, we shall have to take the limit ε0so that, the constant gbeing dismissed, we are left with the field S(and with its equation). Also, we can take εR. Indeed, if εwere complex, it would exhibit a phase factor which could be absorbed in S1. Similarly, the prefactor “i” which is introduced for convenience could be absorbed in S1.

The function TS¯may be explicitly written as:


in which we used a subscript εto insist on the fact that Tdepends on ε. Eq. (39) may give the feeling that we are dealing with a restricted first-order perturbation approach. However, instead of Eq. (38), let us assume:


This can be rewritten as:


which, relabelling, identifies with Eq. (38).

We are now looking for a differential equation satisfied by the wave Ψ, involving partial derivatives with respect to xjand t. This equation must be fundamental, that is to say it must contain lowest-order derivatives compatible with the constraints imposed by the problem under study. Once the fundamental equation is obtained, we can of course generate other equations by further differentiating with respect to xjand t, but such extra-equations are said to be non-fundamental.

We begin with the assumption that, besides derivatives with respect to xj, the wave equation only contains the first derivative ∂Ψ/twith respect to time. We shall later comment on the use of higher-order derivatives with respect to time.

The derivative ∂Ψ/tmay always be written as:


in which we again use a subscript εto insist on the dependence on ε. Also, Kis an extra-field (i.e. a function of time and space, but not a dynamical field possessing its own differential equation), possibly a constant, and ∂Ψrepresents a set of arguments formed from various derivatives of Ψwith respect to xj:


The set ∂Ψis infinite and there is a systematic way to generate all arguments of the set. For instance, the subset generated by Ψijkcontains ΨijkΨiΨjΨk, ΨijkΨijΨk, , and other arguments obtained by using complex conjugations.

We may also express the derivative ∂Ψ/tfrom Eqs. (37) and (39), so that we obtain:


We rewrite Eq. (44) as:


or, invoking Eq. (42):


But, Hamilton-Jacobi’s equation (and the lifting principle) implies that the r.h.s. of Eq. (46) must contain a term with no derivative associated with Vin Eq. (1), and a term involving S/xj2, associated with the first term in the r.h.s. of Eq. (1). These terms have to be involved in the function fε. Upon investigation, we find that the term involving S/xj2can only be generated by Ψjjwhich indeed is found to be:


in which:


We therefore set, without any loss of generality:


in which hεis a complementary function, possibly including non-linear terms, and which also could possibly annihilate the terms a2Ψ/xj2and bΨif, eventually, we would find that they should be zero.

The evolution Eq. (42) then takes the form:


and our next task is to evaluate aand b.

To this purpose, we now return to Eq. (46) and insert in it Eqs. (49) and (47), leading to:




In the classical limit (ε0), Eq. (51) simplifies to:


which must identify with Hamilton-Jacobi’s equation. Under the proviso to be checked later that the r.h.s. of Eq. (52) must be vanishingly small, we then obtain, from the l.h.s.:


in which T0=T0S/gand S¯therefore reduces to S/g. Eq. (53) implies:


We must now recall that the coefficient bhas been actually set as a function bxjt, and Eq. (50) shows that it must pertain to the wave mechanical level. In other words, it does not pertain to the classical mechanical level, that is to say, as a rational demand, we would not like it to depend on S. Therefore, dT0/dS¯must be a constant that we denote as C1.

From Eq. (55), we then have:


With d2T0/dS¯2=0(since the first derivative is a constant), Eq. (54) then implies:


Inserting Eqs. (56) and (57) into Eq. (50), we then obtain:


Concerning the constant C1, I have (at least at the present time) no theoretical reason to assign a value to it.

Let Rdenote the r.h.s. of Eq. (52). We still have to check that it is vanishingly small. With Eq. (57), we obtain:


which is indeed 0in the limit g0. This implies that gis a small action, actually so small that it could not be detected in a classical framework.

Eq. (58) is the main result of this subsection. It provides a set of generalized Schrödinger’s equations, being admitted that they are evolution equations (first derivative with respect to time), obtained by a deformation of Hamilton-Jacobi’s equation, according to the lifting principle. The classical Schrödinger’s equation is, in a certain sense, the simplest equation in the set. It is obtained by setting the nonlinear term hεto 0and C1to 1, while the constant gidentifies with the Planck’s constant . This is equivalent to saying that in Eqs. (49) and (50), only the a- and b-terms in the r.h.s. of the equations, required to match Hamilton-Jacobi’s equation in the classical limit, are retained.

Let us note that the function hεin Eq. (58) may be significant because it allows one to introduce non-linear wave equations. Non-linear Schrödinger’s equations in quantum theory are considered in the literature in many papers. For example, they are comprehensively discussed by Doebner and Goldin in [19], and in many references therein. We may also meet such equations in the Bohm-Bub hidden-variables theory [20], or with the Ghirardi-Rimini-Weber equation for spontaneous collapse of the wave function [21]. More generally, non-linear equations may provide a solution to the measurement problem insofar as linear equations, in utmost rigor, do not allow one to get rid of quantum superpositions. This fact has been recently heavily emphasized by R. Penrose in one of his books [22]. A word of caution is however required, namely that, according to Gisin [23], “the Schrödinger evolution is the only quantum evolution that is deterministic and compatible with relativity”. Hence, “the fact that a deterministic evolution compatible with relativity must be linear puts heavy doubts on the possibility to solve the measurement problem […] by adding non linear terms to the Schrödinger equation”.


5. Complementary discussion

From the generalized Schrödinger’s Eq. (58) we may recover the classical Schrödinger’s equation, as we have commented, by setting hε=0, C1=1and g=, leading to:


This is a first application of the correspondence principle. A second application of this correspondence principle afterward allows one to recover the classical Hamilton-Jacobi’s equation from Schrödinger’s equation, as discussed for instance by Blotkhintsev [2]. From the generalized Schrödinger’s equation, we therefore recover the classical Hamilton-Jacobi’s equation by a two-step up-bottom process, applying twice the correspondence principle. Another approach is to use Eq. (58) as an Ansatz under the form:


and to pursue the game with the correspondence principle to recover, using again a two-step approach, Hamilton-Jacobi’s equation. But the use of an Ansatz is less rigorous than the lifting principle because it contains the risk to make the Ansatz too simple, and therefore to omit significant terms. Note, however, that we have implicitly made the assumption that the state of the wave is defined by the wave ψitself so that we have obtained what is called an evolution equation. The use of a second-order derivative with respect to time would require, for integration, to have the state defined by ψand by its first derivative (and similar considerations for higher order derivatives with respect to time) so that the result would not be an evolution equation. Therefore, in utmost rigor, what we have demonstrated is that Schrödinger’s equation is the simplest evolution equation satisfying the lifting principle.

To clearly emphasize the difference between the correspondence and the lifting principles, let us consider two theories, denoted TG(Gstanding for “general”) and TA(Astanding for “approximate”). By taking some kind of limit on TG, we must recover TA, a up-down process () that may be denoted as TGTA. We then say that TGsatisfies a correspondence principle with respect to TA. If TGis unknown and under construction, any valid candidate, say TG1, TG2… must satisfy the correspondence principle: TG1TA, TG2TA…. It it does not, it is not valid and must be rejected. If several valid candidates are retained, then the discrimination among the candidates may need to rely on other considerations, or even remaining undecidable, such as when dealing with the Duhem-Quine underdetermination of theories by experiments. The lifting principle is a down-up process (): TATG. It starts from a theory relying on an equation (or a set of equations) which is acknowledged to be valid within a certain domain of applicability and extends this domain of validity by extending the original equation (or set of equations) under conditions defined by physical requirements.

For example, the lifting principle tells us that classical mechanics is an approximation to quantum mechanics. Therefore, quantum mechanics must indeed satisfy a correspondence principle, meaning that the correspondence principle is contained in the lifting principle. However, as we have seen, it does not identify with it. What we have done to use it is to start from TAand find a way to reach candidates for TG. However, the word “lifting” may have other meanings, for instance in the theory of nonlinear dynamics when, to study a low-dimensional system it can be easier to study its elevation in a higher dimensional system [24, 25]. On the one hand, the higher-dimensional system must satisfy a correspondence principle. One the other hand, it is said that it is obtained as a result of the “lifting” of the low-dimensional system. My choice of the word “lifting” in the context of the present chapter is the result of my borrowing it to the context of chaos theory.

Another point of view may be taken by using a metaphor from Feynman [12] according to which the correspondence principle proceeds from one object to its shadow (and there is one shadow for one object) while the lifting principle proceeds from a shadow to objects (and there are several possible objects for a given shadow). Our results agree with this expectation. We did not reach Schrödinger’s equation, but rather a set of generalized Schrödinger’s equation. The derivation of Schrödinger, and all Schrödinger-like derivations, reach a single result because they used analogies, guesses and trials, with more or less implicit assumptions. Conversely, the use of the lifting principle simultaneously provides the whole set of admissible possibilities with a minimal number of assumptions (namely that we have to deal with an evolution equation). All candidates are reached in a single step.


6. Conclusion

The realm of nonlinear Schrödinger’s equations is very rich, with many applications such as to fluid mechanics, solitons, nonlinear optics and Bose-Einstein condensates. In the present chapter, we have demonstrated, using a lifting principle, that such equations occur naturally as a generalization of Hamilton-Jacobi’s formulation of classical mechanics, without however pretending that nonlinear equations obtained by the lifting process identify with nonlinear Schrödinger’s equations used in other different contexts (this would require another specific study outside of the scope of the present chapter). The material presented in this chapter is extracted from a book, namely [26]. It is here however presented under a single roof and might then attract the interest of other readers.


chapter PDF

© 2021 The Author(s). Licensee IntechOpen. This chapter is distributed under the terms of the Creative Commons Attribution 3.0 License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite and reference

Link to this chapter Copy to clipboard

Cite this chapter Copy to clipboard

Gérard Gouesbet (September 28th 2021). Nonlinear Generalized Schrödinger’s Equations by Lifting Hamilton-Jacobi’s Formulation of Classical Mechanics [Online First], IntechOpen, DOI: 10.5772/intechopen.100068. Available from:

chapter statistics

26total chapter downloads

More statistics for editors and authors

Login to your personal dashboard for more detailed statistics on your publications.

Access personal reporting

We are IntechOpen, the world's leading publisher of Open Access books. Built by scientists, for scientists. Our readership spans scientists, professors, researchers, librarians, and students, as well as business professionals. We share our knowledge and peer-reveiwed research papers with libraries, scientific and engineering societies, and also work with corporate R&D departments and government entities.

More About Us