Calculus of variations
































Calculus of variations is a field of mathematical analysis that uses variations, which are small changes in functions
and functionals, to find maxima and minima of functionals: mappings from a set of functions to the real numbers.[Note 1] Functionals are often expressed as definite integrals involving functions and their derivatives. Functions that maximize or minimize functionals may be found using the Euler–Lagrange equation of the calculus of variations.


A simple example of such a problem is to find the curve of shortest length connecting two points. If there are no constraints, the solution is obviously a straight line between the points. However, if the curve is constrained to lie on a surface in space, then the solution is less obvious, and possibly many solutions may exist. Such solutions are known as geodesics. A related problem is posed by Fermat's principle: light follows the path of shortest optical length connecting two points, where the optical length depends upon the material of the medium. One corresponding concept in mechanics is the principle of least action.


Many important problems involve functions of several variables. Solutions of boundary value problems for the Laplace equation satisfy the Dirichlet principle. Plateau's problem requires finding a surface of minimal area that spans a given contour in space: a solution can often be found by dipping a frame in a solution of soap suds. Although such experiments are relatively easy to perform, their mathematical interpretation is far from simple: there may be more than one locally minimizing surface, and they may have non-trivial topology.




Contents






  • 1 History


  • 2 Extrema


  • 3 Euler–Lagrange equation


    • 3.1 Example




  • 4 Beltrami identity


  • 5 Du Bois-Reymond's theorem


  • 6 Lavrentiev phenomenon


  • 7 Functions of several variables


    • 7.1 Dirichlet's principle


    • 7.2 Generalization to other boundary value problems




  • 8 Eigenvalue problems


    • 8.1 Sturm–Liouville problems


    • 8.2 Eigenvalue problems in several dimensions




  • 9 Applications


    • 9.1 Fermat's principle


      • 9.1.1 Snell's law


      • 9.1.2 Fermat's principle in three dimensions


        • 9.1.2.1 Connection with the wave equation






    • 9.2 Action principle




  • 10 Variations and sufficient condition for a minimum


  • 11 See also


  • 12 Notes


  • 13 References


  • 14 Further reading


  • 15 External links





History


The calculus of variations may be said to begin with Newton's minimal resistance problem in 1687, followed by the brachistochrone curve problem raised by Johann Bernoulli (1696).[2] It immediately occupied the attention of Jakob Bernoulli and the Marquis de l'Hôpital, but Leonhard Euler first elaborated the subject, beginning in 1733. Lagrange was influenced by Euler's work to contribute significantly to the theory. After Euler saw the 1755 work of the 19-year-old Lagrange, Euler dropped his own partly geometric approach in favor of Lagrange's purely analytic approach and renamed the subject the calculus of variations in his 1756 lecture Elementa Calculi Variationum.[3][4][Note 2]


Legendre (1786) laid down a method, not entirely satisfactory, for the discrimination of maxima and minima. Isaac Newton and Gottfried Leibniz also gave some early attention to the subject.[5] To this discrimination Vincenzo Brunacci (1810), Carl Friedrich Gauss (1829), Siméon Poisson (1831), Mikhail Ostrogradsky (1834), and Carl Jacobi (1837) have been among the contributors. An important general work is that of Sarrus (1842) which was condensed and improved by Cauchy (1844). Other valuable treatises and memoirs have been written by Strauch (1849), Jellett (1850), Otto Hesse (1857), Alfred Clebsch (1858), and Carll (1885), but perhaps the most important work of the century is that of Weierstrass. His celebrated course on the theory is epoch-making, and it may be asserted that he was the first to place it on a firm and unquestionable foundation. The 20th and the 23rd Hilbert problem published in 1900 encouraged further development.[5]


In the 20th century David Hilbert, Emmy Noether, Leonida Tonelli, Henri Lebesgue and Jacques Hadamard among others made significant contributions.[5]Marston Morse applied calculus of variations in what is now called Morse theory.[6]Lev Pontryagin, Ralph Rockafellar and F. H. Clarke developed new mathematical tools for the calculus of variations in optimal control theory.[6] The dynamic programming of Richard Bellman is an alternative to the calculus of variations.[7][8][9]



Extrema


The calculus of variations is concerned with the maxima or minima (collectively called extrema) of functionals. A functional maps functions to scalars, so functionals have been described as "functions of functions." Functionals have extrema with respect to the elements y of a given function space defined over a given domain. A functional J [ y ] is said to have an extremum at the function f  if ΔJ = J [ y ] - J [ f] has the same sign for all y in an arbitrarily small neighborhood of f .[Note 3] The function f is called an extremal function or extremal.[Note 4] The extremum J [ f ] is called a local maximum if ΔJ ≤ 0 everywhere in an arbitrarily small neighborhood of f , and a local minimum if ΔJ ≥ 0 there. For a function space of continuous functions, extrema of corresponding functionals are called weak extrema or strong extrema, depending on whether the first derivatives of the continuous functions are respectively all continuous or not.[11]


Both strong and weak extrema of functionals are for a space of continuous functions but weak extrema have the additional requirement that the first derivatives of the functions in the space be continuous. Thus a strong extremum is also a weak extremum, but the converse may not hold. Finding strong extrema is more difficult than finding weak extrema.[12] An example of a necessary condition that is used for finding weak extrema is the Euler–Lagrange equation.[13][Note 5]



Euler–Lagrange equation



Finding the extrema of functionals is similar to finding the maxima and minima of functions. The maxima and minima of a function may be located by finding the points where its derivative vanishes (i.e., is equal to zero). The extrema of functionals may be obtained by finding functions where the functional derivative is equal to zero. This leads to solving the associated Euler–Lagrange equation.[Note 6]


Consider the functional


J[y]=∫x1x2L(x,y(x),y′(x))dx.{displaystyle J[y]=int _{x_{1}}^{x_{2}}L(x,y(x),y'(x)),dx,.}{displaystyle J[y]=int _{x_{1}}^{x_{2}}L(x,y(x),y'(x)),dx,.}

where




x1, x2 are constants,


y (x) is twice continuously differentiable,


y ′(x) = dy / dx  ,


L(x, y (x), y ′(x)) is twice continuously differentiable with respect to its arguments x,  y,  y.


If the functional J[y ] attains a local minimum at f , and η(x) is an arbitrary function that has at least one derivative and vanishes at the endpoints x1 and x2 , then for any number ε close to 0,


J[f]≤J[f+εη].{displaystyle J[f]leq J[f+varepsilon eta ],.}J[f]leq J[f+varepsilon eta ],.

The term εη is called the variation of the function f and is denoted by δf .[1]


Substituting  f + εη for y  in the functional J[ y ] , the result is a function of ε,


Φ)=J[f+εη].{displaystyle Phi (varepsilon )=J[f+varepsilon eta ],.}Phi (varepsilon )=J[f+varepsilon eta ],.

Since the functional J[ y ] has a minimum for y = f , the function Φ(ε) has a minimum at ε = 0 and thus,[Note 7]


Φ′(0)≡=0=∫x1x2dLdε=0dx=0.{displaystyle Phi '(0)equiv left.{frac {dPhi }{dvarepsilon }}right|_{varepsilon =0}=int _{x_{1}}^{x_{2}}left.{frac {dL}{dvarepsilon }}right|_{varepsilon =0}dx=0,.}Phi '(0)equiv left.{frac {dPhi }{dvarepsilon }}right|_{varepsilon =0}=int _{x_{1}}^{x_{2}}left.{frac {dL}{dvarepsilon }}right|_{varepsilon =0}dx=0,.

Taking the total derivative of L[x, y, y ′] , where y = f + ε η and y ′ = f ′ + ε η are functions of ε but x is not,


dLdε=∂L∂ydydε+∂L∂y′dy′dε{displaystyle {frac {dL}{dvarepsilon }}={frac {partial L}{partial y}}{frac {dy}{dvarepsilon }}+{frac {partial L}{partial y'}}{frac {dy'}{dvarepsilon }}}{frac {dL}{dvarepsilon }}={frac {partial L}{partial y}}{frac {dy}{dvarepsilon }}+{frac {partial L}{partial y'}}{frac {dy'}{dvarepsilon }}

and since  dy / = η  and  dy ′/ = η' ,



dLdε=∂L∂+∂L∂y′η′{displaystyle {frac {dL}{dvarepsilon }}={frac {partial L}{partial y}}eta +{frac {partial L}{partial y'}}eta '}{frac {dL}{dvarepsilon }}={frac {partial L}{partial y}}eta +{frac {partial L}{partial y'}}eta ' .

Therefore,


x1x2dLdε=0dx=∫x1x2(∂L∂+∂L∂f′η′)dx=∫x1x2(∂L∂+ddx(∂L∂f′η)−ηddx∂L∂f′)dx=∫x1x2(∂L∂ηddx∂L∂f′)dx+∂L∂f′η|x1x2{displaystyle {begin{aligned}int _{x_{1}}^{x_{2}}left.{frac {dL}{dvarepsilon }}right|_{varepsilon =0}dx&=int _{x_{1}}^{x_{2}}left({frac {partial L}{partial f}}eta +{frac {partial L}{partial f'}}eta 'right),dx\&=int _{x_{1}}^{x_{2}}left({frac {partial L}{partial f}}eta +{frac {d}{dx}}left({frac {partial L}{partial f'}}eta right)-eta {frac {d}{dx}}{frac {partial L}{partial f'}}right),dx\&=int _{x_{1}}^{x_{2}}left({frac {partial L}{partial f}}eta -eta {frac {d}{dx}}{frac {partial L}{partial f'}}right),dx+left.{frac {partial L}{partial f'}}eta right|_{x_{1}}^{x_{2}}\end{aligned}}}{displaystyle {begin{aligned}int _{x_{1}}^{x_{2}}left.{frac {dL}{dvarepsilon }}right|_{varepsilon =0}dx&=int _{x_{1}}^{x_{2}}left({frac {partial L}{partial f}}eta +{frac {partial L}{partial f'}}eta 'right),dx\&=int _{x_{1}}^{x_{2}}left({frac {partial L}{partial f}}eta +{frac {d}{dx}}left({frac {partial L}{partial f'}}eta right)-eta {frac {d}{dx}}{frac {partial L}{partial f'}}right),dx\&=int _{x_{1}}^{x_{2}}left({frac {partial L}{partial f}}eta -eta {frac {d}{dx}}{frac {partial L}{partial f'}}right),dx+left.{frac {partial L}{partial f'}}eta right|_{x_{1}}^{x_{2}}\end{aligned}}}

where L[x, y, y ′] → L[x, f, f ′] when ε = 0 and we have used integration by parts. The last term vanishes because η = 0 at x1 and x2 by definition. Also, as previously mentioned the left side of the equation is zero so that


x1x2η(∂L∂f−ddx∂L∂f′)dx=0.{displaystyle int _{x_{1}}^{x_{2}}eta left({frac {partial L}{partial f}}-{frac {d}{dx}}{frac {partial L}{partial f'}}right),dx=0,.}int _{x_{1}}^{x_{2}}eta left({frac {partial L}{partial f}}-{frac {d}{dx}}{frac {partial L}{partial f'}}right),dx=0,.

According to the fundamental lemma of calculus of variations, the part of the integrand in parentheses is zero, i.e.


L∂f−ddx∂L∂f′=0{displaystyle {frac {partial L}{partial f}}-{frac {d}{dx}}{frac {partial L}{partial f'}}=0}{displaystyle {frac {partial L}{partial f}}-{frac {d}{dx}}{frac {partial L}{partial f'}}=0}

which is called the Euler–Lagrange equation. The left hand side of this equation is called the functional derivative of J[f] and is denoted δJ/δf(x) .


In general this gives a second-order ordinary differential equation which can be solved to obtain the extremal function f(x) . The Euler–Lagrange equation is a necessary, but not sufficient, condition for an extremum J[f]. A sufficient condition for a minimum is given in the section Variations and sufficient condition for a minimum.



Example


In order to illustrate this process, consider the problem of finding the extremal function y = f (x) , which is the shortest curve that connects two points (x1, y1) and (x2, y2) . The arc length of the curve is given by


A[y]=∫x1x21+[y′(x)]2dx,{displaystyle A[y]=int _{x_{1}}^{x_{2}}{sqrt {1+[y'(x)]^{2}}},dx,,}A[y]=int _{x_{1}}^{x_{2}}{sqrt {1+[y'(x)]^{2}}},dx,,

with


y′(x)=dydx,  y1=f(x1),  y2=f(x2).{displaystyle y,'(x)={frac {dy}{dx}},, y_{1}=f(x_{1}),, y_{2}=f(x_{2}),.}y,'(x)={frac {dy}{dx}},,  y_{1}=f(x_{1}),,  y_{2}=f(x_{2}),.

The Euler–Lagrange equation will now be used to find the extremal function f (x) that minimizes the functional A[y ] .


L∂f−ddx∂L∂f′=0{displaystyle {frac {partial L}{partial f}}-{frac {d}{dx}}{frac {partial L}{partial f'}}=0}{displaystyle {frac {partial L}{partial f}}-{frac {d}{dx}}{frac {partial L}{partial f'}}=0}

with


L=1+[f′(x)]2.{displaystyle L={sqrt {1+[f'(x)]^{2}}},.}L={sqrt {1+[f'(x)]^{2}}},.

Since f does not appear explicitly in L , the first term in the Euler–Lagrange equation vanishes for all f (x) and thus,


ddx∂L∂f′=0.{displaystyle {frac {d}{dx}}{frac {partial L}{partial f'}}=0,.}{displaystyle {frac {d}{dx}}{frac {partial L}{partial f'}}=0,.}

Substituting for L and taking the derivative,


ddx f′(x)1+[f′(x)]2 =0.{displaystyle {frac {d}{dx}} {frac {f'(x)}{sqrt {1+[f'(x)]^{2}}}} =0,.}{frac {d}{dx}} {frac {f'(x)}{sqrt {1+[f'(x)]^{2}}}} =0,.

Thus


f′(x)1+[f′(x)]2=c,{displaystyle {frac {f'(x)}{sqrt {1+[f'(x)]^{2}}}}=c,,}{displaystyle {frac {f'(x)}{sqrt {1+[f'(x)]^{2}}}}=c,,}

for some constant c. Then


[f′(x)]21+[f′(x)]2=c2,{displaystyle {frac {[f'(x)]^{2}}{1+[f'(x)]^{2}}}=c^{2},,}{displaystyle {frac {[f'(x)]^{2}}{1+[f'(x)]^{2}}}=c^{2},,}

where


0≤c2<1.{displaystyle 0leq c^{2}<1.}{displaystyle 0leq c^{2}<1.}

Solving, we get


[f′(x)]2=c21−c2{displaystyle [f'(x)]^{2}={frac {c^{2}}{1-c^{2}}},}{displaystyle [f'(x)]^{2}={frac {c^{2}}{1-c^{2}}},}

which implies that


f′(x)=m{displaystyle f'(x)=m}{displaystyle f'(x)=m}

is a constant and therefore
that the shortest curve that connects two points (x1, y1) and (x2, y2) is


f(x)=mx+bwith  m=y2−y1x2−x1andb=x2y1−x1y2x2−x1{displaystyle f(x)=mx+bqquad {text{with}} m={frac {y_{2}-y_{1}}{x_{2}-x_{1}}}quad {text{and}}quad b={frac {x_{2}y_{1}-x_{1}y_{2}}{x_{2}-x_{1}}}}{displaystyle f(x)=mx+bqquad {text{with}}  m={frac {y_{2}-y_{1}}{x_{2}-x_{1}}}quad {text{and}}quad b={frac {x_{2}y_{1}-x_{1}y_{2}}{x_{2}-x_{1}}}}

and we have thus found the extremal function f(x) that minimizes the functional A[y] so that A[f] is a minimum. Note that y = f(x) is the equation for a straight line, in other words, the shortest distance between two points is a straight line.[Note 8]



Beltrami identity


In physics problems it frequently turns out that L / ∂x = 0, i.e., the integrand only depends on x through y(x),y'(x) but x does not appear separately. In that case, the Euler–Lagrange equation can be simplified to the Beltrami identity:[14]


L−f′∂L∂f′=C,{displaystyle L-f'{frac {partial L}{partial f'}}=C,,}{displaystyle L-f'{frac {partial L}{partial f'}}=C,,}

where C is a constant. The left hand side is the Legendre transformation of L with respect to f.


The intuition behind this result is that, if the variable x is actually time, then the statement L / ∂x = 0 implies that the Lagrangian is time-independent. By Noether's theorem, there is an associated conserved quantity: the Hamiltonian, which (often) coincides with the energy of the system. This is (minus) the constant in Beltrami's identity.



Du Bois-Reymond's theorem


The discussion thus far has assumed that extremal functions possess two continuous derivatives, although the existence of the integral J requires only first derivatives of trial functions. The condition that the first variation vanishes at an extremal may be regarded as a weak form of the Euler–Lagrange equation. The theorem of Du Bois-Reymond asserts that this weak form implies the strong form. If L has continuous first and second derivatives with respect to all of its arguments, and if


2L∂f′2≠0,{displaystyle {frac {partial ^{2}L}{partial f'^{2}}}neq 0,}{displaystyle {frac {partial ^{2}L}{partial f'^{2}}}neq 0,}

then f{displaystyle f}f has two continuous derivatives, and it satisfies the Euler–Lagrange equation.



Lavrentiev phenomenon


Hilbert was the first to give good conditions for the Euler–Lagrange equations to give a stationary solution. Within a convex area and a positive thrice differentiable Lagrangian the solutions are composed of a countable collection of sections that either go along the boundary or satisfy the Euler–Lagrange equations in the interior.


However Lavrentiev in 1926 showed that there are circumstances where there is no optimum solution but one can be approached arbitrarily closely by increasing numbers of sections. For instance the following:



L(t,x,x′)=(x3−t)2x′6,{displaystyle L(t,x,x')=(x^{3}-t)^{2}x'^{6},,}L(t,x,x')=(x^{3}-t)^{2}x'^{6},,

x(0)=0,x(1)=1.{displaystyle x(0)=0,,x(1)=1.,}x(0)=0,,x(1)=1.,


Here a zig zag path gives a better solution than any smooth path and increasing the number of sections improves the solution.



Functions of several variables


For example, if φ(x,y) denotes the displacement of a membrane above the domain D in the x,y plane, then its potential energy is proportional to its surface area:


U[φ]=∬D1+∇φφdxdy.{displaystyle U[varphi ]=iint _{D}{sqrt {1+nabla varphi cdot nabla varphi }}dx,dy.,}U[varphi ]=iint _{D}{sqrt {1+nabla varphi cdot nabla varphi }}dx,dy.,

Plateau's problem consists of finding a function that minimizes the surface area while assuming prescribed values on the boundary of D; the solutions are called minimal surfaces. The Euler–Lagrange equation for this problem is nonlinear:


φxx(1+φy2)+φyy(1+φx2)−xy=0.{displaystyle varphi _{xx}(1+varphi _{y}^{2})+varphi _{yy}(1+varphi _{x}^{2})-2varphi _{x}varphi _{y}varphi _{xy}=0.,}varphi _{xx}(1+varphi _{y}^{2})+varphi _{yy}(1+varphi _{x}^{2})-2varphi _{x}varphi _{y}varphi _{xy}=0.,

See Courant (1950) for details.



Dirichlet's principle


It is often sufficient to consider only small displacements of the membrane, whose energy difference from no displacement is approximated by


V[φ]=12∬D∇φφdxdy.{displaystyle V[varphi ]={frac {1}{2}}iint _{D}nabla varphi cdot nabla varphi ,dx,dy.,}V[varphi ]={frac {1}{2}}iint _{D}nabla varphi cdot nabla varphi ,dx,dy.,

The functional V is to be minimized among all trial functions φ that assume prescribed values on the boundary of D. If u is the minimizing function and v is an arbitrary smooth function that vanishes on the boundary of D, then the first variation of V[u+εv]{displaystyle V[u+varepsilon v]}V[u+varepsilon v] must vanish:


ddεV[u+εv]|ε=0=∬D∇u⋅vdxdy=0.{displaystyle {frac {d}{dvarepsilon }}V[u+varepsilon v]|_{varepsilon =0}=iint _{D}nabla ucdot nabla v,dx,dy=0.,}{frac {d}{dvarepsilon }}V[u+varepsilon v]|_{varepsilon =0}=iint _{D}nabla ucdot nabla v,dx,dy=0.,

Provided that u has two derivatives, we may apply the divergence theorem to obtain


D∇(v∇u)dxdy=∬D∇u⋅v+v∇udxdy=∫Cv∂u∂nds,{displaystyle iint _{D}nabla cdot (vnabla u),dx,dy=iint _{D}nabla ucdot nabla v+vnabla cdot nabla u,dx,dy=int _{C}v{frac {partial u}{partial n}}ds,,}{displaystyle iint _{D}nabla cdot (vnabla u),dx,dy=iint _{D}nabla ucdot nabla v+vnabla cdot nabla u,dx,dy=int _{C}v{frac {partial u}{partial n}}ds,,}

where C is the boundary of D, s is arclength along C and u/∂n{displaystyle partial u/partial n}{displaystyle partial u/partial n} is the normal derivative of u on C. Since v vanishes on C and the first variation vanishes, the result is


Dv∇udxdy=0{displaystyle iint _{D}vnabla cdot nabla u,dx,dy=0,}iint _{D}vnabla cdot nabla u,dx,dy=0,

for all smooth functions v that vanish on the boundary of D. The proof for the case of one dimensional integrals may be adapted to this case to show that



u=0{displaystyle nabla cdot nabla u=0,}nabla cdot nabla u=0, in D.

The difficulty with this reasoning is the assumption that the minimizing function u must have two derivatives. Riemann argued that the existence of a smooth minimizing function was assured by the connection with the physical problem: membranes do indeed assume configurations with minimal potential energy. Riemann named this idea the Dirichlet principle in honor of his teacher Peter Gustav Lejeune Dirichlet. However Weierstrass gave an example of a variational problem with no solution: minimize


W[φ]=∫11(xφ′)2dx{displaystyle W[varphi ]=int _{-1}^{1}(xvarphi ')^{2},dx,}W[varphi ]=int _{-1}^{1}(xvarphi ')^{2},dx,

among all functions φ that satisfy φ(−1)=−1{displaystyle varphi (-1)=-1}varphi (-1)=-1 and
φ(1)=1.{displaystyle varphi (1)=1.}varphi (1)=1.
W{displaystyle W}W can be made arbitrarily small by choosing piecewise linear functions that make a transition between −1 and 1 in a small neighborhood of the origin. However, there is no function that makes W=0{displaystyle W=0}W=0.[15] Eventually it was shown that Dirichlet's principle is valid, but it requires a sophisticated application of the regularity theory for elliptic partial differential equations; see Jost and Li-Jost (1998).



Generalization to other boundary value problems


A more general expression for the potential energy of a membrane is


V[φ]=∬D[12∇φφ+f(x,y)φ]dxdy+∫C[12σ(s)φ2+g(s)φ]ds.{displaystyle V[varphi ]=iint _{D}left[{frac {1}{2}}nabla varphi cdot nabla varphi +f(x,y)varphi right],dx,dy,+int _{C}left[{frac {1}{2}}sigma (s)varphi ^{2}+g(s)varphi right],ds.}V[varphi ]=iint _{D}left[{frac {1}{2}}nabla varphi cdot nabla varphi +f(x,y)varphi right],dx,dy,+int _{C}left[{frac {1}{2}}sigma (s)varphi ^{2}+g(s)varphi right],ds.

This corresponds to an external force density f(x,y){displaystyle f(x,y)}f(x,y) in D, an external force g(s){displaystyle g(s)}g(s) on the boundary C, and elastic forces with modulus σ(s){displaystyle sigma (s)}sigma (s) acting on C. The function that minimizes the potential energy with no restriction on its boundary values will be denoted by u. Provided that f and g are continuous, regularity theory implies that the minimizing function u will have two derivatives. In taking the first variation, no boundary condition need be imposed on the increment v. The first variation of
V[u+εv]{displaystyle V[u+varepsilon v]}V[u+varepsilon v] is given by


D[∇u⋅v+fv]dxdy+∫C[σuv+gv]ds=0.{displaystyle iint _{D}left[nabla ucdot nabla v+fvright],dx,dy+int _{C}left[sigma uv+gvright],ds=0.,}iint _{D}left[nabla ucdot nabla v+fvright],dx,dy+int _{C}left[sigma uv+gvright],ds=0.,

If we apply the divergence theorem, the result is


D[−v∇u+vf]dxdy+∫Cv[∂u∂n+σu+g]ds=0.{displaystyle iint _{D}left[-vnabla cdot nabla u+vfright],dx,dy+int _{C}vleft[{frac {partial u}{partial n}}+sigma u+gright],ds=0.,}{displaystyle iint _{D}left[-vnabla cdot nabla u+vfright],dx,dy+int _{C}vleft[{frac {partial u}{partial n}}+sigma u+gright],ds=0.,}

If we first set v=0 on C, the boundary integral vanishes, and we conclude as before that


u+f=0{displaystyle -nabla cdot nabla u+f=0,}-nabla cdot nabla u+f=0,

in D. Then if we allow v to assume arbitrary boundary values, this implies that u must satisfy the boundary condition


u∂n+σu+g=0,{displaystyle {frac {partial u}{partial n}}+sigma u+g=0,,}{displaystyle {frac {partial u}{partial n}}+sigma u+g=0,,}

on C. Note that this boundary condition is a consequence of the minimizing property of u: it is not imposed beforehand. Such conditions are called natural boundary conditions.


The preceding reasoning is not valid if σ{displaystyle sigma }sigma vanishes identically on C. In such a case, we could allow a trial function
φc{displaystyle varphi equiv c}varphi equiv c, where c is a constant. For such a trial function,


V[c]=c[∬Dfdxdy+∫Cgds].{displaystyle V[c]=cleft[iint _{D}f,dx,dy+int _{C}gdsright].}V[c]=cleft[iint _{D}f,dx,dy+int _{C}gdsright].

By appropriate choice of c, V can assume any value unless the quantity inside the brackets vanishes. Therefore, the variational problem is meaningless unless


Dfdxdy+∫Cgds=0.{displaystyle iint _{D}f,dx,dy+int _{C}g,ds=0.,}iint _{D}f,dx,dy+int _{C}g,ds=0.,

This condition implies that net external forces on the system are in equilibrium. If these forces are in equilibrium, then the variational problem has a solution, but it is not unique, since an arbitrary constant may be added. Further details and examples are in Courant and Hilbert (1953).



Eigenvalue problems


Both one-dimensional and multi-dimensional eigenvalue problems can be formulated as variational problems.



Sturm–Liouville problems



The Sturm–Liouville eigenvalue problem involves a general quadratic form


Q[φ]=∫x1x2[p(x)φ′(x)2+q(x)φ(x)2]dx,{displaystyle Q[varphi ]=int _{x_{1}}^{x_{2}}left[p(x)varphi '(x)^{2}+q(x)varphi (x)^{2}right],dx,,}Q[varphi ]=int _{x_{1}}^{x_{2}}left[p(x)varphi '(x)^{2}+q(x)varphi (x)^{2}right],dx,,

where φ is restricted to functions that satisfy the boundary conditions


φ(x1)=0,φ(x2)=0.{displaystyle varphi (x_{1})=0,quad varphi (x_{2})=0.,}varphi (x_{1})=0,quad varphi (x_{2})=0.,

Let R be a normalization integral


R[φ]=∫x1x2r(x)φ(x)2dx.{displaystyle R[varphi ]=int _{x_{1}}^{x_{2}}r(x)varphi (x)^{2},dx.,}R[varphi ]=int _{x_{1}}^{x_{2}}r(x)varphi (x)^{2},dx.,

The functions p(x){displaystyle p(x)}p(x) and r(x){displaystyle r(x)}r(x) are required to be everywhere positive and bounded away from zero. The primary variational problem is to minimize the ratio Q/R among all φ satisfying the endpoint conditions. It is shown below that the Euler–Lagrange equation for the minimizing u is


(pu′)′+qu−λru=0,{displaystyle -(pu')'+qu-lambda ru=0,,}-(pu')'+qu-lambda ru=0,,

where λ is the quotient


λ=Q[u]R[u].{displaystyle lambda ={frac {Q[u]}{R[u]}}.,}lambda ={frac {Q[u]}{R[u]}}.,

It can be shown (see Gelfand and Fomin 1963) that the minimizing u has two derivatives and satisfies the Euler–Lagrange equation. The associated λ will be denoted by λ1{displaystyle lambda _{1}}lambda _{1}; it is the lowest eigenvalue for this equation and boundary conditions. The associated minimizing function will be denoted by u1(x){displaystyle u_{1}(x)}u_{1}(x). This variational characterization of eigenvalues leads to the Rayleigh–Ritz method: choose an approximating u as a linear combination of basis functions (for example trigonometric functions) and carry out a finite-dimensional minimization among such linear combinations. This method is often surprisingly accurate.


The next smallest eigenvalue and eigenfunction can be obtained by minimizing Q under the additional constraint


x1x2r(x)u1(x)φ(x)dx=0.{displaystyle int _{x_{1}}^{x_{2}}r(x)u_{1}(x)varphi (x),dx=0.,}int _{x_{1}}^{x_{2}}r(x)u_{1}(x)varphi (x),dx=0.,

This procedure can be extended to obtain the complete sequence of eigenvalues and eigenfunctions for the problem.


The variational problem also applies to more general boundary conditions. Instead of requiring that φ vanish at the endpoints, we may not impose any condition at the endpoints, and set


Q[φ]=∫x1x2[p(x)φ′(x)2+q(x)φ(x)2]dx+a1φ(x1)2+a2φ(x2)2,{displaystyle Q[varphi ]=int _{x_{1}}^{x_{2}}left[p(x)varphi '(x)^{2}+q(x)varphi (x)^{2}right],dx+a_{1}varphi (x_{1})^{2}+a_{2}varphi (x_{2})^{2},,}Q[varphi ]=int _{x_{1}}^{x_{2}}left[p(x)varphi '(x)^{2}+q(x)varphi (x)^{2}right],dx+a_{1}varphi (x_{1})^{2}+a_{2}varphi (x_{2})^{2},,

where a1{displaystyle a_{1}}a_{1} and a2{displaystyle a_{2}}a_{2} are arbitrary. If we set φ=u+εv{displaystyle varphi =u+varepsilon v}varphi =u+varepsilon v the first variation for the ratio Q/R{displaystyle Q/R}Q/R is


V1=2R[u](∫x1x2[p(x)u′(x)v′(x)+q(x)u(x)v(x)−λu(x)v(x)]dx+a1u(x1)v(x1)+a2u(x2)v(x2)),{displaystyle V_{1}={frac {2}{R[u]}}left(int _{x_{1}}^{x_{2}}left[p(x)u'(x)v'(x)+q(x)u(x)v(x)-lambda u(x)v(x)right],dx+a_{1}u(x_{1})v(x_{1})+a_{2}u(x_{2})v(x_{2})right),,}V_{1}={frac {2}{R[u]}}left(int _{x_{1}}^{x_{2}}left[p(x)u'(x)v'(x)+q(x)u(x)v(x)-lambda u(x)v(x)right],dx+a_{1}u(x_{1})v(x_{1})+a_{2}u(x_{2})v(x_{2})right),,

where λ is given by the ratio Q[u]/R[u]{displaystyle Q[u]/R[u]}Q[u]/R[u] as previously.
After integration by parts,


R[u]2V1=∫x1x2v(x)[−(pu′)′+qu−λru]dx+v(x1)[−p(x1)u′(x1)+a1u(x1)]+v(x2)[p(x2)u′(x2)+a2u(x2)].{displaystyle {frac {R[u]}{2}}V_{1}=int _{x_{1}}^{x_{2}}v(x)left[-(pu')'+qu-lambda ruright],dx+v(x_{1})[-p(x_{1})u'(x_{1})+a_{1}u(x_{1})]+v(x_{2})[p(x_{2})u'(x_{2})+a_{2}u(x_{2})].,}{frac {R[u]}{2}}V_{1}=int _{x_{1}}^{x_{2}}v(x)left[-(pu')'+qu-lambda ruright],dx+v(x_{1})[-p(x_{1})u'(x_{1})+a_{1}u(x_{1})]+v(x_{2})[p(x_{2})u'(x_{2})+a_{2}u(x_{2})].,

If we first require that v vanish at the endpoints, the first variation will vanish for all such v only if


(pu′)′+qu−λru=0forx1<x<x2.{displaystyle -(pu')'+qu-lambda ru=0quad {hbox{for}}quad x_{1}<x<x_{2}.,}-(pu')'+qu-lambda ru=0quad {hbox{for}}quad x_{1}<x<x_{2}.,

If u satisfies this condition, then the first variation will vanish for arbitrary v only if


p(x1)u′(x1)+a1u(x1)=0,andp(x2)u′(x2)+a2u(x2)=0.{displaystyle -p(x_{1})u'(x_{1})+a_{1}u(x_{1})=0,quad {hbox{and}}quad p(x_{2})u'(x_{2})+a_{2}u(x_{2})=0.,}-p(x_{1})u'(x_{1})+a_{1}u(x_{1})=0,quad {hbox{and}}quad p(x_{2})u'(x_{2})+a_{2}u(x_{2})=0.,

These latter conditions are the natural boundary conditions for this problem, since they are not imposed on trial functions for the minimization, but are instead a consequence of the minimization.



Eigenvalue problems in several dimensions


Eigenvalue problems in higher dimensions are defined in analogy with the one-dimensional case. For example, given a domain D with boundary B in three dimensions we may define


Q[φ]=∭Dp(X)∇φφ+q(X)φ2dxdydz+∬(S)φ2dS,{displaystyle Q[varphi ]=iiint _{D}p(X)nabla varphi cdot nabla varphi +q(X)varphi ^{2},dx,dy,dz+iint _{B}sigma (S)varphi ^{2},dS,,}Q[varphi ]=iiint _{D}p(X)nabla varphi cdot nabla varphi +q(X)varphi ^{2},dx,dy,dz+iint _{B}sigma (S)varphi ^{2},dS,,

and


R[φ]=∭Dr(X)φ(X)2dxdydz.{displaystyle R[varphi ]=iiint _{D}r(X)varphi (X)^{2},dx,dy,dz.,}R[varphi ]=iiint _{D}r(X)varphi (X)^{2},dx,dy,dz.,

Let u be the function that minimizes the quotient Q[φ]/R[φ],{displaystyle Q[varphi ]/R[varphi ],}Q[varphi ]/R[varphi ],
with no condition prescribed on the boundary B. The Euler–Lagrange equation satisfied by u is


(p(X)∇u)+q(x)u−λr(x)u=0,{displaystyle -nabla cdot (p(X)nabla u)+q(x)u-lambda r(x)u=0,,}-nabla cdot (p(X)nabla u)+q(x)u-lambda r(x)u=0,,

where


λ=Q[u]R[u].{displaystyle lambda ={frac {Q[u]}{R[u]}}.,}lambda ={frac {Q[u]}{R[u]}}.,

The minimizing u must also satisfy the natural boundary condition


p(S)∂u∂n+σ(S)u=0,{displaystyle p(S){frac {partial u}{partial n}}+sigma (S)u=0,}{displaystyle p(S){frac {partial u}{partial n}}+sigma (S)u=0,}

on the boundary B. This result depends upon the regularity theory for elliptic partial differential equations; see Jost and Li-Jost (1998) for details. Many extensions, including completeness results, asymptotic properties of the eigenvalues and results concerning the nodes of the eigenfunctions are in Courant and Hilbert (1953).



Applications



Some applications of the calculus of variations include:



  • The derivation of the Catenary shape

  • Newton's minimal resistance problem

  • The Brachistochrone problem


  • Isoperimetric problems


  • Geodesics on surfaces


  • Minimal surfaces and Plateau's problem

  • Optimal control



Fermat's principle


Fermat's principle states that light takes a path that (locally) minimizes the optical length between its endpoints. If the x-coordinate is chosen as the parameter along the path, and y=f(x){displaystyle y=f(x)}y=f(x) along the path, then the optical length is given by


A[f]=∫x=x0x1n(x,f(x))1+f′(x)2dx,{displaystyle A[f]=int _{x=x_{0}}^{x_{1}}n(x,f(x)){sqrt {1+f'(x)^{2}}}dx,,}A[f]=int _{x=x_{0}}^{x_{1}}n(x,f(x)){sqrt {1+f'(x)^{2}}}dx,,

where the refractive index n(x,y){displaystyle n(x,y)}n(x,y) depends upon the material.
If we try f(x)=f0(x)+εf1(x){displaystyle f(x)=f_{0}(x)+varepsilon f_{1}(x)}f(x)=f_{0}(x)+varepsilon f_{1}(x)
then the first variation of A (the derivative of A with respect to ε) is


δA[f0,f1]=∫x=x0x1[n(x,f0)f0′(x)f1′(x)1+f0′(x)2+ny(x,f0)f11+f0′(x)2]dx.{displaystyle delta A[f_{0},f_{1}]=int _{x=x_{0}}^{x_{1}}left[{frac {n(x,f_{0})f_{0}'(x)f_{1}'(x)}{sqrt {1+f_{0}'(x)^{2}}}}+n_{y}(x,f_{0})f_{1}{sqrt {1+f_{0}'(x)^{2}}}right]dx.}delta A[f_{0},f_{1}]=int _{x=x_{0}}^{x_{1}}left[{frac {n(x,f_{0})f_{0}'(x)f_{1}'(x)}{sqrt {1+f_{0}'(x)^{2}}}}+n_{y}(x,f_{0})f_{1}{sqrt {1+f_{0}'(x)^{2}}}right]dx.

After integration by parts of the first term within brackets, we obtain the Euler–Lagrange equation


ddx[n(x,f0)f0′1+f0′2]+ny(x,f0)1+f0′(x)2=0.{displaystyle -{frac {d}{dx}}left[{frac {n(x,f_{0})f_{0}'}{sqrt {1+f_{0}'^{2}}}}right]+n_{y}(x,f_{0}){sqrt {1+f_{0}'(x)^{2}}}=0.,}-{frac {d}{dx}}left[{frac {n(x,f_{0})f_{0}'}{sqrt {1+f_{0}'^{2}}}}right]+n_{y}(x,f_{0}){sqrt {1+f_{0}'(x)^{2}}}=0.,

The light rays may be determined by integrating this equation. This formalism is used in the context of Lagrangian optics and Hamiltonian optics.



Snell's law


There is a discontinuity of the refractive index when light enters or leaves a lens. Let



n(x,y)=n−ifx<0,{displaystyle n(x,y)=n_{-}quad {hbox{if}}quad x<0,,}n(x,y)=n_{-}quad {hbox{if}}quad x<0,,

n(x,y)=n+ifx>0,{displaystyle n(x,y)=n_{+}quad {hbox{if}}quad x>0,,}n(x,y)=n_{+}quad {hbox{if}}quad x>0,,


where n−{displaystyle n_{-}}n_{-} and n+{displaystyle n_{+}}n_{+} are constants. Then the Euler–Lagrange equation holds as before in the region where x<0 or x>0, and in fact the path is a straight line there, since the refractive index is constant. At the x=0, f must be continuous, but f' may be discontinuous. After integration by parts in the separate regions and using the Euler–Lagrange equations, the first variation takes the form


δA[f0,f1]=f1(0)[n−f0′(0−)1+f0′(0−)2−n+f0′(0+)1+f0′(0+)2].{displaystyle delta A[f_{0},f_{1}]=f_{1}(0)left[n_{-}{frac {f_{0}'(0_{-})}{sqrt {1+f_{0}'(0_{-})^{2}}}}-n_{+}{frac {f_{0}'(0_{+})}{sqrt {1+f_{0}'(0_{+})^{2}}}}right].,}delta A[f_{0},f_{1}]=f_{1}(0)left[n_{-}{frac {f_{0}'(0_{-})}{sqrt {1+f_{0}'(0_{-})^{2}}}}-n_{+}{frac {f_{0}'(0_{+})}{sqrt {1+f_{0}'(0_{+})^{2}}}}right].,

The factor multiplying n−{displaystyle n_{-}}n_{-} is the sine of angle of the incident ray with the x axis, and the factor multiplying n+{displaystyle n_{+}}n_{+} is the sine of angle of the refracted ray with the x axis. Snell's law for refraction requires that these terms be equal. As this calculation demonstrates, Snell's law is equivalent to vanishing of the first variation of the optical path length.



Fermat's principle in three dimensions


It is expedient to use vector notation: let X=(x1,x2,x3),{displaystyle X=(x_{1},x_{2},x_{3}),}X=(x_{1},x_{2},x_{3}), let t be a parameter, let X(t){displaystyle X(t)}X(t) be the parametric representation of a curve C, and let (t){displaystyle {dot {X}}(t)}{dot {X}}(t) be its tangent vector. The optical length of the curve is given by


A[C]=∫t=t0t1n(X)X˙dt.{displaystyle A[C]=int _{t=t_{0}}^{t_{1}}n(X){sqrt {{dot {X}}cdot {dot {X}}}}dt.,}A[C]=int _{t=t_{0}}^{t_{1}}n(X){sqrt {{dot {X}}cdot {dot {X}}}}dt.,

Note that this integral is invariant with respect to changes in the parametric representation of C. The Euler–Lagrange equations for a minimizing curve have the symmetric form


ddtP=X˙n,{displaystyle {frac {d}{dt}}P={sqrt {{dot {X}}cdot {dot {X}}}}nabla n,,}{frac {d}{dt}}P={sqrt {{dot {X}}cdot {dot {X}}}}nabla n,,

where


P=n(X)X˙.{displaystyle P={frac {n(X){dot {X}}}{sqrt {{dot {X}}cdot {dot {X}}}}}.,}P={frac {n(X){dot {X}}}{sqrt {{dot {X}}cdot {dot {X}}}}}.,

It follows from the definition that P satisfies


P⋅P=n(X)2.{displaystyle Pcdot P=n(X)^{2}.,}Pcdot P=n(X)^{2}.,

Therefore, the integral may also be written as


A[C]=∫t=t0t1P⋅dt.{displaystyle A[C]=int _{t=t_{0}}^{t_{1}}Pcdot {dot {X}},dt.,}A[C]=int _{t=t_{0}}^{t_{1}}Pcdot {dot {X}},dt.,

This form suggests that if we can find a function ψ whose gradient is given by P, then the integral A is given by the difference of ψ at the endpoints of the interval of integration. Thus the problem of studying the curves that make the integral stationary can be related to the study of the level surfaces of ψ. In order to find such a function, we turn to the wave equation, which governs the propagation of light. This formalism is used in the context of Lagrangian optics and Hamiltonian optics.



Connection with the wave equation

The wave equation for an inhomogeneous medium is


utt=c2∇u,{displaystyle u_{tt}=c^{2}nabla cdot nabla u,,}u_{tt}=c^{2}nabla cdot nabla u,,

where c is the velocity, which generally depends upon X. Wave fronts for light are characteristic surfaces for this partial differential equation: they satisfy


φt2=c(X)2∇φφ.{displaystyle varphi _{t}^{2}=c(X)^{2}nabla varphi cdot nabla varphi .,}varphi _{t}^{2}=c(X)^{2}nabla varphi cdot nabla varphi .,

We may look for solutions in the form


φ(t,X)=t−ψ(X).{displaystyle varphi (t,X)=t-psi (X).,}varphi (t,X)=t-psi (X).,

In that case, ψ satisfies


ψψ=n2,{displaystyle nabla psi cdot nabla psi =n^{2},,}nabla psi cdot nabla psi =n^{2},,

where n=1/c.{displaystyle n=1/c.}n=1/c. According to the theory of first-order partial differential equations, if P=∇ψ,{displaystyle P=nabla psi ,}P=nabla psi , then P satisfies


dPds=n∇n,{displaystyle {frac {dP}{ds}}=nnabla n,,}{frac {dP}{ds}}=nnabla n,,

along a system of curves (the light rays) that are given by


dXds=P.{displaystyle {frac {dX}{ds}}=P.,}{frac {dX}{ds}}=P.,

These equations for solution of a first-order partial differential equation are identical to the Euler–Lagrange equations if we make the identification


dsdt=X˙n.{displaystyle {frac {ds}{dt}}={frac {sqrt {{dot {X}}cdot {dot {X}}}}{n}}.,}{frac {ds}{dt}}={frac {sqrt {{dot {X}}cdot {dot {X}}}}{n}}.,

We conclude that the function ψ is the value of the minimizing integral A as a function of the upper end point. That is, when a family of minimizing curves is constructed, the values of the optical length satisfy the characteristic equation corresponding the wave equation. Hence, solving the associated partial differential equation of first order is equivalent to finding families of solutions of the variational problem. This is the essential content of the Hamilton–Jacobi theory, which applies to more general variational problems.



Action principle



In classical mechanics, the action, S, is defined as the time integral of the Lagrangian, L. The Lagrangian is the difference of energies,


L=T−U,{displaystyle L=T-U,,}L=T-U,,

where T is the kinetic energy of a mechanical system and U its potential energy. Hamilton's principle (or the action principle) states that the motion of a conservative holonomic (integrable constraints) mechanical system is such that the action integral


S=∫t=t0t1L(x,x˙,t)dt{displaystyle S=int _{t=t_{0}}^{t_{1}}L(x,{dot {x}},t)dt,}S=int _{t=t_{0}}^{t_{1}}L(x,{dot {x}},t)dt,

is stationary with respect to variations in the path x(t).
The Euler–Lagrange equations for this system are known as Lagrange's equations:


ddt∂L∂=∂L∂x,{displaystyle {frac {d}{dt}}{frac {partial L}{partial {dot {x}}}}={frac {partial L}{partial x}},,}{displaystyle {frac {d}{dt}}{frac {partial L}{partial {dot {x}}}}={frac {partial L}{partial x}},,}

and they are equivalent to Newton's equations of motion (for such systems).


The conjugate momenta P are defined by


p=∂L∂.{displaystyle p={frac {partial L}{partial {dot {x}}}}.,}{displaystyle p={frac {partial L}{partial {dot {x}}}}.,}

For example, if


T=12mx˙2,{displaystyle T={frac {1}{2}}m{dot {x}}^{2},,}T={frac {1}{2}}m{dot {x}}^{2},,

then


p=mx˙.{displaystyle p=m{dot {x}}.,}p=m{dot {x}}.,

Hamiltonian mechanics results if the conjugate momenta are introduced in place of {displaystyle {dot {x}}}{dot {x}} by a Legendre transformation of the Lagrangian L into the Hamiltonian H defined by


H(x,p,t)=px˙L(x,x˙,t).{displaystyle H(x,p,t)=p,{dot {x}}-L(x,{dot {x}},t).,}H(x,p,t)=p,{dot {x}}-L(x,{dot {x}},t).,

The Hamiltonian is the total energy of the system: H = T + U.
Analogy with Fermat's principle suggests that solutions of Lagrange's equations (the particle trajectories) may be described in terms of level surfaces of some function of X. This function is a solution of the Hamilton–Jacobi equation:


ψt+H(x,∂ψx,t)=0.{displaystyle {frac {partial psi }{partial t}}+Hleft(x,{frac {partial psi }{partial x}},tright)=0.,}{displaystyle {frac {partial psi }{partial t}}+Hleft(x,{frac {partial psi }{partial x}},tright)=0.,}


Variations and sufficient condition for a minimum


Calculus of variations is concerned with variations of functionals, which are small changes in the functional's value due to small changes in the function that is its argument. The first variation[Note 9] is defined as the linear part of the change in the functional, and the second variation[Note 10] is defined as the quadratic part.[16]


For example, if J[y] is a functional with the function y = y(x) as its argument, and there is a small change in its argument from y to y + h, where h = h(x) is a function in the same function space as y, then the corresponding change in the functional is



ΔJ[h]=J[y+h]−J[y]{displaystyle Delta J[h]=J[y+h]-J[y]}Delta J[h]=J[y+h]-J[y] .[Note 11]

The functional J[y] is said to be differentiable if



ΔJ[h]=ϕ[h]+ϵh‖{displaystyle Delta J[h]=phi [h]+epsilon |h|}Delta J[h]=phi [h]+epsilon |h| ,

where φ[h] is a linear functional,[Note 12]||h|| is the norm of h,[Note 13] and ε → 0 as ||h|| → 0. The linear functional φ[h] is the first variation of J[y] and is denoted by,[20]



δJ[h]=ϕ[h]{displaystyle delta J[h]=phi [h]}{displaystyle delta J[h]=phi [h]} .

The functional J[y] is said to be twice differentiable if



ΔJ[h]=ϕ1[h]+ϕ2[h]+ϵh‖2{displaystyle Delta J[h]=phi _{1}[h]+phi _{2}[h]+epsilon |h|^{2}}Delta J[h]=phi _{1}[h]+phi _{2}[h]+epsilon |h|^{2} ,

where φ1[h] is a linear functional (the first variation), φ2[h] is a quadratic functional,[Note 14] and ε → 0 as ||h|| → 0. The quadratic functional φ2[h] is the second variation of J[y] and is denoted by,[22]



δ2J[h]=ϕ2[h]{displaystyle delta ^{2}J[h]=phi _{2}[h]}{displaystyle delta ^{2}J[h]=phi _{2}[h]} .

The second variation δ2J[h] is said to be strongly positive if



δ2J[h]≥k‖h‖2{displaystyle delta ^{2}J[h]geq k|h|^{2}}delta ^{2}J[h]geq k|h|^{2} ,

for all h and for some constant k > 0 .[23]


Using the above definitions, especially the definitions of first variation, second variation, and strongly positive, the following sufficient condition for a minimum of a functional can be stated.



.mw-parser-output .quotebox{background-color:#F9F9F9;border:1px solid #aaa;box-sizing:border-box;padding:10px;font-size:88%}.mw-parser-output .quotebox.floatleft{margin:0.5em 1.4em 0.8em 0}.mw-parser-output .quotebox.floatright{margin:0.5em 0 0.8em 1.4em}.mw-parser-output .quotebox.centered{margin:0.5em auto 0.8em auto}.mw-parser-output .quotebox.floatleft p,.mw-parser-output .quotebox.floatright p{font-style:inherit}.mw-parser-output .quotebox-title{background-color:#F9F9F9;text-align:center;font-size:larger;font-weight:bold}.mw-parser-output .quotebox-quote.quoted:before{font-family:"Times New Roman",serif;font-weight:bold;font-size:large;color:gray;content:" “ ";vertical-align:-45%;line-height:0}.mw-parser-output .quotebox-quote.quoted:after{font-family:"Times New Roman",serif;font-weight:bold;font-size:large;color:gray;content:" ” ";line-height:0}.mw-parser-output .quotebox .left-aligned{text-align:left}.mw-parser-output .quotebox .right-aligned{text-align:right}.mw-parser-output .quotebox .center-aligned{text-align:center}.mw-parser-output .quotebox cite{display:block;font-style:normal}@media screen and (max-width:360px){.mw-parser-output .quotebox{min-width:100%;margin:0 0 0.8em!important;float:none!important}}

Sufficient condition for a minimum:
The functional J[y] has a minimum at y = ŷ if its first variation δJ[h] = 0 at y = ŷ and its second variation δ2J[h] is strongly positive at y = ŷ .[24][Note 15][Note 16]





See also




  • First variation

  • Isoperimetric inequality

  • Variational principle

  • Variational bicomplex

  • Fermat's principle

  • Principle of least action

  • Infinite-dimensional optimization

  • Functional analysis

  • Ekeland's variational principle

  • Inverse problem for Lagrangian mechanics

  • Obstacle problem

  • Perturbation methods

  • Young measure

  • Optimal control

  • Direct method in calculus of variations

  • Noether's theorem

  • De Donder–Weyl theory

  • Variational Bayesian methods

  • Chaplygin problem

  • Nehari manifold

  • Hu Washizu principle

  • Luke's variational principle

  • Mountain pass theorem

  • Category:Variational analysts

  • Measures of central tendency as solutions to variational problems

  • Stampacchia Medal

  • Fermat Prize

  • Convenient vector space




Notes





  1. ^ Whereas elementary calculus is about infinitesimally small changes in the values of functions without changes in the function itself, calculus of variations is about infinitesimally small changes in the function itself, which are called variations.[1]


  2. ^ "Euler waited until Lagrange had published on the subject in 1762 ... before he committed his lecture ... to print, so as not to rob Lagrange of his glory. Indeed, it was only Lagrange's method that Euler called Calculus of Variations."[3]


  3. ^ The neighborhood of f is the part of the given function space where | y - f| < h over the whole domain of the functions, with h a positive number that specifies the size of the neighborhood.[10]


  4. ^ Note the difference between the terms extremal and extremum. An extremal is a function that makes a functional an extremum.


  5. ^ For a sufficient condition, see section Variations and sufficient condition for a minimum.


  6. ^ The following derivation of the Euler–Lagrange equation corresponds to the derivation on pp. 184–5 of:
    Courant, R.; Hilbert, D. (1953). Methods of Mathematical Physics. Vol. I (First English ed.). New York: Interscience Publishers, Inc. ISBN 978-0471504474..mw-parser-output cite.citation{font-style:inherit}.mw-parser-output q{quotes:"""""""'""'"}.mw-parser-output code.cs1-code{color:inherit;background:inherit;border:inherit;padding:inherit}.mw-parser-output .cs1-lock-free a{background:url("//upload.wikimedia.org/wikipedia/commons/thumb/6/65/Lock-green.svg/9px-Lock-green.svg.png")no-repeat;background-position:right .1em center}.mw-parser-output .cs1-lock-limited a,.mw-parser-output .cs1-lock-registration a{background:url("//upload.wikimedia.org/wikipedia/commons/thumb/d/d6/Lock-gray-alt-2.svg/9px-Lock-gray-alt-2.svg.png")no-repeat;background-position:right .1em center}.mw-parser-output .cs1-lock-subscription a{background:url("//upload.wikimedia.org/wikipedia/commons/thumb/a/aa/Lock-red-alt-2.svg/9px-Lock-red-alt-2.svg.png")no-repeat;background-position:right .1em center}.mw-parser-output .cs1-subscription,.mw-parser-output .cs1-registration{color:#555}.mw-parser-output .cs1-subscription span,.mw-parser-output .cs1-registration span{border-bottom:1px dotted;cursor:help}.mw-parser-output .cs1-hidden-error{display:none;font-size:100%}.mw-parser-output .cs1-visible-error{font-size:100%}.mw-parser-output .cs1-subscription,.mw-parser-output .cs1-registration,.mw-parser-output .cs1-format{font-size:95%}.mw-parser-output .cs1-kern-left,.mw-parser-output .cs1-kern-wl-left{padding-left:0.2em}.mw-parser-output .cs1-kern-right,.mw-parser-output .cs1-kern-wl-right{padding-right:0.2em}



  7. ^ The product εΦ′(0) is called the first variation of the functional J and is denoted by δJ. Some references define the first variation differently by leaving out the ε factor.


  8. ^ As an historical note, this is an axiom of Archimedes. See e.g. Kelland, Philip (1843). Lectures on the principles of demonstrative mathematics. Google Books. p. 58.


  9. ^ The first variation is also called the variation, differential, or first differential.


  10. ^ The second variation is also called the second differential.


  11. ^ Note that Δ J[h] and the variations below, depend on both y and h. The argument y has been left out to simplify the notation. For example, Δ J[h] could have been written Δ J[y ; h].[17]


  12. ^ A functional φ[h] is said to be linear if φ[αh] = α φ[h]   and   φ[h1 +h2] = φ[h1] + φ[h2] , where h, h1, h2 are functions and α is a real number.[18]


  13. ^ For a function h = h(x) that is defined for axb, where a and b are real numbers, the norm of h is its maximum absolute value, i.e. ||h|| = max |h(x)| for axb.[19]


  14. ^ A functional is said to be quadratic if it is a bilinear functional with two argument functions that are equal. A bilinear functional is a functional that depends on two argument functions and is linear when each argument function in turn is fixed while the other argument function is variable.[21]


  15. ^ For other sufficient conditions, see in Gelfand & Fomin 2000,


    • Chapter 5: "The Second Variation. Sufficient Conditions for a Weak Extremum" – Sufficient conditions for a weak minimum are given by the theorem on p. 116.


    • Chapter 6: "Fields. Sufficient Conditions for a Strong Extremum" – Sufficient conditions for a strong minimum are given by the theorem on p. 148.




  16. ^ One may note the similarity to the sufficient condition for a minimum of a function, where the first derivative is zero and the second derivative is positive.




References





  1. ^ ab Courant & Hilbert 1953, p. 184


  2. ^ Gelfand, I. M.; Fomin, S. V. (2000). Silverman, Richard A., ed. Calculus of variations (Unabridged repr. ed.). Mineola, New York: Dover Publications. p. 3. ISBN 978-0486414485.


  3. ^ ab Thiele, Rüdiger (2007). "Euler and the Calculus of Variations". In Bradley, Robert E.; Sandifer, C. Edward. Leonhard Euler: Life, Work and Legacy. Elsevier. p. 249. ISBN 9780080471297.


  4. ^ Goldstine, Herman H. (2012). A History of the Calculus of Variations from the 17th through the 19th Century. Springer Science & Business Media. p. 110. ISBN 9781461381068.


  5. ^ abc van Brunt, Bruce (2004). The Calculus of Variations. Springer. ISBN 0-387-40247-0.


  6. ^ ab Ferguson, James (2004). "Brief Survey of the History of the Calculus of Variations and its Applications". arXiv:math/0402357.


  7. ^ Dimitri Bertsekas. Dynamic programming and optimal control. Athena Scientific, 2005.


  8. ^ Bellman, Richard E. (1954). "Dynamic Programming and a new formalism in the calculus of variations" (PDF). Proc. Natl. Acad. Sci. 40 (4): 231–235. Bibcode:1954PNAS...40..231B. doi:10.1073/pnas.40.4.231. PMC 527981. PMID 16589462.


  9. ^ Kushner, Harold J. (2004). "Richard E. Bellman Control Heritage Award". American Automatic Control Council. Retrieved 2013-07-28. See 2004: Harold J. Kushner: regarding Dynamic Programming, "The calculus of variations had related ideas (e.g., the work of Caratheodory, the Hamilton-Jacobi equation). This led to conflicts with the calculus of variations community."


  10. ^ Courant, R; Hilbert, D (1953). Methods of Mathematical Physics. Vol. I (First English ed.). New York: Interscience Publishers, Inc. p. 169. ISBN 978-0471504474.


  11. ^ Gelfand & Fomin 2000, pp. 12–13


  12. ^ Gelfand & Fomin 2000, p. 13


  13. ^ Gelfand & Fomin 2000, pp. 14–15


  14. ^ Weisstein, Eric W. "Euler-Lagrange Differential Equation." From MathWorld--A Wolfram Web Resource. See Eq. (5).


  15. ^ The resulting controversy over the validity of Dirichlet's principle is explained in http://turnbull.mcs.st-and.ac.uk/~history/Biographies/Riemann.html.


  16. ^ Gelfand & Fomin 2000, pp. 11–12, 99


  17. ^ Gelfand & Fomin 2000, p. 12, footnote 6


  18. ^ Gelfand & Fomin 2000, p. 8


  19. ^ Gelfand & Fomin 2000, p. 6


  20. ^ Gelfand & Fomin 2000, pp. 11–12


  21. ^ Gelfand & Fomin 2000, pp. 97–98


  22. ^ Gelfand & Fomin 2000, p. 99


  23. ^ Gelfand & Fomin 2000, p. 100


  24. ^ Gelfand & Fomin 2000, p. 100, Theorem 2




Further reading



  • Benesova, B. and Kruzik, M.: [1] Weak Lower Semicontinuity of Integral Functionals and Applications. SIAM Review 59(4) (2017), 703–766.


  • Dacorogna, Bernard. Introduction to the Calculus of Variations (3rd Edition), 2014, World Scientific Publishing,
    ISBN 978-1-78326-551-0. Introduction


  • Bolza, O.: Lectures on the Calculus of Variations. Chelsea Publishing Company, 1904, available on Digital Mathematics library [2]. 2nd edition republished in 1961, paperback in 2005,
    ISBN 978-1-4181-8201-4.

  • Cassel, Kevin W.: Variational Methods with Applications in Science and Engineering, Cambridge University Press, 2013.

  • Clegg, J.C.: Calculus of Variations, Interscience Publishers Inc., 1968.


  • Courant, R.: Dirichlet's principle, conformal mapping and minimal surfaces. Interscience, 1950.

  • Elsgolc, L.E.: Calculus of Variations, Pergamon Press Ltd., 1962.

  • Forsyth, A.R.: Calculus of Variations, Dover, 1960.

  • Fox, Charles: An Introduction to the Calculus of Variations, Dover Publ., 1987.

  • Jost, J. and X. Li-Jost: Calculus of Variations. Cambridge University Press, 1998.

  • Lebedev, L.P. and Cloud, M.J.: The Calculus of Variations and Functional Analysis with Optimal Control and Applications in Mechanics, World Scientific, 2003, pages 1–98.

  • Logan, J. David: Applied Mathematics, 3rd Ed. Wiley-Interscience, 2006

  • Roubicek, T.: Calculus of variations. Chap.17 in: Mathematical Tools for Physicists. (Ed. M. Grinfeld) J. Wiley, Weinheim, 2014,
    ISBN 978-3-527-41188-7, pp. 551–588.


  • Sagan, Hans: Introduction to the Calculus of Variations, Dover, 1992.

  • Weinstock, Robert: Calculus of Variations with Applications to Physics and Engineering, Dover, 1974 (reprint of 1952 ed.).


  • Chapter 8: Calculus of Variations, from Optimization for Engineering Systems, by Ralph W. Pike, Louisiana State University.



External links




  • Variational calculus. Encyclopedia of Mathematics.


  • calculus of variations. PlanetMath.


  • Calculus of Variations. MathWorld.


  • Calculus of variations. Example problems.


  • Mathematics - Calculus of Variations and Integral Equations. Lectures on YouTube.

  • Selected papers on Geodesic Fields. Part I, Part II.









Popular posts from this blog

Y

Mount Tamalpais

Indian Forest Service