# Lagrangian mechanics

Marcos Duarte

A large number of applications of classical mechanics are based directly on [Newton's laws of motion](https://en.wikipedia.org/wiki/Newton's_laws_of_motion). However, there are other ways of formulating the equations of motion, such as [Lagrangian mechanics](https://en.wikipedia.org/wiki/Lagrangian_mechanics),  introduced by [Joseph-Louis Lagrange](https://en.wikipedia.org/wiki/Joseph-Louis_Lagrange) in the late 18th century. They are not new theories, because they derive from Newton's laws, but are different ways of expressing the same theory by means of more advanced mathematical concepts. In many aspects they are more elegant than the Newtonian formulation, and in some cases more powerful, because they allow the solution of some problems which, if based directly on Newton's laws, would be very difficult.

In few words, the Lagrangian mechanics is a reformulation of classical mechanics where the equations of motion are obtained from the kinetic and potential energy of the system (scalar quantities) represented in generalized coordinates instead of deducing the equations of motion from the forces on the system (vector quantities) represented in Cartesian coordinates.

Let's deduce the Lagrange equations next, but first we will outline the basics of the Newtonian approach.

## Description of motion 

One can describe the motion of a particle by specifying its position with respect to a frame of reference in the three-dimensional space as a function of time:

\begin{equation} x(t),\, y(t),\, z(t) \quad \equiv \quad x_i(t) \quad i=1,\dotsc,3 \end{equation}

A system of $N$ particles will require $3N$ equations to describe their motion:

\begin{equation} \sum\limits_{i=1}^{3N} {x_i(t)}  \end{equation}

The basic problem in classical mechanics is to find ways to determine functions such as these, also known as equations of motion, capable of specifying the position of objects over time, for any mechanical situation. Assuming as known the meaning of $x_i(t)$, one can define the components of velocity, $v_i$, and acceleration, $a_i$, at time $t$, as:

\begin{equation} v_i(t) = \frac{\mathrm d x_i(t)}{\mathrm d t} = \dot{x}_i(t) \end{equation}

\begin{equation} a_i(t) = \frac{\mathrm d^2 x_i(t)}{\mathrm d t^2} = \dot{v}_i(t) \end{equation}

Where we used the Newton's notation for differentiation (also called the dot notation), a dot over the dependent variable.  
Of note, [Joseph Louis Lagrange](https://pt.wikipedia.org/wiki/Joseph-Louis_Lagrange) introduced the prime mark to denote a derivative: $x'(t)$. Read more about the different notations for differentiation at [Wikipedia](https://en.wikipedia.org/wiki/Notation_for_differentiation).

## Laws of motion

The [Newton's laws of motion](https://en.wikipedia.org/wiki/Newton's_laws_of_motion) laid the foundation for classical mechanics. They describe the relationship between the motion of a body and the possible forces acting upon it. Here are these three laws in Newton's own words ([*Philosophiæ Naturalis Principia Mathematica*](http://archive.org/details/newtonspmathema00newtrich), p. 83 of Book I of the first American edition):

> LAW I    
> *Every body perseveres in its state of rest, or of uniform motion in a right line, unless it is compelled to change that state by forces impressed thereon.*   
> LAW II    
> *The alteration of motion is ever proportional to the motive force impressed; and is made in the direction of the right line in which that force is impressed.*   
> LAW III   
> *To every action there is always opposed an equal reaction: or the mutual actions of two bodies upon each other are always equal, and directed to contrary parts.*   

Regarding the second law, by alteration of motion, Newton was referring to the change in time of the body's linear momentum, which is defined as:
 
\begin{equation} \mathbf{p} = m\mathbf{v} \end{equation}

If the mass of the body is constant, the second law can be expressed by the well known equation:

\begin{equation} \mathbf{F} = \frac{\mathrm d \mathbf{p}}{\mathrm d t}=\frac{\mathrm d \big(m\mathbf{v}\big)}{\mathrm d t} = m\mathbf{a} \end{equation}

Using Newton's second law, to determine the position of the body we will have to solve the following second order ordinary differential equation:

\begin{equation} \frac{\mathrm d^2 x_i(t)}{\mathrm d t^2} = \frac{\mathbf{F}}{m} \end{equation}

Which has the general solution:

\begin{equation} \mathbf{x}(t) = \int\!\bigg(\int\frac{\mathbf{F}}{m} \mathrm{d}t\bigg)\mathrm{d}t \end{equation}

A related physical quantity is the mechanical energy, which is the sum of kinetic and potential energies.  
The kinetic energy, $T$  of a particle is given by:

\begin{equation} T = \frac{1}{2}mv^2 \end{equation}

The kinetic energy of a particle can be expressed in terms of its linear momentum:

\begin{equation} T = \frac{p^2}{2m} \end{equation}

And for a given coordinate of the particle's motion, its linear momentum can be obtained from its kinetic energy by:

\begin{equation} p_i = \frac{\partial T}{\partial v_i} \end{equation}

The potential energy, $V$ is the stored energy of a particle and its formulation is dependent on the force acting on the particle. For example, for a conservative force dependent solely on the particle position, such as due to the gravitational field near the Earth  surface or due to a linear spring, force and potential energy are related by:

\begin{equation} \mathbf{F} = - \frac{\partial \mathbf{V}}{\partial x} \end{equation}

### Lagrange's equation in Cartesian Coordinates

For simplicity, let's first deduce the Lagrange's equation in Cartesian Coordinates for a particle.

Because we want to deduce the laws of motion based on the mechanical energy of the particle, one can see that the time derivative of the expression for the linear momentum as a function of the kinetic energy, cf. Eq. (11), is equal to the force acting on the particle and we can substitute the force in Newton's second law by this term:

\begin{equation} \frac{\mathrm d }{\mathrm d t}\bigg(\frac{\partial T}{\partial \dot x}\bigg) = m\ddot x \end{equation}

We saw that a conservative force can also be expressed in terms of the potential energy of the particle, cf. Eq. (12); substituting the right side of the equation above by this expression, we have:

\begin{equation} \frac{\mathrm d }{\mathrm d t}\bigg(\frac{\partial T}{\partial \dot x}\bigg) = -\frac{\partial V}{\partial x} \end{equation}

Using the fact that:

\begin{equation} \frac{\partial T}{\partial x} = 0 \quad and \quad \frac{\partial V}{\partial \dot x} = 0 \end{equation}

We can write:

\begin{equation} \frac{\mathrm d }{\mathrm d t}\bigg(\frac{\partial (T-V)}{\partial \dot x}\bigg) - \frac{\partial (T-V)}{\partial x} = 0 \end{equation}

Defining the Lagrange or Lagrangian function, $L$, as the difference between the kinetic and potential energy in the system:

\begin{equation} L = T - V \end{equation}

We have the Lagrange's equation in Cartesian Coordinates for a conservative force acting on a particle:

\begin{equation} \frac{\mathrm d }{\mathrm d t}\bigg(\frac{\partial L}{\partial \dot x}\bigg) - \frac{\partial L}{\partial x} = 0 \end{equation}

If there are $N$ independent particles in a three-dimensional space, there will be $3N$ equations like above for the system.

### Example: a particle moving under the influence of a conservative force

For a particle with mass $m$ moving in the 3D space under the influence of a conservative force, the Lagrangian of the particle is:

\begin{equation} L(x,y,z,\dot x,\dot y,\dot z) = \frac{1}{2}m(\dot x^2,\dot y^2,\dot z^2) - V(x,y,z) \end{equation}

The equations of motion for the particle are found by applying the Lagrange's equation for each coordinate. 

For the x coordinate:

\begin{equation} \frac{\mathrm d }{\mathrm d t}\left( {\frac{\partial L}{\partial \dot{x}}} 
\right) = \frac{\partial L}{\partial x } \end{equation}

And the derivatives are given by:

\begin{equation} \frac{\partial L}{\partial x} = -\frac{\partial \mathbf{V}}{\partial x}, \quad \frac{\partial L}{\partial \dot{x}} =m\dot{x}, \quad \frac{\mathrm d }{\mathrm d t}\left( {\frac{\partial L}{\partial \dot{x}}} \right) = m\ddot{x} \end{equation}

hence:

\begin{equation} m\ddot{x} = -\frac{\partial \mathbf{V}}{\partial x} \end{equation}

and similarly for the $y$ and $z$ coordinates.

For instance, if the conservative force is due to the gravitational field near Earth's surface $(\mathbf{V}=[0, mgy, 0])$:

\begin{equation} \begin{array}{rcl}
m\ddot{x} &=& -\frac{\partial (0)}{\partial x} = 0 \\
m\ddot{y} &=& -\frac{\partial (mgy)}{\partial y} = -mg \\
m\ddot{z} &=& -\frac{\partial (0)}{\partial z} = 0 
\end{array} \end{equation}

## Generalized coordinates

The direct application of Newton's laws to mechanical systems results in a set of equations of motion in terms of Cartesian coordinates of each of the particles that make up the system. In many cases, this is not the most convenient coordinate system to solve the problem or describe the movement of the system. For example, in problems involving many particles, it may be convenient to choose a system that includes the coordinate of the center of mass. Another example is a serial chain of rigid links, such as a member of the human body or from a robot manipulator, it may be simpler to describe the positions of each link by the angles between links.  

Coordinate systems such as these are referred as [generalized coordinates](https://en.wikipedia.org/wiki/Generalized_coordinates). Generalized coordinates uniquely specify the positions of the particles in a system. Although there may be several generalized coordinates to describe a system, usually a judicious choice of generalized coordinates provides the minimum number of independent coordinates that define the configuration of a system (which is the number of <a href="https://en.wikipedia.org/wiki/Degrees_of_freedom_(mechanics)">degrees of freedom</a> of the system), turning the problem simpler to solve.  

In problems where it is desired to use generalized coordinates, one can write Newton's equations of motion in terms of Cartesian coordinates and then transform them into generalized coordinates. However, it would be desirable and convenient to have a general method that would directly establish the equations of motion in terms of a set of convenient generalized coordinates. In addition, general methods for writing, and perhaps solving, the equations of motion in terms of any coordinate system would also be desirable. The [Lagrangian mechanics](https://en.wikipedia.org/wiki/Lagrangian_mechanics) is such a method.

## Lagrange's equation in generalized coordinates

When describing a system of particles using any set of generalized coordinates, $q_1,\dotsc,q_{3N}$, these are related to the Cartesian coordinates by:
 
\begin{equation} q_i =q_i (x_1,\dotsc,x_{3N} ) \quad i=1,\dotsc,3N \end{equation}

\begin{equation} x_i =x_i (q_1,\dotsc,q_{3N} ) \quad i=1,\dotsc,3N \end{equation}

The Cartesian components of velocity as a function of generalized coordinates are:

\begin{equation}
\dot{x}_i =\frac{\mathrm d x_i (q_1, q_2,\dotsc,q_{3N} 
)}{\mathrm d t}=\sum\limits_{j=1}^{3N} {\frac{\partial x_i }{\partial q_j }} 
\frac{\mathrm d q_j }{\mathrm d t}
\end{equation}

where for simplicity we omitted the explicit mention of the temporal dependence of each coordinate.

That is, any Cartesian component of the particle velocity as a function of generalized coordinates is a function of all the components of position and velocity in the generalized coordinates:

\begin{equation} \dot{x}_i = \dot{x}_i (q_1,\dotsc,q_{3N} ,\dot{q}_1,\dotsc,\dot{q}_{3N} ) \quad i=1,\dotsc,3N \end{equation}

Let's now deduce the Lagrange's equation in generalized coordinates. In fact, the Lagrange's equation in generalized coordinates will be similar to the equation in Cartesian Coordinates; the only difference will be that we will use $q_i$ will instead of the Cartesian coordinate.

In analogy to Newtonian mechanics, cf. Eq. (6), one can think that the equations of motion can be obtained by equating the generalized force, $F_i$, to the temporal rate of change of each generalized momentum, $p_i$:

\begin{equation} F_i =\frac{\partial p_i }{\partial t} \end{equation}

In the formula above, let's substitute the quantity $p_i$ by its definition in terms of the kinetic energy:

\begin{equation} \frac{\partial p_i }{\partial t} =\frac{\partial }{\partial t}\left( {\frac{\partial T}{\partial 
\dot{q}_i }} \right)=\frac{\partial }{\partial t}\left( 
{\sum\limits_{j=1}^{3N} {m_j \dot{x}_j \frac{\partial \dot{x}_j 
}{\partial \dot{q}_i }} } \right) \end{equation}

where we used:

\begin{equation} \frac{\partial T}{\partial \dot{q}_i }=\sum\limits_{j=1}^{3N} 
{\frac{\partial T}{\partial \dot{x}_j }\frac{\partial \dot{x}_j 
}{\partial \dot{q}_i }}  \end{equation}

Using the [product rule](https://en.wikipedia.org/wiki/Product_rule), the derivative of the product in Eq. (29) is:

\begin{equation} \frac{\partial p_i }{\partial t}=\sum\limits_{j=1}^{3N} {m_j 
\ddot{x}_j \frac{\partial \dot{x}_j }{\partial \dot{q}_i }} 
+\sum\limits_{j=1}^{3N} {m_j \dot{x}_j \frac{\mathrm d }{\mathrm d t}\left( 
{\frac{\partial \dot{x}_j }{\partial \dot{q}_i }} \right)}  \end{equation}

But:

\begin{equation} \frac{\partial \dot{x}_i }{\partial \dot{q}_j }=\frac{\partial x_i 
}{\partial q_j } \quad because \quad \frac{\partial 
\dot{x}_i }{\partial \dot{q}_j }=\frac{\partial x_i }{\partial 
t}\frac{\partial t}{\partial q_j }=\frac{\partial x_i }{\partial q_j} \end{equation}

Then:

\begin{equation} \frac{\partial p_i }{\partial t}=\sum\limits_{j=1}^{3N} {m_j 
\ddot{x}_j \frac{\partial x_j }{\partial q_i }} 
+\sum\limits_{j=1}^{3N} {m_j \dot{x}_j \frac{\mathrm d }{\mathrm d t}\left( 
{\frac{\partial x_j }{\partial q_i }} \right)}  \end{equation}

The first term on the right side of the equation above is proportional to $m_j 
\ddot{x}_j$ and we will define as the generalized force, $Q_i$. But, different from Newtonian mechanics, the temporal variation of the generalized momentum is equal to the generalized force plus another term, which will investigate now. The last part of this second term can be derived as:

\begin{equation} \frac{\mathrm d }{\mathrm d t}\left( {\frac{\partial x_j }{\partial q_i }} \right) = 
\sum\limits_{k=1}^{3N} {\frac{\mathrm d }{\mathrm d q_k }\left( {\frac{\partial 
x_j }{\partial q_i }} \right)\frac{\mathrm d q_k }{\mathrm d t}} =\sum\limits_{k=1}^{3N} 
{\frac{\partial^2 x_j }{\partial q_k \partial q_i }\dot{q}_k }  \end{equation}

where we used the [chain rule](https://en.wikipedia.org/wiki/Chain_rule) for the differentiation:
\begin{equation} \frac{\mathrm d }{\mathrm d t}\Big( {f\big({g(t)}\big)}\Big) = \frac{\partial f}{\partial g}\frac{\partial g}{\partial t} \end{equation}

But if we look at Eq. (26) we see that the last term of the Eq. (34) can be obtained by:

\begin{equation} \frac{\partial \dot{x}_j }{\partial q_i } = \frac{\partial }{\partial q_i }\left(\sum\limits_{k=1}^{3N} \frac{\partial 
x_j }{\partial q_i }\dot{q}_k \right) = \sum\limits_{k=1}^{3N} 
{\frac{\partial^2 x_j }{\partial q_k \partial q_i }\dot{q}_k }  \end{equation}

Comparing the Eq. (34) and Eq. (36) we have:

\begin{equation} \frac{\mathrm d }{\mathrm d t}\left( {\frac{\partial x_j }{\partial q_i }} \right) = 
\frac{\mathrm d }{\mathrm d q_i}\left( {\frac{\partial x_j }{\partial t }} \right) \end{equation}

On the other hand, it is possible to relate the term $\partial \dot{x}_j / \partial q_i$ to the derivative of kinetic energy with respect to the coordinate $q_i$:

\begin{equation} \frac{\partial T}{\partial q_i }=\frac{\partial }{\partial q_i }\left( 
{\sum\limits_{j=1}^{3N} {\frac{1}{2}m_j \dot{x}_j^2} } 
\right)=\sum\limits_{j=1}^{3N} {m_j \dot{x}_j } \frac{\partial 
\dot{x}_j }{\partial q_i } \end{equation}

where once again we used the chain rule for the differentiation.

Using Eq. (37), Eq. (38) becomes

\begin{equation} \frac{\partial T}{\partial q_i }=\sum\limits_{j=1}^{3N} {m_j 
\dot{x}_j } \frac{\mathrm d }{\mathrm d t}\left( {\frac{\partial x_j }{\partial q_i }} 
\right) \end{equation}

Returning to Eq. (33), it can be rewritten as:

\begin{equation} \frac{\mathrm d }{\mathrm d t}\left( {\frac{\partial T}{\partial \dot{q}_i }} \right) = Q_i + \frac{\partial T}{\partial q_i } \end{equation}

and

\begin{equation} \frac{\mathrm d }{\mathrm d t}\left( {\frac{\partial T}{\partial \dot{q}_i }} \right) - \frac{\partial T}{\partial q_i } = Q_i \end{equation}

Now let's look at $Q_i$, the generalized force. It can be decomposed into two terms: 

The first term, composed of the conservative forces, i.e. forces that can be written as potential gradients:

\begin{equation} Q_C =-\frac{\partial V}{\partial q_i } \quad , \quad V=V\left( {q_1,\dotsc,q_{3N} } \right) \end{equation}

An example of conservative force is the gravitational force.

And the second term, encompassing all non-conservative forces, such as the frictional force. For now, we will only consider conservative forces.  

Considering only conservative forces, Eq. (41) becomes

\begin{equation} \frac{\mathrm d }{\mathrm d t}\left( {\frac{\partial T}{\partial \dot{q}_i }} 
\right)-\frac{\partial T}{\partial q_i }=-\frac{\partial V}{\partial q_i} \end{equation}

Rearranging, we have:

\begin{equation} \frac{\mathrm d }{\mathrm d t}\left( {\frac{\partial \left( {T-V} \right)}{\partial 
\dot{q}_i }} \right)-\frac{\partial \left( {T-V} \right)}{\partial q_i} = 0 \end{equation}

This is possible because:

\begin{equation} \frac{\partial V}{\partial \dot{q}_i} = 0 \end{equation}

Defining:

\begin{equation} L \equiv L(q_1,\dotsc,q_{3N} ,\dot{q}_1,\dotsc,\dot{q}_{3N} ) = T - V \end{equation}

as the Lagrange or Lagrangian function, we have the Lagrange's equation:

\begin{equation} \frac{\mathrm d }{\mathrm d t}\left( {\frac{\partial L}{\partial \dot{q}_i }} 
\right)-\frac{\partial L}{\partial q_i } = 0 \quad i=1,\dotsc,3N \end{equation}

To be more specific, the equation above is known as the Euler–Lagrange equation or Lagrange's equation of the second kind, but this detail is beyond the scope of this notebook.

### Constraints
 
An important class of problems in mechanics, in which the Lagrangian equations are particularly useful, are composed of constrained systems. A constraint is a restriction on the freedom of movement of a particle or a system of particles. A rigid body, or the movement of a pendulum, are examples of constrained system of particles. It can be shown, in a similar way, that the Lagrange equation, deduced here for a system of free particles, is also valid for a system of particles under the action of constraints. The Lagrange's equation, for a system of $3N$ particles and with $k$ constraints, is then defined as:

\begin{equation} \frac{\mathrm d }{\mathrm d t}\left( {\frac{\partial L}{\partial \dot{q}_i}} \right)-\frac{\partial L}{\partial q_i } = 0 \quad i=1,\dotsc,3N-k \end{equation}

### Example: a simple pendulum under the influence of gravity

Consider a pendulum with a massless rod of length $\ell$ and a mass $m$ at the extremity swinging in a plane forming the angle $\theta$ with vertical.  
The kinetic energy is:

\begin{equation} T = \frac{1}{2}m\ell^2\dot\theta^2 \end{equation}

And the potential energy is:

\begin{equation} V = -mg\ell\cos\theta \end{equation}

The Lagrangian function is:

\begin{equation} L = \frac{1}{2}m\ell^2\dot\theta^2 + mg\ell\cos\theta \end{equation}

And the derivatives are given by:

\begin{equation} \frac{\partial L}{\partial \theta} = -mg\ell\sin\theta, \quad \frac{\partial L}{\partial \dot{\theta}} =m\ell^2\dot{\theta}, \quad \frac{\mathrm d }{\mathrm d t}\left( {\frac{\partial L}{\partial \dot{\theta}}} \right) = m\ell^2\ddot{\theta} \end{equation}

Finally, the Lagrange's equation is:

\begin{equation} m\ell^2\ddot\theta + mg\ell\sin\theta = 0 \end{equation}

## Considerations on the Lagrangian formalism

First, it must be reiterated that Lagrangian formalism does not constitute a new theory. The results of a Lagrangian or Newtonian analysis must be the same for any mechanical system, only the method used to obtain the results is different.  
Where the Newtonian formalism emphasizes an external agent acting on the body, the force, the Lagrange method manipulates only with quantities associated with the body, the kinetic and potential energies.  

In fact, the concept of force does not enter into Lagrangian formalism. This is an important property of the method. Since energy is a scalar quantity, the Lagrangian function for a system is invariant for coordinate transformations. Therefore, it is possible to move from a certain configuration space (in which the equations of motion can be somewhat complicated) to a space that can be chosen to allow maximum simplification of the problem. 

We are accustomed to think of mechanical systems in terms of vector quantities such as force, velocity, angular momentum, torque, etc., but in the Lagrangian formalism the equations of motion are obtained entirely in terms of scalar operations in the configuration space.  
Another important aspect of the force vs. energy analogy is that in situations where it is not possible to make explicit all the forces acting on the body, it is still possible to obtain expressions for the kinetic and potential energies.

## Problems

1. Derive the Lagrange's equation for a double pendulum.

## References

- [Goldstein H (1980) Classical Mechanics, 3rd ed., Addison-Wesley](https://books.google.com.br/books?id=tJCuQgAACAAJ).  
- [Marion JB (1970) Classical Dynamics of particles and systems, 2nd ed., Academic Press](https://books.google.com.br/books?id=Ss43BQAAQBAJ).  
- [Synge JL (1949) Principles of Mechanics, 2nd ed., McGraw-hill](https://books.google.com.br/books?id=qsYfENCRG5QC).  