Introduction to Rigid Body Motion

In today’s post I want to explore the rigid body motion.

When a body moves, we can divide it into two distinct types of motion, a translation and a rotation. The translation can be treated with Newton’s Laws applied to the Center of Mass, but the rotation is a topic that often eludes people.

To try to develop the theory for the rotation of bodies we have to review the concept of a point mass particle, a body with negligible dimension, which means that rotation is undefined. How can we see if a point rotates? This means that rotation is a characteristic of sets of point particles and not of individual particles. This is the way we are going to define a rigid body of non-negligible dimensions, as a set of point particles whose distances between one another is constant.

Now that we know what we are studying, let’s define correctly the center of mass, a concept very important (and handy) for the study of rotations. The center mass corresponds to the position of the body (interior to the body or not) such that if the intire mass was compressed into that point, its motion could be accounted by only our knowledge so far of Newton’s equation. Imagine you throw a stick in the air with some rotation, there will be a point in the which will describe a parabola as predicted by the equations of a throw. The remaining points can be thought of translating with the center of mass (CM) and rotating around it.

The position of the center of mass can be calculated by:

\vec R_{CM} = \sum_i \frac{\vec r_i m_i}{M_{total}}

i corresponds to a point particle of the body and we are summing over all of them.

As I said before the motion of each particle can be understood as a translation with the CM as well as a rotation around it. This means that the displacement in a particle i is given by:

d\vec r_i = d\vec R_{CM} + d\vec\phi \times \vec r'_i

Where d\vec\phi is the change in angle of the particle with respect to the CM and r'_i is the position of the particle in relation to the CM. One thing we must point out is that since the body is rigid, the same rotation will affect all particles. Only this way is the distance between them preserved.

Now let’s try to find more about rotations by writing the energy of the system, which will be the sum of the kinetic energies of all particles (\frac{d\vec\phi}{dt} = \Omega \text{ , } \frac{d\vec R_{CM}}{dt} = V_{CM} )

\sum_i \frac{1}{2}m_i (v_i)^2 = \sum_i \frac{1}{2}m_i (\vec V_{CM} + \vec \Omega\times \vec r'_i)^2
\sum_i \frac{1}{2}m_i ((V_{CM})^2 + 2\vec V_{CM}\cdot \vec \Omega\times \vec r'_i + (\vec \Omega\times \vec r'_i)^2) = \frac{M_{total}}{2}V_{CM}^2 + \sum_i m_i \vec r'_i \cdot \vec V_{CM}\times \vec \Omega + \sum_i \frac{1}{2}m_i(\vec \Omega\times \vec r'_i)^2)

However since r'_i are the distances in respect to the CM the weighted sum of all m_ir'_i will be zero, hence our expression becomes:

E_k = \frac{1}{2}M_{total}V_{CM}^2 + \sum_i \frac{1}{2}m_i(\vec \Omega\times \vec r'_i)^2)

The energy then reinforces the idea that the motion is a translation of the CM plus a rotation around the CM.

Now let’s see if by expanding the energy of rotation we can find fundamental structure.

E_{rot} = \frac{1}{2}\sum_i m_i (\vec \Omega \times \vec r'_i)^2
\vec\Omega\times\vec r'_i = (\Omega_y r'_{iz} - \Omega_z r'_{iy}) \hat x + (\Omega_z r'_{ix} - \Omega_x r'_{iz})\hat y + (\Omega_x r'_{iy} - \Omega_y r'_{ix})\hat z
E_{rot} = \frac{1}{2}\sum_i m_i ((\Omega_y r'_{iz} - \Omega_z r'_{iy})^2 + ( \Omega_z r'_{ix} - \Omega_x r'_{iz})^2 + ( \Omega_x r'_{iy} - \Omega_y r'_{ix})^2)
= \frac{1}{2}\sum_i m_i (\sum_k \Omega_k^2 (\sum_l r'^2_{il}) - \sum_{k,l} \Omega_k\Omega_l r'_{ik}r'_{il})

This last summation part needs care, if you don’t see it at first, please read it carefully. Dropping the summation sign to make the notation less cumbersome we have:

E_{rot}=\frac{1}{2}\sum_i m_i ( \Omega_k \Omega_l r'^2_{im} \delta_{kl} - \sum_{k,l} \Omega_k \Omega_l r'_{ik} r'_{il})
=\frac{1}{2}\Omega_k \Omega_l \sum_i m_i(r'^2_{im} \delta_{kl} - r'_{ik}r'_{il})

We know define a new quantity, the tensor of inertia I_{lk} where

I_{lk} = \sum_i m_i(r'^2_{im} \delta_{kl}-r'_{ik}r'_{il})

Then the rotation energy can be written simply as:

T_{rot}=\frac{1}{2}\Omega_i\Omega_l I_{ik}

And I written explicitely will be:

I = \begin{bmatrix} \sum m(y^2 + z^2) & -\sum m(xy) & -\sum m(xz)\\ -\sum m(yx) & \sum m(x^2 + z^2) & -\sum m(yz) \\ -\sum m(zx) & -\sum m(zy) & \sum(m(x^2 + y^2) \end{bmatrix}

If the body is continuous we just transform the summations into integrals over the entire volume of the body.

So now the inertial moment of a body is not only a scalar, it actually depends on the various directions of the rotation.

However the is a cool result which states that it is always possible to find a set of directions such that I is diagonal, which eases the mathematics of the problems as well as give us a better insight into the symmetries of the body. These directions are called the principal axis of the body.

The next point in our analysis will be developing an analog to the linear momentum for rotations, and then develop a analog to the Newton’s second equation but for rotations.

In the generalization of the momentum, we start by defining it as $m\vec r\times \vec v$ for a point particle. Then for a full body:

\vec L = \sum_i m_i \vec r_i \times(\vec \Omega \times \vec r_i) = \sum m_i(r^2\vec \Omega - \vec r(\vec r\cdot \vec \Omega))
= \sum_i m_i(r_l^2\Omega_k - r_kr_l\Omega_l) = \Omega_l \sum_i m_i(r_l^2\delta_{kl} - r_kr_l) =\Omega_l I_{lk} = I \vec \Omega

Deriving \vec L:

\frac{d}{dt}\vec L = \sum_i m_i( \dot{\vec r_i}\times \vec v_i + r_i \times \dot{\vec v_i}) = \sum_i m_i( \vec v_i \times \vec v_i + r_i \times \vec a_i)
= \sum_i r_i \times (m_i \vec a_i) = \sum_i r_i \times F_i = \vec \tau

Now we found a quatity that varies our Angular momentum the same way as in Newton’s second law.

We now have the basis for the rigid body motion.

References: Landau Volume 1 Mechanics


Two Timed approximation

The last post concerned about perturbation theory. A method used to approximate solution which was based on the idea of a solution we already know which has been slightly perturbed. Unfortunately this method is not always the best. Today I will present a different method called Two Timin in Strogatz’s “Nonlinear Dynamics and Chaos”

The idea of this approximation method is to make use of the different timescales of the function. To explain this idea I will use the example I will be solving afterwards.

Imagine you have damped harmonic oscillator. At first there doesn’t seem to exist two timescales, it is simply a harmonic motion whose amplitude decreases with time, however there are two time scales associated with the motion, the first gives us the timescale of the vibrations while the other gives the timescale of the exponential decrease. (more…)

Introduction to perturbation theory

In this post I will go through an introduction to perturbation theory, a method used to solve approximately differential equations that may not have solution otherwise. I will first go through the idea and then use an example, the non-linear oscillator to better explain the method.

The main idea of perturbation theory is to understand our system as a function/solution we know but which has been slightly perturbed. Imagine the case of ball rolling through a downhill straight valley. If the system was “at its best”, it would just go straight through the valley. However, if at the beginning we give it a little push to the side, it will go down, but also oscillate. We can think of the motion as the straight motion plus a small perturbation of the system.


Integration Methods (Part 4)

Yesterday I covered a different integration methods, Heun’s method and we covered the reason why Euler’s method was not very accurate.

Today I will cover another integration method Runge-Kutta, which improves on the Heun’s method. Moreover I will touch on the subject of justifying why one method is better than another.

The basic idea of the Runge-Kutta method is to improve on the Heun’s method.
The improve this method (Heun’s) brought was to take into account the derivative not only at our current point, but also at, approximately, the end point, and from there get a better estimate of the average derivative in the interval.

Using the derivative at our first test point increases the accuracy of the step since, despite not being the derivative at our desired point, it will be a good approximation to it. So by averaging it with the the derivative at our start point we get a better approximation of the average value since we’ll take into some consideration the increase or decrease of the derivative in the time step.

As we saw this simple idea brought a great improvement on our method, however we can think how we can try to squeeze more information out of our system. Runge-Kutta method does just that.


Integration Methods (Part 1)

I would like to have my first post with a topic I find very interesting, integration methods.Since the advent of computers they have been put to use to solve scientific problems. One of the biggest uses is to do numerical computations, either massive calculations or using computers to approximate solutions. This second comes as very important, specially in differential equations when we are faced with systems which do not have an analytic solution or closed formula, like:

  • \ddot \theta + k\sin(\theta) = 0 – The system of a simple pendulum
  • \begin{array} {l} \dot x = -x + ay + x^2y\\ \dot y = b -ay - x^2y \end{array} – Biological processes like glycolisis
  • \begin{array} {l} \dot x = x(3-x-2y)\\ \dot y = y(2-x-y) \end{array} – Growth models of species

The study of these systems (despite existing a lot of techniques to deal with them) depend on our ability to approximate the solution from the differential equations.

It is for this reason that integration methods are very important. We need methods which can give us a very approximate solution with the minimum effort.

You may ask “but if it is the computers that do the calculations, why do we need to minimize effort?” The answer is simple, computers aren’t invincible, they have limitations, and if we ask too much from them they will take a long time to give us an answer, and sometimes we do not have that time, whether is just a scientist that needs results to continue his work or a company that needs the information as soon as possible to know what course they need to follow.