You’ve probably heard the word ‘tensor’ before, but if you’re like me, you’ve gone through your entire physics degree without knowing what they are. So today, I’m going to express the relativistic wave equation in the language of tensors and Lagrangians, and show just how powerful these things can be.
Let’s first begin with an example we all know, the dot product. Now, given any two vectors, we can calculate their dot product, . Let’s do some fancy renaming and let
Let’s make some notes about .
- It is a function of 2 vectors
- It returns a real number
- As we know, since is just the dot product, it is bi-linear, meaning it is linear in both arguments
This is in fact an example of what we call a (0, 2) tensor, as it takes as its arguments 2 vectors and returns a real number. This has a matrix representation, namely the 2 by 2 identity matrix. However, in polar co-ordinates the metric would have matrix representation with components given and with the rest of the entries zero. We can denote any element of of the matrix, and hence elements of the tensor, as .
Now, you may be thinking, why is it a (0,2) tensor, what does that zero stand for? Well, for that we have to introduce the notion of a covector.
A covector is function which maps vectors into the real numbers, for those of you who are snazzy with their linear algebra, these are elements of the dual space of a vector space. As an example, consider the some useless function which maps every element in our vector space to , this is a covector, it maps our vectors into the real numbers. Now, we will make an important distinction in notation, and this will come back in a huge way later, for now, memorise it. We represent the components of a vector with raised indices, and components of a covector with lowered indices, . It is important that you know this, say goodbye to arrows and tildes indicating vectors, we know use the position of the index, up is a vector, down is a covector.
Now, an important relationship between vectors and covectors is that a vector can be mapped into a covector, using the Minkowski metric of minkowski space, the metric that has components , , and all other entries zero. Using our notation this is
What on earth does this mean? This is a summation over the index, known as the Einstein summation convention, we say that the indices contract. This is true whenever you have the same index appearing lowered and raised. What is important to note is that the index was lowered, it has become a covector. In much of what will follow, we will use the metric to raise and lower indices as we please. We note that an index can be raised by using the inverse of the Minkowski metric, .
Now, back to the zero in (0,2) tensor; it states that our tensor takes 0 covectors as arguments. A (1,1) tensor, lets call it , would take 1 covector and 1 vector as arguments and return a real number. We can represent the components of this tensor by , where represents the index of the covector and and represents the index of the vector. If was instead a (2,1) tensor, it would be indexed by . In general, a (r,s) tensor will have upper indices representing the number of covectors the tensor takes as input and lower indices representing the number of vector inputs. It is important to note that any real valued function is a (0,0) tensor. So really, tensors have been in front of your eyes this entire time, you just haven’t realised it.
Now, I told you the notation was going to come back in a huge way, and here it is. Say we have a (1,1) tensor . And say we wish to input it a vector, , by our notation and the summation convention, we can simply write
and note that our indices contract naturally. The lowered on which represents the vector input is contracted with the raised on a vector. The same is true of covectors. With this notation now, if you make a mistake, you will know it, and almost always you can just guess how to put things together just by looking at how the indices should contract! It’s powerful stuff.
Now let’s see this stuff in action by finding the equation of motion for the following Lagrangian density
Note that the summation convention is applied to the indices on the partial derivatives. Now the Euler Lagrange equation for fields is
where the summation convention occurs again over the bold indices, one is raised and one is lowered by being in the denominator of the fraction.
Clearly, . Now for the fun part.
Now, we can see that in order for our lagrange equation of motion to make since, this is the only possible position of index we could have gotten, because now, the summation convection holds for when we put in the final partial derivative from the Euler Lagrange equation of motion, and we get
which can be re-written as
Now, for the physics buffs, since is a function of (t,x,y,z) we have that is just the d’Alembert operator, which is the Laplace operator in Minkowski spacetime. So now, the Euler Lagrange equation which we just derived should be recognised as the Klein-Gordon equation.