Cardano’s formula

A quadratic equation is relatively easy to solve, and the Babylonians were aware of methods to solve them in 1600 BC. The cubic formula is significantly harder to solve and it took much longer to find the solution to the general form of the cubic equation: In the 16th century, Italian mathematicians Scipione del Ferro and Niccolo Tartaglia found methods to solve the general cubic equation. Gerolamo Cardano was the first one to publish this method, and the formula is still best known by his name.

Somewhat surprisingly, it is possible to derive Cardano’s formula by using a method that is a three-dimensional analogue of completing the square. I present the derivation here, which is based on Mieke Jansen’s master thesis. To fully understand this article, some understanding of roots of unity is necessary. However, the lion’s share will be understandable even for those without understanding of roots of unity.

Before considering the cubic formula, we’ll have a look at the method that is called ‘completing the square’.

Completing the square

Suppose we have a quadratic equation of the form

Any quadratic equation can be written in this form. If we now add some expression, to both sides so that the left side is a perfect square, we can take the square root of both sides. Indeed, this is possible, since we have

So if we add from both sides, the left side is now the square of a linear factor, and we obtain

which is quite easy to work out. Assuming that and are positive for the moment, we can gain some geometric intuition for the procedure by using this figure:

Square, divided into four sections.

The blue square has width and height . The red rectangles are chosen so that their area adds up to , which means that they have one side with length and one side with length . The green square has area and represents the quantity we added.

Solving the cubic equation

It is not easy to attack the general cubic equation with the same strategy. To make the equation simpler, we use substitution to reduce it to the form

This form is called the depressed cubic. I now show that the general cubic equation can be reduced to this form. This means that solving cubics of this form means that we are also able to solve cubics of the general form.

Start with an equation in the general form

If we substitute , we obtain

For simplicity, put and to end up with

So, if we have cubic equation in the form , we can compute and , and try to find the solution of . If we have found we can use to find .

Completing the cube

Consider a depressed cubic of the form

Now, consider the following three-dimensional analogue of the figure in the last section:

Cube, divided into five sections.

Let be the length of the side of the large cube that is formed by the blue cube, the red boxes, and the green cube. Let . Now, the green cube has side and the blue cube has side . The three red rectangular boxes have sides with lengths , , and , and the big cube that is formed by all the other shapes has side length .

Analogously to the completion of the square, we demand that the volume of the blue cube and the red boxes to add up to . Then, since all these boxes have the same volume as the volume of the big cube (with sides ) minus the volume of the small cube (with sides ), we have . Since the blue cube has a volume of , each of the three red boxes has volume . Since each red rectangular box has a volume of , we find . It follows that .

Setting , we obtain the equations and . In other words, we are looking for two numbers which have sum and product . Note that this is equivalent to solving the quadratic equation , since exactly when the sum of and is and their product is . The solutions are and . Note that the expressions are symmetric in and , so we can choose freely which expression we pick for and which one for (as long as we pick as one of them and as the other).

Using that and , we find

Note that stays the same, regardless of how we pick the signs in the expression for and . Using , we find

Where we have and .

Summary

Algebraically, we can summarize the steps we took as follows:

Take the equation

Substitute

We now have the following polynomial equation in :

Introduce that satisfy and . From this, we can derive

Now substitute

So that we have

These equations have the following solution, where and are interchangeable:

If we substitute back and try to express and in , and , we find

where

and

You might have noticed that we find only one solution. It is actually possible to find the other two solutions with a slight change in the formula, which I’ll explain in the last section. If you like a good challenge, try to find out why we don’t find all solutions, and try to modify the formula to provide all complex solutions of the cubic formula (if you need a hint, take a look at the second paragraph).

Simplifying the result

In retrospect, it would be easier to use variables that equal and instead. Using this, and some other simplifications, we find the solution

where and

for the equation

What about the other solutions?

As we know from the fundamental theorem of algebra, a cubic equations has three (complex) solutions. Yet the formula we found only provides a formula for one of those roots. Indeed we missed some solutions. Specifically, when we took the cube root of and , we only considered the single real solution. Instead, there are three solutions, which can be deduced from the fundamental theorem of algebra. Specifically, for a real number , the formula

has the solutions , and . We missed the last two solutions. However, if we consider all those solutions in and , we obtain 8 new solutions, which is more than expected. Not all of these are indeed solutions of the cubic equation. While all our solutions satisfy , the stronger condition is not always satisfied. This condition can be satisfied by making sure that we pick solutions and for and (so that ).

Statement of the final result

Theorem: Consider the equation . Let , , and a cube root of unity. Substituting in

gives all three solutions of the equation. Moreover, if , the solution for is real.

Proof: Substitute the expression in , simplify, and observe that the result is 0. If and , the expression involves only real numbers, so the result will be real as well.