I know the formulas for solving cubic equation, but when I try to use them in both Cardano's method and https://en.wikipedia.org/wiki/Cubic_function#Algebraic_solution, I usually encounter something like Cube root of numbers such as $2+11i$ or nested radicals, where square root is inside of cube root. According to hypergeometric's answer to my previous question, I would need to solve another cubic equation to trisects the argument angle to find the cube root and there is similiar problem with the denesting the cube root. So is there some algorithm to analytically solve cubic equation without guessing and rounding (I want to solve it symbolically)? How do the computer algebra systems like Wolfram Alpha do it?
-
9What is unsatisfactory about a result such as $\sqrt[3]{2+11i}$? – JMoravitz Jul 02 '18 at 18:15
-
3The general way to solve them is the Cardano formula (which was discovered by Tartaglia). If a simpler method would exist, likely it had been already found. There are significant simplifications for many specific cases. Before the discovery of the Cardano-formula, the solution was considered generally unlikely to exist, and many math contests had the tasks of solving 3rd-grade equations with ad hoc, improvised methods. These may be even today useful, because too many nested radicals may be unfeasible. – peterh Jul 02 '18 at 18:19
-
3See the Wikipedia page Casus irreducibilis and this google search. For a lot of literature references and historical information, see my 26 September 2005 sci.math post. – Dave L. Renfro Jul 02 '18 at 18:44
-
@JMoravitz Because it is not in the simplest form. But sometimes are the obtained results even worse, for example nested radicals. When I try to solve $x^3+x^2+x-3=0$, one root is obviously 1. But from the method on Wikipedia I get $-\frac{1}{3}(1+\sqrt[3]{18\sqrt{6}-44}-\sqrt[3]{18\sqrt{6}+44})$. So how do the computer algebra systems get the nice results they give? – Martin Schmied Jul 03 '18 at 07:35
-
1So how do the computer algebra systems get the nice results they give? --- Probably by methods/algorithms for denesting radicals. Of course, when the root is rational (and the equation has integer coefficients), as is the case with your example, the standard high school rational roots theorem will give you the desired result. – Dave L. Renfro Jul 03 '18 at 11:24
-
@DaveL.Renfro Nevertheless don't I have to solve another cubic equation for denesting cube root? I know about the rational roots theorem, but I am looking for the most general solving algorithm. However it seems I will have to use it in some cases like checking if the case with three real roots is really irreducible and maybe in some cases with nested radicals as well. Or is there a more general method? – Martin Schmied Jul 03 '18 at 12:13
-
4Some of these Stack Exchange questions/answers might be of interest: How to check if a quadratic surd is a perfect cube? AND Simplification of expressions containing radicals AND How does one evaluate $\sqrt[3]{x + iy} + \sqrt[3]{x - iy}$? AND others under "Linked" (see right side) at this places. – Dave L. Renfro Jul 03 '18 at 15:39
-
@DaveL.Renfro Thanks for help. – Martin Schmied Jul 04 '18 at 14:49
-
The mathoverflow question algorithm for finding radical expressions of all conjugates of an arbitrary algebraic number expressed in radicals is somewhat relevant, although most of what's there is more advanced than the other things I've cited. There's also the 2015 book The Unattainable Attempt to Avoid the Casus Irreducibilis for Cubic Equations by Sara Confalonieri, which I only learned about today so I haven't seen it. – Dave L. Renfro Jul 07 '18 at 08:12
-
1Bear in mind that $r:=\sqrt{a^2+b^2},,\theta:=\operatorname{atan2}(y,,x)$ implies $\sqrt[3]{x+yi}=\sqrt[3]{r}\exp \frac{i\theta}{3}\omega^n$ with $\omega:=\exp\frac{2\pi i}{3},,n\in{0,,1,,2}$. The point of Cardano's method isn't to avoid cube roots; it's to reduce the problem of solving arbitrary cubics to the problem of solving $z^3-w=0$. – J.G. Jul 08 '18 at 08:09
1 Answers
There are a number of similar questions, for example
Cubic roots and Cardano formula
Cube root of numbers such as $2+11i$
The core of all questions seems to be an uneasy feeling about cubic roots of complex numbers because "they are not really computable".
What is the difference between a purely real expression like $\sqrt[3]{2}$ and a complex expression like $\sqrt[3]{2+11i}$?
$\sqrt[3]{2}$ seems to be very familiar whereas one might be a little skeptical about $\sqrt[3]{2+11i}$ - how to compute the latter?
But is it really "easier"to compute $\sqrt[3]{2}$ than $\sqrt[3]{2+11i}$? I believe the answer is "no". If you want to compute $\sqrt[3]{2}$ you need an algorithm producing a sequence of rational approximations $x_n$ converging to $\sqrt[3]{2}$ plus a concrete estimate of $\lvert \sqrt[3]{2} - x_n \rvert$ so that you know when to stop the computation. There are very simple algorithms to compute cubic roots of real numbers, but there are also algorithms to compute cubic roots of complex numbers. These are somewhat more complicated, but there is no philosophical difference concerning the iterative approach.
Let me close with some remarks concerning the Cardano formula. In any cubic equation $$x^3 + a_2x^2 +a_1x + a_0 = 0 \tag{1}$$ we can substitute $x = y - \frac{1}{3}a_2$ and obtain the cubic equation $$y^3 +ay = b \tag{2}$$ where $a = a_1 - \frac{1}{3}a_2^2$ and $b = -\frac{2}{27}a_2^3 + \frac{1}{3}a_1a_2 - a_0$.
The solutions of $(2)$ are given as follows. Set
$$R = \frac{b^2}{4} + \frac{a^3}{27} , \tag{3}$$
$$w_+ = \sqrt[3]{\frac{b}{2} + \sqrt{R}}. \tag{4}$$
Although at first glance $w_+$ seems to be uniquely determined, it involves two choices: The square root has two values, the cubic root three values in $\mathbb{C}$. Let us adopt the following conventions:
For $x \ge 0$ we let $\sqrt{x}$ denote the nonnegative square root of $x$, for $x <0$ we define $\sqrt{x} = i\sqrt{-x}$.
For any $x \in \mathbb{R}$ we let $\sqrt[3]{x}$ denote the real cubic root of $x$.
For a non-real $z \in \mathbb{C}$ we do not get explicit about the choice of $\sqrt[3]{z}$ among the the three complex (non-real) cubic roots of $z$.
This gives a standard interpretation of $\sqrt[3]{\frac{b}{2} + \sqrt{R}}$ as a real number for $R \ge 0$ and leaves a single choice (of a complex cubic root) for $R < 0$.
Note that $w_+ = 0$ if and only $\sqrt{R} = -\frac{b}{2}$, i.e. $b \le 0$ and $a = 0$. This case (in which $y^3 = b$) is trivial and could also be omitted. Observe that $\frac{b}{2} - \sqrt{R} = b$ in this case.
Now define
$$w_- = \begin{cases} -\dfrac{a}{3w_+} & w_+ \ne 0 \\ \phantom{..} \sqrt[3]{b} & w_+ = 0 \end{cases} \tag{5}$$
It is easily verified that $w_-$ is a cubic root of $\frac{b}{2} - \sqrt{R}$. If $w_+$ is real (which happens precisely when $R \ge 0$), then also $w_-$ is real so that it is the standard interpretation of $\sqrt[3]{\frac{b}{2} - \sqrt{R}}$ as a real number.
So far we specified cubic roots $w_\pm$ of $\frac{b}{2} \pm \sqrt{R}$. These cubic roots are not independent, but satisfy the relation $$3w_+w_- + a = 0 . \tag{6}$$
The solutions of $(2)$ are then $$y_0 = w_+ + w_- \tag{7}$$ $$y_1 = \zeta_1 w_+ + \zeta_2 w_- \tag{8}$$ $$y_2 = \zeta_2 w_+ + \zeta_1 w_- \tag{9}$$ where $\zeta_1 = -\frac{1}{2} + \frac{\sqrt{3}}{2}i, \zeta_2 = -\frac{1}{2} - \frac{\sqrt{3}}{2}i = \zeta_1^2 = \overline{\zeta_1}$ are the two complex third unit roots. This is easily verified by inserting into $(2)$. With $\zeta_0 = 1$ we may also write $$y_k = \zeta_k w_+ + \zeta_k^2 w_- = \zeta_k w_+ + \overline{\zeta_k} w_- \tag{10}$$ Note that the $\zeta_kw_+$ are the three complex cubic roots of $\frac{b}{2} + \sqrt{R}$ and the $\zeta_kw_-$ are the three complex cubic roots of $\frac{b}{2} - \sqrt{R}$. Thus all solutions of $(2)$ are sums of certain complex cubic roots of $\frac{b}{2} + \sqrt{R}$ and $\frac{b}{2} - \sqrt{R}$.
The Cardano formula is frequently reported in the sloppy form that $$\sqrt[3]{\frac{b}{2} + \sqrt{R}} + \sqrt[3]{\frac{b}{2} - \sqrt{R}} \tag{11}$$ is a solution of $(2)$. This is misleading since it suggests that
there is always a standard interpretation of $\sqrt[3]{\frac{b}{2} \pm \sqrt{R}}$,
two separate computations for $\sqrt[3]{\frac{b}{2} \pm \sqrt{R}}$ must be performed.
But 1. is true only for $R \ge 0$, and in that case $(11)$ is in fact a solution of $(2)$. However, there are two more solutions described by $(10)$ and which involve other values of the cubic roots of $\frac{b}{2} \pm \sqrt{R}$. And, more important, 2. is not true because $w_-$ can easily be derived from $w_+$ via $(5)$. This is always easier than computing $\sqrt[3]{\frac{b}{2} - \sqrt{R}}$ ab ovo, even for $R \ge 0$. For an even easier alternative in case $R < 0$ see the discussion below.
Let us now consider three cases for $R$.
If $R > 0$, then we get one real solution $y_0$ (which may be expressed by $(11)$ in the standard interpretation) and two non-real solutions $y_1, y_2$ which are complex conjugate. All solutions can be expressed by using only square and cubic roots of real numbers.
If $R = 0$ we obtain the three real solutions $y_0 = 2\sqrt[3]{\frac{b}{2}}$ and $y_1 = y_2 = -\sqrt[3]{\frac{b}{2}}$.
The case $R < 0$ (casus irreducibilis) is most interesting. Note that it can only occur when $a < 0$. We have three distinct real solutions, but $w_+, w_-$ are non-real. They are cubic roots of the complex conjugate numbers $\frac{b}{2} \pm i \sqrt{-R}$, but recall that these cubic roots cannot be chosen independently because they are subject to $(6)$. If we make any choice for $w_+$ as a cubic root of $\frac{b}{2} + i \sqrt{-R}$, then clearly $\overline{w_+}$ is a cubic root of $\overline{\frac{b}{2} + i \sqrt{-R}} = \frac{b}{2} - i \sqrt{-R}$. We have $\lvert w_+ \rvert^3 = \lvert \frac{b}{2} + i \sqrt{-R} \rvert = \sqrt{-a^3/27}$, i.e. $\lvert w_+ \rvert = \sqrt{-a/3}$. This implies $3w_+\overline{w_+} + a = 3\lvert w_+ \rvert^2 + a = 0$ which means $$w_- = \overline{w_+} . \tag{12}$$ This is even simpler than $(5)$ and it moreover seems completely natural to choose complex conjugate cubic roots for the complex conjugate numbers $\frac{b}{2} \pm i \sqrt{-R}$.
Anyway, we get $$y_k = \zeta_k w_+ + \overline{\zeta_k} w_- = \zeta_k w_+ + \overline{\zeta_k} \overline{w_+} = \zeta_k w_+ + \overline{\zeta_k w_+} = 2 Re(\zeta_k w_+) . \tag{13}$$
Note that $\zeta_k w_+$, $k = 0,1,2$, are the three complex cubic roots of $\frac{b}{2} + i \sqrt{-R}$.
It is known that in general it is impossible to express any of these three real solutions in terms of roots of real numbers. See
https://en.wikipedia.org/wiki/Casus_irreducibilis
Those who are historically interested and read German may also look at
Hölder, Otto. "Über den Casus irreducibilis bei der Gleichung dritten Grades." Mathematische Annalen 38.2 (1891): 307-312.
Summarizing, the recipe to solve $(2)$ is the following: Compute $w_+$ as a cubic root of $\frac{b}{2} + \sqrt{R}$, then if $R \ge 0$ determine $w_-$ via $(5)$ and insert in $(10)$, and if $R < 0$ insert in $(13)$.
Remark 1:
The special case $a < 0, b = 0$ is interesting. The equation $y^3 + ay = 0$ has the obvious solution $y_1 = 0$. This reduces the problem to $y^2 + a = 0$ and gives $y_{2,3} = \pm \sqrt{-a}$.
Nevertheless we are in the casus irreducibilis: We have $R = \frac{a^3}{27} < 0$. This gives $\sqrt{R} = i \sqrt{\frac{-a^3}{27}}$ and $w_+ = \eta \sqrt{ \frac{-a}{3}} = \eta \frac{\sqrt{-a}}{\sqrt 3}$, where $\eta$ is one of the three complex cubic roots of $i$. Taking $\eta = -i$, we get $Re(w_+) = 0$, thus $y = 0$ is a solution found by Cardano's formula. The solutions $y = \pm \sqrt{-a}$ are obtained by taking $\eta = \pm \frac{\sqrt{3}}{2} + \frac{i}{2}$.
Remark 2:
Cardano mentioned that his formula $(11)$ fails in the casus irreducibilis, certainly because he did not know the general concept of complex numbers (although he had an idea about square roots of negative numbers). Rafael Bombelli (1526 - 1572) seems to be the first who really used complex numbers (although be did not introduce a proper concept of them). For the equation $x^3 = 15x + 4$ - which is casus irreducibilis - he obtained the solution (written in modern terms) $$x = \sqrt[3]{2 + \sqrt{-121}} + \sqrt[3]{2 - \sqrt{-121}}$$ and showed that a cubic root of $2 \pm \sqrt{-121}$ is given by $w_\pm = 2 \pm \sqrt{-1}$. Thus he obtained $x = 4$ which is in fact an exact real solution found by Cardano's formula. However, in a a sense he had good luck since his choices of cubic roots satisfy $(12)$.
Bombelli's example shows that in the casus irreducibilis it is sometimes possible to find a simple explicit expression for $w_+$ which allows to find the exact values of the three real solutions.
Remark 3:
To use Cardano's formula it is essential to determine the values $\sqrt[3]{\frac{b}{2} \pm \sqrt{R}}$. In Remarks 1 and 2 we have done this for some special cases. A more general approach can be found in When does $r + \sqrt s$ with $r,s \in \mathbb Q$ have a cubic root of the form $u \pm \sqrt v$ with $u,v \in \mathbb Q$?
Note, however, that the explicit computation of $\sqrt[3]{\frac{b}{2} \pm \sqrt{R}}$ requires that we know a solution of $(2)$ by applying the Rational root theorem. To put it pointedly: In some cases we can determine $\sqrt[3]{\frac{b}{2} \pm \sqrt{R}}$ by making finitely many trials.

- 76,394
- 12
- 43
- 125
-
1+1 For the casus irreducibilis wikipedia might be more accessible than the 1891 reference in German. https://en.wikipedia.org/wiki/Casus_irreducibilis – Ethan Bolker Jul 07 '18 at 13:05
-
-
@EthanBolker I changed my answer, thank you again for your comment. – Paul Frost Jul 08 '18 at 10:48