Monodromy Groups and the Insolvability of Transcendental Equations in Quadratures
A. Allemand, A. Ya. Belov, R. V. Zaytsev
Abstact.
This paper presents the authors’ results in applying Arnold’s method to compute the monodromy groups of certain trigonometric complex equations, and also provides a survey of other results in this area.
Our primary goal in this paper is to compute the monodromy groups for several equations of interest. The transcendental equations investigated here are not arbitrary; they are motivated by their appearance in mathematical physics and spectral theory. Equations of the type are canonical in this regard: they arose in Fourier’s 19th-century analysis of the heat equation, where their roots correspond to the eigenvalues of a Sturm-Liouville operator, as noted by F. Klein [5]. A similar result was obtained by Fourier, but here we present a new perspective and proof using modern methods, which interestingly leads to the same conclusion. A. Elishev demonstrated this fact in modern terms [2], and we include his argument.
Modern analogues, such as the equation studied by Heifetz for acoustic wave propagation, further highlight the physical importance of both the real roots (propagating modes) and the complex roots (resonances) [3]. We include a full summary of his results, as the original was published in a difficult-to-access Soviet journal. This motivates a deeper study of the complete root structure of such equations under parameter variation, which is what their monodromy groups describe.
Our primary tool is the topological Galois theory developed by V. I. Arnold and later by A. G. Khovanskii [4]. This theory connects the solvability of an equation to the algebraic properties of its monodromy group. The notion of solvability can be made precise through a hierarchy of function classes. Starting with elementary functions (built from rational functions, , , and algebraic operations), one can define functions solvable in quadratures (allowing integration), and recursively, in -quadratures (meaning functions obtained by applying up to nested integrations). The union of all such classes forms the class of functions solvable in generalized quadratures.
The main theorem of topological Galois theory, in its strong form due to Khovanskii, addresses this entire hierarchy:
Theorem.
If the solution of an equation , considered as a function of the parameter , is representable in generalized quadratures, then the monodromy group of the equation is solvable.
In this paper, we use the contrapositive of this theorem. By showing these monodromy groups to be unsolvable (e.g., containing or ), we prove that these equations are not solvable even in the broad class of generalized quadratures.
Our approach follows the constructive spirit of Arnold’s method. The core idea is to analyze the behavior of the roots of the equation as the parameter traverses a closed loop in the complex plane. The roots themselves trace continuous paths, which may not be closed, resulting in a permutation of their initial positions. This permutation defines an element of the monodromy group. Non-trivial permutations are generated when these loops encircle the critical values of — the images of points where the derivative vanishes. Examples of application of this method with illustrations can be found in our previous paper [1].
Therefore, our work in the subsequent sections will be to identify these critical values and deliberately construct paths around them to generate a set of permutations sufficient to prove the unsolvability of the corresponding monodromy group.
2 The Case of a Self-Adjoint Function
Let the function have the following properties:
1.
The function is self-adjoint, i.e., .
2.
The critical points of (where ) have multiplicity 3.
3.
The equation for a real has a unique real root.
From complex analysis, we know that traversing a critical point must permute three roots in a cycle. Since the function is self-adjoint, its set of roots consists of one real root and pairs of complex conjugate roots. Consequently, when traversing a critical point, the unique real root and a pair of complex conjugate roots must be permuted.
By traversing one critical point, we obtain the permutation of roots , where denotes the unique real root. By traversing another critical point, we get the permutation . Together, these two permutations generate the alternating group . Thus, the monodromy group of the equation is unsolvable, which implies that the equation is not solvable in elementary functions.
It remains for us to verify that the following equations satisfy these initial conditions, from which their insolvability will immediately follow. Note that although we do not provide the proof here, the same method is applicable to simpler equations like and .
2.1 Insolvability of
Let us prove the insolvability of the equation
where the function is the difference between a composition of sines and the identity function. For simplicity, we will show that conditions 1–3 are met for the simple composition
as the proof for more complex compositions is completely analogous. First, let us find the critical points of the function:
Clearly, both and must be simultaneously equal to either or . The latter is impossible, because at the points , the sine is zero, and the cosine of zero is one. Consequently, the critical points of the function will be
At these points,
We find that our points have multiplicity 3. Furthermore, a real root exists for any real and is unique due to the strict monotonicity of the function, since
at all points except . The other roots will come in complex conjugate pairs because the composition of two self-adjoint functions is self-adjoint.
2.2 Insolvability of
Now we will show that conditions 1–3 are satisfied for
Let us find the critical points and check their multiplicity:
Furthermore,
In this case, there is a unique real root because
at all points except . As before, the other roots will come in complex conjugate pairs because the composition of two self-adjoint functions is self-adjoint.
3 Insolvability of
Let us prove the insolvability of the equation
First, we find the critical points:
Let us check the multiplicity of these points:
A consequence of Rouché’s theorem is that the equation has only real roots (a detailed proof is given in [6]). Consequently, if we let the parameter in the equation approach zero, three roots will merge into one point.
The function is self-adjoint, which implies that its roots must come in complex conjugate pairs. In this case, the function is strictly monotonic on each interval , which means that the equation has a unique root on each such interval. The situation is similar to the previous cases, except that the real root is not unique. Nevertheless, due to the multiplicity of 3, in the vicinity of a critical point, one real root and a pair of complex conjugate roots will also be permuted. We will construct a path for the parameter in such a way that a real root and a complex conjugate root from the upper half-plane are exchanged.
We modify the method used in the previous section. Let us fix the value and denote two complex roots as and , where is in the upper half-plane. Let us denote the real roots as , where as .
We start with the configuration of real roots:
Consider a path in the parameter plane of that starts at and moves left along the real axis, bypassing each critical value along a small semicircle (we choose the one that causes an exchange between a real root and ).
When reaches the -th critical point, it loops around it in a small circle and returns along the same path. Let us examine what happens to the roots.
First, swaps places with , then with , and so on.
By the time we approach the -th critical point, the arrangement of roots becomes:
The root moves into the upper half-plane. Traversing the critical point causes a cyclic permutation . Repeating this traversal twice, we get:
Now is in the upper half-plane, and is in the lower. As we move back, all previous permutations are reversed: and swap places, then and , and so on.
As a result, we get the configuration:
where returns to its original place, while remains below.
If we now repeat the same operation for the -th critical point, we get the permutation:
and both complex roots will return to their places. This corresponds to the cycle .
Such 3-cycles for all generate the infinite alternating group, denoted . By analogy with , this group is a group of finitely supported permutations. More precisely, is the subgroup of consisting of all even permutations on the countable set of roots.
Since any 3-cycle is an even permutation, and any finite alternating group is generated by 3-cycles, our method allows us to construct any permutation from for any . As the group is unsolvable for , the group which contains all of them is also unsolvable.
4 Insolvability of an Equation related to
We now turn to another classical transcendental equation, . While a full analysis of its monodromy group is possible, for the purpose of demonstrating the method with simpler derivatives, we will analyze the related, but distinct, equation . It is important to note that these two equations are not equivalent. The analysis below pertains strictly to .
Let us find the critical points of :
In this case,
Thus, the critical points have multiplicity 2. The function of a real argument is strictly monotonic because
which means the equation has a unique real root.
First, for simplicity, let us consider the starting point .
Let us consider a path that leads from this real root to the -th critical point, then makes a semi-loop around it, swapping the roots, after which the new root returns along the same path.
Now let us derive the trajectory described by the parameter while the real root moves along the specified path. On the first segment of the path:
where is the real root (i.e., ), so
It is important to note that the loops bypass the "dangerous" points , since at :
Since (which is easy to prove), and the real part of is , the point is to the left of . This also shows that at the end of the first path segment, is located to the left of .
Let us denote . Then the second segment of the path is given by the equation:
Thus, also moves along a straight line parallel to the real axis.
On the third segment of the path, after the permutation of the roots, the parameter describes a loop around , which follows from the general theory. This trajectory is homotopically equivalent to a simple loop around the corresponding point.
Since a plane with a discrete set of points removed is homotopically equivalent to a wedge of circles, any loop around these points can be decomposed into a product of simple loops like the one shown above.
This means that any permutation can be realized by successive transpositions of the real root with the others. On the other hand, the permutation group is transitive, which is true in the general case.
Let us consider the decomposition of a permutation that maps to the real root (let us number the roots with natural numbers so that is the real root). As shown above, it will have the form:
Obviously, the transposition must be part of this product, otherwise the root would remain in its place. Consequently, the group contains all transpositions , which generate the group of all permutations of the countable set of roots that affect only a finite number of elements, known as the group of finitely supported permutations. is countable and contains all finite symmetric groups . The unsolvability of for implies the unsolvability of .
References
[1]A. Belov-Kanel, A. Malistov, and R. Zaytsev (2020)Solvability of equations in elementary functions.
Journal of Knot Theory and Its Ramifications 29 (02), pp. 2040005.
Cited by: §1.
[2]A. Elishev (2020)A brief comment on the paper "solvability of equations in elementary functions" by kanel-belov, malistov and zaytsev.
Journal of Knot Theory and Its Ramifications 29 (02), pp. 2040007.
Cited by: §A.2,
§1.
[3]A. I. Heifetz (1985)On the roots of a transcendental equation in the problem of shallow sea acoustics.
Akusticheskii Zhurnal 31 (2), pp. 258–263.
Note: (in Russian). English translation in: Soviet Physics, Acoustics, 31(2), 1985
Cited by: §A.1,
§1.
[4]A. G. Khovanskii (2008)Topological galois theory. solvability and unsolvability of equations in finite terms.
MCCME, Moscow.
Note: (in Russian)
Cited by: §1.
[5]F. Klein (1989)Lectures on the development of mathematics in the 19th century.
Vol. 1, Nauka, Moscow.
Note: (in Russian)
Cited by: §1.
[6]R. V. Zaytsev (2021)Topological theory of solvability of equations in elementary functions.
Master’s thesis, Higher School of Economics.
Note: (in Russian)
Cited by: §3.
×A. Belov-Kanel, A. Malistov, and R. Zaytsev (2020)Solvability of equations in elementary functions.
Journal of Knot Theory and Its Ramifications 29 (02), pp. 2040005.
Cited by: §1.
×A. Elishev (2020)A brief comment on the paper "solvability of equations in elementary functions" by kanel-belov, malistov and zaytsev.
Journal of Knot Theory and Its Ramifications 29 (02), pp. 2040007.
Cited by: §A.2,
§1.
×A. I. Heifetz (1985)On the roots of a transcendental equation in the problem of shallow sea acoustics.
Akusticheskii Zhurnal 31 (2), pp. 258–263.
Note: (in Russian). English translation in: Soviet Physics, Acoustics, 31(2), 1985
Cited by: §A.1,
§1.
×A. G. Khovanskii (2008)Topological galois theory. solvability and unsolvability of equations in finite terms.
MCCME, Moscow.
Note: (in Russian)
Cited by: §1.
×F. Klein (1989)Lectures on the development of mathematics in the 19th century.
Vol. 1, Nauka, Moscow.
Note: (in Russian)
Cited by: §1.
×R. V. Zaytsev (2021)Topological theory of solvability of equations in elementary functions.
Master’s thesis, Higher School of Economics.
Note: (in Russian)
Cited by: §3.
Appendix A Related Work in Functional Analysis
The equations we have studied are not merely mathematical curiosities; they arise in various physical and analytical contexts. In this appendix, we review two such connections.
A.1 Asymptotic Solutions
For the reader’s convenience and due to the difficulty of accessing the original Soviet-era source [3], this section provides a detailed summary of Heifetz’s work on the roots of a dispersion equation. The formulas obtained are used for numerical calculations and asymptotic investigation of the roots.
Heifetz considered a point acoustic source located in a homogeneous liquid layer on a homogeneous liquid half-space. This model, despite its simplicity, proves to be very useful in some cases (e.g., shallow sea) and has been the subject of many studies. The calculation of the eigenvalues of the corresponding boundary value problem reduces to solving the transcendental equation
where , and are the densities of the liquid layer and the lower half-space, respectively, , is the layer thickness, is the wave number of the layer, is the angular frequency of the source, is the speed of sound in the layer, is the speed of sound in the bottom, . Throughout, it is assumed that , , and the wave number for the bottom is real. Similar equations arise in other problems as well.
Depending on the method of calculating the integral representing the sound pressure field, it is necessary to consider either only the real roots lying on the so-called physical sheet of the Riemann surface of the radical (they generate normal, or propagating modes, and it is these roots that correspond to the eigenvalues of the problem), or also the complex roots lying on the non-physical sheet of the Riemann surface (virtual modes or resonances). The roots of the transcendental equation were previously investigated by Heifetz using asymptotic and numerical methods, and its parametric solution is known. However, the development of algorithms for the numerical solution of this equation, in particular for calculating complex roots, continued to attract the attention of researchers. This work is devoted to refining the location of the roots of the equation, deriving explicit formulas for all roots, and their investigation. Heifetz considered the case and . The case can be studied similarly.
Let us describe the location of the roots of the equation. It is obvious that there are no roots on the rays and , so to isolate the single-valued branches of the radical, it is convenient to make cuts along these rays. The complex plane thus cut is denoted by . We denote by the single-valued and continuous branches of the radical in , defined by the conditions , and consider in two holomorphic functions . The roots of lie on the physical sheet, and the roots of on the non-physical sheets of the Riemann surface.
To solve the two equations in , we transform the original transcendental equation to the form
Thus, we replace our original equation with a countable set of equations
where
It is easy to see that on the rays , , the original equation also has no roots. We denote the region , cut along these rays, by , and select the branch of the logarithm in by the condition (the choice of any other branch of the logarithm only leads to a renumbering of the roots). In the simply connected region , are single-valued analytic functions, the number of whose roots can be found by the argument principle. We find that on the physical sheet, the functions for have one real root , with and , , while for , have no roots. In other words, the original physical problem has propagating modes. (Here denotes the integer part of the number .)
On the non-physical sheet, the functions for have one real root , with , , and for , have two complex-conjugate roots , , with , , . The function has three roots for : and symmetric purely imaginary roots , , and for – five roots: and , . Finally, the function , where (the case of is analogous), also has two roots, which for but close to , are complex-conjugate, approach the real axis as increases, and at some value of the parameter merge to form a double "non-physical" real root , . With a further increase in , it splits into two simple real roots, the larger of which moves towards the edge of the cut – the point , reaches the edge at and then passes to the physical sheet – thus a new propagating mode is born. For the values and , Heifetz indicated that one can write down transcendental equations, which were not detailed there.
To obtain explicit formulas for the roots, another method is used: we consider only the roots of the functions , ; formulas for all other roots are derived analogously. Regardless of whether is complex or real (for and – see above), it is a root of a quadratic trinomial with real coefficients. Consider the integrals
where is a closed contour in containing the point inside. By calculating these integrals, first, by the residue theorem and, second, by deforming the contour so that it coincides with the boundary of the region , we obtain a system of two linear equations with respect to and , from which , where
The coefficients are defined through integrals:
where
where
These integrals converge slowly, but changes of variables or transform them into integrals with an exponentially decaying integrand, the calculation of which on a computer poses no difficulties. This method does not require the choice of an initial approximation and provides the same accuracy for roots with any numbers.
Table 1: Values of resonances and absorption coefficients
6
6.342
5.90
6.40
7
16.6009
16.7
16.8
8
25.2224
25.2
25.7
The table presents the values of the first three resonances of the considered model, calculated using the formulas above and rounded to , for the parameter values m, , m/s, m/s, , Hz. They correspond to the complex roots of equation (1) with numbers . We denote by the absorption coefficient of the -th mode. For comparison, the table also shows and – the corresponding values of the same quantities, calculated by another method in other works.
In addition to numerical calculations, the formulas obtained allow for the study of the asymptotic behavior of the roots. Along with the known asymptotics of large-modulus complex roots , , we obtain, for example, for the roots corresponding to propagating modes, the relation as and for a fixed , , as well as an asymptotic for complex roots useful in the case of a shallow sea, when there are no normal modes, as and for a fixed : , with the estimate being uniform in .
A.2 Sturm-Liouville Operators
In a related work, A. Elishev [2] described another way to prove the reality of the roots of the equation . We summarize his argument here, with the author’s permission.
First, it is necessary to prove that all zeros of are in fact given by the square roots of the eigenvalues of some regular Sturm-Liouville operator. Indeed, let us define the space as the space of complex-valued square-integrable functions defined on the unit interval , with the inner product given by
Suppose that is the subspace of consisting of twice-differentiable functions for which and are square-integrable, and such that
and
Then the operator , defined on is a positive semi-definite operator corresponding to a regular Sturm-Liouville problem with separated boundary conditions. Consequently, by a fundamental result of Sturm-Liouville theory, together with positive semi-definiteness, the eigenvalues of form an increasing sequence.
Note that each eigenvalue has multiplicity one, and also that is self-adjoint — this can be checked directly by integration by parts. We establish its positive semi-definiteness.
The explicit form of the eigenfunctions (unique up to a scalar factor for each eigenvalue) is a trivial exercise. First, we obtain for
Then, for , , the boundary value problem is
The general solution is given by . The first boundary condition implies , while the second yields the equation
or (if )
Consequently, the spectrum of is given by the set
Now, since is self-adjoint, its spectrum is real, which means that any solution to
is either real or purely imaginary, . The purely imaginary case leads to the equation for :
which has no (real) solutions other than . Thus, the proof of the lemma is complete.
Elishev notes that each positive eigenvalue corresponds to two solutions of the equation, given by and . This is expected since is an odd function.
The situation described in this comment is typical for the realization of the zeros of meromorphic functions as spectra of self-adjoint operators. Often they correspond to the Hamiltonians of quantum-mechanical systems; several other transcendental equations can be handled in this way.
The example provided here by Elishev (along with other easily obtainable examples of meromorphic functions whose zeros are spectra of self-adjoint operators) is quite straightforward; at the same time, on the opposite end of the complexity scale lies, as is well known, the conjectured spectral realization of the non-trivial zeros of the Riemann zeta function. In fact, as Elishev notes, there is a significant amount of evidence in favor of a spectral approach to the Riemann hypothesis, developed, in particular, in the deep and far-reaching work of Alain Connes and his school. Simple examples, such as the one considered in this comment, can, in Elishev’s opinion, serve as illustrations of the powerful methods of operator theory and its potential applications to long-standing open problems.