Preprint
Article

On Some Specific Order Estimates for the Mertens Function and Its Relation to the Non-trivial Zeros of $\zeta(s)$

Altmetrics

Downloads

239

Views

104

Comments

0

This version is not peer-reviewed

Submitted:

25 July 2024

Posted:

26 July 2024

You are already at the latest version

Alerts
Abstract
The primary purpose of this article is to provide a brief overview about the notion of the \textit{Mertens Function} $M(n)$ and \textit{Redheffer Matrices} $\mathbb{A}_{n}$ and, to derive specific order estimates for $M(n)$, in order to analyze the location of the non-trivial zeros of the \textit{Riemann Zeta function} $\zeta(s)$. A priori using the above concepts, the article aims at establishing a necessary and sufficient condition for the \textit{Riemann Hypothesis} to hold true.
Keywords: 
Subject: Computer Science and Mathematics  -   Algebra and Number Theory

MSC:  Primary 11-02, 11A25, 11N37, 11N56, 15A18; Secondary 15-02; 11Y70

1. Introduction & Motivation

Arguably one of the most intriguing problems in the field of pure mathematics, especially Number Theory to be more precise, it is by no means, a surprise that the famous "Riemann Hypothesis" is still considered as an open problem, and has drawn tremendous interest from mathematicians and even scholars from other areas of science, for example, Quantum Physics, Computer Science etc. since more than two centuries.
Among all the equivalent versions of the statement of the Riemann Hypothesis, its extremely rare that one intends to approach the problem using concepts of Advanced Linear Algebra and Advanced Matrix Theory. Although every possible headway in this daunting endeavour is extremely appreciable, its absolutely correct to say that, over the years, there have been several instances of outstanding claims made by pioneers in this field, which gave their successors enough motivation to observe this problem from a different perspective altogether.
This article is devoted to provide an exposition on one such method proposed by Redheffer [48] in the late 1900’s, when he defined formally a special kind of non-symmetricmatrices, which was eventually named after him as Redheffer Matrices, usually denoted as A n . These matrices were "special" in a sense that, apart from its unique spectral properties, Redheffer claimed and proved that,
A necessary and sufficient condition for the statement of Riemann Hypothesis to be true is , d e t ( A n ) = O n 1 2 + ϵ , for every ϵ > 0 .
Roesler [54] also introduced another type of non-symmetric matrices named after him, with similar properties (Section 5) like Redheffer Matrices. Furthermore, he also established a condition which is necessary and sufficient for Riemann Hypothesis to be true.
One can, in fact, observe that, in terms of determinant, these two matrices are very closely related to the Mertens function  M ( n ) , which, by definition, can also be interpreted as a summatory function of the M o ¨ bius Function μ ( n ) (2.2.1) (pp. 91 [59]).
As simple as it might seem, to be honest, understanding the true essence of these results and their implications in Linear Algebra and Number Theory takes a humoungous effort from any motivated indivudual. As a prerequisite for studying these results in detail, we shall familiarize ourselves with the notion of Mertens Function and Redheffer Matrices in the earlier sections of 3 to 5, along with deriving the relation between these two.
In addition to studying the Spectral Properties for A n in section 5, our aim is to focus on trying to estimate the order of M ( n ) and even for the extended Mertens Function M ( x ) (2.2.1), a significant application of which is to analyze the location of zeros of the Riemann Zeta Function ζ ( s ) . [ section 6]
Another important use of order estimates of M ( n ) is to find counterexamples in support of disproving the statement of the famous Mertens Hypothesis [ref. (3) ]. In section 7, we shall discuss several numerical evidence and theoretical justifications in the form of conjectures proposed by experts in this field, where our objective is to learn the approach opted by researchers in order to assess the optimal value of n for which the first such violation to the statement of Mertens Hypothesis occurs. Although the deductions are far from acceptable, and realistically not completely verifible by rigorous computation, thus it is still considered to be an active area of research. Although almost all the results and notations pertinent to the contents of this paper has been explained properly to the best of one’s ability, the paper contains several references, which should be resourceful to any motivated indivual interested in pursuing further studies or maybe research in any of these topics.

Notations and Abbreviations

  • a | b             b is divisible by a
  • a b            b is not divisible by a
  • x            Greatest Integer less than or equal to x
  • x            Least Integer greater than or equal to x
  • R e ( s )          Real part of a complex number s C
  • RH            Riemann Hypothesis
  • MH            Mertens Hypothesis

2. A Survey on Mertens Function

2.1. Notion of Arithmetic Functions

We start with some standard definitions in Analytic Number Theory.
Definition 2.1.1. (Arithmetic Function) These are real or complex valued functions defined on the set of natural numbers N .
In this section, we shall define some specific examples of arithmetic functions pertinent to the concept of Mertens Function.
Definition 2.1.2. (M o ¨ bius Function) The M o ¨ bius Function μ : N { 0 , ± 1 } is defined as follows:
μ ( n ) : = ( 1 ) k if n = i = 1 k p i a i such that , g c d ( p i , p j ) = 1 i j 1 if , n = 1 0 otherwise .
One can in fact use definition (2.1.2) to deduce the following property regarding the M o ¨ bius Function.
Theorem 2.1.1. 
d | n μ ( d ) = 1 n = 1 i f , n = 1 0 o t h e r w i s e .
With all the neccessary analytic tools already discussed in this section, we can formally introduce the notion of a specific arithemtic function proposed by F. Mertens in his paper [9].

2.2. Formal Definition of M ( n )

Definition 2.2.1. (Mertens Function) The Mertens Function M : N Z is defined as,
M ( n ) : = d = 1 n μ ( d )
Remark 2.2.1. In general, we can define the Extended Mertens Function as,
M ( x ) : = 1 n x μ ( n ) , x R
In his paper [9], Mertens conjectured that, for all M ( n ) with 1 n 10 4 , we shall have,
| M ( n ) | < n
This is also known as the Mertens Hypothesis. [ Interested readers can refer to [2] [Theorem 14.28, pp. 374]]
Extending Mertens’ results further upto n = 5 × 10 6 , Sterneck [10] conjectured that,
| M ( n ) | < 1 2 n , n > 200 .
The primary objective for Mertens behind introducing the function M ( x ) (As defined in (2.2.1)) was its underlying relation to the location of the zeros of the Riemann Zeta Function ζ ( s ) , the reason being largely due to it’s consequences for the distribution of the primes, also hailed as one of the most important unsolved problems in Analytic Number Theory.

3. The Riemann Hypothesis Revisited

In this section, we introduce our readers to the famous Riemann Zeta Function, first introduced by Leonhard Euler, later extended by Bernhard Riemann, which is the integral part of the Riemann Hypothesis, and also has applications in the field of Modular Forms, Class Field Theory etc.

3.1. Riemann Zeta Function ζ ( s )

Definition 3.1.1. (Riemann Zeta Function) The Riemann Zeta Function  ζ ( s ) is defined as,
ζ ( s ) : = n = 1 1 n s w h e r e s C , R e ( s ) > 1 .
We can, in fact extend ζ ( s ) to the whole complex plane using the Analytic Continuation Property of this function defined for R e ( s ) > 1 . Interested readers may refer to the book written by L. Ahlfors [6].
Using the Euler Product Formula, we can pursue an alternative approach to define ζ ( s ) as,
Definition 3.1.2. 
Riemann Zeta Function has the following Euler Product Representation,
ζ ( s ) = n = 1 1 n s = p = p r i m e s 1 1 p s
Where, the product on the R.H.S. is taken over all primes p, and converges for R e ( s ) > 1 .

3.2. Analytic Properties of ζ ( s )

We can indeed summarize some of the analytic properties of ζ ( s ) in a certain way as mentioned below.
Proposition 3.2.1. 
The following are true for ζ ( s ) :-
  • ζ ( s ) is meromorphic on C .
  • ζ ( s ) has a simple pole at s = 1 with residue 1.
  • ζ ( 1 ) is a Harmonic Series that diverges to + .
  • ζ ( 2 ) = π 2 6 . (Also known as the “Basel Problem”)
Using Definition (2.1.2) and expanding the R.H.S. in Definition (3.1.2) of ζ ( s ) , we can conclude,
Proposition 3.2.2. 
1 ζ ( s ) = n = 1 μ ( n ) n s
The Functional Equation of ζ ( s ) further allows us to assert that, the trivial zeros of ζ ( s ) occurs at negative even integers, i.e., at s = 2 k , ∀ k N .
Also, Riemannconjectured in his paper [5] that, all the non-trivial zeros of ζ ( s ) must lie on the critical strip, { s C : 0 < R e ( s ) < 1 } , moreover, they are symmetrically located about the real axis and the critical line, R e ( s ) = 1 2 .
Theorem 3.2.3.(Riemann Hypothesis) All the non-trivial zeroes of the Riemann Zeta Function ζ ( s ) lie on the critical line, R e ( s ) = 1 2 .
RH still remains one of Number Theory’s most elusive unsolved problems (also well-known as Problem 8 in Hilbert’s listof 23 unsolved problems, and also known as one of the Millenium Prize Problems), and it is still open till date.
However, one of the most important consequences of RH in Number Theory is it’s connection to the prime counting function  π ( x ) .
Definition 3.2.1. 
For each x 0 , we define,
π ( x ) : = The number of primes x .
In particular, it has been established by von Koch [11] in 1901 that, RH is equivalent to,
π ( x ) = L i ( x ) + O ( x ln x ) .
To be more specific, Schoenfield [12] proved in 1976 that, under RH,
| π ( x ) L i ( x ) | 1 8 π x ln x , x 2657 .
where, we define,
L i ( x ) : = 2 x 1 ln t d t .
It is important to observe that, we already have an explicit formula for π ( x ) and if we observe closely the structural similarity of this equation to the function L i ( x ) , thus approaching Riemann hypothesis using concepts of Linear Algebra, especially the notion of Redheffer Matrices ( which we shall discuss in the next section ) seems to be a promising direction.

4. Redheffer Matrices: An Introduction

4.1. Definition

Definition 4.1.1. (Redheffer Matrix) Often denoted by A n , Redheffer Matrix was first introduced by mathematician Redheffer (1977), and is defined to be a square ( 0 , 1 ) matrix of the form, A n : = ( a i j ) n × n such that,
a i j : = 1 i f , i | j 0 o t h e r w i s e .
for every 1 i , j n .
A priori given the invertibility of the Redheffer Matrices being relatively complicated due to the existence of the initial column of 1’s in the matrix, A n is often expressed as,
A n : = C n + D n
where, C n : = ( c i j ) n × n and, D n : = ( d i j ) n × n are the ( 0 , 1 ) matrices defined as,
c i j : = 1 i f f , i 1 , j = 1 . 0 o t h e r w i s e .
d i j : = 1 i f f , i | j 0 o t h e r w i s e .
for every i = 1 , 2 , 3 , . . . , n and, j = 1 , 2 , 3 , . . . , n .
For example, a 4 × 4  Redheffer Matrix can be represented in the following manner:
A 4 = 1 1 1 1 1 1 0 1 1 0 1 0 1 0 0 1 = 0 0 0 0 1 0 0 0 1 0 0 0 1 0 0 0 + 1 1 1 1 0 1 0 1 0 0 1 0 0 0 0 1
Here, one can observe that,
C 4 = 0 0 0 0 1 0 0 0 1 0 0 0 1 0 0 0
And,
D 4 = 1 1 1 1 0 1 0 1 0 0 1 0 0 0 0 1

4.2. Relation with Mertens Function M ( n )

Theorem 4.2.1.(Determinant of A n ) We have, for the Redheffer Matrix A n ,
d e t ( A n ) = M ( n ) .
Proof. 
Proof uses the following result.
Lemma 4.2.2. 
The inverse of the matrix D n defined in (9) is defined as, D n 1 = δ i j n , where,
δ i j = μ j i i f , i | j 0 o t h e r w i s e .
Using Lemma (4.2.2), we obtain, d e t ( D n 1 ) = 1 . Hence, d e t ( A n ) = d e t ( D n 1 A n ) = d e t ( D n 1 C n + I n ) .Hence, D n 1 C n is lower triangular, so is D n 1 C n + I n , which implies that, d e t D n 1 C n + I n is the product of its diagonal entries.
Consequently,
d e t D n 1 C n + I n = k = 2 n μ ( k ) + 1 = M ( n ) .
   □

4.3. Characteristic Polynomial of A n

We start with stating a formal definition of a Directed Graph .
Definition 4.3.1. (Directed Graph) A directed graph, also called a digraph, is a graph in which each of the edges has a direction.
One method to derive the characteristic polynomial of A n is to use the concept of directed graph D ( M n ) corresponding to the matrix, M n : = x I n A n .
By definition (4.3.1), D ( M n ) has vertex set, Λ = { 1 , 2 , . . . . . , n } and a directed edge from vertex i to vertex jiff, m i j 0 . In other words, D ( M n ) will have a directed edge from i to jiff, i | j or, j = 1 . We denote it by the ordered pair ( i , j ) .
Definition 4.3.2. (Cycle) A set of directed edges { ( i 1 , j 1 ) , ( i 2 , j 2 ) , . . . . . , ( i k , j k ) } is called a cycle of length k if, j r = i r + 1 , ∀ r = 1 ( 1 ) k 1 , and j k = i 1 , and i 1 , i 2 , . . . . . , i k are distinct.
Moreover, suppose c = { ( i 1 , i 2 ) , ( i 2 , i 3 ) , . . . . . , ( i k , i 1 ) } is a cycle in D ( M n ) .
We set, P [ c ] = m i 1 i 2 m i 2 i 3 . . . . . m i k i 1 . Also, for any I Λ , we define M n [ I ] as the Principal Submatrixof M n with rows and columns in I . Assume C 1 to be the set of cycles in D ( M n ) containg the vertex 1, and for every c C 1 , denote its length by l ( c ) . Also, suppose c be the complement of the vertices connected by c in Λ .
Using ([16] Theorem 2, pp. 503), we can assert that,
d e t ( M n ) = c C 1 ( 1 ) l ( c ) + 1 . P [ c ] . d e t ( M n [ c ] ) .
We can further claim that, the principal submatrix M n [ c ] is upper-triangular [ a priori since, each c C 1 contains the vertex 1] having determinant equal to ( x 1 ) n l ( c ) . Moreover, for l ( c ) 2 , P [ c ] doesn’t contain any diagonal entry and precisely is equal to ( 1 ) l ( c ) . Since, the only cycle of length 1 is, c = { ( 1 , 1 ) } , hence, P [ c ] = x 1 .
Therefore, we obtain the characteristic equation of A n as,
d e t ( M n ) = ( x 1 ) n c C 1 l ( c ) 2 ( x 1 ) n l ( c )
which can be rewritten as,
d e t ( M n ) = ( x 1 ) n k = 2 n ( x 1 ) n k v n , k 1 .
where, v n , k denotes the number of cycles starting at vertex 1, of length k + 1 , in the graph D ( M n ) . In other words, if a k-list in Λ is a list { 1 , i 1 , i 2 , . . . . . , i k } with i r | i r + 1 , ∀ r = 1 ( 1 ) k 1 and 1 < i 1 < i 2 < . . . . . < i k n , then, v n , k shall denote the number of k-lists in Λ .
Remark 4.3.1. Similar computation for a particular case A 6 can be found in [17].
Remark 4.3.2. It has been deduced that,
v n , 1 = n 1 , and , v n , 2 = k = 2 n n k 1 .
Remark 4.3.3. Also, we can conclude that, each v n , k is positive and bounded above, as evident from the relation,
v n , k < n . ( log n ) k 1 ( k 1 ) !

4.4. Eigenvalues of A n

Theorem 4.4.1. 
The Redheffer Matrix A n has eigenvalue 1 with multiplicity n log 2 n 1 .
Proof. 
Here, instead of computing the eigenvalues using the characteristic polynomial of A n given by,
p n ( x ) = d e t ( x I n A n )
We work with the following polynomial,
q n ( x ) = d e t ( x I n + ( A n I n ) )
Observation: p n ( x ) and q n ( x ) are very closely related polynomials.
Thus using properties of determinants, we obtain,
p n ( x ) = ( 1 ) n d e t { ( 1 x ) I n + ( A n I n ) } = ( 1 ) n q n ( 1 x ) .
Implying that, each eigenvalue α j of A n can be expressed as ( 1 α j ) , where, α j is a root of q n ( x ) , ∀ j = 1 , 2 , 3 , . . . . , n .
Hence it only suffices to compute roots of q n ( x ) in order to deduce the eigenvalues of A n .
Using properties of directed graphs, we can associate a directed graph with any given square matrix, say, B : = ( b i j ) n × n , all of whose entries are either 0 or 1, by assigning a vertex to each row ( or column equivalently ) and including an edge between the i t h and the j t h vertices iff, b i j = 1 , ∀ i , j = 1 , 2 , 3 , . . . . , n .
Therefore, corresponding to the matrix ( A n I n ) , we can assert from (),
q n ( x ) = x n + k = 1 n ( 1 ) k 1 c ( n , k ) x n k
Where, c ( n , k ) denotes the cycles of length k in the directed graph corresponding to the matrix ( A n I n ) .
Now, for the proof of Theorem (4.4.1), observe that, in the directed graph of ( A n I n ) , the longest cycle is,
1 2 2 2 . . . . . . . . 2 log 2 n 1 .
      (Since, each entry in the cycle must be a non-trivial multiple of the previous one, and multiplying by 2 each time increases the cycle’s elements as slowly as possible . )
The cycle contains log 2 n + 1 elements, which yields, c ( n , k ) = 0 for k > log 2 n + 1 .
The lowest power of x occuring in q n ( x ) is, x n log 2 n 1 . Thus, q n has a root 0 with multiplicity n log 2 n 1 , implying that, p n has a root 1 with multiplicity n log 2 n 1 .
   □
With this information, we can make further deductions about the spectral radius of A n .

4.5. Spectral Radius of A n

Definition 4.5.1. (Spectral Radius) Spectral Radius of a matrix is defined to be the maximum absolute value of its eigenvalues.
Remark 4.5.1. A n has a positive spectral radius.
( A priori given the “Perron-Frobenius Theorem”, we can deduce that, A n clearly satisfies the irreducibility criterion mentioned in the theorem, hence, the statement follows. )
Remark 4.5.2. Barrett, Forcade and Follington [1] deduced the upper bounds for the co-efficients of the characteristic polynomial of A n . If ρ n denotes the spectral radius of A n , then using these bounds, they proved the following asymptotic relation for ρ n .
Theorem 4.5.3. 
lim n ρ n n = 1 .
Barrett proposed several interesting conjectures based on numerical evidence regarding the other eigenvalues of A n . Interested readers can find the details about this in [13].
One of the significant among them was,
Theorem 4.5.4. 
A n has a negative (real) eigenvalue of magnitude n , and the remaining eigenvalues are bounded and are close to the origin.
In ([15] Theorem 2, pp. 147), the existence of such a negative real eigenvalue has been proven. Furthermore, we can conclude that it is asymptotically equal to n as n , and also that, the remaining eigenvalues of A n have magnitude O n log n .
Again, given that most of the eigenvalues of A n are 1, we can comment on their corresponding eigenspaces.

4.6. Eigenspaces of A n corresponding to its Eigenvalues

Theorem 4.6.1. 
The eigenspace of A n corresponding to the eigenvalue 1 has dimension n 2 1 . In particular, A n is non-diagonalizable for n 5 .
Proof. 
Follows by obtaining the row-reduced echelon form of the matrix ( A n I n ) and then determining its nullity.
   □
A priori from the fact that, the eigenspaces of A n and A n T are isomorphic, we can comment further about the eigenspaces corresponding to the eigenvalues of A n that are not equal to 1 .
Theorem 4.6.2. 
Every eigenspace of A n with eigenvalue not equal to 1 has dimension 1 .
Proof. 
As for the proof, we use the following result.
Lemma 4.6.3. 
( w 1 , w 2 , . . . . . . , w n ) is an eigenvector of A n T with λ as an eigenvalue iff,
  • d | n w d = λ w n
  • j = 1 n w j = λ w 1
Observe that, if λ 1 be an eigenvalue of A n T , then, given a value for w 1 , we can uniquely determine the remaining w j ’s using Lemma (4.6.3) and the following recursive relation,
w j = 1 ( λ 1 ) d | j d j w d
Hence the result follows.    □
As a result of the above concept, we can introduce another arithmetic function in the next section.

4.7. The Arithmetic Function “ ν λ

4.7.1. Definition

Definition 4.7.1. 
The function ν λ : N R is defined for arbitrary λ 1 using the following recursive formula,
ν λ ( n ) : = 1 i f , n = 1 1 ( λ 1 ) d | n d n ν λ ( d ) o t h e r w i s e .
Remark 4.7.1. Definition (4.7.1) allows us to conclude that, λ is an eigenvalue of A n iff,
k = 1 n ν λ ( k ) = λ .

4.7.2. Dirichlet Series Representation

Given the definition of ν λ , we can obtain the following estimate regarding it’s corresponding Dirichlet Series.
V λ ( s ) = n = 1 ν λ ( n ) n s
As a consequence, we obtain,
Proposition 4.7.2. 
V λ ( s ) = n = 1 ν λ ( n ) n s = λ 1 λ ζ ( s )
Where, ζ ( s ) stands for the Riemann Zeta Function .
Proof. 
Using partial summation formula,
V λ ( s ) ζ ( s ) = n = 1 d | n ν λ ( d ) 1 n s = n = 1 λ ν λ ( n ) n s λ + 1 [ Sin ce , d | n ν λ ( d ) = λ ν λ ( n ) for every n N { 1 } ] = λ V λ ( s ) λ + 1 .
Solving for V λ gives us the result.
   □
Definition 4.7.2. 
If a k ( n ) denotes the number of ways of expressing a natural number n as a product of k factors in some particular order, then, we have the following recursive definition given by,
a k ( n ) = 1 i f , k = 0 , n = 1 0 i f , k = 0 , n 1 d | n a k 1 n d i f , k > 0 .
Using Definition (4.7.2), we can have another expression for ν λ .
Theorem 4.7.3. 
For any n N , we have,
ν λ ( n ) = 1 1 λ k = 0 a k ( n ) λ k
Where, λ > 1 .
Remark 4.7.4. The Redheffer Matrix does provide an alternative approach on different notions of divisibility. Being closely related to the Riemann Zeta Functions, it effectively bundles together the divisibility properties of many numbers into a single term, which can then be studied using techniques of Linear Algebra .

5. Order Estimates for M ( x ) and its relation to the non-trivial zeros of ζ ( s )

As mentioned previously in the introduction, among the various statements of the famous Riemann Hypothesis proposed by mathematicians over the years which are equialent to each other, only few of them have been approached using concepts of Linear Algebra and Advanced Matrix Theory.
Mertens [9] introduced the notion of the extended Mertens Function, M ( x ) and suggested a completely different method to analyze the zeros of the location of the Riemann Zeta Function ζ ( s ) . A priori for R e ( s ) > 1 , one can observe from Proposition (3.2.2),
1 ζ ( s ) = n = 1 μ ( n ) n s = n = 1 M ( n ) M ( n 1 ) n s = n = 1 1 n s 1 ( n + + 1 ) s = s 1 M ( x ) x s + 1 d x
If we assume, M ( x ) = O ( x ϵ ) for some ϵ > 0 , then the integral on the R.H.S. of (21) represents an analytic function in the half-plane, R e ( s ) > ϵ . Thus, 1 ζ ( s ) shall also be analytic in that region. Consequently, we can in fact conclude, ζ ( s ) 0 , which, in turn will imply [18],
π ( x ) = L i ( x ) + O x ϵ log x
Clearly the above relation justifies the significance of studying the order of M ( x ) .
Remark 5.0.5. In particular, it seems obvious that, RH follows from the statement of MH [(3)], which can be generalized as follows:
M ( x ) = O x 1 2
A slightly more rigorous computation of the statement in Remark (5.0.5) yields [2, Theorem 14.25C, pp.37],
Theorem 5.0.6. 
A necessary and sufficient condition for the Riemann Hypothesis to be true is,
M ( x ) = O x 1 2 + ϵ
for every ϵ > 0 .
Remark 5.0.7. Given the definition (4.1.1) of A n and the underlying relation between the redheffer matrices and M ( n ) [ See (4.2.1) ], we can relate to the result proved by Redheffer [48] that the statement of RH is true iff,
d e t ( A n ) = O n 1 2 + ϵ
for every ϵ > 0 .
Roesler also defined a special kind of matrix, B n : = ( b i j ) 2 i , j n such that,
b i j : = i 1 i f , i | j 1 o t h e r w i s e .
for every 2 i , j n .
Known as the Roesler Matrices, using these matrices, he established that in a similar manner as for Redheffer Matrices, RH holds true iff,
d e t ( B n ) = O n ! n 1 2 + ϵ
for every ϵ > 0 . It is important to observe that, both A n and B n are extensively related to M ( n ) via their determinants. It can be checked that, these matrices are not symmetric, and one has to compute many of their eigenvalues to estimate their respective determinants.

6. Conjectures based on the Order of M ( x )

This section provides a survey on various important numerical estimates on the bounds of M ( x ) , and conjectures proposed by pioneers in this field of research during a timeline spanning over more than half a century. Significant to observe that, many of these theories were enough to refute the validity of Mertens Hypothesis (2.3).
Neubauer deduced four isolated value of M ( x ) in his paper [20] for which von Sterneck’s conjecture () doesn’t hold true. One of the smallest such values of n is, M ( 7760000000 ) = 47465 .
Later on, Cohen and Dressfurther claimed and proved [21] that, the first violation of von Sterneck’s conjecture in the positive direction is, M ( 7725038629 ) = 43947 , whereas, credit for deriving the first violation of the theorem in the negative direction belongs to Dress[22], as he showed, M ( 330486258610 ) = 287440 .
Towards the end of the 20th century, Odlyzko and te Riele [23] astonishingly came up with the following bounds which which was sufficient to conclude tha, the statemnt of MH is indeed false along both the directions.
lim inf x | M ( x ) | x < 1.009
lim sup x | M ( x ) | x > 1.06
Although, Pintz established in his paper [24], that the violation occurs for some, x e 3 . 21 × 10 64 10 1 . 4 × 10 64 . As for further research in this area, interested readers can refer to the works of Jurkat [25,26], Spira [27], Jurkat and Peyerimhoff [28], te Riele [29], M o ¨ ller [30] and Anderson [31].
Many claim that, a priori assuming RH to be true, (5.4) is satisfied, although the statement of the generalized Mertens Hypothesis (5.3) is indeed false. One of the conjectures which supports this claim is as follows.
lim sup x | M ( x ) | x log log x = C
where one can obtain estimates for C in the form of, C = 6 2 π 2 ( L e ´ vy in a comment to Saffari [49] ), and C = 12 π ( Good and Churchhouse [32] ).
El Marraki [33] did provide one of the strongest unconditional order estimate for M ( x ) . He claimed,
M ( x ) = O x log 236 75 x
A similar result can be found in Walfisz’s article [34],
M ( x ) = O x exp A ( log x ) 3 5 ( log log x ) 1 5
for some costant A > 0 . One valid estimate for A can be obtained as, A = 0 . 2098 . ( Ford [35] )
There have been significant progress in the direction of obtaing estimates for M ( x ) of the form, | M ( x ) | < x K , for some constant K > 0 and ∀ x > x 0 for some x 0 R . Interested readers can read from the papers of Hackel [36], MacLeod [37], dress [38], Diamond and McCurley [39], Costa Pereira [40] and Dress and El Marraki [41].
Some more article pertinent to this topic can be cited as Landau [42,43] , te Riele [44,45] and, Odlyzko and te Riele [46].
Some of the authors even opted for rigorous computation of the function M ( n ) and the ratio M ( n ) n with high hopes of finding possible counterexample to MH. For refernce, one can study articles by Mertens [9], von Sterneck [10,50,51], Neubauer [20], Yorinaga [52], Cohen and Dress [21], dress [22], Lioen and van de Lune [53].

7. Research Prospects in Mertens Hypothesis

In section 7, we have encountered enough theoretical justification and numerical data which clearly suggests that anyhow Mertens Hypothesis isn’t true. Furthermore, looking at the already derived estimates for n N at which the first violation occurs for MH, one should spontaneously come up with this obvious question,
I s   i t   j u s t i f i a b l e   t o   s t u d y   t h e   M e r t e n s   H y p o t h e s i s ,   a n d   i f   s o ,   w h y ?
As ludacris as it may sound, the appropriate answer to this question is, YES. Even though MH have been disproved convincingly, the everlasting desire among many mathematicians in this era to pursue research in this area using different techniques is truly commendable. One of the most important among all the problems in this field of study is to estimate the least possible value of n for which first violation of MH occurs.
Recall that, Pintz provided a proper bound for n satisfying such condition as, n < 10 1 . 4 × 10 64 . An extensive study of the conjectures of Good and Churchhouse, and also that of L e ´ vy, enables us to conclude, n 10 and, n 48 respectively. Naturally, these bounds are not preferable, as we can obtain by hands on computation, n > 10 14 .
In their article [46], Odlyzko and te Riele claimed that, the first violation for MH won’t occur for n < 10 20 , and probably not even for n < 10 30 . We recall the following theorem from Titchmarsh ([2] Theorem 14.27, pp. 372).
Theorem 7.0.8. 
A priori assuming RH, suppose we denote the zeros of ζ ( s ) on the critical line, R e ( s ) = 1 2 by, ρ = 1 2 + i t . Further consider all the zeros to be simple. Then, ∃ a sequence, T k , satisfying, k T k k + 1 , such that,
M ( x ) = 2 lim k 0 < t < T k R e x ρ ρ ζ ( ρ ) + O ( 1 )
Kotnik and van de Lune [47] illustrated an experimentdepending on the evaluation of partial sums of the series on the R.H.S. of (7.1). Thus, eventually, we can obtain,
lim sup x | M ( x ) | x log log log x = C
where, a valid estimate for C can be derived as, C 0 . 5 . As a consequence, we deduce the following estimate for n as, n 10 2 . 3 × 10 23 for the occurance of the first violation of MH.
Remark 7.0.9. This approximation for n is significantly better than the bound derived by Pintz, although being too large makes it difficult for mathematicians to verify all the details by direct computation. It is the very reason why people are still motivated as ever to investigate such a violation of MH by computing M ( n ) consistently.

Acknowledgments

I’ll always be grateful to Prof. Ritumoni Sarma ( Associate Professor, Department of Mathematics, IIT Delhi, India ) for inspiring me in order to pursue research on this topic. His kind guidance helped me immensely in detailed understanding of this subject.

Data Availability Statement

I as the sole author of this aarticle confirm that the data supporting the findings of this study are available within the article [and/or] its supplementary materials.

Conflicts of Interest

I as the author of this article declare no conflicts of interest.

References

  1. W.W. Barrett, R.W. Forcade, A.D. Follington, On the Spectral Radius of a (0,1) matrix related to Mertens Function, Linear Algebra and it’s applications, 107(1987), 151-159.
  2. E. C. Titchmarsh, The Theory of the Riemann Zeta-function, Oxford University Press, 1951. Second edition revised by D. R. Heath-Brown, published by Oxford University Press, 1986.
  3. Will Dana, Eigenvalues of the Redheffer Matrix and their relation to the Mertens Function, 2015.
  4. Jean-Paul Cardinal, Symmetric matrices related to the Mertens function, URL: arXiv:0811.3701v4 .
  5. Bernhard Riemann, Ueber die Anzahl der Primzahlen unter einer gegebenen Grösse, Monatsberichte der Berliner Akademie, Nov. 1859 . URL: https://www.claymath.org/sites/default/files/ezeta.pdf.
  6. L Ahlfors, Complex Analysis, McGraw-Hill Education , 3rd edition , Jan. 1, 1979 .
  7. Serge Lang, Algebra, Graduate Texts In Mathematics, Springer, 3rd edition , Jan. 8, 2002 .
  8. Joram Soch, Linear Algebraic Number Theory, Part-I : Foundations, URL: arXiv:1709.05959v1 .
  9. F. Mertens, Über eine zahlentheoretische Funktion, Sitzungsber. Akad. Wiss. Wien 106(IIa) (1897) 761-830 .
  10. R. D. von Sterneck, Die zahlentheoretische Funktion σ(n) bis zur Grenze 5000000, Sitzungsber. Akad. Wiss. Wien 121(IIa) (1912) 1083-1096 .
  11. H. von Koch, Sur la distribution des nombres premiers, Acta Mathematica, vol. 24, iss. 1, pp. 159-182; DOI: 10.1007/BF02403071.
  12. L. Schoenfeld, Sharper bounds for the Chebyshev Functions θ(x) and ψ(x). II, Mathematics of Computation, vol. 30, no. 134, pp. 337-360; DOI: 10.2307/2005976.
  13. T. J. Jarvis, An Investigation into the Eigenvalues of a Matrix of Redheffer, Undergraduate Thesis, Brigham Young Univ., 1989 .
  14. A. Ivić, The Riemann Zeta-Function, Wiley, New York, 1985 .
  15. T. J. Jarvis, A Dominant Negative Eigenvalue of a Matrix of Redheffer, Linear Algebra and its Applications, Volume 142, Dec. 1990, Pages 141-152 . URL: https://doi.org/10.1016/0024-3795(90)90262-B.
  16. J. S. Maybee, D. D. Olesky, P. van den Driessche, G. Wiener, Matrices, digraphs, and determinants, SIAM J. Matrix Anal. Appl. 10(4):500-519, Oct. 1989 .
  17. Wayne W. Barrett, Tyler J. Jarvis, Spectral Properties of a Matrix of Redheffer, Linear Algebra and its Applications, Volume 162-164, Feb. 1992, Pages 673-683 . URL: . [CrossRef]
  18. A. E. Ingham, The distribution of prime numbers, Cambridge University Press, 1932. Reprinted by Stechert-Hafner, 1964, and (with a foreword by R. C. Vaughan) by Cambridge University Press, 1990.
  19. J. E. Littlewood, Sur la distribution des nombres premiers, C. R. Acad. Sci. 158 (1917) 1869-1872.
  20. G. Neubauer, Eine empirische Untersuchung zur Mertensschen Funktion, Numer. Math. 5 (1963) 1-13.
  21. H. Cohen, F. Dress, Calcul numérique de M(x), In Rapport de l’ATP A12311 "Informatique 1975", pp. 11-13, CNRS, 1979.
  22. F. Dress, Fonction sommatoire de la fonction de Möbius. 1, Majorations experimentales. Exp. Math. 2 (1993) 89-98.
  23. A. M. Odlyzko, H. J. J. te Riele, Disproof of the Mertens conjecture, J. reine angew. Math. 357 (1985) 138-160.
  24. J. Pintz, An effective disproof of the Mertens conjecture, Asterisque 147-148 (1987) 325-333.
  25. W. B. Jurkat, Eine Bemerkung zur Vermutung von Mertens, Nachr. Österr. Math. Ges., Sondernummer Österr. Mathematikerkongres (1961) 11.
  26. W. B. Jurkat, On the Mertens conjecture and related general Ω-theorems, In H. Diamond, editor, Analytic Number Theory, pp. 147-158, American Mathematical Society, 1973.
  27. R. Spira , Zeros of sections of the zeta function. II, Math. Comp. 22 (1966) 163-173.
  28. W. B. Jurkat, A. Peyerimhoff, A constructive approach to Kronecker approximation and its applications to the Mertens conjecture, J. reine angew. Math. 286-287 (1976) 332-340.
  29. H. J. J. te Riele, Computations concerning the conjecture of Mertens J. reine angew. Math. 311-312 (1979) 356-360.
  30. H. Möller, Zur Numerik der Mertens’schen Vermutung, PhD thesis, Univ. Ulm, 1987.
  31. R. J. Anderson, On the Mobius sum function, Acta Arith. 59 (1991) 205-213.
  32. I. J. Good, R. F. Churchhouse, The Riemann hypothesis and pseudo-random features of the Möbius function, Math. Comp. 22 (1968) 857-861.
  33. M. El Marraki, Majorations effectives de la fonction sommatoire de la fonction de Möbius, PhD Thesis, Univ. Bordeaux, 1991.
  34. A. Walfisz, Weylsche Exponentialsummen in der neueren Zahlentheorie, VEB Deutscher Verlag, 1963.
  35. K. Ford, Vinogradov’s integral and bounds for the Riemann zeta function, Proc. Lond. Math. Soc. 85 (2002) 565-633.
  36. R. Hackel, Zur elementaren Summierung gewisser zahlentheoretischer Funktionen Sitzungsber. Akad. Wiss. Wien 118(IIa) (1909) 1019-1034.
  37. R. A. MacLeod, A new estimate for the sum M ( x ) = n x μ ( n ) , Acta Arith. 13 (1967) 49-59. Erratum, ibid. 16 (1969) 99-100.
  38. F. Dress, Majorations de la fonction sommatoire de la fonction de Möbius, Bull. Soc. Math. Fr., Suppl., Mem. 49-50 (1977) 47-52.
  39. H. G. Diamond, K. S. McCurley, Constructive elementary estimates for M(x), In M. I. Knopp, editor, Analytic Number Theory, Lecture Notes in Mathematics 899, pp. 239-253, Springer, 1982.
  40. N. Costa Pereira, Elementary estimate for the Chebyshev function ψ(x) and the Möbius function M(x), Acta Arith. 52 (1989) 307-337.
  41. F. Dress, M. El Marraki, Fonction sommatoire de la fonction de Mobius. 2, Majorations asymptotiques elementaires, Exp. Math. 2 (1993) 99-112.
  42. E. Landau, Handbuch der Lehre von der Verteilung der Primzahlen, Vol. 2 (of 2), Teubner, 1909. Reprinted by Chelsea, 1953.
  43. E. Landau, Vorlesungen über Zahlentheorie, Vol. 2 (of 3), Hirzel-Verlag, 1927.
  44. H. J. J. te Riele, Some historical and other notes about the Mertens conjecture and its recent disproof, Nieuw Arch. Wisk. 3(1V) (1985) 237-243.
  45. H. J. J. te Riele, On the history of the function M ( x ) / x since Stieltjes, In G. van Dijk, editor, Thomas Jan Stieltjes, Collected Papers, Vol. 1, pp. 69-79, Springer, 1993.
  46. A. M. Odlyzko, H. J. J. te Riele, Disproof of the Mertens conjecture, J. reine angew. Math. 357 (1985) 138-160.
  47. T. Kotnik, J. van de Lune, On the order of the Mertens function, experimental mathematics, Vol. 13 (2004), pp. 473-481.
  48. R. M. Redheffer, Eine explizit lösbare Optimierungsaufgabe, Internat. Schriftenreihe Numer. Math. 36 (1977).
  49. B. Saffari, Sur la fausseté de la conjecture de Mertens. Avec une observation par Paul Lévy, C. R. Acad. Sci. 271A(A) (1970) 1097-1101.
  50. R. D. von Sterneck, Empirische Untersuchung über den Verlauf derzahlentheoretischen Funktion σ(n) im intervalle von 0 bis 150000, Sitzungsber. Akad. Wiss. Wien 106(IIa) (1897) 835-1024.
  51. R. D. von Sterneck, Empirische Untersuchung über den Verlauf derzahlentheoretischen Funktion σ(n) im intervalle 150000 bis 500000, Sitzungsber. Akad. Wiss. Wien 106(IIa) (1901) 1053-1102.
  52. M. Yorinaga, numerical investigation of sums of the Möbius function, Math. J. Okayama Univ. 21 (1979) 41-47.
  53. W. M. Lioen, J. van de Lune, Systematic computations on Mertens’ conjecture and Dirichlet’s divisor problem by vectorized sieving, In K. Apt, L. Schrijver, and N. Temme, editors, From Universal Morphisms to Megabytes: A Baayen Space Odyssey, pp. 421-432, CWI, Amsterdam, 1994.
  54. F. Roesler, Riemann’s Hypothesis as an Eigenvalue Problem, Linear Algebra and its Applications. 81 (1986) 153-198.
  55. F. Roesler, Riemann’s Hypothesis as an Eigenvalue Problem II, Linear Algebra and its Applications. 92 (1987) 45-73.
  56. F. Roesler, Riemann’s Hypothesis as an Eigenvalue Problem III, Linear Algebra and its Applications. 141 (1990) 1-46.
  57. R. C. Vaughan, On the Eigenvaluesof a Redheffer’s matrix I, in: Proc. Conf. BYU, Marcel Dekker, 1993, pp. 283-296.
  58. R. C. Vaughan, On the Eigenvaluesof a Redheffer’s matrix II, J. Austral. Math. Soc. (Series A.) 60 (1996) 260-273.
  59. T. M. Apostol, Introduction to Analytic Number Theory, Springer, 1976.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2024 MDPI (Basel, Switzerland) unless otherwise stated