Preprint
Article

Collatz Conjecture

Altmetrics

Downloads

899

Views

275

Comments

0

This version is not peer-reviewed

Submitted:

06 February 2024

Posted:

07 February 2024

Read the latest preprint version here

Alerts
Abstract
This paper presents an analysis of the number of zeros in the binary representation of natural numbers. The primary method of analysis involves the use of the concept of the fractional part of a number, which naturally emerges in the determination of binary representation. This idea is grounded in the fundamental property of the Riemann zeta function, constructed using the fractional part of a number. Understanding that the ratio between the fractional and integer parts of a number, analogous to the Riemann zeta function, reflects the profound laws of numbers becomes the key insight of this work. The findings suggest a new perspective on the interrelation between elementary properties of numbers and more complex mathematical concepts, potentially opening new directions in number theory and analysis.
Keywords: 
Subject: Computer Science and Mathematics  -   Analysis

1. Introduction

We will use the following well-known fact: if, for the members of the Collatz sequence, zeros predominate in their binary representation, then these members will lead to a decrease in the subsequent members according to the Collatz rule. A striking example is when the initial number in the Collatz sequence is equal to 2 n . Let’s write the solution of the equation n = 2 x in the form x = { x } + [ x ] and note that the smaller x, the more zeros in the corresponding binary representation for n. Developing this idea, we come to the following steps.
  • Analysis of the binary representation of simple cases of natural numbers.
  • Creation of a process for decomposing an arbitrary natural number into powers of two.
  • Analysis of the proximity of the process to binary decomposition at the completion of decomposition at each stage.
  • Calculation of the number of zeros in the binary decomposition of a natural number.
  • Estimation of the Collatz sequence members depending on the number of ones in the binary decomposition.

2. Results

This document reveals a comprehensive solution to the Collatz Conjecture, as first proposed in [1]. The Collatz Conjecture, a well-known unsolved problem in mathematics, questions whether iterative application of two basic arithmetic operations can invariably convert any positive integer into 1. It deals with integer sequences generated by the following rule: if a term is even, the subsequent term is half of it; if odd, the next term is the previous term tripled plus one. The conjecture posits that all such sequences culminate in 1, regardless of the initial positive integer. Named after mathematician Lothar Collatz, who introduced the concept in 1937, this conjecture is also known as the 3n + 1 problem, the Ulam conjecture, Kakutani’s problem, the Thwaites conjecture, Hasse’s algorithm, or the Syracuse problem. The sequence is often termed the hailstone sequence due to its fluctuating nature, resembling the movement of hailstones. Paul Erdős and Jeffrey Lagarias have commented on the complexity and mathematical depth of the Collatz Conjecture, highlighting its challenging nature. Consider an operation applied to any positive integer:
  • Divide it by two if it’s even.
  • Triple it and add one if it’s odd.
This operation is mathematically defined as:
f ( n ) = n 2 , if n 0 mod 2 , 3 n + 1 , if n 1 mod 2 .
A sequence is formed by continuously applying this operation, starting with any positive integer, where each step’s result becomes the next input. The Collatz Conjecture asserts that this sequence will always reach 1 Recent substantial advancements in addressing the Collatz problem have been documented in works [2]. Now let’s move on to our research, which we will conduct according to the announced plan. For this, we will start with the following
Theorem 1.
Let
M N , [ α j ] [ α j + 1 ] = δ j > 0 , ϵ 1 < 0.65 , | F j ( x ) | < | x | , α j = [ α j ] + ϵ j , ϵ j < 1 , σ j = 1 ϵ j .
M = i = 1 j 1 2 [ α i ] + 2 α j , M = i = 1 j 2 [ α i ] + 2 α j + 1 ,
Then for δ j = 1
σ j = 2 1 σ j + 1 1 σ j + 1 ln 2 2 + F j σ j + 1 3 12 ,
and for δ j > 1
σ j = 2 δ j σ j + 1 + 1 2 δ j 2 2 δ j + 1 ln 2 2 2 δ j σ j + 1 2 ln 2 4 + 2 2 δ j R j ln 2 2 σ j + 1 3 8 .
Proof. 
Consider
M M = 0 = i = 1 j 2 [ α i ] + 2 α j + 1 i = 1 j 1 2 [ α i ] + 2 α j = 2 [ α j ] + 2 α j + 1 2 α j 2 α j = 2 [ α j ] + 2 α j + 1 = 2 [ α j ] + 2 [ α j + 1 ] [ α j ] + [ α j ] + ϵ j + 1 .
Next, we move to functional relations between σ j and σ j + 1 :
2 ϵ j = 2 δ j + ϵ j + 1 + 1 2 1 σ j = 2 δ j + 1 σ j + 1 + 1 ln ( 2 1 σ j ) = ln 2 σ j ln 2 = ln ( 2 δ j + 1 σ j + 1 + 1 ) .
Calculating for δ j = 1 , we get:
ln ( 2 δ j + 1 σ j + 1 + 1 ) | δ j = 1 = ln ( 2 σ j + 1 + 1 ) = ln 2 + ln 1 σ j + 1 ln 2 2 + σ j + 1 2 ln 2 2 4 + F j σ j + 1 3 12 .
Continuing calculations for δ j > 1 , we get:
ln ( 2 δ j + 1 σ j + 1 + 1 ) = ln 1 + 2 δ j + 1 2 δ j + 1 σ j + 1 ln 2 2 + 2 δ j + 1 F j σ j + 1 2 + 2 δ j + 1 = 2 δ j 2 2 δ j + 1 2 δ j σ j + 1 ln 2 2 + 2 2 δ j F j σ j + 1 2 .
Thus, we obtain the final formulas. □
Theorem 2.
Let
M = 3 n = 2 [ α ] + { α } = i = 1 n * γ i 2 i ,
1 { α } > 0.55 , n * = n ln ( 3 ) ln ( 2 ) ,
then
γ i = 0 1 n * 2 .
Proof. 
Let
3 n = 2 α α = n ln ( 3 ) / ln ( 2 ) 3 n = 2 [ α ] + { α } .
Using Theorem 1, we create a sequence
ϵ i , m i , ϵ 1 = { α } ,
2 ϵ 1 = k = 0 i 1 2 [ α k ] α 1 + 2 α i α 1 .
Suppose the binary decomposition process, according to formula (1), stopped at the j-th step. It then follows that the other terms of the decomposition are zeros, and we immediately reach the validity of the theorem’s statement. Therefore, let’s consider the case where the generation of decomposition according to formula (1) does not stop, and j reaches n. This means all σ j > 0 , j < n .
Let’s conduct a more detailed analysis of the number of zeros and ones in our binary representation. We introduce the following notations:
l - the number of zeros in the binary representation.
m - the number of ones in the binary representation.
n - the digit length of the binary decomposition, then
n=l+m.
δ j = 1 , α j = 0 , β j = ( 1 ln 2 δ j + 1 2 ) / 2 + F j σ j + 1 2 12 1
δ j > 1 , α j = 2 δ j 1 2 δ j 2 2 δ j + 1 ln 2 + 2 δ j R j ln 2 2 σ j + 1 3 8 + 2 2 δ j + 1 ln 2 , β j = 2 δ j
To solve the following equations
σ j + 1 = α j + β j σ j
we introduce the notations λ k - the number of ones after the appearance of α k > 0 and until the next appearance of zero in the binary decomposition and
γ k = m = k + 1 k + 1 + λ k β m , α k + 1 > 0
Let’s conduct a series of transformations to understand the following steps.
σ j + 1 = α j + β j γ j σ j
σ j + 1 = α j + β j γ j ( α j 1 + β j 1 γ j 1 )
continuing the transformations we get
σ n + 1 = α n + β 1 γ 1 α 1 β 1 γ 1 k = 0 n 2 γ n m β n m + m = 1 n 2 γ n m β n m α n m β n m γ n m k = 0 m 1 β n k γ n k + σ 1 k = 0 n 1 β n k γ n k
σ n + 1 = α n + α 1 β 1 γ 1 + k = 0 n 1 β n k γ n k + m = 1 n 2 α n m β n m γ n m k = 0 m β n k γ n k + σ 1 k = 0 n 1 β n k γ n k
Introduce the following notations:
α * = inf 0 i n α i β i
α * = sup 0 i n α i β i
A ( m ) = k = 1 , δ j = 1 m ln 2 ( β j ) + k = 1 , δ j > 1 m ln 2 ( β j ) = A 1 ( m ) + A 2 ( m )
Note that δ k , σ k occur at coordinates x ( δ k ) , x ( σ k ) , x ( δ k ) = x ( σ k ) and by definition of α i
1 < α * < α * < 1.3
Thus, all possible variants with L zeros will be determined by all possible combinations of
( δ 1 , δ 2 . . . . δ n )
With corresponding coordinates
( x ( δ 1 ) , x ( δ 2 ) . . . . x ( δ n ) )
m * = i = 1 , δ i > 1 n δ i
Rewrite formula (5)
σ n + 1 k = 0 n 1 β n k γ n k = α n k = 0 n 1 β n k γ n k + α 1 β 1 γ 1 + m = 1 n 1 α n m β n m γ n m 1 k = m n 1 β n k γ n k + σ 1
σ 1 σ n 2 A ( n ) α * 2 A ( n ) i = 1 n 1 2 A ( i ) / 2
To calculate the sum in the last inequality, we use the equalities
2 k = 1 + i = 0 k 1 2 i , 2 k + 2 l = 2 l 1 + i = 0 k l 1 2 i = 2 l + i = 0 k l 1 2 i + l = 2 l + i = l k 1 2 i
It is important to note that here k,l also have their coordinates x ( k ) , x ( l ) and all i , l < i < k , have coordinates x ( i ) which are built on a uniform grid Thanks to these simple formulas and corresponding coordinates, we can calculate the sums using integrals.
I = i = 1 n 2 A ( i )
I ( γ ) = 0 n 2 γ x d x = I + R ( n ) , γ = m * + ( ln 2 + ϵ ) l n >
where R(n) is the residual term which can be neglected for large n
where L = n l < m * - the set level of the number of zeros.
Calculating, we get the following equalities
I ( γ ) = 1 2 A ( n ) γ ln 2
α * 1 2 A ( n ) 2 ln 2 ( 1 + ln 2 + ϵ ) σ 1 , d I ( γ ) d γ < 0
Note that the smaller γ the greater I ( γ ) , therefore to achieve the set level L is only possible with the corresponding σ 1 , and to achieve the level L = n / 2 , it is necessary to choose
0.55 = 1.3 2 ln 2 ( 1 + ln 2 + ϵ ) < σ 1
L n / 2 .
The statement of the theorem is correct. □
Theorem 3.
Let
a n = i = 0 n γ i 2 i , n > 1000 , γ i { 0 , 1 } ,
then
j * { 0 , 1 } , and a 4 n j * < a n .
Proof. 
Introduce operators defined as follows:
P f = f 2 , T f = 3 f + 1 , Z f = 3 f ,
T i { P , T } , R i { Z , P } .
Consider all possible scenarios of Collatz sequence behavior, which can be written in the following form:
a n + n = T 1 T 2 T n a n ,
We need to estimate each 2 n -th term of the Collatz sequence based on the number of applied operators P , T , Z during n steps.
a n + n = T n T n 1 T 1 a n ,
Let a n have m ones in its binary representation, then we count the number of applications of operator Z using the following formula:
m = R i = Z , i n 1 ,
and the number of applications of operator P using the following formula:
R i = P , i n 1 = m + n m = n .
Since each application of Z is accompanied by operator P, and the number of applications of operator P corresponds to the number of zeros in a n , which equals n m . According to the rules of Collatz, after n steps we have:
a n + n = 3 m 2 n a n + T n T n 1 T 1 1 = 3 m 2 n a n + B n ,
B n 2 n + m j = 1 m 3 j 2 j a n < 2 n + m · 3 m / 2 m · a n 2 2 n + 1 · 3 m · a n .
According to the last formula, we see that the growth of each term of the sequence depends on the number of ones in the binary representation. Next, we will show that a large number of ones at the 2 n -th step leads to an increase in the number of zeros at the 3 n -th step for binary representation according to the previous theorems, from which it follows that subsequent terms of the sequence decrease:
a 2 n = 3 m a n · 2 n + B n = 3 m + 3 m ( a n 2 n ) + B n ,
Repeating the reasoning of Theorem 2, consider the equation
2 x = a 2 n = 3 m a n · 2 n + B n = 3 m + 3 m ( a n 2 n ) · 2 n + B n ,
x ln 2 = m ln ( 3 ) + ln 1 + ( a n 2 n ) · 2 n + B n · 3 m ,
From the last equation, to apply the results of theorem 2, we need σ 1 > 1 2 ln 2 . To satisfy the last inequality, consider m j = m j , θ = ( a n 2 n ) · 2 n ,
{ x } = min j < 10 ( m j ) ln ( 3 ) ln ( 2 ) + ln ( 1 + θ ) ln 2 + F j 1 2 n ln 2 ,
Consider p = ( m j ) ln 3 ln 2 = ( 2 k + l ) 1.5849625007 , ϵ = 1.5849625007 1.5 , we get
p = ( 2 k + l ) ( 1.5 + ϵ + ln ( 1 + θ ) ln 2 ) = 3 k + ( 2 k + l ) · ϵ + ln ( 1 + θ ) ln 2 ,
{ p } = { 1.5 · l + ( 2 k + l ) · ϵ + ln ( 1 + θ ) ln 2 } = { 1.5 · l + { ( 2 k + l ) · ϵ + ln ( 1 + θ ) ln 2 } } ,
Choosing l from even numbers less than 10, if inequalities 0 { ( 2 k ) · ϵ + ln ( 1 + θ ) ln 2 } 0.5 , are true
{ p } = { 2 k · ϵ + ln ( 1 + θ ) ln 2 } = { 2 k · ϵ + ln ( 1 + θ ) ln 2 } ,
Choosing l from odd numbers less than 10, if inequalities 0.5 < { 2 k · ϵ + ln ( 1 + θ ) ln 2 } < 1 , are true
{ p } = { 2 k · ϵ + ln ( 1 + θ ) ln 2 } = { 0.5 + ( 2 k + l ) · ϵ } ,
Using ϵ < 0.1 , also satisfy the condition σ 1 = 1 { x } > 1 2 ln 2 .
m * number of non - zero γ i ,
According to theorem 2 we get
m * n / 2 + ( n j * ) · ln 3 / ln 2 / 2 ,
According to our application of Collatz rules, we have an element a 4 n j * , and the order of its binary representation is
n 2 = n + ( n j * ) · ln 3 / ln 2 / 2 ,
After 3 n j * steps of applying Collatz rules we have
a 4 n j * = 3 m * 2 2 n j * a 2 n + T 3 n j * T 3 n 1 j * T 1 1 = 3 m * 2 2 n a 2 n + B 3 n ,
a 4 n j * = 3 m * 2 2 n a 2 n + T 3 n j * T 3 n j * 1 T 1 1 = 3 m * 2 2 n 3 m 2 n j * a n + B n + B 3 n j * ,
a 4 n j * = 3 m * + m · 2 3 n j * a n + 3 m * · 2 2 n j * B n + B 3 n j * ,
a 4 n j * q 1 · a n ,
By definition of m * , l * , B n we get
q 1 < 1 ,
Using n > 1000 , it follows that q 1 < 1 a 4 n j * < a n . □
Theorem 4.
Let
a n = i = 0 n γ i 2 i , n > 1000 , γ i { 0 , 1 } ,
then
j * < 0.1 n , and a 4 n j * < a n .
Proof. 
Let’s introduce operators defined by the formulas
P f = f 2 , T f = 3 f + 1 , Z f = 3 f ,
T i { P , T } , R i { Z , P } .
Consider all possible scenarios of the behavior of the Collatz sequence, which can be written in the following form:
a n + n = T 1 T 2 T n a n ,
It is necessary to calculate an estimate for each 2 n -th member of the Collatz sequence based on the number of P , T , Z operators applied during n steps.
a n + n = T n T n 1 T 1 a n ,
Let a n have m units in its binary representation, then calculate the number of applications of the Z operator by the following formula:
m = R i = Z , i n 1 ,
and calculate the number of applications of the P operator by the following formula:
R i = P , i n 1 = m + n m = n .
Since each application of Z is accompanied by the P operator, and the number of applications of the P operator corresponds to the number of zeros in a n , which is equal to n m . According to the rules of Collatz after n steps, we have:
a n + n = 3 m 2 n a n + T n T n 1 T 1 1 = 3 m 2 n a n + B n ,
B n 2 n + m j = 1 m 3 j 2 j a n < 2 n + m · 3 m / 2 m · a n 2 2 n + 1 · 3 m · a n .
According to the last formula, we see that the growth of each member of the sequence depends on the number of units in the binary representation. Next, we will show that a large number of units on the 2 n -th step leads to an increase in the number of zeros in the 3 n -th step for the binary representation according to previous theorems, hence the reduction of subsequent members of the sequence:
a 2 n = 3 m a n · 2 n + B n = 3 m + 3 m ( a n 2 n ) + B n ,
Repeating the reasoning of Theorem 2, consider the equation
2 x = a 2 n = 3 m a n · 2 n + B n = 3 m + 3 m ( a n 2 n ) · 2 n + B n ,
x ln 2 = m ln ( 3 ) + ln 1 + ( a n 2 n ) · 2 n + B n · 3 m ,
From the last equation, in order to apply the results of theorem 2, we need σ 1 = 1 { x } > 0.5 . To fulfill the last inequality, consider m j = m j , θ = ( a n 2 n ) · 2 n ,
{ x } = min j { 0 , 1 } ( m j ) ln ( 3 ) ln ( 2 ) + ln ( 1 + θ ) ln 2 + F j 1 2 n ln 2 ,
Consider p = ( m j ) ln 3 ln 2 = ( 2 k + l ) 1.5849625007 , ϵ = 1.5849625007 1.5 , we get
p = ( 2 k + l ) ( 1.5 + ϵ + ln ( 1 + θ ) ln 2 ) = 3 k + ( 2 k + l ) · ϵ + ln ( 1 + θ ) ln 2 ,
{ p } = { 1.5 · l + ( 2 k + l ) · ϵ + ln ( 1 + θ ) ln 2 } = { 1.5 · l + { ( 2 k + l ) · ϵ + ln ( 1 + θ ) ln 2 } } ,
Choosing l = 0 , if the inequalities 0 { ( 2 k ) · ϵ + ln ( 1 + θ ) ln 2 } 0.5 are true,
{ p } = { 2 k · ϵ + ln ( 1 + θ ) ln 2 } = { 2 k · ϵ + ln ( 1 + θ ) ln 2 } ,
Choosing l = 1 , if the inequalities 0.5 < { 2 k · ϵ + ln ( 1 + θ ) ln 2 } < 1 are true,
{ p } = { 2 k · ϵ + ln ( 1 + θ ) ln 2 } = { 0.5 + ( 2 k + l ) · ϵ } ,
Using ϵ < 0.1 , we also satisfy the condition σ 1 = 1 { x } > 0.51 .
m * is the number of non - zero γ i ,
According to theorem 2 we get
m * n / 2 + ( n j * ) · ln 3 / ln 2 / 2 ,
According to our application of the Collatz rules, we have the element a 4 n j * , and the order of its binary representation is
n 2 = n + ( n j * ) · ln 3 / ln 2 / 2 ,
After 3 n j * steps of applying the Collatz rules, we have
a 4 n j * = 3 m * 2 2 n j * a 2 n + T 3 n j * T 3 n 1 j * T 1 1 = 3 m * 2 2 n a 2 n + B 3 n ,
a 4 n j * = 3 m * 2 2 n a 2 n + T 3 n j * T 3 n j * 1 T 1 1 = 3 m * 2 2 n 3 m 2 n j * a n + B n + B 3 n j * ,
a 4 n j * = 3 m * + m · 2 3 n j * a n + 3 m * · 2 2 n j * B n + B 3 n j * ,
a 4 n j * q 1 · a n ,
By definition of m * , l * , B n we get
q 1 < 1 ,
Using n > 1000 , implies q 1 < 1 a 4 n j * < a n . □
Theorem 5.
Let
a n = i = 0 n γ i 2 i , n > 1000 , γ i { 0 , 1 } ,
then for a n the Collatz conjecture is true.
Proof. 
The proof follows from Theorems 1-3. □
Proof. Proof follows from theorem 1-3

6. Conclusions

Our assertion proves that after 3n steps, a sequence with an initial binary length of n arrives at a number strictly smaller than the initial one, from which the solution to the Collatz conjecture follows. This is because by applying this process n times, we are guaranteed to arrive at 1.

References

  1. O’Connor, J.J.; Robertson, E.F. (2006). "Lothar Collatz". St Andrews University School of Mathematics and Statistics, Scotland.
  2. Tao, Terence (2022). "Almost all orbits of the Collatz map attain almost bounded values". Forum of Mathematics, Pi. 10: e12. arXiv:1909.03562. arXiv:1909.03562.ISSN 2050-5086. [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2024 MDPI (Basel, Switzerland) unless otherwise stated