Preprint
Article

On the Raleigh-Ritz Variational Method. Non-orthogonal Basis Set

Altmetrics

Downloads

74

Views

20

Comments

0

A peer-reviewed article of this preprint also exists.

This version is not peer-reviewed

Submitted:

14 May 2024

Posted:

15 May 2024

You are already at the latest version

Alerts
Abstract
We overview the main equations of the Rayleigh-Ritz variational method and discuss their connection with the problem of simultaneous diagonalization of two Hermitian matrices.
Keywords: 
Subject: Physical Sciences  -   Atomic and Molecular Physics

1. Introduction

The Rayleigh-Ritz variational method (RR) is one of the approximate methods most commonly used in the study of the electronic structure of atoms and molecules [1,2]. One of its main advantages is that it provides increasingly accurate upper bounds to all the eigenvalues of the Hamiltonian operator of the system [3,4]. In this paper we provide a comprehensible overview of the approach and illustrate some of its relevant points by means of a simple problem.

2. The Rayleigh-Ritz Variational Method

The starting point of our analysis is a linearly independent set of vectors V = f 1 , f 2 , . Clearly, the only solution to the vector equation
i = 1 N a i f i = 0 ,
is a i = 0 for all i = 1 , 2 , , N . If we apply the bras f j , j = 1 , 2 , , N , to this equation from the left we obtain
i = 1 N S j i a i = 0 , j = 1 , 2 , , N ,
where S i j = f i f j . We have an homogeneous system of N linear equations with N unknowns a i with the only solution a i = 0 . Consequently, S 0 where S = S i j i , j = 1 N is an N × N Hermitian matrix and . . . stands for determinant. Note that S i j = S j i * so that S = S where † stands for adjoint. The matrix S is commonly called overlap matrix[1].
Let v be an eigenvector of S with eigenvalue s, Sv = s v , then v Sv = s v v . If v i , i = 1 , 2 , , N , are the elements of the N × 1 column vector v then
v Sv = i = 1 N v i f i j = 1 N v j f j > 0 ,
and we conclude that s > 0 . In other words, the overlap matrix S is positive definite.
We are interested in the eigenvalue equation
H ψ n = E n ψ n , n = 1 , 2 , , E 1 E 2 , ψ i ψ j = δ i j ,
for an Hermitian operator H. In order to solve it approximately we propose and ansatz of the form
φ = j = 1 N c j f j ,
where V = f 1 , f 2 , is not only assumed to be linearly independent but also complete.
The RR variational method consists of minimizing the integral
W = φ H φ φ φ ,
with respect to the expansion coefficients c j
W c j = 0 , j = 1 , 2 , , N .
This equation leads to the so-called secular equation[1,2]
j = 1 N H i j W S i j c j = 0 , i = 1 , 2 , , N ,
where, H i j = f i H f j . There are nontrivial solutions c j , j = 1 , 2 , , N , provided that the secular determinant vanishes
H W S = 0 ,
where H = H i j i , j = 1 N is an N × N Hermitian matrix.
For each of the roots of the secular determinant (9), W 1 W 2 W N , we derive an approximate solution; for example, when W = W k we have
φ k = j = 1 N c j k f j ,
and the secular equation (8) can be rewritten
j = 1 N H i j c j k = j = 1 N W k S i j c j k = j = 1 N m = 1 N S i j W m δ m k c j m .
If we define the N × N matrices W = W i δ i j i , j = 1 N and C = c i j i , j = 1 N then this equation can be rewritten in matrix form as
HC = SCW ,
which is equivalent to
C 1 S 1 HC = W ,
and the procedure reduces to the diagonalization of the matrix S 1 H by means of the invertible matrix C . Note that S 1 exists because S is positive definite as argued above.
In order to determine the coefficients c j k completely, we require that φ i φ j = δ i j that leads to
φ i φ j = k = 1 N m = 1 N c k i * c m j f k f m = δ i j ,
that in matrix form reads
C SC = I ,
where I is the N × N identity matrix. It follows from equations (15) and (12) that
C HC = W .
It is clear that there exists an invertible matrix ( C ) that transforms two Hermitian matrices ( H and S ), one of them positive definite ( S ), into diagonal form. This procedure is well known in the mathematical literature[5]. However, it is most important to note that equations (15) and (16) are not what we commonly know as matrix diagonalization. In fact, the eigenvalues of S are not unity and the eigenvalues of H are not the RR eigenvalues W i . We will illustrate this point in Section 3 by means of a simple example. It is also worth noting that that we cannot obtain C neither from (15) or (16). One obtains the matrix C in the process of diagonalizing S 1 H as in equation (13) and the remaining undefined matrix elements c i j from equation (15).
Since S is positive definite, we can define S 1 / 2 . The matrix U = S 1 / 2 C is unitary as shown by
U U = C S 1 / 2 S 1 / 2 C = I .
On substituting C = S 1 / 2 U into equation (16) we obtain
U S 1 / 2 HS 1 / 2 U = W .
This equation is just the standard diagonalization of the Hermitian matrix S 1 / 2 HS 1 / 2 .
If the basis set is orthonormal, f i f j = δ i j , then S = I , C = C 1 and the secular equation (13) becomes
C HC = W .
In this particular case, the eigenvalues of the matrix H are the RR eigenvalues W i . Note that equations (16) and (19) look identical but were derived under different assumptions (they agree only when S = I ).

3. Simple Example

As a simple example we consider the dimensionless eigenvalue equation
H ψ = E ψ , H = 1 2 d 2 d x 2 + λ x , ψ ( 0 ) = ψ ( 1 ) = 0 .
In order to illustrate the RR variational method with a non-orthogonal basis set we choose f i ( x ) = x i ( 1 x ) , i = 1 , 2 , , that satisfy the boundary conditions at x = 0 and x = 1 .
A straightforward calculation shows that
S i j = 2 i + j + 1 i + j + 2 i + j + 3 ,
and
H i j = i j i + j i + j + 1 i + j 1 + 2 λ i + j + 2 i + j + 3 i + j + 4 .
Table 1 and Table 2 show the RR eigenvalues W i , i = 1 , 2 , 3 , 4 , for λ = 0 and λ = 1 , respectively. We appreciate that the approximate eigenvalues converge from above as expected[3,4].
In what follows, we illustrate some of the general results of Section 2 for the simplest case N = 2 when λ = 0 . The matrices are
S = 1 60 2 1 1 4 7 , H = 1 12 2 1 1 4 5 ,
and we obtain
C 1 S 1 HC = W = 5 0 0 21 , C = 30 1 7 0 2 7 .
One can easily verify that these matrices already satisfy equations (15) and (16). On the other hand, the symmetric matrices S and H can be diagonalized in the usual way by orthogonal matrices that we call U S and U H , respectively.
U S SU S = 1 420 9 74 0 0 9 + 74 , U S = 1 2 5 174 148 1 2 + 5 174 148 1 2 + 5 174 148 1 2 5 174 148 , U H HU H = 1 60 7 34 0 0 70 + 34 , U H = 1 2 3 34 68 1 2 + 3 34 68 1 2 + 3 34 68 1 2 3 34 68
We clearly see that the eigenvalues of S are not unity and those of H are not the RR eigenvalues W i as argued in Section 2.
Using equation (25) one can easily obtain
S 1 / 2 = 233 8880 + 7 7 8880 21 2960 7 7 8880 21 2960 7 7 8880 151 62160 + 7 7 8880 .

4. Conclusions

We have shown that the main equations of the Rayleigh-Ritz variational method [1,2] lead to the mathematical problem of diagonalization of two Hermitian matrices[5]. Although equations (15) and (16) are discussed in some textbooks on quantum chemistry, the latter does not appear to be correctly interpreted[1].

References

  1. F. L. Pilar, Elementary Quantum Chemistry, McGraw-Hill, New York, (1968).
  2. A. Szabo and N. S. Ostlund, Modern Quantum Chemistry, Dover Publications, Inc., Mineola, New York, (1996).
  3. J. K. L. MacDonald, Successive approximations by the Rayleigh-Ritz variation method, Phys Rev. 43 (1933) 830-833.
  4. F. M. Fernández, On the Rayleigh-Ritz variational method, 2022. arXiv:2206.05122 [quant-ph].
  5. R. Benedetti and P. Cragnolini, On simultaneous diagonallzation of one Hermitlan and one symmetric form, Lin Algebra Appl. 57 (1984) 215-226.
Table 1. Convergence of the Rayleigh-Ritz variational method with a non-orthogonal basis set for λ = 0
Table 1. Convergence of the Rayleigh-Ritz variational method with a non-orthogonal basis set for λ = 0
N E 1 E 2 E 3 E 4
4 4.934874810 19.75077640 51.06512518 100.2492235
5 4.934802217 19.75077640 44.58681182 100.2492235
6 4.934802217 19.73923669 44.58681182 79.99595777
7 4.934802200 19.73923669 44.41473408 79.99595777
8 4.934802200 19.73920882 44.41473408 78.97848206
9 4.934802200 19.73920882 44.41322468 78.97848206
10 4.934802200 19.73920880 44.41322468 78.95700917
11 4.934802200 19.73920880 44.41321981 78.95700917
12 4.934802200 19.73920880 44.41321981 78.95683586
13 4.934802200 19.73920880 44.41321980 78.95683586
14 4.934802200 19.73920880 44.41321980 78.95683521
15 4.934802200 19.73920880 44.41321980 78.95683521
16 4.934802200 19.73920880 44.41321980 78.95683520
17 4.934802200 19.73920880 44.41321980 78.95683520
18 4.934802200 19.73920880 44.41321980 78.95683520
19 4.934802200 19.73920880 44.41321980 78.95683520
20 4.934802200 19.73920880 44.41321980 78.95683520
Table 2. Convergence of the Rayleigh-Ritz variational method with a non-orthogonal basis set for λ = 1
Table 2. Convergence of the Rayleigh-Ritz variational method with a non-orthogonal basis set for λ = 1
N E 1 E 2 E 3 E 4
4 5.432678349 20.25175971 51.56499993 100.7505620
5 5.432608286 20.25141191 45.08766430 100.7488422
6 5.432607868 20.23989706 45.08714181 80.49674963
7 5.432607855 20.23989074 44.91514957 80.49606992
8 5.432607855 20.23986309 44.91512224 79.47878520
9 5.432607855 20.23986306 44.91361487 79.47871372
10 5.432607855 20.23986304 44.91361453 79.45724985
11 5.432607855 20.23986304 44.91360967 79.45724783
12 5.432607855 20.23986304 44.91360967 79.45707467
13 5.432607855 20.23986304 44.91360966 79.45707465
14 5.432607855 20.23986304 44.91360966 79.45707400
15 5.432607855 20.23986304 44.91360966 79.45707400
16 5.432607855 20.23986304 44.91360966 79.45707400
17 5.432607855 20.23986304 44.91360966 79.45707400
18 5.432607855 20.23986304 44.91360966 79.45707400
19 5.432607855 20.23986304 44.91360966 79.45707400
20 5.432607855 20.23986304 44.91360966 79.45707400
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2024 MDPI (Basel, Switzerland) unless otherwise stated