On the Raleigh-Ritz Variational Method. Non-orthogonal Basis Set

Francisco Fernández

doi:10.20944/preprints202405.1006.v1

Submitted:

14 May 2024

Posted:

15 May 2024

You are already at the latest version

Abstract

We overview the main equations of the Rayleigh-Ritz variational method and discuss their connection with the problem of simultaneous diagonalization of two Hermitian matrices.

Keywords:

Rayleigh-RItz

;

variational method

;

matrix diagonalization

;

Hermitian matrix

;

eigenvalues

;

convergence

Subject:

Physical Sciences - Atomic and Molecular Physics

1. Introduction

The Rayleigh-Ritz variational method (RR) is one of the approximate methods most commonly used in the study of the electronic structure of atoms and molecules [1,2]. One of its main advantages is that it provides increasingly accurate upper bounds to all the eigenvalues of the Hamiltonian operator of the system [3,4]. In this paper we provide a comprehensible overview of the approach and illustrate some of its relevant points by means of a simple problem.

2. The Rayleigh-Ritz Variational Method

The starting point of our analysis is a linearly independent set of vectors

V =

\{f_{1}, f_{2}, \dots\}

. Clearly, the only solution to the vector equation

\sum_{i = 1}^{N} a_{i} f_{i} = 0,

(1)

is

a_{i} = 0

for all

i = 1, 2, \dots, N

. If we apply the bras

〈f_{j}|

,

j = 1, 2, \dots, N

, to this equation from the left we obtain

\sum_{i = 1}^{N} S_{j i} a_{i} = 0, j = 1, 2, \dots, N,

(2)

where

S_{i j} = 〈f_{i}| f_{j}〉

. We have an homogeneous system of N linear equations with N unknowns

a_{i}

with the only solution

a_{i} = 0

. Consequently,

|S| \neq 0

where

S = {(S_{i j})}_{i, j = 1}^{N}

is an

N \times N

Hermitian matrix and

|. . .|

stands for determinant. Note that

S_{i j} = S_{j i}^{*}

so that

S^{†} = S

where † stands for adjoint. The matrix

S

is commonly called overlap matrix[1].

Let

v

be an eigenvector of

S

with eigenvalue s,

Sv = s v

, then

v^{†} Sv = s v^{†} v

. If

v_{i}

,

i = 1, 2, \dots, N

, are the elements of the

N \times 1

column vector

v

then

v^{†} Sv = 〈\sum_{i = 1}^{N} v_{i} f_{i}| \sum_{j = 1}^{N} v_{j} f_{j}〉 > 0,

(3)

and we conclude that

s > 0 .

In other words, the overlap matrix

S

is positive definite.

We are interested in the eigenvalue equation

\begin{matrix} H ψ_{n} & = & E_{n} ψ_{n}, n = 1, 2, \dots, \\ E_{1} & \leq & E_{2} \leq \dots, 〈ψ_{i} |ψ_{j}〉 = δ_{i j}, \end{matrix}

(4)

for an Hermitian operator H. In order to solve it approximately we propose and ansatz of the form

φ = \sum_{j = 1}^{N} c_{j} f_{j},

(5)

where

V = \{f_{1}, f_{2}, \dots\}

is not only assumed to be linearly independent but also complete.

The RR variational method consists of minimizing the integral

W = \frac{〈φ| H |φ〉}{〈φ| φ〉},

(6)

with respect to the expansion coefficients

c_{j}

\frac{\partial W}{\partial c_{j}} = 0, j = 1, 2, \dots, N .

(7)

This equation leads to the so-called secular equation[1,2]

\sum_{j = 1}^{N} (H_{i j} - W S_{i j}) c_{j} = 0, i = 1, 2, \dots, N,

(8)

where,

H_{i j} = 〈f_{i}| H |f_{j}〉

. There are nontrivial solutions

c_{j}

,

j = 1, 2, \dots, N

, provided that the secular determinant vanishes

|H - W S| = 0,

(9)

where

H = {(H_{i j})}_{i, j = 1}^{N}

is an

N \times N

Hermitian matrix.

For each of the roots of the secular determinant (9),

W_{1} \leq W_{2} \leq \dots \leq W_{N}

, we derive an approximate solution; for example, when

W = W_{k}

we have

φ_{k} = \sum_{j = 1}^{N} c_{j k} f_{j},

(10)

and the secular equation (8) can be rewritten

\sum_{j = 1}^{N} H_{i j} c_{j k} = \sum_{j = 1}^{N} W_{k} S_{i j} c_{j k} = \sum_{j = 1}^{N} \sum_{m = 1}^{N} S_{i j} W_{m} δ_{m k} c_{j m} .

(11)

If we define the

N \times N

matrices

W = {(W_{i} δ_{i j})}_{i, j = 1}^{N}

and

C = {(c_{i j})}_{i, j = 1}^{N}

then this equation can be rewritten in matrix form as

HC = SCW,

(12)

which is equivalent to

C^{- 1} S^{- 1} HC = W,

(13)

and the procedure reduces to the diagonalization of the matrix

S^{- 1} H

by means of the invertible matrix

C

. Note that

S^{- 1}

exists because

S

is positive definite as argued above.

In order to determine the coefficients

c_{j k}

completely, we require that

〈φ_{i} |φ_{j}〉 = δ_{i j}

that leads to

〈φ_{i} |φ_{j}〉 = \sum_{k = 1}^{N} \sum_{m = 1}^{N} c_{k i}^{*} c_{m j} 〈f_{k}| f_{m}〉 = δ_{i j},

(14)

that in matrix form reads

C^{†} SC = I,

(15)

where

I

is the

N \times N

identity matrix. It follows from equations (15) and (12) that

C^{†} HC = W .

(16)

It is clear that there exists an invertible matrix (

C

) that transforms two Hermitian matrices (

H

and

S

), one of them positive definite (

S

), into diagonal form. This procedure is well known in the mathematical literature[5]. However, it is most important to note that equations (15) and (16) are not what we commonly know as matrix diagonalization. In fact, the eigenvalues of

S

are not unity and the eigenvalues of

H

are not the RR eigenvalues

W_{i}

. We will illustrate this point in Section 3 by means of a simple example. It is also worth noting that that we cannot obtain

C

neither from (15) or (16). One obtains the matrix

C

in the process of diagonalizing

S^{- 1} H

as in equation (13) and the remaining undefined matrix elements

c_{i j}

from equation (15).

Since

S

is positive definite, we can define

S^{1 / 2}

. The matrix

U = S^{1 / 2} C

is unitary as shown by

U^{†} U = C^{†} S^{1 / 2} S^{1 / 2} C = I .

(17)

On substituting

C = S^{- 1 / 2} U

into equation (16) we obtain

U^{†} S^{- 1 / 2} {HS}^{- 1 / 2} U = W .

(18)

This equation is just the standard diagonalization of the Hermitian matrix

S^{- 1 / 2} {HS}^{- 1 / 2}

.

If the basis set is orthonormal,

〈f_{i}| f_{j}〉 = δ_{i j}

, then

S = I

,

C^{†} = C^{- 1}

and the secular equation (13) becomes

C^{†} HC = W .

(19)

In this particular case, the eigenvalues of the matrix

H

are the RR eigenvalues

W_{i}

. Note that equations (16) and (19) look identical but were derived under different assumptions (they agree only when

S = I

).

3. Simple Example

As a simple example we consider the dimensionless eigenvalue equation

H ψ = E ψ, H = - \frac{1}{2} \frac{d^{2}}{d x^{2}} + λ x, ψ (0) = ψ (1) = 0 .

(20)

In order to illustrate the RR variational method with a non-orthogonal basis set we choose

f_{i} (x) = x^{i} (1 - x)

,

i = 1, 2, \dots

, that satisfy the boundary conditions at

x = 0

and

x = 1

.

A straightforward calculation shows that

S_{i j} = \frac{2}{(i + j + 1) (i + j + 2) (i + j + 3)},

(21)

and

H_{i j} = \frac{i j}{(i + j) (i + j + 1) (i + j - 1)} + \frac{2 λ}{(i + j + 2) (i + j + 3) (i + j + 4)} .

(22)

Table 1 and Table 2 show the RR eigenvalues

W_{i}

,

i = 1, 2, 3, 4

, for

λ = 0

and

λ = 1

, respectively. We appreciate that the approximate eigenvalues converge from above as expected[3,4].

In what follows, we illustrate some of the general results of Section 2 for the simplest case

N = 2

when

λ = 0

. The matrices are

S = \frac{1}{60} (\begin{matrix} 2 & 1 \\ 1 & \frac{4}{7} \end{matrix}), H = \frac{1}{12} (\begin{matrix} 2 & 1 \\ 1 & \frac{4}{5} \end{matrix}),

(23)

and we obtain

C^{- 1} S^{- 1} HC = W = (\begin{matrix} 5 & 0 \\ 0 & 21 \end{matrix}), C = \sqrt{30} (\begin{matrix} 1 & \sqrt{7} \\ 0 & - 2 \sqrt{7} \end{matrix}) .

(24)

One can easily verify that these matrices already satisfy equations (15) and (16). On the other hand, the symmetric matrices

S

and

H

can be diagonalized in the usual way by orthogonal matrices that we call

U_{S}

and

U_{H}

, respectively.

\begin{matrix} U_{S}^{†} {SU}_{S} & = & \frac{1}{420} (\begin{matrix} 9 - \sqrt{74} & 0 \\ 0 & 9 + \sqrt{74} \end{matrix}), \\ U_{S} & = & (\begin{matrix} \sqrt{\frac{1}{2} - \frac{5 \sqrt{174}}{148}} & \sqrt{\frac{1}{2} + \frac{5 \sqrt{174}}{148}} \\ - \sqrt{\frac{1}{2} + \frac{5 \sqrt{174}}{148}} & \sqrt{\frac{1}{2} - \frac{5 \sqrt{174}}{148}} \end{matrix}), \\ U_{H}^{†} {HU}_{H} & = & \frac{1}{60} (\begin{matrix} 7 - \sqrt{34} & 0 \\ 0 & 70 + \sqrt{34} \end{matrix}), \\ U_{H} & = & (\begin{matrix} \sqrt{\frac{1}{2} - \frac{3 \sqrt{34}}{68}} & \sqrt{\frac{1}{2} + \frac{3 \sqrt{34}}{68}} \\ - \sqrt{\frac{1}{2} + \frac{3 \sqrt{34}}{68}} & \sqrt{\frac{1}{2} - \frac{3 \sqrt{34}}{68}} \end{matrix}) \end{matrix}

(25)

We clearly see that the eigenvalues of

S

are not unity and those of

H

are not the RR eigenvalues

W_{i}

as argued in Section 2.

Using equation (25) one can easily obtain

S^{1 / 2} = (\begin{matrix} \sqrt{\frac{233}{8880} + \frac{7 \sqrt{7}}{8880}} & \sqrt{\frac{21}{2960} - \frac{7 \sqrt{7}}{8880}} \\ \sqrt{\frac{21}{2960} - \frac{7 \sqrt{7}}{8880}} & \sqrt{\frac{151}{62160} + \frac{7 \sqrt{7}}{8880}} \end{matrix}) .

(26)

4. Conclusions

We have shown that the main equations of the Rayleigh-Ritz variational method [1,2] lead to the mathematical problem of diagonalization of two Hermitian matrices[5]. Although equations (15) and (16) are discussed in some textbooks on quantum chemistry, the latter does not appear to be correctly interpreted[1].

References

F. L. Pilar, Elementary Quantum Chemistry, McGraw-Hill, New York, (1968).
A. Szabo and N. S. Ostlund, Modern Quantum Chemistry, Dover Publications, Inc., Mineola, New York, (1996).
J. K. L. MacDonald, Successive approximations by the Rayleigh-Ritz variation method, Phys Rev. 43 (1933) 830-833.
F. M. Fernández, On the Rayleigh-Ritz variational method, 2022. arXiv:2206.05122 [quant-ph].
R. Benedetti and P. Cragnolini, On simultaneous diagonallzation of one Hermitlan and one symmetric form, Lin Algebra Appl. 57 (1984) 215-226.

Table 1. Convergence of the Rayleigh-Ritz variational method with a non-orthogonal basis set for

λ = 0

Table 1. Convergence of the Rayleigh-Ritz variational method with a non-orthogonal basis set for

λ = 0

N	$E_{1}$	$E_{2}$	$E_{3}$	$E_{4}$
4	4.934874810	19.75077640	51.06512518	100.2492235
5	4.934802217	19.75077640	44.58681182	100.2492235
6	4.934802217	19.73923669	44.58681182	79.99595777
7	4.934802200	19.73923669	44.41473408	79.99595777
8	4.934802200	19.73920882	44.41473408	78.97848206
9	4.934802200	19.73920882	44.41322468	78.97848206
10	4.934802200	19.73920880	44.41322468	78.95700917
11	4.934802200	19.73920880	44.41321981	78.95700917
12	4.934802200	19.73920880	44.41321981	78.95683586
13	4.934802200	19.73920880	44.41321980	78.95683586
14	4.934802200	19.73920880	44.41321980	78.95683521
15	4.934802200	19.73920880	44.41321980	78.95683521
16	4.934802200	19.73920880	44.41321980	78.95683520
17	4.934802200	19.73920880	44.41321980	78.95683520
18	4.934802200	19.73920880	44.41321980	78.95683520
19	4.934802200	19.73920880	44.41321980	78.95683520
20	4.934802200	19.73920880	44.41321980	78.95683520

Table 2. Convergence of the Rayleigh-Ritz variational method with a non-orthogonal basis set for

λ = 1

Table 2. Convergence of the Rayleigh-Ritz variational method with a non-orthogonal basis set for

λ = 1

N	$E_{1}$	$E_{2}$	$E_{3}$	$E_{4}$
4	5.432678349	20.25175971	51.56499993	100.7505620
5	5.432608286	20.25141191	45.08766430	100.7488422
6	5.432607868	20.23989706	45.08714181	80.49674963
7	5.432607855	20.23989074	44.91514957	80.49606992
8	5.432607855	20.23986309	44.91512224	79.47878520
9	5.432607855	20.23986306	44.91361487	79.47871372
10	5.432607855	20.23986304	44.91361453	79.45724985
11	5.432607855	20.23986304	44.91360967	79.45724783
12	5.432607855	20.23986304	44.91360967	79.45707467
13	5.432607855	20.23986304	44.91360966	79.45707465
14	5.432607855	20.23986304	44.91360966	79.45707400
15	5.432607855	20.23986304	44.91360966	79.45707400
16	5.432607855	20.23986304	44.91360966	79.45707400
17	5.432607855	20.23986304	44.91360966	79.45707400
18	5.432607855	20.23986304	44.91360966	79.45707400
19	5.432607855	20.23986304	44.91360966	79.45707400
20	5.432607855	20.23986304	44.91360966	79.45707400

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

On the Raleigh-Ritz Variational Method. Non-orthogonal Basis Set

Abstract

Keywords:

Subject:

1. Introduction

2. The Rayleigh-Ritz Variational Method

3. Simple Example

4. Conclusions

References

MDPI Initiatives

Important Links

Subscribe