1. Introduction
The Density Functional Theory (DFT) is one of the most widely used methods for calculations of the structure of inhomogeneous many-body systems including atoms, molecules, liquids, solids, and surfaces [
1,
2] (for a pedagogic introduction see [
3]). The theory, which finds its earliest roots in the Thomas-Fermi-Dirac model, was first introduced in its modern form by Hohenberg and Kohn who showed that the ground state of an electron gas in an external potential can be uniquely characterized by the electron density [
4] and by Kohn and Sham who showed how to include the effects of exchange and correlations [
5]. The implications of these ideas were soon extended to finite temperatures in the context of the grand canonical framework [
6] (see also [
7,
8,
9] and references therein) and to the statistical mechanics of non-uniform classical fluids, such as the liquid-vapor interface [
10,
11,
12,
13] (see also [
14] for more references).
In previous work we derived the classical DFT as an application of the method of maximum entropy [
15]. A central concept is the use of entropy itself as a tool to generate optimal approximations to probability distributions [
17] in terms of those variables that capture the relevant physical information namely, the particle density
. We showed that the entropic DFT (eDFT) approach directly leads to Evans’ variational principle of the classical DFT [
11].
In this paper we are concerned with the very foundations of the DFT framework and our main goal is to extend the entropic DFT (eDFT) formalism to the quantum domain. We emphasize that our goal is neither to derive an alternative to DFT nor to develop improvements to the approximations that are inevitably necessary to the successful implementation of DFT in practical applications.
In
Section 2 we review the use of relative entropy as a tool to update density operators in response to new information and we extend the use of entropy as a tool to derive optimal approximations from the classical context [
17] to the quantum domain. In
Section 3 we construct the entropic DFT formalism and prove a form of the Hohenberg-Kohn theorem at finite temperature within the canonical (fixed number of particles) framework. In
Section 4, as an illustration of the eDFT formalism, we discuss the Kohn-Sham model in the local density approximation. Finally in
Section 5 we summarize our conclusions.
2. Preliminaries
The realization that a fundamental theory such as thermodynamics should be interpreted as an application of a general scheme for inference on the basis of information codified into constraints can be traced to Brillouin and Jaynes [
18,
19,
20,
21,
22]. According to Jaynes – as motivated by the Shannon’s axioms [
23] – entropy is interpreted as the amount of information that is missing in a probability distribution. The preferred probability distribution is that which agrees with what we know — the information codified into the constraints — but is maximally ignorant about everything else. Thus, one is led to maximize the entropy subject to constraints, a procedure dubbed the MaxEnt method.
A drawback of this approach is that the interpretation of entropy as an amount of missing information is not completely satisfactory. To address this problem Shore and Johnson [
24] proposed that one could avoid invoking questionable measures of information by directly axiomatizing the method for updating probabilities through a variational principle that involved maximizing an entropy functional satisfying certain desirable properties. The question of why should one adopt a variational principle was later clarified by Skilling [
25] who proposed a simple ranking strategy: in order to select an optimal distribution (he had in mind the more general case of positive additive distributions which include
e.g. intensities in an image) one proceeds by ranking the distributions according to some preference criteria and then choosing the one which ranks the highest. The ranking scheme is naturally implemented by associating a real number – the entropy – to each distribution with the preference criteria fixed through the axioms of Shore and Johnson. In later work the nature of the method of maximum entropy was further streamlined as a scheme
designed to update probabilities when confronted with new information. In this approach the question “what is information?” receives a very simple answer.
Information is just the constraints we decide to impose on our beliefs, and there is no need to define “amounts” of information. The motivation behind the design criteria was clarified and their number reduced from five to two [
26,
27,
28,
29] (reviewed in [
31] and [
39]).
2.1. The Quantum MaxEnt Method
The task of extending the method of maximum entropy to the quantum domain as a method to update density operators was carried out by Vanslette [
29]. The goal is to update a prior density operator
when provided with new information in the form of the expected value of some self-adjoint operators
. Vanslette showed that the Umegaki relative entropy [
32],
provides the
unique criterion to rank density operators
relative to the prior
.
The maximization of
subject to the constraints
and normalization,
leads to the posterior density operator
where
Substituting
back into eq.(
1) gives the value of the maximized entropy,
It is widely known that the classical MaxEnt method leads to a mathematical formalism characterized by a contact structure (see
e.g., [
35,
36]). In a parallel development the use of Legendre transforms in the context of DFT has also been widely explored [
7,
33,
34]. These results can be extended to the quantum domain leading to a similar contact structure (see
e.g., [
16]). The significance of these results is that the physical content of the formalism is preserved under Legendre transformations quite independently of restrictions to thermal equilibrium and of the physical significance of the so-called “free energies” or Massieu functions.
2.2. Optimal Approximations of Density Operators
The last prerequisite for the construction of the DFT formalism is a systematic method of approximation for density operators. The method we adopt is an extension of the technique developed by Tseng and Caticha in the classical context [
17]. The problem is that the exact probability distributions
Q obtained using the MaxEnt method are often too intractable to be useful in actual calculations. The solution is to consider a family of more tractable trial distributions
dependent on some parameters
. The goal is to select the trial distribution
that best approximates the exact distribution
Q. In [
17] it was argued that the criterion to select the optimal parameters
is again provided by the method of maximum entropy: The optimal
is that which is “closest” to the exact
Q in the sense that it maximizes the relative entropy
.
Next, we extend this approximation technique to the quantum domain. We consider a family of tractable density operators
parametrized by parameters
. The member of the trial family
that best approximates the exact density operator
is the one which maximizes the entropy of
relative to
,
As an example, consider the special case where
and
take the exponential form,
the Gibbs inequality,
reduces to the Bogolyubov inequality,
where
Thus, the argument above shows the popular approximation method based on the Bogolyubov inequality (see
e.g., [
37]) is a special case of the more general approximation method based on entropy maximization.
3. Density functional formalism
The goal of the DFT formalism is to find tractable approximations to study the structure of matter. The first crucial step is to recognize that the quantity that captures the desired structural information is the electron density . We wish to design a formalism in which the central role played by the electron density is explicitly displayed.
In the absence of magnetic fields the time independent Schrödinger equation for an electron gas of
N particles is
where
and
is an antisymmetrized product of
N two-spinor orbitals. The potential
describes interparticle interactions and the potential
describes interactions with nuclei and other external potentials.
3.1. Introducing density as the relevant variable
We are interested in the thermal properties of an inhomogeneous electron fluid and therefore we need trial states that describe both thermal equilibrium and inhomogeneity. The former is imposed by a constraint on the expected value of energy and the latter is incorporated by constraints on the expected value
of the electron density
. The continuous density function
plays a role analogous to the discrete parameters
in equations (
6-
10).
Adopting a uniform prior, the relevant trial states are obtained by maximizing the entropy
subject to the constraints
where
To be clear, throughout this work the trace is taken over the Hilbert space of a fixed number
N of particles and in this respect our formalism
resembles the canonical ensemble approach. Indeed, all states
in the Hilbert space are eigenstates of the number operator,
but they need not be eigenstates of the density operators
. Our formalism
differs from the canonical formalism in that eq.(16) represents an additional infinite number of constraints — one constraint on the expected density function
at each point in space. Due to (
18) the expected density function
is not arbitrary; it is constrained to obey (
18).
Proceeding to the MaxEnt analog of eq.(
3) we find the trial density operator
where
and where
and the infinite number of Lagrange multipliers
are implicitly determined by
with the additional constrain (
17),
The notation
indicates that
Z is a function of
and a functional of
and depends on
through the Hamiltonian
. At this point in the argument there is no implication that the trial states
are in any way more computationally tractable than the exact state
obtained from (
19) by setting
to zero.
Next we calculate the entropy of
relative to the uniform prior to define the trial entropy,
An important symmetry of the DFT formalism, which is what makes the whole DFT formalism work, arises from the fact that the dependence of
and
on
and
occurs only through the particular combination
The reason for the subscript ‘int’, which denotes ‘intrinsic’, will become clear later in eq.(
56). This DFT symmetry implies that a change in the potential
can be compensated by a suitable change in the multiplier
in such a way that
and the expected density
remain unaffected. From (
12) and (
24) we find that (
20) can be written as
so that eqs.(
19) and (
21) become
and
3.2. The entropic DFT variational principle
The exact canonical density operator
is found by maximizing (
13) subject to (
14) and (15). The result can be read off eq.(
19) by setting
,
(We use a star * to denote exact canonical quantities.) The goal is to approximate
by the best matching member of the family
with all density operators referring to the same
and
N. This involves maximizing the entropy of
relative to
,
From (
19) and (
28) we find
Introducing a Lagrange multiplier
to enforce the constraint on
N we have,
From the construction above one might expect that the optimal
coincides with the exact
. We can check that this is indeed the case. Substituting eq.(
30) into (
31) we find
The LHS vanishes by eq.(
21). Therefore, the optimal
is achieved for
. From (
19), (
28) and (
30) we see that
which means that imposing the
N constraint was unnecessary: the optimal density reproduces the exact density
whether the variations
preserve the total
N or not.
We conclude that the entropic DFT variational principle,
leads to an optimal
which coincides with the exact canonical
in eq.(
28),
Thus, at this point our “approximation” scheme is (trivially) exact: by explicit construction we have demonstrated the existence of a functional of the density
,
and
N — the relative entropy
— that assumes its maximum value at the exact density
. At this point, however, we have not yet shown that this variational principle is equivalent to the thermal DFT principle derived by Mermin [
6]. This, we show next.
3.3. The DFT Theorem
Equations (
23) and (
30) allows us to write
where we have introduced the “free energy” functional
The new functional
,
allows us to rewrite the entropic variational principle (
31) as
The optimal density
is found by minimizing
at fixed
and
N. Furthermore, from (
37) the multipliers
are obtained from
From eq.(
34),
, we obtain
which has been called the “core integro-differential equation of DFT” [
11].
To proceed further, substitute (
12), (15), into (
36) to find
so that
where we have introduced
We are now ready to state the finite temperature DFT theorem.
The Density Functional Theorem:
The density functional is independent of the external potential ,
This result justifies dropping the index
v,
and referring to
as the
intrinsic density functional. (The term ‘intrinsic’ indicates that
is independent of the external potential
.)
Proof: The crucial observation behind the entropic DFT formalism is that
and
depend on the external potential
and the Lagrange multiplier function
only through the particular combination
defined in (
24). Substitute (
23), (
24) and (
25) into (
43) to get
Then the derivative
at fixed
and
is
Eq.(
26) shows that keeping
fixed is achieved by keeping
fixed and vice versa, therefore
which implies (
44) and concludes the proof.
Equations (
19) and (
40) suggest that (up to an additive constant) the multiplier
plays a role
analogous to that of a chemical potential. Let us then use eq.(
39) to introduce
which we shall call the “
local chemical potential.” The core equation (
40) has a natural interpretation: the condition for neighboring volume elements to be in equilibrium is that the local chemical potential be uniform,
The optimal value of
is
From eq.(
42) we have
while eq.(
49) gives
Subtracting these two equations gives
which shows that
can be interpreted as the
local intrinsic chemical potential,
with
Evaluating at
gives the
equilibrium intrinsic chemical potential,
(The term ‘intrinsic’ reminds us that both
and
are independent of the external potential
.)
We mentioned earlier that the multiplier
plays a role
analogous to that of a chemical potential. We can now be more explicit. Let
be the actual chemical potential and the intrinsic chemical potential at location
x,
Equilibrium among different volume elements is achieved when
Then eqs.(
49)-(
51) lead us to identify
and
We can express the eDFT variational principle in terms of
F. Using eqs.(
38) and (
42) we find
To summarize, we have reproduced the foundational theorem behind the thermal DFT formalism as an application of maximum entropy methods. This is the main result of this paper. The treatment, so far, has been exact. In the next section, as an illustration of the method, we adapt the well-known Kohn-Sham model to the entropic DFT approach.
4. The Kohn-Sham approximation scheme
The exact calculation of requires calculating . Unfortunately, this is just as difficult as calculating the original canonical partition function which was precisely what we wanted to avoid. An analogous problem arises in the standard many-body theory: even for relatively small particle numbers the calculation of the N-particle wave function becomes impractically difficult because the wave function lives in a -dimensional configuration space. The DFT framework attempts to evade this problem by focusing attention on the hopefully easier problem of calculating the density which is a function that lives in a mere 3 dimensions. Unfortunately, the problem is not solved, but merely transferred to the calculation of the functional . Not all is lost, however, because the reformulation in terms of the density suggests new useful approximations.
The discussion below parallels closely the ground state formulation of Kohn and Sham [
5]. It differs from the grand canonical thermal DFT of Mermin [
6] in that here we remain within the canonical framework of fixed particle number. In common with the Hartree-Fock approximation the Kohn-Sham model reduces an interacting many-particle Schrödinger equation to that of a single particle in the presence of an effective potential that includes exchange and correlation effects. An important advantage is that, unlike Hartree-Fock, the Kohn-Sham framework can
in principle be exact. In practice, however, the success of the model hinges on whether the approximations for exchange and correlations are sufficiently simple and accurate. Fortunately, the “local density approximation,” which is exact for a uniform electron gas, and should remain valid for slowly varying potentials, has turned out to be quite successful for the prediction of bond lengths and molecular structures even when these involve inhomogeneities at the atomic scale.
Referring to eq.(
43) the idea is that
can be split into three terms,
The first term
represents the intrinsic free energy of a gas of
non-interacting and
uncorrelated particles at the same temperature and density. The second term
is the classical Coulomb interaction,
that represents the dominant contribution from the interparticle potential term
in (
43). The third
is a correction that accounts for all additional exchange and correlations effects. To the extent that we can define
to be the difference
equation (
64) is trivially exact.
We are now ready to substitute (
64) into the eDFT variational principle (
63). The result is
where we introduced
So far this is exact. However, to make further progress we note that although exchange correlations are intrinsically non-local, for a thermal system we can assume that entanglement effects are appreciable only over short distances. Therefore it might not be unreasonable to approximate
by a sum over independent volume elements. Accordingly, we adopt the so-called local density approximation,
where the function
is assumed known: it is the exchange correlation free energy per particle for a uniform electron gas with density
n. The corresponding potential
is therefore also known.
To find the optimal density
that solves the variational equation (
67) we can use the same trick introduced by Kohn and Sham. They noticed that their variational equation for the ground state — the analogue of our eq.(
67) — is exactly of the form one obtains for a gas of non-interacting and uncorrelated particles moving in an effective single-particle potential. This leads us to rewrite (
67) as
where
Thus, the problem of
N interacting particles has been translated into the problem of a single particle moving in an density-dependent effective potential created by all the other particles. This shows that we can adopt the same iterative procedure followed with the Hartree self-consistent potential. If
is the density at the
iteration, use (
72) to construct the potential
, and solve the single-particle equation,
Then construct the density
for the next iteration as the thermal average,
where the cutoff
is such the occupation of orbitals with
can be neglected and
is found by imposing
. The process is repeated until convergence to the optimal
is achieved.
Just as in the standard Kohn-Sham model neither the single particle potential , nor the wave functions and energies are to be given any real physical interpretation. They are auxiliary quantities whose only purpose is the calculation of the physical density .
5. Conclusion
To summarize our conclusions:
We have produced a
reconstruction of DFT that makes explicit how DFT fits within an ongoing research program that places the concepts of entropy and information at the very foundation for all of physics (see
e.g., [
31]). This includes statistical mechanics [
18,
19,
20,
21,
22], quantum mechanics [
38,
39], and as we have shown in this work, also the main techniques to study structure — variational principles including mean field methods and DFT.
We extended the use of entropy as a systematic method to generate optimal approximations from the classical to the quantum domain. This allowed an entropic reconstruction of quantum DFT. This process involves a family of trial density operators parametrized by the particle density. The optimal density operator is found by maximizing the quantum entropy relative to the exact canonical density operator. This approach reproduces the variational principle of DFT and allows a proof of the Hohenberg-Kohn theorem at finite temperature that is simpler in that it evades some of the subtleties of the ground state formalism. Our formalism differs from previous approaches in that (i) the central role of entropy is explicit, and (ii) we remain with the canonical ensemble formalism.
Acknowledgments
We are thankful to Oleg Lunin, Carlo Cafaro, Herbert Fotso, and Daniel Robins, for their reading and insightful comments as the committee members for Yousefi’s thesis; from which this paper is extracted.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Kohn, W. Noble Lecture: Electronic structure of matter—wave functions and density functionals. Rev. Mod. Phys. 1999, 71, 1253. [Google Scholar] [CrossRef]
- Jones, R.O. Density functional theory: Its origins, rise to prominence, and future. Rev. Mod. Phys. 2015, 87, 897. [Google Scholar] [CrossRef]
- Argaman, M.; Makov, G. Density functional theory: an introduction. Am. J. Phys. 2000, 68, 69. [Google Scholar]
- Hohenberg, P.; Kohn, W. Inhomogeneous electron gas. Phys. Rev. 1964, 136, B864. [Google Scholar] [CrossRef]
- Kohn, W.; Sham, L.J. Self-consistent equations including exchange and correlation effects. Phys. Rev. 1965, 140, A1133. [Google Scholar] [CrossRef]
- Mermin, D. Thermal properties of inhomogeneous electron gas. Phys. Rev. 1965, 137, A1441. [Google Scholar] [CrossRef]
- Eschrig, H. T>0 ensemble-state density functional theory via Legendre transform. Phys. Rev. B 2010, 205120. [Google Scholar]
- Pribram-Jones, A.; Pittalis, S.; Gross, E.; Burke, K. Thermal Density Functional Theory in Context. In Frontiers and Challenges in Warm Dense Matter, Lecture Notes in Computational Science and Engineering; Graziani, F., et al., Eds.; Springer International Publishing, 2014; Volume 96, pp. 25–60. [Google Scholar]
- Burke, K.; Smith, J.C.; Grabowski, P.E.; Pribram-Jones, A. Exact conditions on the temperature dependence of density functionals. Phys. Rev. B 2016, 93, 195132. [Google Scholar] [CrossRef]
- Ebner, C.; Saam, W.F.; Stroud, D. Density-functional theory of simple classical fluids. I. Surfaces. Phys. Rev. A 1976, 14, 2264. [Google Scholar] [CrossRef]
- Evans, R. The nature of the liquid-vapor interface and other topics in the statistical mechanics of non-uniform classical fluids. Advances in Physics 1979, 28, 143–200. [Google Scholar] [CrossRef]
- Tarazona, P. Free-energy density functional for hard spheres. Phys. Rev. A 1985, 31, 2672. [Google Scholar] [CrossRef] [PubMed]
- Rosenfeld, Y. Free-energy model for the inhomogeneous hard-sphere fluid mixture and density-functional theory of freezing. Phys. Rev. Lett. 1989, 63, 980. [Google Scholar] [CrossRef] [PubMed]
- Evans, R.; Oettel, M.; Roth, R.; Kahl, G. New developments in classical density functional theory. J. Phys.: Condens. Matter 2016, 28, 240401. [Google Scholar] [CrossRef] [PubMed]
- Yousefi, A.; Caticha, A. An entropic approach to classical density functional theory. Phys. Sci. Forum 2021, arXiv:2108.015943, 13. [Google Scholar]
- Yousefi, A. Entropic density functional theory: entropic inference and the equilibrium state of inhomogeneous fluids; State University of New York at Albany ProQuest Dissertation Publishing, 2021; p. 28863510. [Google Scholar]
- Tseng, C.-Y.; Caticha, A. Using relative entropy to find optimal approximations: an application to simple fluids. Physica A 2008, arXiv:0808.4160v1387, 6759. [Google Scholar] [CrossRef]
- Brillouin, L. Science and Information Theory; Academic Press: New York, 1952. [Google Scholar]
- Brillouin, L. The negentropy principle of information. J. Appl. Phys. 1953, 24, 1152. [Google Scholar] [CrossRef]
- Jaynes, E.T. Information theory and statistical mechanics. Phys. Rev. 1957, 106, 620. [Google Scholar] [CrossRef]
- Jaynes, E.T. Information theory and statistical mechanics II. Phys. Rev. 1957, 108, 171. [Google Scholar] [CrossRef]
- Jaynes, E.T. Where do we stand on maximum entropy? In The Maximum Entropy Principle; Levine, R.D., Tribus, M., Eds.; MIT Press, 1979. [Google Scholar]
- Shannon, C.E. A mathematical theory of communication. The Bell System Technical Journal 1948, 27, 3. [Google Scholar] [CrossRef]
- Shore, J.; Johnson, R. Axiomatic derivation of the principle of maximum entropy and the principle of minimum cross-entropy. IEEE Transactions on Information Theory 1980, 26, 26–37. [Google Scholar] [CrossRef]
- Skilling, J. The Axioms of Maximum Entropy. Maximum-Entropy and Bayesian Methods in Science and Engineering 1988, 173–187. [Google Scholar]
- Caticha, A. Relative Entropy and Inductive Inference. AIP Conf. Proc. 2004, arXiv:physics/0311093707, 75. [Google Scholar]
- Caticha, A.; Giffin, A. Updating Probabilities. AIP Conf. Proc 2006, arXiv:physics/0608185872, 31. [Google Scholar]
- Caticha, A. Information and Entropy. AIP Conf. Proc. 2007, arXiv:0710.1068954, 11. [Google Scholar]
- Vanslette, K. Entropic Updating of Probabilities and Density Matrices. Entropy 2017, arXiv:1710.0937319, 664. [Google Scholar] [CrossRef]
- Caticha, A. Entropy, Information, and the Updating of Probabilities. Entropy 2021, arXiv:2107.0452923, 895. [Google Scholar] [CrossRef] [PubMed]
- Caticha, A. Entropic Physics: Probability, Entropy and the Foundations of Physics. Available online: https://www.arielcaticha.com/ (accessed on 20 November 2023).
- Umegaki, H. Conditional expectation in an operator algebra. IV. Entropy and information. Kodai Math Sem. Rep. 1962, 14, 59–85. [Google Scholar] [CrossRef]
- Lieb, E.H. Density functionals for Coulomb systems. Int. J. Quantum Chem. 1983, 24, 243. [Google Scholar] [CrossRef]
- Fukuda, R.; Kotani, T.; Suzuki, Y.; Yokojima, S. Density functional theory through Legendre transformation. Prog. Theor. Phys. 1994, 92, 833–862. [Google Scholar] [CrossRef]
- Rajeev, S.G. A Hamilton-Jacobi formalism for thermodynamics. Annals of Physics 2008, arXiv:0711.4319323, 2265–2285. [Google Scholar] [CrossRef]
- Balian, R.; Valentin, P. Hamiltonian structure of thermodynamics with gauge. Eur. Phys. J. B 2001, 21, 269–282. [Google Scholar] [CrossRef]
- Feynman, R.P. Statistical mechanics: a set of lectures; W. A. Benjamin: Reading, MA, USA, 1972. [Google Scholar]
- Caticha, A. The Entropic Dynamics Approach to Quantum Mechanics. Entropy 2019, arXiv:1908.0469321, 943. [Google Scholar] [CrossRef]
- Caticha, A. Quantum Mechanics as Hamilton-Killing Flows on a Statistical Manifold. Phys. Sci. Forum 2021, arXiv:2107.085023, 12. [Google Scholar]
|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).