up


LDA+U Method: Ver. 1.0

Taisuke Ozaki, ISSP, the Univ. of Tokyo

1 Total energy

In conjunction with on-site terms of the unrestricted Hartree-Fock theory, the total energy of a LDA+U method [1] within the collinear spin treatment could be defined by

ELDA+U=ELDA+EU (1)

with

EU = 12σiplUipl[Tr(Niplσ)-Tr(NiplσNiplσ)], (2)
= 12σsUs[Tr(Nsσ)-Tr(NsσNsσ)],

where i is a site index, l an angular momemtum quantum number, p a multiplicity number of radial basis functions, σ a spin index, and s an organized index of (ipl). N is an diagonalized occupation matrix. U is the effective Coulomb electron-electron interaction energy. Considering the rotational invariance of total energy with respect to each subshell s, Eq. (2) can be transformed as follows:

EU = 12σsUs[Tr(AsNsσAs)-Tr(AsNsσAsAsNsσAs)], (3)
= 12σsUs[Tr(nsσ)-Tr(nsσnsσ)],
= 12σsUs[mnsmmσ-m,mnsmmσnsmmσ].

In the Eq. (3), although off-diagonal occupation terms in each subshell s are taken into account, however, those between subshells are neglected. This treatment is consistent with their rotational invariant functional by Dudarev et al. [2], and is a simple extension of the rotational invariant functional for the case that a different U-value is given for each basis orbital indexed with s(ipl). In this simple extension, we can not only include multiple d-orbitals as basis set, but also can easily derive the force on atoms in a simple form as discussed later on.

The ELDA+U can be expressed in terms of the Kohn-Sham eigenenergies ενσ as follows:

ELDA+U = ELDA+EU, (4)
= Eband+[Eee+Ecc+Exc-σ,νψνσ|v^LDAσ|ψνσ]+[EU-σ,νψνσ|v^Uσ|ψνσ],
= Eband+ΔELDA+12σsUsm,mnsmmσnsmmσ,
= Eband+ΔELDA+ΔEU,

where ΔELDA and ΔEU are the double couting corrections of LDA- and U-energies, respectively.

2 Occupation number

The occupation number n may be defined by

nsmmσ = νfνψνσ|n^smmσ|ψνσ, (5)

where, to count the occupation number n, we define three occupation number operators given by
on-site

n^smmσ=|smσ~smσ~|, (6)

full

n^smmσ=|smσsmσ|, (7)

dual

n^smmσ=12(|smσ~smσ|+|smσsmσ~|), (8)

where |smσ~ is the dual orbital of a original non-orthogonal basis orbital |smσ, and is defined by

|smσ~=smSsm,sm-1|smσ (9)

with the overlap matrix S between non-orthogonal basis orbitals. Then, the following bi-orthogonal relation is verified:

smσ~|smσ=δsmσ,smσ. (10)

The on-site and full occupation number operators have been proposed by Eschrig et al. [3] and Pickett et al. [4], respectively. It is noted that these definitions do not satisfy a sum rule that the trace of the occupation number matrix is equivalent to the total number of electrons, while only the dual occupation number operator fulfills the sum rule as follows:

σTr(nσ)=σ12{Tr(Sρσ)+Tr(ρσS)}=Nele, (11)

where ρσ is the density matrix defined by

ρsm,smσ = νfνψνσ|ρ^sm,smσ|ψνσ, (12)
= νfνcsmσ,*csmσ

with a density operator:

ρ^sm,smσ=|smσ~smσ~|. (13)

The notes limit the discussion to non-Bloch wave functions for simplicity, but the extension is straightforward. For three definition of occupation number operators, on-site, full, and dual, the occupation numbers are given by
on-site

nsmmσ=ρsm,smσ, (14)

full

nsmmσ=tn,tnρtn,tnσStn,smSsm,tn, (15)

dual

nsmmσ=12tn{Ssm,tnρtn,smσ+ρsm,tnσStn,sm}. (16)

3 Effective potential

The derivative of the total energy Eq. (1) with respect to LCAO coefficient cν,tnσ is given by

ELDA+Ucν,tnσ,* = ELDAcν,tnσ,*+EUcν,tnσ,*, (17)
= ELDAcν,tnσ,*+smmEUnsmmσnsmmσcν,tnσ,*
= ELDAcν,tnσ,*+smmUs(12δmm-nsmmσ)nsmmσcν,tnσ,*
= ELDAcν,tnσ,*+smmvU,smmσnsmmσcν,tnσ,*

with
on-site

nsmmσcν,tnσ,*=δstδmncν,smσ (18)

full

nsmmσcν,tnσ,*=tnStn,smSsm,tncν,tnσ (19)

dual

nsmmσcν,tnσ,*=12{δstδmntnSsm,tncν,tnσ+cν,smσStn,sm} (20)

Substituting Eqs. (18)-(20) for the second term of Eq. (17), we see
on-site

smmvU,smmσnsmmσcν,tnσ,*=tntnσ|[smm|smσ~vU,smmσsmσ~|]|tnσcν,tnσ, (21)

full

smmvU,smmσnsmmσcν,tnσ,*=tntnσ|[smm|smσvU,smmσsmσ|]|tnσcν,tnσ, (22)

dual

smmvU,smmσnsmmσcν,tnσ,*=tntnσ|12smm[|smσ~vU,smmσsmσ|+|smσvU,smmσsmσ~|]|tnσcν,tnσ, (23)

Therefore, the effective projector potentials v^Uσ can be expressed by
on-site

v^Uσ=smm|smσ~vU,smmσsmσ~|, (24)

full

v^Uσ=smm|smσvU,smmσsmσ|, (25)

dual

v^Uσ=12smm[|smσ~vU,smmσsmσ|+|smσvU,smmσsmσ~|]. (26)

It is clear that the effective potentials of on-site and full are Hermitian. Also, it is verified that the effective potential of dual is Hermitian as follows:

tnσ|v^Uσ|tnσ = 12mvU,tnmσStm,tn+12mStn,tmvU,tmnσ, (27)
= tnσ|v^Uσ|tnσ. (28)

It should be noted that in the full and dual the vUσ of the site i can affect the different sites by the projector potentials by Eqs. (25) and (26) because of the overlap.

4 Force on atom

The derivative of the total energy with respect to atomic coordinates τk consists of two contributions:

ELDA+Uτk = ELDAτk+EUτk. (29)

The first term can be evaluated in the same way as in the LDA. The second term is given by

EUτk = σ,smmEUnsmmσnsmmστk, (30)
= σ,smmvU,smmσnsmmστk
= σ,νtn,tn{cν,tnσ,*τktnσ|v^Uσ|tnσcν,tnσ+cν,tnσ,*tnσ|v^Uσ|tnσcν,tnστk+cν,tnσ,*cν,tnσtnσ|v^Uσ|tnστk}.

Considering Hcν=ενScν and cSc=I, the first and second terms in Eq. (30) can be transformed into derivatives of the overlap matrix. The third term in Eq. (30) is analytically differentiated, since it contains just two-center integrals.

5 Enhancement of orbital polarization

The LDA+U functional can possess multiple stationary points due to the degree of freedom in the configuration space of occupation ratio for degenerate orbitals. If electrons are occupied with a nearly same occupancy ratio in degenerate orbitals at the first stage of SCF steps, the final electronic state often converges a stationary minimum with non-orbital polarization after the SCF iteration. Also, it is often likely that electrons are disproportionately occupied in some of degenerate orbitals due to the exchange interaction, which is so-called ’orbital polarization’. As an example of the multiple minima, we can point out a cobalt oxide (CoO) bulk in which d-orbitals of the cobalt atom are split to t2g and eg states, and the five of seven d-electrons are occupied in t2g and eg states of the majority spin, and remaining two d-electrons are occupied in the t2g state of the minority spin. Then, it depends on the initial occupancy ratios for the t2g states of the minority spin how the remaining two d-electrons are occupied in three t2g states. If the initial occupancy ratios are uniform, we may arrive at the non-orbital polarized state. In fact, unless any special treatment is considered for the initial occupancy ratios, we see the non-orbital polarized state of the CoO bulk. In order to explore the degree of freedom for the orbital occupation, therefore, it is needed to develop a general method which explicitly induces the orbital polarization. To induce the orbital polarization, a polarized redistribution scheme is proposed as follows:

diagonalize dsσ=VnsσV  dsσ:ascending order (31)
summation D=m=12l+1dsmσ (32)
redistribution d2l+1=1, (33)
d2l=1,
,
dm=D-(2l+1-m),
dm-1=0,.
where  D=mdm (34)
backtrasform nsσ=VdmσV (35)

After diagonalizing each subshell matrix consisting of occupation numbers, we introduce a polarized redistribution scheme given by Eq. (33) while keeping Eq. (34). Then, by a back transformation Eq. (35), we can obtain a polarized occupation matrix for each subshell. This polarized redistribution scheme is applied during the first few SCF steps, and then no modification is made during subsequent SCF steps. This proposed scheme maybe applicable to a general case: any crystal field, any number of electrons in the subshell, and any orbitals: p,d,f,…

6 Orbital optimization within LDA+U

In the orbital optimization within LDA+U, let us assume that the effective U-potential in the LDA+U method is applied to the primitive basis orbital χ instead of the optimized basis orbital ϕ, which is more natural in a physical sense than the opposite assumption.

A Kohn-Sham (KS) orbital ψμ in the orbital optimization method is expressed by a linear combination of primitive orbitals χ:

|ψνσ = iαcμ,iασ|ϕiα, (36)
= iαcμ,iασ{qaiαq|χiη},
= iαqcμ,iασaiαq|χiη,
= iη{pcμ,iασaiαq}|χiη,
= iηbμ,iησ|χiη,

where α(plm), η(qlm), c and b are LCAO coefficients for contracted and primitive orbitals, respectively, and a contraction coefficients. For simplicity we consider an non-Bloch expression of the one-particle wave functions, but the extention of the below description to Bloch wave functions is straightforward. Assuming that the occupation number operators defined by Eqs. (6)-(8) are constructed by the primitive orbitals, we have the occupation numbers for the on-site, full, and dual given by
on-site

nsmmσ=ϱsm,smσ, (37)

full

nsmmσ=tn,tnϱtn,tnσStn,smSsm,tn, (38)

dual

nsmmσ=12tn{Ssm,tnϱtn,smσ+ϱsm,tnσStn,sm}, (39)

where ϱσ is the primitive density matrix defined by

ϱsm,smσ = νfνψνσ|ϱ^sm,smσ|ψνσ, (40)
= νfνbsmσ,*bsmσ

with a primitive density operator:

ϱ^sm,smσ = |χ~smσχ~smσ|. (41)

Moreover, by defining a contracted density operator:

ρ^sm,smσ = |ϕ~smσϕ~smσ|, (42)

we have the contracted density matrix ρσ given by

ρsm,smσ = νfνψνσ|ρ^sm,smσ|ψνσ, (43)
= νfνcsmσ,*csmσ.

Then, the primitive density matrix ϱ is written by the contracted density matrix ρ as follows:

ϱiqlm,iqlmσ = p,paiplmqaiplmqρiplm,iplmσ. (44)

Considering the variation of the total energy Eq. (1) with respect to b, we find the effective potentials of the LDA+U method with respect to the primitive basis orbital. They are given by the same expression as Eqs. (24)-(26), while the occupation number is given by Eqs. (37)-(39). After the Hamiltonian matrix with respect to the primitive basis orbital χ is constructed, it is transformed to that of the optimized basis orbital ϕ as follows:

ϕiplm|H^|ϕiplm = q,qaiplmqaiplmqχiqlm|H^|χiqlm. (45)

The Hamiltonian matrix with respect to the contracted basis orbital is diagonalized. The procedure is summarized as follows:

  1. 1.

    diagonalize the contracted Hamiltonian ϕiplm|H^|ϕiplm

  2. 2.

    calculate the contracted density matrix by Eq. (43)

  3. 3.

    calculate the primitive density matrix by Eq. (44)

  4. 4.

    calculate the occupation number by Eq. (37), (38), or (39)

  5. 5.

    construct the Hamitonian by Eq. (24), (25), or (26)

  6. 6.

    contract the Hamitonian by Eq. (45)

  7. 7.

    return 1

Although the optimization procedure of the contracted coefficients a is not discussed here, it can be easily verified that the same procedure as in the LDA method is derived. Thus, the orbital optimization can be performed within the LDA+U method as well as the LDA method.

References

  • [1] M. J. Han, T. Ozaki, and J. Yu, Phys. Rev. B 73, 045110 (2006).
  • [2] S. L. Dudarev, G. A. Botton, S. Y. Savrasov, C. J. Humphreys, and A. P. Sutton, Phys. Rev. B 57, 1505 (1998).
  • [3] H. Eschrig, K. Koepernik, and I. Chaplygin, J. Solid State Chem. 176, 482 (2003).
  • [4] W. E. Pickett, SC. Erwin, E. C. Ethridge, Phy. Rev. B 58, 1201 (1998).