N-electron valence state perturbation theory
In quantum chemistry, n-electron valence state perturbation theory is a perturbative treatment applicable to multireference CASCI-type wavefunctions. It can be considered as a generalization of the well-known second-order Møller–Plesset perturbation theory to multireference Complete Active Space cases. The theory is directly integrated into many quantum chemistry packages such as MOLCAS, Molpro, DALTON and ORCA.
The research performed into the development of this theory led to various implementations. The theory here presented refers to the deployment for the Single-State NEVPT, where the perturbative correction is applied to a single electronic state.
Research implementations has been also developed for Quasi-Degenerate cases, where a set of electronic states undergo the perturbative correction at the same time, allowing interaction among themselves. The theory development makes use of the quasi-degenerate formalism by Lindgren and the Hamiltonian multipartitioning technique from Zaitsevskii and Malrieu.
Theory
Let be a zero-order CASCI wavefunction, defined as a linear combination of Slater determinantsobtained diagonalizing the true Hamiltonian inside the CASCI space
where is the projector inside the CASCI space.
It is possible to define perturber wavefunctions in NEVPT as zero-order wavefunctions of the outer space where electrons are removed from the inactive part and added to the valence part. At second order of perturbation. Decomposing the zero-order CASCI wavefunction as an antisymmetrized product of the inactive part and a valence part
then the perturber wavefunctions can be written as
The pattern of inactive orbitals involved in the procedure can be grouped as a collective index, so to represent the various perturber wavefunctions as, with an enumerator index for the different wavefunctions. The number of these functions is relative to the degree of contraction of the resulting perturbative space.
Supposing indexes and referring to core orbitals, and referring to active orbitals and and referring to virtual orbitals, the possible excitation schemes are:
- two electrons from core orbitals to virtual orbitals
- one electron from a core orbital to a virtual orbital, and one electron from a core orbital to an active orbital
- one electron from a core orbital to a virtual orbital, and one electron from an active orbital to a virtual orbital
- two electrons from core orbitals to active orbitals
- two electrons from active orbitals to virtual orbitals
- one electron from a core orbital to a virtual orbital, and an internal active-active excitation
- one electron from a core orbital to an active orbital, and an internal active-active excitation
- one electron from an active orbital to a virtual orbital, and an internal active-active excitation
Totally Uncontracted Approach
The full dimensionality of these spaces can be exploited to obtain the definition of the perturbers, by diagonalizing the Hamiltonian inside them
This procedure is impractical given its high computational cost: for each space, a diagonalization of the true Hamiltonian must be performed. Computationally, is preferable to improve the theoretical development making use of the modified Dyall's Hamiltonian. This Hamiltonian behaves like the true Hamiltonian inside the CAS space, having the same eigenvalues and eigenvectors of the true Hamiltonian projected onto the CAS space. Also, given the decomposition for the wavefunction defined before, the action of the Dyall's Hamiltonian can be partitioned into
stripping out the constant contribution of the inactive part and leaving a
subsystem to be solved for the valence part
The total energy is the sum of and the energies of the orbitals involved in the definition of the inactive part. This introduces the possibility to perform a single diagonalization of the valence Dyall's Hamiltonian on the CASCI zero-order wavefunction and evaluate the perturber energies using the property depicted above.
Strongly Contracted Approach
A different choice in the development of the NEVPT approach is to choose a single function for each space, leading to the Strongly Contracted scheme. A set of perturbative operators are used to produce a single function for each space, defined as the projection inside each space of the application of the Hamiltonian to the contracted zero order wavefunction. In other words,where is the projector onto the subspace. This can be equivalently written as the application of a specific part of the Hamiltonian to the zero-order wavefunction
For each space, appropriate operators can be devised. We will not present their definition, as it could result overkilling. Suffice to say that the resulting perturbers are not normalized, and their norm
plays an important role in the Strongly Contracted development. To evaluate these norms, the spinless density matrix of rank not higher than three between the functions are needed.
An important property of the is that any other function of the space which is orthogonal to do not interact with the zero-order wavefunction through the true Hamiltonian. It is possible to use the functions as a basis set for the expansion of the first-order correction to the wavefunction, and also for the expression of the zero-order Hamiltonian by means of a spectral decomposition
where are the normalized.
The expression for the first-order correction to the wavefunction is therefore
and for the energy is
This result still misses a definition of the perturber energies, which can be defined in a computationally advantageous approach by means of the Dyall's Hamiltonian
leading to
Developing the first term and extracting the inactive part of the Dyall's Hamiltonian it can be obtained
with equal to the sum of the orbital energies of the newly occupied virtual orbitals minus the orbital energies of the unoccupied core orbitals.
The term that still needs to be evaluated is the bracket involving the commutator. This can be obtained developing each operator and substituting. To obtain the final result it is necessary to evaluate Koopmans matrices and density matrices involving only active indexes. An interesting case is represented by the contribution for the case, which is trivial and can be demonstrated identical to the Møller–Plesset second-order contribution
NEVPT2 can therefore be seen as a generalized form of MP2 to multireference wavefunctions.
Partially Contracted Approach
An alternative approach, named Partially Contracted is to define the perturber wavefunctions in a subspace of with dimensionality higher than one. To define this subspace, a set of functions is generated by means of the operators, after decontraction of their formulation. For example, in the case of the operatorThe Partially Contracted approach makes use of functions and. These functions must be orthonormalized and purged of linear dependencies which may arise. The resulting set spans the space.
Once all the spaces have been defined, we can obtain as usual a set of perturbers from the diagonalization of the Hamiltonian inside this space
As usual, the evaluation of the Partially Contracted perturbative correction by means of the Dyall Hamiltonian involves simply manageable entities for nowadays computers.
Although the Strongly Contracted approach makes use of a perturbative space with very low flexibility, in general it provides values in very good agreement with those obtained by the more decontracted space defined for the Partially Contracted approach. This can be probably explained by the fact that the Strongly Contracted perturbers are a good average of the totally decontracted perturbative space.
The Partially Contracted evaluation has a very little overhead in computational cost with respect to the Strongly Contracted one, therefore they are normally evaluated together.
Properties
NEVPT is blessed with many important properties, making the approach very solid and reliable. These properties arise both from the theoretical approach used and on the Dyall's Hamiltonian particular structure:- Size consistency: NEVPT is size consistent. Briefly, if A and B are two non-interacting systems, the energy of the supersystem A-B is equal to the sum of the energy of A plus the energy of B taken by themselves. This property is of particular importance to obtain correctly behaving dissociation curves.
- Absence of intruder states: in perturbation theory, divergencies can occur if the energy of some perturber happens to be nearly equal to the energy of the zero-order wavefunction. This situation, which is due to the presence of an energy difference at the denominator, can be avoided if the energies associated to the perturbers are guaranteed to be never nearly equal to the zero-order energy. NEVPT satisfies this requirement.
- Invariance under active orbital rotation: The NEVPT results are stable if an intraclass active-active orbital mixing occurs. This arises both from the structure of the Dyall Hamiltonian and the properties of a CASSCF wavefunction. This property has been also extended to the intraclass core-core and virtual-virtual mixing, thanks to the Non Canonical NEVPT approach, allowing to apply a NEVPT evaluation without performing an orbital canonization
- Spin purity is guaranteed: The resulting wave functions are guaranteed to be spin pure, due to the spin-free formalism.
- Efficiency: although not a formal theoretical property, computational efficiency is highly important for the evaluation on medium-size molecular systems. The current limit of the NEVPT application is largely dependent on the feasibility of the previous CASSCF evaluation, which scales factorially with respect to the active space size. The NEVPT implementation using the Dyall's Hamiltonian involves the evaluation of Koopmans' matrices and density matrices up to the four-particle density matrix spanning only active orbitals. This is particularly convenient, given the small size of currently used active spaces.
- Partitioning into additive classes: The perturbative correction to the energy is additive on eight different contributions. Although the evaluation of each contribution has a different computational cost, this fact can be used to improve performance, by parallelizing each contribution to a different processor.