Mixed-precision ab initio tensor network state methods adapted for NVIDIA Blackwell technology via emulated FP64 arithmetic

Cole Brower, Samuel Rodriguez Bernabeu, Jeff Hammond, John Gunnels, Sotiris S. Xanthea, Martin Ganahl, Andor Menczer, Örs Legeza

公開日: 2025/10/6

Abstract

We report cutting-edge performance results via mixed-precision spin adapted ab initio Density Matrix Renormalization Group (DMRG) electronic structure calculations utilizing the Ozaki scheme for emulating FP64 arithmetic through the use of fixed-point compute resources. By approximating the underlying matrix and tensor algebra with operations on a modest number of fixed-point representatives (``slices''), we demonstrate on smaller benchmark systems and for the active compounds of the FeMoco and cytochrome P450 (CYP) enzymes with complete active space (CAS) sizes of up to 113 electrons in 76 orbitals [CAS(113, 76)] and 63 electrons in 58 orbitals [CAS(63, 58)], respectively, that the chemical accuracy can be reached with mixed-precision arithmetic. We also show that, due to its variational nature, DMRG provides an ideal tool to benchmark accuracy domains, as well as the performance of new hardware developments and related numerical libraries. Detailed numerical error analysis and performance assessment are also presented for subcomponents of the DMRG algebra by systematically interpolating between double- and pseudo-half-precision. Our analyis represents the first quantum chemistry evaluation of FP64 emulation for correlated calculations capable of achieving chemical accuracy and emulation based on fixed-point arithmetic, and it paves the way for the utilization of state-of-the-art Blackwell technology in tree-like tensor network state electronic structure calculations, opening new research directions in materials sciences and beyond.

Mixed-precision ab initio tensor network state methods adapted for NVIDIA Blackwell technology via emulated FP64 arithmetic | SummarXiv | SummarXiv