AdsorbML: A Leap in Efficiency for Adsorption Energy Calculations using Generalizable Machine Learning Potentials (2211.16486v3)

Published 29 Nov 2022 in cond-mat.mtrl-sci and cs.LG

Abstract: Computational catalysis is playing an increasingly significant role in the design of catalysts across a wide range of applications. A common task for many computational methods is the need to accurately compute the adsorption energy for an adsorbate and a catalyst surface of interest. Traditionally, the identification of low energy adsorbate-surface configurations relies on heuristic methods and researcher intuition. As the desire to perform high-throughput screening increases, it becomes challenging to use heuristics and intuition alone. In this paper, we demonstrate machine learning potentials can be leveraged to identify low energy adsorbate-surface configurations more accurately and efficiently. Our algorithm provides a spectrum of trade-offs between accuracy and efficiency, with one balanced option finding the lowest energy configuration 87.36% of the time, while achieving a 2000x speedup in computation. To standardize benchmarking, we introduce the Open Catalyst Dense dataset containing nearly 1,000 diverse surfaces and 100,000 unique configurations.

Authors (9)

Janice Lan (12 papers)
Aini Palizhati (4 papers)
Muhammed Shuaibi (16 papers)
Brandon M. Wood (10 papers)
Brook Wander (6 papers)
Abhishek Das (61 papers)
Matt Uyttendaele (8 papers)
C. Lawrence Zitnick (50 papers)
Zachary W. Ulissi (14 papers)

Citations (29)

View on Semantic Scholar

Summary

The paper's main contribution is the development of the AdsorbML algorithm that integrates ML relaxations with DFT evaluations, achieving significant computational speedups.
Methodologically, the study leverages a novel dataset (OC20D) and several graph neural networks to predict energies and forces, with DFT benchmarks ensuring accuracy.
The results demonstrate that combining ML with selective DFT verification can accelerate catalyst screening by over 2200x while maintaining practical precision.

Evaluation of AdsorbML for Efficient Adsorption Energy Calculations

The paper "AdsorbML: A Leap in Efficiency for Adsorption Energy Calculations using Generalizable Machine Learning Potentials" presents a comprehensive paper on using ML to accelerate the calculation of adsorption energies in heterogeneous catalyst design. This paper harnesses the computational power of ML to efficiently screen vast material spaces, addressing the traditionally expensive and time-consuming Density Functional Theory (DFT)-based methods. The authors propose and evaluate a hybrid approach, combining ML potentials with DFT, demonstrating notable computational savings while maintaining accuracy.

Methodology and Approach

The paper introduces a novel dataset, OC20D, which expands upon the previous Open Catalyst 2020 (OC20) dataset by providing a more exhaustive sampling of adsorbate-surface configurations. This new dataset serves as a rigorous benchmark to evaluate ML potentials' ability to predict adsorption energies. The researchers propose the AdsorbML algorithm, which integrates ML relaxations with subsequent DFT evaluations to identify the lowest energy configurations across a vast configuration space.

Two ML strategies are explored: using DFT single-point evaluations on ML relaxed structures (ML+SP) and performing full DFT relaxations from ML generated structures (ML+RX). The authors test several graph neural networks (GNNs), such as SchNet, DimeNet++, and GemNet-OC, trained on the OC20 dataset to predict energies and forces, thus enabling efficient relaxation of atomic structures.

Results and Performance

The evaluation highlights the significant computational advantages of the AdsorbML approach. For instance, the best-performing model, eSCN-MD-Large, achieved a success rate of 87.77% with a more than 2200x speedup when compared to exhaustive DFT-only calculations. The paper notes the efficiency of ML+SP providing substantial speedups with slight decreases in accuracy, while ML+RX offers improved accuracy at increased computational costs.

The research emphasizes the rigorous validation of ML predictions using DFT single-point calculations to ensure reliability. This hybrid verification approach safeguards against erroneous low-energy estimations, which could arise from potential ML inaccuracies.

Implications and Future Directions

The implications of this work are significant for the field of computational catalysis. By drastically reducing the computational resources required to predict adsorption energies, the AdsorbML framework can accelerate the discovery of efficient catalysts. This approach is particularly beneficial for scenarios demanding high-throughput screening, such as evaluating materials for CO₂ reduction reactions.

The paper opens pathways for further optimization of ML models to improve inference speeds and accuracy, implying potential future refinements in GNN architectures and training methodologies. Moreover, the authors suggest exploring alternative global optimization strategies to further refine the search for optimal adsorbate-surface configurations beyond brute force enumeration.

Conclusion

This paper presents an effective synergy between machine learning and first-principles methods, offering a refined, efficient pathway for catalytic material discovery. The proposed methodologies demonstrate significant advancements in leveraging modern ML potentials to address longstanding computational challenges in heterogeneous catalysis. Future work could look into extending these strategies to more diverse chemical systems and refining ML architectures specialized for these applications.

PDF Markdown