An Equivariant Pretrained Transformer for Unified 3D Molecular Representation Learning (2402.12714v2)

Published 20 Feb 2024 in cs.LG and physics.chem-ph

Abstract: Pretraining on a large number of unlabeled 3D molecules has showcased superiority in various scientific applications. However, prior efforts typically focus on pretraining models in a specific domain, either proteins or small molecules, missing the opportunity to leverage cross-domain knowledge. To mitigate this gap, we introduce Equivariant Pretrained Transformer (EPT), an all-atom foundation model that can be pretrained from multiple domain 3D molecules. Built upon an E(3)-equivariant transformer, EPT is able to not only process atom-level information but also incorporate block-level features (e.g. residuals in proteins). Additionally, we employ a block-level denoising task, rather than the conventional atom-level denoising, as the pretraining objective. To pretrain EPT, we construct a large-scale dataset of 5.89M entries, comprising small molecules, proteins, protein-protein complexes, and protein-molecule complexes. Experimental evaluations on downstream tasks including ligand binding affinity prediction, protein property prediction, and molecular property prediction, show that EPT significantly outperforms previous state-of-the-art methods in the first task and achieves competitively superior performance for the remaining two tasks. Furthermore, we demonstrate the potential of EPT in identifying small molecule drug candidates targeting 3CL protease, a critical target in the replication of SARS-CoV-2. Among 1,978 FDA-approved drugs, EPT ranks 7 out of 8 known anti-COVID-19 drugs in the top 200, indicating the high recall of EPT. By using Molecular Dynamics (MD) simulations, EPT further discoveries 7 novel compounds whose binding affinities are higher than that of the top-ranked known anti-COVID-19 drug, showcasing its powerful capabilities in drug discovery.

References (64)

Citations (4)

View on Semantic Scholar

Summary

The paper introduces a novel transformer architecture that leverages E(3) equivariance and block-level denoising to unify geometric learning across multi-domain 3D molecules.
It enhances atom-level features by integrating residue-level contexts and distance matrices, leading to improved predictions in ligand binding and molecular properties.
Experimental results demonstrate that EPT outperforms state-of-the-art methods, setting a new benchmark for unified cross-domain molecular representation learning.

Equivariant Pretrained Transformer for Multi-Domain 3D Molecular Learning

In the recent surge of advancements in deep learning applications for scientific research, the accurate representation and understanding of molecular structures have emerged as a pivotal area, particularly due to its relevance in drug discovery, materials science, and biochemistry. Traditional approaches often constrain their focus to specific domains, either proteins or small molecules, neglecting the enriching potential of cross-domain knowledge sharing. Addressing this research gap, the newly introduced Equivariant Pretrained Transformer (EPT) proposes a novel framework designed for unified geometric learning across different domains of 3D molecular structures.

Introduction to EPT

EPT stands out by its innovative employment of a block-enhanced representation technique that enriches each atom’s context by aligning atom-level and residue-level features. Built upon a transformer architecture, EPT integrates E(3) equivariance, enabling it to capture the 3D structure more accurately than traditional methods. A notable breakthrough in this research is the block-level denoising pretraining task, which allows for a more nuanced understanding of the complex hierarchical geometry inherent in 3D molecules.

Experimental Evaluation

EPT's performance was rigorously tested against a variety of benchmarks in the fields of ligand binding affinity prediction, molecular property prediction, and protein property prediction. The results affirm EPT’s capacity to significantly outperform state-of-the-art methods in affinity prediction, while achieving comparable or superior performance in other tasks, evidencing its robust applicability across different molecular domains.

Technical Insights and Analysis

The paper offers profound technical insights into the components contributing to EPT's performance. Key findings include:

Enhancing atom features with block-level information slightly elevates performance, suggesting the effectiveness of incorporating broader atom context.
The integration of distance matrices and edge features into the attention mechanism underpins the model's ability to encapsulate diverse interatomic relations effectively.
The block-level denoising strategy adopted by EPT positively impacts its ability to capture broader molecular dynamics, as illustrated by its superior performance across various molecular benchmarks.

Implications and Future Directions

The development of EPT initiates a promising direction towards the creation of generalizable and accurate models for molecular representation learning. It paves the way for further exploration into the benefits of cross-domain knowledge transfer and the development of unified models capable of understanding the universal principles governing molecular structures. Future research could focus on enhancing the scalability of EPT to larger molecular systems and extending its applicability to encompass more diverse scientific domains.

In conclusion, the Equivariant Pretrained Transformer (EPT) sets a new benchmark in the field of 3D molecular learning, demonstrating an unprecedented level of performance across multiple domains. Its innovative approach to pretraining and geometric representation holds promising potential for revolutionizing how molecular systems are modeled and understood in computational research.

PDF Markdown

Tweets

https://twitter.com/amelie_iska/status/1774176215764705419

https://twitter.com/rkakamilan/status/1761263859820859766