Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Uni-Mol Docking V2: Towards Realistic and Accurate Binding Pose Prediction (2405.11769v1)

Published 20 May 2024 in q-bio.BM, cs.LG, and physics.bio-ph

Abstract: In recent years, ML methods have emerged as promising alternatives for molecular docking, offering the potential for high accuracy without incurring prohibitive computational costs. However, recent studies have indicated that these ML models may overfit to quantitative metrics while neglecting the physical constraints inherent in the problem. In this work, we present Uni-Mol Docking V2, which demonstrates a remarkable improvement in performance, accurately predicting the binding poses of 77+% of ligands in the PoseBusters benchmark with an RMSD value of less than 2.0 {\AA}, and 75+% passing all quality checks. This represents a significant increase from the 62% achieved by the previous Uni-Mol Docking model. Notably, our Uni-Mol Docking approach generates chemically accurate predictions, circumventing issues such as chirality inversions and steric clashes that have plagued previous ML models. Furthermore, we observe enhanced performance in terms of high-quality predictions (RMSD values of less than 1.0 {\AA} and 1.5 {\AA}) and physical soundness when Uni-Mol Docking is combined with more physics-based methods like Uni-Dock. Our results represent a significant advancement in the application of artificial intelligence for scientific research, adopting a holistic approach to ligand docking that is well-suited for industrial applications in virtual screening and drug design. The code, data and service for Uni-Mol Docking are publicly available for use and further development in https://github.com/dptech-corp/Uni-Mol.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (13)
  1. Uni-mol: A universal 3d molecular representation learning framework. In The Eleventh International Conference on Learning Representations, 2023.
  2. Highly accurate quantum chemical property prediction with uni-mol+, 2023.
  3. Autodock vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. Journal of computational chemistry, 31(2):455–461, 2010.
  4. Comparative assessment of scoring functions: the casf-2016 update. Journal of chemical information and modeling, 59(2):895–913, 2018.
  5. Posebusters: Ai-based docking methods fail to generate physically valid poses or generalise to novel sequences. Chemical Science, 2024.
  6. Generalized biomolecular modeling and design with rosettafold all-atom. bioRxiv, 2023.
  7. Anonymous. Diffdock-pocket: Diffusion for pocket-level docking with sidechain flexibility. In Submitted to The Twelfth International Conference on Learning Representations, 2023. under review.
  8. Structure prediction of protein-ligand complexes from sequence information with umol. bioRxiv, pages 2023–11, 2023.
  9. Isomorphic Labs Team and Google DeepMind AlphaFold Team. A glimpse of the next generation of alphafold.
  10. UMD-fit: Generating realistic ligand conformations for distance-based deep docking models. In NeurIPS 2023 Generative AI and Biology (GenBio) Workshop, 2023.
  11. Binding moad (mother of all databases). Proteins: Structure, Function, and Bioinformatics, 60(3):333–340, 2005.
  12. Synergistic application of molecular docking and machine learning for improved protein-ligand binding pose prediction. ChemRxiv, 2023.
  13. Posebusters: Ai-based docking methods fail to generate physically valid poses or generalise to novel sequences, 2023.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Eric Alcaide (8 papers)
  2. Zhifeng Gao (37 papers)
  3. Guolin Ke (43 papers)
  4. Yaqi Li (18 papers)
  5. Linfeng Zhang (160 papers)
  6. Hang Zheng (42 papers)
  7. Gengmo Zhou (4 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.