New computational methods enable screening of 100-billion-molecule libraries for drug discovery
Researchers developed CombiDOCK and MINT-Dock, two computational frameworks that can exhaustively screen libraries of over 100 billion drug candidate molecules in weeks rather than decades. The methods combine traditional molecular docking with generative AI and Monte Carlo Tree Search to accelerate the discovery of promising drug compounds. This advancement addresses a major bottleneck in drug discovery by making it computationally feasible to search vast chemical spaces that were previously impractical to explore.
Researchers at bioRxiv have presented two complementary computational approaches to screen extremely large libraries of potential drug molecules. CombiDOCK, a combinatorial docking framework, can exhaustively screen 100 billion compounds in approximately 40 days using standard computing resources, compared to the 50+ years required by conventional methods. MINT-Dock, a generative framework that integrates CombiDOCK with Monte Carlo Tree Search, further accelerates the search by intelligently navigating the chemical space. When benchmarked against 46 diverse drug targets, CombiDOCK matched the accuracy of traditional full-molecule docking, while MINT-Dock achieved a 4,800-fold enrichment over random selection. In prospective validation studies, the methods identified higher hit rates and more potent ligands compared to prior billion-scale screening campaigns, with some predictions confirmed by cryo-EM structural analysis.
What's missing
The preprint does not discuss potential limitations of the docking predictions for certain protein classes or chemical scaffolds, computational resource requirements for researchers without access to high-performance computing clusters, or timeline for validation of additional prospective predictions beyond those mentioned.
What different sources said
- bioRxivCenter
Combinatorial docking and molecular generation to navigate over 100-billion molecules for prospective ligand discovery
Related
Study reveals spermatogonial stem cell clones don't follow random drift patterns in zebrafish
Researchers used CRISPR barcoding to track how spermatogonial stem cells (SSCs) contribute to sperm production across a zebrafish's lifetime, finding that clonal dynamics deviate significantly from neutral drift models. The study developed a mathematical framework showing that larger clones tend to drift at higher rates, suggesting non-random selection pressures. These findings have implications for understanding allele transmission and male fertility across the reproductive lifespan.
Researchers identify synthetic lethal TYMS inhibitor effective against ATRX-deficient cells
Scientists developed a phenotype-first screening approach that identified PP12, a covalent fragment that selectively kills ATRX-deficient cells by inhibiting thymidylate synthase (TYMS). The study combines covalent fragment screening, chemoproteomics, and genetic analysis to link drug phenotypes to specific cellular targets. This work establishes a generalizable methodology for discovering synthetic lethal drug candidates in cancer cells with specific genetic deficiencies.
Study Reveals How Cells Maintain Balance During Fatty Acid Overload Through Membrane Sensing
Researchers identified adaptive cellular pathways that help hepatocytes maintain homeostasis when overwhelmed with fatty acids, even when their normal triglyceride storage capacity is exceeded. The response involves two coordinated mechanisms: activating fatty acid breakdown while simultaneously suppressing fatty acid synthesis. The findings suggest that endoplasmic reticulum membrane fluidity acts as a sensor that regulates these metabolic processes, with potential implications for understanding metabolic diseases.