AI System Achieves Automated Molecular Structure Determination from NMR Spectroscopy
Researchers developed a deep learning framework that can determine molecular structures from one-dimensional NMR spectra for molecules up to 40 non-hydrogen atoms, a task previously considered intractable due to the combinatorial explosion of possible structures. The system uses a transformer-based architecture inspired by natural language processing and achieves 60.4% accuracy in identifying the correct molecule within its top 15 predictions. This advancement could significantly accelerate drug discovery and chemical analysis by automating a time-consuming step in structure elucidation.
A new deep learning approach demonstrates that automated de novo structure generation from NMR spectroscopy is feasible for molecules spanning the drug-like chemical space. The researchers' transformer-based model successfully handles molecules with up to 40 non-hydrogen atoms across diverse elements (C, N, O, H, P, S, Si, B, and halogens), overcoming the combinatorial challenge where possible structures for 36-atom molecules range from 10^20 to 10^60. Using only ¹H and ¹³C NMR spectra as input, the system predicts the correct structure with 60.4% accuracy within its top 15 predictions. The architecture leverages insights from natural language processing, treating molecular structure prediction similarly to sequence-to-sequence translation tasks. The authors note the model is extensible to experimental data through fine-tuning, suggesting practical applicability beyond training datasets.
Limitations & open questions
The study does not discuss computational requirements (training time, hardware needs), comparison with existing automated structure elucidation methods, or performance on experimentally-derived spectra versus synthetic training data. The limitations of the 60.4% top-15 accuracy metric for practical chemistry workflows and failure modes on ambiguous or novel molecular classes are not detailed.
What different sources said
- arXiv cs.LGCenter
Pushing the limits of one-dimensional NMR spectroscopy for automated structure elucidation using artificial intelligence
Related
Study Identifies D-Retro-Inverso Peptides as Potential Treatments for Cardiac Amyloidosis
Researchers designed four peptide candidates using D-amino acids to inhibit Serum Amyloid A (SAA) fibril formation, which may contribute to complications following heart attacks. The study, conducted using molecular dynamics simulations in a mouse model, identified two peptides—DRI-R5S and DRI-H6A—as promising drug candidates. This work could lead to new therapeutic approaches for cardiac amyloidosis, a serious post-infarction complication.
Scientists Discover Previously Unknown Branch of Tryptophan Metabolism in Humans
Researchers identified a new enzymatic step in human tryptophan catabolism, showing that the protein ASPDH acts as a 2-aminomuconate reductase to produce a previously unknown amino acid. This discovery fills a gap in the kynurenine pathway, one of the body's major metabolic routes for processing the amino acid tryptophan. The finding expands understanding of human metabolism and may have implications for understanding NAD cofactor production and related metabolic diseases.

Japanese researchers find Adelie penguins use group behavior to locate new foraging grounds
A Japanese research team discovered that Adelie penguins in Antarctica follow colony members to find new feeding areas after unsuccessful foraging attempts. The study analyzed data from 116 penguins fitted with activity recorders in Lutzow-Holm Bay and was published in Proceedings of the Royal Society B. The findings demonstrate how group living provides foraging advantages and may help explain the evolution of social behavior in animals.