Graph-Based Machine Learning Models Improve Prediction of Drug-Induced Liver Injury and Cellular Toxicity
Researchers developed a QSAR modeling pipeline using graph neural networks and graph transformers to predict chemical toxicity, including caspase-3/7 activation, mitochondrial membrane potential disruption, and FDA-defined drug-induced liver injury (DILI). The pipeline benchmarked classical machine learning against advanced graph-based models across multiple PubChem assay datasets. The work advances computational toxicology by improving DILI prediction accuracy and identifying specific chemical substructures linked to apoptosis initiation.
A team of researchers published a preprint on bioRxiv describing a comprehensive quantitative structure-activity relationship (QSAR) modeling pipeline designed to predict in vitro and in vivo chemical toxicity. The pipeline integrates data preprocessing, molecular fingerprints, molecular graph representations, and a range of machine learning models—including classical approaches, graph neural networks (GNNs), and graph transformers (GTs)—benchmarked against PubChem assay data for caspase-3/7 activation and mitochondrial membrane potential (MMP) disruption. For FDA Drug-Induced Liver Injury (DILI) prediction, the full consensus model achieved an AUC of 0.69 and Graphormer achieved an F1 score of 0.79, both substantially surpassing the previous best reported AUC of 0.63 and F1 of 0.65. The study found that graph-based models excelled when active compound counts were large, while classical models and GTs performed better on highly imbalanced datasets with few active compounds. Mechanistic analysis identified phenolic compounds with a para-hydroxyphenyl motif and lipophilic long-chain alkyl compounds as capable of collapsing mitochondrial membrane potential and activating caspases-3/7. Cell-line-specific structural motifs were also identified, including 1,1-dichloroethane and chlorobenzene for HEK293 cells, an epoxide fragment for SK-N-SH neuroblastoma cells, and tetramethylcyclohexene and acetaldehyde fragments for H-4-II-E rat hepatoma cells. The authors plan to incorporate biological activity data, toxicity literature, large language models, and agentic AI to further refine the framework.
What's missing
As a bioRxiv preprint, this work has not yet undergone formal peer review, so findings should be interpreted with caution. The study does not report external prospective validation of the DILI model on held-out real-world compounds beyond the FDA gold standard dataset, and the generalizability of identified structural motifs to broader chemical spaces remains untested.
What different sources said
- bioRxivCenter
A Graph-based QSAR Modeling Pipeline for Predicting In vitro PubChem Assays and In vivo Human Hepatotoxicity: Mechanistic Analysis of Caspase-3/7 Activation
Related
Multiscale Brain Model Predicts Novel Propofol Anesthesia Biomarker Without Training on Clinical Data
Researchers developed a mechanistic computational model of thalamocortical brain circuits that successfully predicted a previously unnoticed dose-dependent biomarker of propofol anesthesia. The model, driven solely by GABA-A receptor modulation, reproduced empirical data from both macaques and humans without being fitted to any anesthesia-specific data. The findings suggest that simulation-first approaches could accelerate biomarker discovery in neuropharmacology without requiring large clinical datasets.
Green-Synthesized Zinc Oxide Nanoparticles from Mimosa pudica Show Biocompatibility with Bone Marrow Stem Cells in Lab Study
Researchers synthesized zinc oxide nanoparticles using Mimosa pudica leaf extract and tested their effects on human bone marrow mesenchymal stromal cells, finding the nanoparticles preserved cell viability, structure, and bone-forming capacity. The plant-derived nanoparticles outperformed both the raw plant extract and conventionally synthesized zinc oxide in maintaining cell metabolic activity over five days. The findings suggest these bioactive nanomaterials could be candidates for musculoskeletal tissue engineering, though the research remains at an early in vitro stage.
Study Compares Genetic Modeling Approaches for Dyadic Social Interactions in Animals
A new preprint study compared two statistical modeling approaches for analyzing the genetic basis of social interactions in animals, finding that dyadic models outperform marginal models that aggregate individual-level data. The research used pig aggression data from 797 finishing pigs across 59 social groups as a test case. The findings have implications for how animal geneticists model and interpret the heritable components of social behavior.