PublicationsJun 1278% confidence

PeptiDIA: Machine Learning Framework Improves Peptide Identification in Fast-Gradient Proteomics

Center 100%

1 source

Researchers have developed PeptiDIA, a machine learning framework that improves peptide identification in fast-gradient data-independent acquisition (DIA) proteomics without changing how samples are acquired. The tool trains gradient-boosted decision tree models using paired fast and long-gradient data from the same samples, then applies those models to recover peptides missed in faster, lower-depth runs. This addresses a longstanding throughput-versus-depth trade-off in proteomics, potentially enabling high-throughput studies to achieve identification depths closer to those of slower, more resource-intensive workflows.

PeptiDIA is a machine learning framework designed to close the analytical gap between fast-gradient and long-gradient data-independent acquisition (DIA) mass spectrometry in proteomics. Fast chromatographic gradients allow higher sample throughput but typically identify fewer peptides than longer gradients; PeptiDIA addresses this by training gradient-boosted decision tree models on paired acquisitions from identical samples, using long-gradient identifications as reference labels. The framework processes outputs from DIA-NN at relaxed false discovery rate thresholds to expand the candidate peptide pool, then integrates DIA-NN scoring features with engineered peptide descriptors, and applies isotonic regression to calibrate confidence probabilities. Tested on human and murine datasets across six tissue types acquired on an Orbitrap Exploris 480 mass spectrometer, PeptiDIA increased peptide identifications by 25–34% at a 1% target reference-discordance rate (RDR) and expanded the number of protein groups with at least one recovered peptide by 15–17%. Critically, these gains are achieved purely at the computational level, requiring no changes to experimental acquisition strategies. The tool is publicly available as both a web application and a command-line tool on GitHub, lowering the barrier to adoption for proteomics laboratories.

What's missing

As a preprint, PeptiDIA has not yet undergone formal peer review. Key open questions include how well the framework generalizes to instruments other than the Orbitrap Exploris 480, whether paired fast- and long-gradient acquisitions from the same samples are always feasible in practice, and whether the reference-discordance rate metric fully captures false discovery behavior compared to conventional FDR approaches. The study does not report computational runtime or resource requirements, which are relevant for large-scale adoption.

What different sources said

bioRxivCenter
PeptiDIA: A Machine Learning Framework for Enhanced Peptide Identification in Fast-Gradient Data-Independent Acquisition Proteomics

Publications

Gut Bacteria Enzyme Found to Break Down Heat-Processed Food Compounds, Producing Novel Biogenic Amines

Researchers have discovered that an enzyme in common gut bacteria can degrade N-epsilon-carboxymethyllysine (CML), a compound formed during thermal food processing, producing previously unknown biogenic amines. The enzyme, ornithine decarboxylase SpeC from enterobacteria, acts on CML and related modified lysine derivatives through a low-level 'underground' catalytic activity. This finding suggests a previously unrecognized communication axis between thermally processed dietary compounds and gut microbial physiology, with potential implications for host health.

1 sourceJun 13

Publications

Full-Length Gene Sequencing Reveals Two Distinct Bacterial Communities in Black-Legged Ticks Expanding Into Canada

Researchers used Oxford Nanopore full-length 16S rRNA gene sequencing to characterize the microbiome of Ixodes scapularis black-legged ticks collected in Nova Scotia, Canada, distinguishing between tick-adapted bacteria and environmentally acquired bacteria. The study comes as I. scapularis — the primary vector of Lyme disease — is rapidly expanding northward into Canada due to climate change. The findings suggest that environmentally derived bacteria in tick microbiomes are not mere contamination, which has implications for how tick microbiome data is collected and interpreted across surveillance studies.

1 sourceJun 13

Publications

Study Identifies Metabolic Link Between Cell Envelope Stress and Biofilm Formation in Bacteria

Researchers have discovered that the metabolite acetyl-CoA directly inhibits enzymes that degrade the bacterial signaling molecule c-di-GMP, connecting cell envelope biosynthesis stress to biofilm formation in Pseudomonas aeruginosa. The study found that sub-inhibitory concentrations of antibiotics targeting early peptidoglycan biosynthesis — but not other antibiotic classes — elevate c-di-GMP levels by reducing phosphodiesterase activity, with acetyl-CoA competing for the enzyme active site. Because the relevant enzyme domain is broadly conserved across bacterial species, this checkpoint mechanism may be widespread and could have implications for understanding antibiotic-induced biofilm responses.

1 sourceJun 13

PeptiDIA: Machine Learning Framework Improves Peptide Identification in Fast-Gradient Proteomics

What's missing

What different sources said

Related

Gut Bacteria Enzyme Found to Break Down Heat-Processed Food Compounds, Producing Novel Biogenic Amines

Full-Length Gene Sequencing Reveals Two Distinct Bacterial Communities in Black-Legged Ticks Expanding Into Canada

Study Identifies Metabolic Link Between Cell Envelope Stress and Biofilm Formation in Bacteria