PublicationsJun 1283% confidence

TAB-PO: New Method Improves AI Model Performance on Structured Data Tasks

Center 100%

1 source

Researchers have introduced Token-Adaptive Barrier Preference Optimization (TAB-PO), a training objective designed to improve large language models' ability to generate structured outputs like JSON that conform to predefined schemas. Standard Direct Preference Optimization (DPO) struggles with structured generation tasks because preferred and rejected outputs often differ in only a few critical tokens, causing gradient signal to be wasted on irrelevant formatting tokens. TAB-PO addresses this by selectively anchoring learning to low-confidence, schema-critical tokens, achieving an average 11.59% improvement over supervised fine-tuning and outperforming frontier models by 14.71% on a scientific information extraction benchmark.

The paper identifies two core failure modes of applying DPO to ontology-driven structured prediction: gradient dilution, where learning signal is spread across non-critical serialization tokens, and token erosion, where the likelihood of rare but important schema tokens is inadvertently reduced. To construct better training data, the authors develop a confusion-aware preference-construction strategy that combines expert-curated ambiguity patterns with empirically observed structured errors from validation-set predictions, generating minimally perturbed, schema-valid negative examples. TAB-PO then introduces a confidence-gated token-level barrier during post-SFT training that applies supervised anchoring specifically to under-confident schema tokens. Experiments on the public SciERC scientific information extraction benchmark using Llama and Qwen models ranging from 1.5B to 70B parameters show TAB-PO wins 100% of head-to-head comparisons against the strongest token-level and sequence-level DPO variants on ontology-critical metrics. The method also surpasses leading frontier models by 14.71% on semantic-label and relational-linking metrics while maintaining strong textual grounding performance.

What's missing

The study evaluates exclusively on the SciERC benchmark; generalization to other structured generation domains (e.g., medical, legal, or general-purpose JSON extraction) is not demonstrated.

What different sources said

arXiv cs.CLCenter
TAB-PO: Preference Optimization with a Token-Level Adaptive Barrier for Token-Critical Structured Generation

Publications

Gut Bacteria Enzyme Found to Break Down Heat-Processed Food Compounds, Producing Novel Biogenic Amines

Researchers have discovered that an enzyme in common gut bacteria can degrade N-epsilon-carboxymethyllysine (CML), a compound formed during thermal food processing, producing previously unknown biogenic amines. The enzyme, ornithine decarboxylase SpeC from enterobacteria, acts on CML and related modified lysine derivatives through a low-level 'underground' catalytic activity. This finding suggests a previously unrecognized communication axis between thermally processed dietary compounds and gut microbial physiology, with potential implications for host health.

1 sourceJun 13

Publications

Full-Length Gene Sequencing Reveals Two Distinct Bacterial Communities in Black-Legged Ticks Expanding Into Canada

Researchers used Oxford Nanopore full-length 16S rRNA gene sequencing to characterize the microbiome of Ixodes scapularis black-legged ticks collected in Nova Scotia, Canada, distinguishing between tick-adapted bacteria and environmentally acquired bacteria. The study comes as I. scapularis — the primary vector of Lyme disease — is rapidly expanding northward into Canada due to climate change. The findings suggest that environmentally derived bacteria in tick microbiomes are not mere contamination, which has implications for how tick microbiome data is collected and interpreted across surveillance studies.

1 sourceJun 13

Publications

Study Identifies Metabolic Link Between Cell Envelope Stress and Biofilm Formation in Bacteria

Researchers have discovered that the metabolite acetyl-CoA directly inhibits enzymes that degrade the bacterial signaling molecule c-di-GMP, connecting cell envelope biosynthesis stress to biofilm formation in Pseudomonas aeruginosa. The study found that sub-inhibitory concentrations of antibiotics targeting early peptidoglycan biosynthesis — but not other antibiotic classes — elevate c-di-GMP levels by reducing phosphodiesterase activity, with acetyl-CoA competing for the enzyme active site. Because the relevant enzyme domain is broadly conserved across bacterial species, this checkpoint mechanism may be widespread and could have implications for understanding antibiotic-induced biofilm responses.

1 sourceJun 13

TAB-PO: New Method Improves AI Model Performance on Structured Data Tasks

What's missing

What different sources said

Related

Gut Bacteria Enzyme Found to Break Down Heat-Processed Food Compounds, Producing Novel Biogenic Amines

Full-Length Gene Sequencing Reveals Two Distinct Bacterial Communities in Black-Legged Ticks Expanding Into Canada

Study Identifies Metabolic Link Between Cell Envelope Stress and Biofilm Formation in Bacteria