PublicationsJun 1183% confidence

Researchers Propose Compute-Aware Framework for Evaluating Adversarial Robustness in Language Models

Center 100%

1 source

Researchers have proposed a compute-aware evaluation framework for assessing adversarial robustness in large language models, replacing fixed-query attack success rates with metrics based on cumulative floating-point operations (FLOPs). The study evaluated ten models across three families and four training stages using three attack strategies on two jailbreak benchmarks. The work reveals that current robustness metrics can be misleading, and that alignment training has complex, non-monotonic effects on how hard a model is to compromise.

A new preprint from arXiv introduces a framework for evaluating how difficult it is to jailbreak large language models (LLMs) by measuring adversarial effort in terms of computational cost—specifically cumulative floating-point operations (FLOPs)—rather than the conventional attack success rate (ASR) under a fixed query budget. The authors argue that existing metrics obscure true attacker effort because different attack strategies vary in cost by orders of magnitude. Their framework introduces 'risk-compute curves' and two summary metrics that map compute budgets to attack risk across models. Key findings include that alignment training has non-monotonic effects on robustness in compute space, that scaling model size reduces vulnerability to expensive gradient-based attacks but not to cheaper template-based ones, and that gradient-based attacks can transfer from surrogate to target models, lowering attacker costs. The study also found that computational cost to succeed varies by up to approximately five times across different harm categories within a single model, and that safety-aligned reinforcement learning raises aggregate attack cost while leaving certain harm categories disproportionately accessible. The researchers have released their framework publicly to support more rigorous, compute-aware safety evaluations.

What's missing

It is unclear how well FLOPs as a proxy for adversarial effort generalizes to real-world attacker constraints beyond compute, such as API rate limits or financial cost. The paper has not yet undergone peer review, as it is a preprint. The generalizability of risk-compute curves to future model architectures or novel attack strategies remains an open question.

What different sources said

arXiv cs.AICenter
Risk Under Pressure: Compute-Aware Evaluation of Adversarial Robustness in Language Models

Publications

Gut Bacteria Enzyme Found to Break Down Heat-Processed Food Compounds, Producing Novel Biogenic Amines

Researchers have discovered that an enzyme in common gut bacteria can degrade N-epsilon-carboxymethyllysine (CML), a compound formed during thermal food processing, producing previously unknown biogenic amines. The enzyme, ornithine decarboxylase SpeC from enterobacteria, acts on CML and related modified lysine derivatives through a low-level 'underground' catalytic activity. This finding suggests a previously unrecognized communication axis between thermally processed dietary compounds and gut microbial physiology, with potential implications for host health.

1 sourceJun 13

Publications

Full-Length Gene Sequencing Reveals Two Distinct Bacterial Communities in Black-Legged Ticks Expanding Into Canada

Researchers used Oxford Nanopore full-length 16S rRNA gene sequencing to characterize the microbiome of Ixodes scapularis black-legged ticks collected in Nova Scotia, Canada, distinguishing between tick-adapted bacteria and environmentally acquired bacteria. The study comes as I. scapularis — the primary vector of Lyme disease — is rapidly expanding northward into Canada due to climate change. The findings suggest that environmentally derived bacteria in tick microbiomes are not mere contamination, which has implications for how tick microbiome data is collected and interpreted across surveillance studies.

1 sourceJun 13

Publications

Study Identifies Metabolic Link Between Cell Envelope Stress and Biofilm Formation in Bacteria

Researchers have discovered that the metabolite acetyl-CoA directly inhibits enzymes that degrade the bacterial signaling molecule c-di-GMP, connecting cell envelope biosynthesis stress to biofilm formation in Pseudomonas aeruginosa. The study found that sub-inhibitory concentrations of antibiotics targeting early peptidoglycan biosynthesis — but not other antibiotic classes — elevate c-di-GMP levels by reducing phosphodiesterase activity, with acetyl-CoA competing for the enzyme active site. Because the relevant enzyme domain is broadly conserved across bacterial species, this checkpoint mechanism may be widespread and could have implications for understanding antibiotic-induced biofilm responses.

1 sourceJun 13

Researchers Propose Compute-Aware Framework for Evaluating Adversarial Robustness in Language Models

What's missing

What different sources said

Related

Gut Bacteria Enzyme Found to Break Down Heat-Processed Food Compounds, Producing Novel Biogenic Amines

Full-Length Gene Sequencing Reveals Two Distinct Bacterial Communities in Black-Legged Ticks Expanding Into Canada

Study Identifies Metabolic Link Between Cell Envelope Stress and Biofilm Formation in Bacteria