TellWell
← Back to feed
Publications3d ago83% confidenceConfidence 83% — the share of independent, credible sources corroborating the core facts.

Withdrawn Paper Argues Cosine Similarity Validity Depends on Embedding Normalization

Center 100%
1 source

A machine learning paper argues that cosine similarity is geometrically valid when embeddings are constrained to the unit sphere, contrary to recent warnings against its use. The paper responds to prior work showing that cosine similarity can be made arbitrary through diagonal matrix transformations in unnormalized embeddings. The paper has since been withdrawn by its author and was noted as an incomplete blog post draft.

Bouhsine's paper, now withdrawn, challenged conclusions from Steck, Ekanadham, and Kallus that cautioned against using cosine similarity on learned embeddings from matrix factorization models. The prior work demonstrated that cosine similarity could be rendered arbitrary through a diagonal gauge matrix transformation. Bouhsine argued this critique conflates a problem with incompatible training objectives rather than cosine similarity itself. The paper proved that when embeddings are constrained to the unit sphere during or after training, the gauge matrix ambiguity disappears and cosine distance becomes equivalent to half the squared Euclidean distance, ensuring identical neighbor rankings. The author noted in comments that the paper was a blog post companion draft requiring updates before proper preprint submission, and subsequently withdrew it.

What's missing

The paper does not discuss practical implications for existing deployed systems using unnormalized embeddings, nor does it address computational costs of enforcing unit sphere constraints during training.

What different sources said

  • In Defense of Cosine Similarity: Normalization Eliminates the Gauge Freedom

Related

PublicationsConfidence 78% — the share of independent, credible sources corroborating the core facts.

Gut Bacteria Enzyme Found to Break Down Heat-Processed Food Compounds, Producing Novel Biogenic Amines

Researchers have discovered that an enzyme in common gut bacteria can degrade N-epsilon-carboxymethyllysine (CML), a compound formed during thermal food processing, producing previously unknown biogenic amines. The enzyme, ornithine decarboxylase SpeC from enterobacteria, acts on CML and related modified lysine derivatives through a low-level 'underground' catalytic activity. This finding suggests a previously unrecognized communication axis between thermally processed dietary compounds and gut microbial physiology, with potential implications for host health.

1 source50m ago
PublicationsConfidence 78% — the share of independent, credible sources corroborating the core facts.

Full-Length Gene Sequencing Reveals Two Distinct Bacterial Communities in Black-Legged Ticks Expanding Into Canada

Researchers used Oxford Nanopore full-length 16S rRNA gene sequencing to characterize the microbiome of Ixodes scapularis black-legged ticks collected in Nova Scotia, Canada, distinguishing between tick-adapted bacteria and environmentally acquired bacteria. The study comes as I. scapularis — the primary vector of Lyme disease — is rapidly expanding northward into Canada due to climate change. The findings suggest that environmentally derived bacteria in tick microbiomes are not mere contamination, which has implications for how tick microbiome data is collected and interpreted across surveillance studies.

1 source50m ago
PublicationsConfidence 78% — the share of independent, credible sources corroborating the core facts.

Study Identifies Metabolic Link Between Cell Envelope Stress and Biofilm Formation in Bacteria

Researchers have discovered that the metabolite acetyl-CoA directly inhibits enzymes that degrade the bacterial signaling molecule c-di-GMP, connecting cell envelope biosynthesis stress to biofilm formation in Pseudomonas aeruginosa. The study found that sub-inhibitory concentrations of antibiotics targeting early peptidoglycan biosynthesis — but not other antibiotic classes — elevate c-di-GMP levels by reducing phosphodiesterase activity, with acetyl-CoA competing for the enzyme active site. Because the relevant enzyme domain is broadly conserved across bacterial species, this checkpoint mechanism may be widespread and could have implications for understanding antibiotic-induced biofilm responses.

1 source50m ago