PublicationsJun 1078% confidence

Researchers Release Largest McGurk Illusion Dataset to Standardize Audiovisual Speech Research

Center 100%

1 source

Scientists have published the McGurk Illusion Dataset (MID), the largest publicly available collection of McGurk stimuli to date, comprising 1,440 audio, visual, and audiovisual speech samples from 80 Mandarin speakers validated across nearly 361,000 behavioral trials. The McGurk illusion—where conflicting audio and visual speech signals produce a third, fused perception—has been a key tool for studying how the brain integrates sensory information for five decades, but its reliability has been hampered by wide variability across individuals and speakers. This standardized dataset aims to resolve those reliability issues and support broader research into lip-reading, speaker normalization, and speech perception across diverse populations.

The McGurk Illusion Dataset (MID), introduced in a new bioRxiv preprint, represents the largest publicly available stimulus set for studying audiovisual speech integration, containing 400 auditory, 400 visual, and 640 audiovisual speech stimuli derived from 80 Mandarin-speaking participants. The dataset was validated through approximately 360,900 behavioral trials, providing a robust empirical foundation for characterizing how acoustic and facial articulatory properties relate to illusion susceptibility. The study replicated well-documented findings of substantial variability in how strongly individuals and speakers experience the McGurk effect—where, for example, hearing 'ba' while watching lip movements for 'ga' produces the perception of 'da'—and investigated the acoustic and visual factors underlying that variability. Researchers also examined associations between illusion rates and unisensory perception quality, audiovisual correspondence, and speaker-specific characteristics, while systematically comparing the reliability of different McGurk-based measures of audiovisual integration. The authors argue that MID fills a critical gap in the field by offering a standardized, large-scale resource that can support cross-population comparisons, including studies of clinical groups or developmental differences in multisensory processing. By making the dataset publicly available, the team hopes to enable more reproducible and comparable findings across laboratories worldwide.

What's missing

As a preprint, this work has not yet undergone formal peer review, so its methods and conclusions should be interpreted with caution. The dataset is drawn exclusively from Mandarin speakers, which may limit generalizability to other languages and phonological systems. The study does not yet report how well MID-derived measures predict audiovisual integration ability in clinical populations (e.g., autism, dyslexia, hearing loss), which is a key stated motivation. Additionally, the ecological validity of lab-generated McGurk stimuli versus naturalistic face-to-face communication remains an open question in the field.

What different sources said

bioRxivCenter
Hearing Lips and Seeing Voices After Fifty Years: A Large-Scale McGurk Illusion Dataset for Audiovisual Speech Research

Publications

Gut Bacteria Enzyme Found to Break Down Heat-Processed Food Compounds, Producing Novel Biogenic Amines

Researchers have discovered that an enzyme in common gut bacteria can degrade N-epsilon-carboxymethyllysine (CML), a compound formed during thermal food processing, producing previously unknown biogenic amines. The enzyme, ornithine decarboxylase SpeC from enterobacteria, acts on CML and related modified lysine derivatives through a low-level 'underground' catalytic activity. This finding suggests a previously unrecognized communication axis between thermally processed dietary compounds and gut microbial physiology, with potential implications for host health.

1 sourceJun 13

Publications

Full-Length Gene Sequencing Reveals Two Distinct Bacterial Communities in Black-Legged Ticks Expanding Into Canada

Researchers used Oxford Nanopore full-length 16S rRNA gene sequencing to characterize the microbiome of Ixodes scapularis black-legged ticks collected in Nova Scotia, Canada, distinguishing between tick-adapted bacteria and environmentally acquired bacteria. The study comes as I. scapularis — the primary vector of Lyme disease — is rapidly expanding northward into Canada due to climate change. The findings suggest that environmentally derived bacteria in tick microbiomes are not mere contamination, which has implications for how tick microbiome data is collected and interpreted across surveillance studies.

1 sourceJun 13

Publications

Study Identifies Metabolic Link Between Cell Envelope Stress and Biofilm Formation in Bacteria

Researchers have discovered that the metabolite acetyl-CoA directly inhibits enzymes that degrade the bacterial signaling molecule c-di-GMP, connecting cell envelope biosynthesis stress to biofilm formation in Pseudomonas aeruginosa. The study found that sub-inhibitory concentrations of antibiotics targeting early peptidoglycan biosynthesis — but not other antibiotic classes — elevate c-di-GMP levels by reducing phosphodiesterase activity, with acetyl-CoA competing for the enzyme active site. Because the relevant enzyme domain is broadly conserved across bacterial species, this checkpoint mechanism may be widespread and could have implications for understanding antibiotic-induced biofilm responses.

1 sourceJun 13

Researchers Release Largest McGurk Illusion Dataset to Standardize Audiovisual Speech Research

What's missing

What different sources said

Related

Gut Bacteria Enzyme Found to Break Down Heat-Processed Food Compounds, Producing Novel Biogenic Amines

Full-Length Gene Sequencing Reveals Two Distinct Bacterial Communities in Black-Legged Ticks Expanding Into Canada

Study Identifies Metabolic Link Between Cell Envelope Stress and Biofilm Formation in Bacteria