Researchers Release Largest McGurk Illusion Dataset to Standardize Audiovisual Speech Research
Scientists have created the McGurk Illusion Dataset (MID), the largest publicly available collection of audiovisual speech stimuli, containing 400 auditory, 400 visual, and 640 audiovisual speech samples from 80 Mandarin speakers validated across 360,900 trials. The McGurk illusion—where conflicting audio and visual speech cues create a perceptual illusion—has been used for 50 years to study how the brain integrates sensory information, but previous research showed high variability in susceptibility across individuals and speakers. This standardized dataset addresses a key limitation in audiovisual speech research by enabling more reliable measurement of how people integrate sight and sound in communication.
Researchers have introduced the McGurk Illusion Dataset (MID), a large-scale collection of audiovisual speech stimuli designed to improve the study of how humans integrate auditory and visual information during face-to-face communication. The dataset comprises 400 auditory stimuli, 400 visual stimuli, and 640 audiovisual stimuli generated from 80 Mandarin speakers, with behavioral validation across 360,900 trials. The McGurk illusion—a phenomenon where incongruent audio and visual speech cues cause listeners to perceive a different sound than what is actually spoken—has been a standard tool in audiovisual speech research for five decades. However, previous studies showed substantial variability in how susceptible different people and speakers are to the illusion, limiting its reliability as a measure of audiovisual integration ability. Using the MID, researchers characterized acoustic and facial articulatory properties of McGurk stimuli, confirmed inter-participant and inter-speaker variabilities, and identified associations between illusion susceptibility and unisensory perception, audiovisual correspondences, and speaker characteristics. The dataset is intended to serve as a standardized resource for investigating audiovisual speech integration across different populations and to support related research on speaker normalization, lip-reading, and speech perception.
Limitations & open questions
The study does not discuss potential limitations of using Mandarin speakers exclusively, which may affect generalizability to other languages with different phonetic and articulatory properties. Additionally, the paper does not specify demographic characteristics of the 360,900 trial participants or discuss whether findings apply equally across different age groups, hearing abilities, or language backgrounds.
What different sources said
- bioRxivCenter
Hearing Lips and Seeing Voices After Fifty Years: A Large-Scale McGurk Illusion Dataset for Audiovisual Speech Research
Related
Profilin-1 Deficiency Activates Immune Response Against Breast Cancer in Preclinical Study
Researchers found that removing the Profilin-1 protein from breast cancer cells triggers DNA damage and activates an immune pathway called STING, which recruits cancer-fighting T cells and causes tumor regression in mice. The study used CRISPR gene-editing technology to deplete Profilin-1 and observed that the resulting genomic instability paradoxically strengthens anti-tumor immunity. The findings suggest targeting Profilin-1 could be a new strategy to enhance immunotherapy effectiveness in breast cancer.
Computational Study Explores How Magnetic Fields May Affect Tomato Plant Ion Channels
Researchers used molecular dynamics simulations to investigate how static magnetic fields affect the CNGC6 ion channel in tomato plants, finding that magnetic fields may alter the channel's structure in specific ways. The study was motivated by observations that magnetic treatment of tomato seeds appears to speed germination and improve plant development, though the underlying cellular mechanisms remain unclear. The findings provide a computational foundation for future experimental work, though the authors emphasize this is a preliminary exploratory study requiring validation.
New Algorithm Simplifies Evolutionary Network Reconstruction for Hybridized Species
Researchers developed NetCS, a fast algorithm for reconstructing evolutionary networks in hybridized species that avoids expensive computational bottlenecks. The method works well when given accurate intermediate data but reveals that the real challenge in network inference lies in an earlier reconstruction step. This finding could enable phylogenetic analyses of larger datasets while identifying where future improvements are needed.