TellWell
← Back to feed
Tech4h ago96% confidenceConfidence 96% — the share of independent, credible sources corroborating the core facts.

Google Introduces DiffusionGemma: Experimental Model Achieving 4x Faster Text Generation

2 sources

Google DeepMind has released DiffusionGemma, an experimental open-source AI model that generates text up to 4 times faster than traditional autoregressive models by producing entire blocks of text in parallel rather than sequentially. The 26-billion-parameter Mixture of Experts model uses diffusion-based generation similar to image models, activating only 3.8 billion parameters during inference to fit on consumer GPUs. The model is designed for speed-critical applications like real-time editing and local inference, though it trades off output quality compared to standard Gemma 4 models.

Google DeepMind introduced DiffusionGemma, an experimental open model released under an Apache 2.0 license that fundamentally changes how text generation works. Unlike traditional autoregressive language models that generate text token-by-token from left to right, DiffusionGemma uses a diffusion-based approach similar to image generation models, producing entire blocks of up to 256 tokens simultaneously. The model achieves speeds of 700+ tokens per second on an NVIDIA GeForce RTX 5090 and over 1,000 tokens per second on an NVIDIA H100, representing approximately 4x faster inference than comparable autoregressive Gemma models. As a 26-billion-parameter Mixture of Experts model that activates only 3.8 billion parameters during inference, it fits within the 18GB VRAM limits of high-end consumer GPUs when quantized. The parallel generation approach enables bidirectional attention, allowing tokens to attend to all others simultaneously, which provides advantages for non-linear tasks like code infilling and mathematical problem-solving. Google acknowledges that DiffusionGemma prioritizes speed over output quality and recommends standard Gemma 4 models for applications requiring maximum quality.

What's missing

The sources do not provide information about how DiffusionGemma's output quality compares quantitatively to autoregressive Gemma 4 models, specific benchmark results on standard evaluation metrics, or detailed performance comparisons across different hardware configurations beyond the two GPU examples mentioned.

What different sources said

  • DiffusionGemma: 4x Faster Text Generation

  • Google's latest DiffusionGemma open AI model comes with a 4x speed boost

Related

TechConfidence 72% — the share of independent, credible sources corroborating the core facts.

BYD Demonstrates Ultra-Fast 9-Minute EV Charging Technology at UK Headquarters

BYD showcased its Flash Charge technology at its West London headquarters, charging a Denza Z9 GT from 10% to nearly 100% in nine minutes using 1,500kW peak power. The system uses CCS 2 connectors compatible with most EVs and includes on-site battery storage to reduce grid demand. BYD plans to deploy 6,000 Flash Charging stalls globally by end of 2027, with 3,000 in Europe and 300 in the UK, potentially offering charging at under 50 pence per kilowatt-hour.

1 source5m ago
TechConfidence 75% — the share of independent, credible sources corroborating the core facts.

Anthropic's Claude Fable 5 Model Blocking Harmless User Requests with Overly Strict Safety Filters

Anthropic's newly released Claude Fable 5 AI model is refusing to respond to innocuous user prompts, including simple greetings like "hello," due to overly conservative safety guardrails. The company acknowledged the issue and stated that false positives occur in less than 5% of sessions, but has not provided exact refusal rates. The problem affects millions of users and has generated numerous bug reports and complaints from researchers and developers.

1 source5m ago
TechConfidence 85% — the share of independent, credible sources corroborating the core facts.

Open-Source Raspberry Pi Project Recreates Retro VCR Interface for Modern Media Playback

Developer Anthony Caccese has released 240-MP, an open-source Raspberry Pi project that creates a vintage VCR-style interface for playing local media files and Plex libraries on CRT or modern screens. The project runs on Raspberry Pi 4B, 3B+, and 3B models and supports navigation via remote control or keyboard. The tool addresses nostalgia for older display formats while enabling modern streaming functionality.

1 source5m ago