Tech1h ago72% confidence

Researchers Develop Low-Cost Foundation Model Training Method at $1,500

1 source

Researchers at Sapient Intelligence developed HRM-Text, a foundation language model that can be trained from scratch for approximately $1,500, compared to the millions typically required. The approach uses a Hierarchical Recurrent Model architecture that trains on instruction-response pairs rather than raw text, achieving performance competitive with much larger models on industry benchmarks. This development could democratize AI model training for enterprises that currently cannot afford the computational costs of building foundation models.

Sapient Intelligence researchers have created HRM-Text, a 1-billion-parameter foundation model trained for roughly $1,500—a dramatic reduction from the millions typically spent on foundation model development. The key innovation is replacing standard Transformer architectures with a Hierarchical Recurrent Model (HRM) that decouples computation into strategic and execution layers, allowing the model to train exclusively on instruction-response pairs rather than raw internet text. This approach addresses what the researchers identify as computational inefficiency in current large language models, which spend significant resources on tasks like reconstructing prompts that are already known at inference time. According to Sapient CEO Guan Wang, the method addresses enterprise pain points: expensive training, heavy infrastructure requirements, and slow experimentation cycles. The model achieved competitive performance on key industry benchmarks while using a fraction of the tokens required by standard LLMs, potentially enabling organizations to develop proprietary reasoning models without relying on external frontier models or sending sensitive data to third parties.

What's missing

The article does not provide independent verification of the claimed $1,500 training cost or performance benchmarks, nor does it specify which industry benchmarks were used for comparison. Details about the model's actual capabilities, limitations, or how it compares quantitatively to specific open-source models are absent. The practical scalability and real-world deployment results of HRM-Text are not discussed.

What different sources said

VentureBeatCenter
Researchers say they trained a foundation model from scratch for about $1,500

Tech

AI Is Changing Asia's Workplaces, According to Financial Leaders

Financial executives from major companies in Hong Kong, including Baidu's CFO and BlackRock's Asia-Pacific head, have shared insights on how artificial intelligence is transforming workplaces across the region. The article features perspectives from established financial institutions adapting to AI integration. These observations reflect broader trends of AI adoption reshaping employment and business operations in Asia.

1 source4m ago

Tech

GPT-5.5 Tops New AI Benchmark for Professional Workflows, Beating Claude Fable 5

OpenAI's GPT-5.5 achieved a 24.0% pass rate on the Agents' Last Exam (ALE), a new benchmark designed to measure AI performance on real-world professional tasks, narrowly beating Anthropic's Claude Fable 5 at 22.0%. The benchmark, developed by UC Berkeley researchers and 300+ domain experts, tests AI agents across 55 industries using authentic workflows from professional practitioners rather than isolated coding puzzles. The results highlight that current advanced AI models still fundamentally struggle with complex, long-horizon professional work despite recent progress.

1 source4m ago

Tech

Critical Infrastructure Protection Evolving Beyond Traditional Security as Drone Threats Multiply

Governments and security experts are grappling with how to protect critical infrastructure from drone threats, which can bypass traditional ground-based security measures. Recent conflicts have demonstrated that sensitive sites now face three-dimensional threats, prompting countries like China to tighten drone regulations and others to implement Remote ID mandates and geofencing. Experts argue that regulation alone is insufficient without detection capabilities and supply chain vetting to prevent circumvention of security frameworks.

1 source4m ago

Researchers Develop Low-Cost Foundation Model Training Method at $1,500

What's missing

What different sources said

Related

AI Is Changing Asia's Workplaces, According to Financial Leaders

GPT-5.5 Tops New AI Benchmark for Professional Workflows, Beating Claude Fable 5

Critical Infrastructure Protection Evolving Beyond Traditional Security as Drone Threats Multiply