Anthropic Releases Powerful Claude Model with Safety Guardrails to Broader Audience

Anthropic has made its most advanced AI model, Claude Fable 5, available to a wider audience with built-in safety restrictions that prevent use for cybersecurity attacks and other risky applications. The move follows an April announcement that an earlier version of the model could identify thousands of software vulnerabilities, which had been restricted to about 200 organizations including the U.S. government. The release represents Anthropic's effort to balance capability with safety as it competes with OpenAI in the AI industry.
Anthropic has launched Claude Fable 5, its most powerful AI model to date, with guardrails designed to prevent misuse in sensitive areas such as cybersecurity. The model will refuse requests to help find vulnerabilities in software and will fall back to a less capable version (Opus 4.8) for such queries. Previously, access to the model's vulnerability-detection capabilities was limited to approximately 200 organizations, including U.S. government agencies, following April's disclosure that the model could uncover thousands of software flaws. The company has conducted extensive testing to ensure users cannot bypass safety guidelines. Pricing is set at $10 per million input tokens and $50 per million output tokens, with the company noting that despite higher per-token costs, overall task costs may be lower due to reduced token usage. Anthropic plans to expand access gradually through a "systematic trusted-access program."
What's missing
The article does not specify what the actual differences are between Claude Fable 5 and the earlier Mythos model beyond guardrails, nor does it clarify the relationship between the names 'Mythos' and 'Fable 5' or explain why the naming changed. Additionally, details about the specific safety mechanisms that prevent guardrail bypass are limited.
What different sources said
- NDTVCenter
Anthropic Opens Up Its Most Powerful AI Model To All, But There's A Catch
Related

Blacksmith CI Service Charges $1,081 to User on Free Trial Without Credit Card on File
A developer team using Blacksmith, a GitHub Actions alternative, received a $1,081 invoice after exceeding free tier limits without having provided a credit card. The company's free trial continued accruing charges rather than stopping service, contrary to typical SaaS conventions. The incident raises questions about whether such billing practices are legally permissible and whether they align with user expectations.

Apple Testing Camera-Equipped AirPods for AI-Enhanced Siri, But Privacy Concerns May Delay Launch
Apple has designed AirPods with built-in cameras to give Siri visual context for user requests and is in late-stage testing with employees, according to Bloomberg reporting. The cameras would enable features like landmark-based navigation, food identification, and smarter contextual assistance, though they would not record photos or video like smart glasses. However, Wired reports Apple may delay the product due to insufficient AI capabilities and executive concerns about privacy risks without compelling use cases.

AI Companies Adopt Serif Fonts to Signal Trustworthiness and Human Touch
AI companies like Claude, Perplexity, and Runway are increasingly using serif fonts in their branding and user interfaces, a shift designers attribute to efforts to make artificial intelligence appear more human and trustworthy. Serif typefaces, historically associated with print media, books, and authority, contrast with the cleaner sans-serif fonts often perceived as computer-like and cold. The trend reflects broader public skepticism about AI and companies' attempts to build confidence in their products through design choices that evoke human craftsmanship and reliability.