

Anthropic has abruptly walked back a controversial, unannounced policy that degraded the performance of its latest model, Claude Fable 5.
The reversal follows intense backlash from the machine learning community, which criticized the company for a lack of transparency and anti-competitive behavior, according to a Wired report.
The controversy began earlier this week with the release of Claude Fable 5, a version of Anthropic’s highly sophisticated Mythos system equipped with specialized national security guardrails. While the company openly said it would reroute hazardous prompts regarding cybersecurity, biology, and chemistry to less advanced models, it did not disclose a separate restriction: silently throttling requests tied to frontier LLM development.
AI researchers quickly noticed that when Fable 5 was tasked with training competing LLMs, debugging AI code, or optimizing neural architecture, the model would covertly fail or degrade its output without notifying the user. This hidden mechanism drew immediate fire from developers who complained they were burning expensive API tokens on a deliberately crippled system.
Many in the tech sector viewed the stealth restrictions as a hostile maneuver designed to prevent rivals from using Anthropic’s proprietary data to build competing systems. The move severely bruised Anthropic’s public image, as the company has long positioned itself as a more ethical, researcher-friendly, and safety-oriented alternative to rivals like OpenAI.
“Degrading performance on ML research without telling the user is shockingly hostile and a terrible look,” said Dean W. Ball, a prominent research fellow, on social media platform X.
In statements issued to Wired and Business Insider, Anthropic acknowledged the misstep and announced immediate changes to make the guardrails entirely visible.
“We’re changing Fable 5’s safeguards for frontier LLM development to make them visible,” an Anthropic spokesperson said. “We made the wrong trade-off, and we apologize for not getting the balance right.”
Under the revised policy, the safety restrictions remain in place, but the stealth element has been eliminated. Starting this week, flagged API requests will explicitly return a reason for refusal, and standard user queries will visibly fall back to the older Claude Opus 4.8 model rather than degrading silently.
Anthropic defended the core existence of the safeguards, emphasizing that the underlying Mythos framework possesses unprecedented capabilities in advanced reasoning, cyber-operations, and scientific research. The strict barriers are meant to prevent foreign adversaries from accelerating weapons research or gaining a dangerous technological edge.
The company reassured developers that the vast majority of standard coding and machine learning applications remain entirely unaffected by these national security parameters. While the full Mythos system remains restricted to vetted government entities, the public-facing Fable 5 will now operate with the transparency the developer community demands.
from DevOps.com https://ift.tt/1lmn4U5
Comments
Post a Comment