It blocked us at 'hello!' Anthropic Fable 5 refusing innocuous prompts
Hyper-vigilant safety classifiers turn Fable into cautionary tale
IT/기술 · "FABLE" · 총 44건
필터 보기현재 지수
49.4
0 = 부정 우세
50 = 중립
100 = 긍정 우세
최근 7일 기준 77,748건을 분석한 결과, 뉴스 심리지수는 49.4(균형)입니다. 긍정 9,432건(12.1%)·중립 56,170건(72.2%)·부정 12,146건(15.6%)이며, 중립 비중이 뚜렷하게 높습니다. 성향 지수는 종합 21.1(보수 경향)입니다.
Hyper-vigilant safety classifiers turn Fable into cautionary tale
Anthropic just released Claude Fable 5, calling it the most powerful AI model it has ever made widely available and praising its skills in biology, among others. But the model won't answer basic biology questions - the kind you'd expect a high schooler to handle. Instead, it hands off the query to the former flagship […]

Das KI-Modell Mythos sei zu mächtig für die Öffentlichkeit, hieß es. Jetzt gibt es eine angeblich sichere Version. Wer sie testet, stößt schnell an Grenzen.
A paragraph buried in Fable 5's 319-page system card revealed the model would silently downgrade its responses for certain AI development work — without telling users.

Anthropic released Claude Fable, its first Mythos-class AI model, yesterday and it's already causing concerns inside Microsoft. Sources tell me that Microsoft is limiting the use of Claude Fable 5 for employees because of Anthropic's new data retention requirements. While Microsoft quickly rolled out Claude Fable 5 to its GitHub Copilot and Foundry customers, I'm […]

Cybersecurity researchers are complaining that Anthropic's new model Fable has guardrails that are too strict for any cybersecurity work.
Après avoir freiné la publication de son modèle par prudence, la société a présenté mardi 9 juin Fable 5, une version plus limitée. Un système loin d’être infaillible, alerte Arthur Grimonpont, membre du Centre de la sécurité de l’IA.
Die KI-Firma Anthropic hat ihr angeblich zu mächtiges „Mythos“-Modell in Ketten gelegt und lässt es unter dem Namen „Fable 5“ auf Normalnutzer los. Das Ziel ist klar: Die Fantasie der Anleger beflügeln, bevor das Unternehmen an die Börse geht.

Anthropic has released Claude Fable 5, a public version of its most powerful AI model, two months after restricting an equivalent system over cybersecurity fears.
_0_1781080275.jpg)
Anthropic has released Claude Fable 5, a public version of its most powerful AI model, two months after restricting an equivalent system over cybersecurity fears.
_0_1781080278.jpg)
Anthropic is rolling out a public version of its Mythos AI model, but with guardrails barring its use in risky areas such as cybersecurity, after an earlier preview this year sent shockwaves globally for its ability to find software flaws. The new Claude Fable 5 is the most powerful model Anthropic has ever made for wider use, the startup said on Tuesday, touting its performance in software engineering and analytics. Anthropic has so far limited its access to a group of about 200 organisations, including the US government under the Glasswing program, after announcing in April that Mythos had uncovered thousands of software vulnerabilities. Offering its capabilities more widely may allow the $965 billion company to extend the momentum that has powered its valuation above rival OpenAI just as the two startups at the centre of the AI industry race to go public. The company said it had done extensive testing to ensure that users could not manipulate the new model to bypass its guidelines and perform restricted actions. “Let’s say I’m a college student asking the model like help me find cyber vulnerabilities on X package or code. The model would refuse and Fable 5 will fall back to Opus 4.8 for a response,” Dianne Penn, Anthropic’s head of product management, research and labs, told Reuters. Fable 5 will be a more expensive model, but it accomplishes tasks with lower token usage, bringing the overall cost per task down, according to early customer feedback, Penn said. Anthropic also said users who had access to the preview version of Claude Mythos, the version without guardrails, would be able to upgrade to the new Claude Mythos 5. The company said it planned to expand access over time through a more “systematic trusted-access programme”. Pricing on both models is $10 per million input tokens and $50 per million output tokens, the company said.
Anthropic has launched Claude Fable 5, a powerful AI model now publicly available. This release follows extensive safety measures to prevent misuse in high-risk fields. Fable 5, designed for complex tasks, is priced competitively and offers enhanced token efficiency, marking a significant step in making advanced AI more accessible.
The broad safeguards built into Anthropic's Claude Fable 5, a Mythos-class AI model, blocks some mundane requests on cybersecurity and biology.
Anthropic offers an unrestricted Claude Mythos 5 alongside Fable 5, the first from its Mythos class, its most advanced AI lineup.

AI company restricted access to Fable 5, its most powerful Mythos model, for months over cybersecurity concerns Anthropic, the maker of the Claude artificial intelligence (AI) models, made a new version of its technology available to the general public on Tuesday while restricting its use in sensitive areas. Dubbed Fable 5, the model is the first to be made widely available from the company’s new Mythos class – its most advanced lineup of AI technology, unveiled in April but restricted to a small set of partner institutions for months over cybersecurity concerns. Continue reading...
Anthropic is launching the next iteration of its powerful artificial intelligence models to the general public – but with guardrails that cap its ability to wreak havoc in areas like cybersecurity and biological research. Anthropic said in a statement Tuesday that the new model, called Claude Fable 5, will allow users to query Mythos –...

Comments
Anthropic is rolling out a public version of its Mythos AI model, but with guard rails barring its use in risky areas such as cybersecurity, after a preview earlier this year sent shock waves globally with its ability to find software flaws. The new Claude Fable 5 is the most powerful model Anthropic has ever made for wider use, the start-up said on Tuesday, touting its performance in software engineering and analytics. Anthropic has so far limited its access to a group of about 200...

Anthropic's Claude Fable 5 is going to be a big hit with the web's vibe coders.