BleepingComputer0 Hot0 bình luận3 phút đọc2 giờ trước

Claude Fable relaunch disappoints users with nerfed performance

Claude Fable 5, Anthropic's most powerful model, has been relaunched for all users after a government-imposed ban was lifted, but the reception has been disappointing. Users report that the restored model frequently falls back to Opus 4.8 due to overly aggressive safety guardrails, even for benign tasks like searching for dead code or working with C/C++/Rust. Triggers appear to include security-adjacent language in prompts or project files. Anthropic has not nerfed the model itself, but its updated safety systems use a large 'safety margin' that causes excessive false positives. Additionally, Fable 5 usage is capped at 50% of weekly limits on subscription plans, with a full transition to pay-per-use credits expected after July 7.

Đọc bài gốc

#llm #claude #anthropic #claude-code #ai-safety

Nguồn: https://www.bleepingcomputer.com/news/artificial-intelligence/claude-fable-relaunch-disappoints-users-with-nerfed-performance. 8sync News chỉ tóm tắt và dẫn link; bản quyền nội dung thuộc tác giả và nguồn gốc.

Đề xuất cho bạn

Baeldung1 Hot9 phút7 giờ trướcAI

Building LLM-as-a-Judge Using Recursive Advisors in Spring AI

Bài viết hướng dẫn từng bước triển khai mô hình LLM-as-a-Judge trong Spring AI bằng cách sử dụng recursive advisors, nơi LLM thứ hai đánh giá và cho điểm phản hồi của LLM sinh ra dựa trên tiêu chí rubric, sau đó phản hồi phê bình được đưa trở lại prompt để tinh chỉnh. Quá trình lặp lại cho đến khi đạt ngưỡng chất lượng hoặc giới hạn số lần thử tối đa.

Làm việc với LLM-as-a-Judge trong Spring AI giúp tối ưu hóa chất lượng phản hồi của AI bằng cách kết hợp đánh giá tự động và phản hồi lặp đi lặp lại, giảm thiểu sai sót và tăng hiệu suất cho các ứng dụng tự động hóa.

#java

Claude Fable relaunch disappoints users with nerfed performance

Đề xuất cho bạn

Building LLM-as-a-Judge Using Recursive Advisors in Spring AI

Please stop the AI Confidence Theater

Better Models: Worse Tools

LangChain is great for prototypes. Here’s why I didn’t use it in production.

Cross-Model Experiments — The Recursion Institute

Why Specialization Is Inevitable

Is Open Source Dead?

I ditched my productivity stack for Claude Code, and it does everything paid tools used to do