
GPT-5.5 outperforms Claude Fable 5 in AI's toughest real-world test
A new benchmark from UC Berkeley measures AI's ability to handle complex, long-horizon professional tasks. GPT-5.5 leads with a 24% pass rate, exposing sharp limitations in even top-tier models.