Open-source AI agent outperforms Google's model on terminal benchmarks
A newly released open-source AI agent achieved a 65.2% score on TerminalBench, surpassing Google's 47.8% and the proprietary Junie CLI's 64.3%, with no reported cheating mechanisms.