Kimi K2.7-Code claims 30% fewer thinking tokens and a drop-in API swap path, but independent benchmarks show kernel ...
This report follows KushoAI's earlier launch of APIEval-20, the industry's first open benchmark for evaluating AI agents on ...
M3 demonstrates that the next phase of agent development will not just be driven by larger datasets, but by efficient ...
SAN FRANCISCO, April 8, 2026 /PRNewswire/ -- KushoAI, an AI-native platform for API testing and software reliability, has introduced APIEval-20, an open benchmark designed to evaluate how effectively ...
MiniMax M3 launched June 1, 2026 with a 1-million-token context window and company-reported SWE-Bench Pro scores that edge ...
Following a record-breaking sweep of global embedding leaderboards, Octen debuts its proprietary distributed search engine, achieving 60ms latency and 1M+ QPS to power search in the age of AI The ...