benchmarks
-
AI
Ai2's new Olmo 3.1 extends reinforcement learning training for stronger reasoning benchmarks
The Allen Institute for AI (Ai2) recently released what it calls its most powerful model family to date, Olmo 3.…
Read More » -
AI
Gemini 3 Pro scores 69% trust in blinded testing up from 16% for Gemini 2.5: The case for evaluating AI on real-world trust, not academic benchmarks
Just a few weeks ago, Google debuted its Gemini 3 model and claims it scores a leading position in multiple…
Read More » -
AI
Google unveils Gemini 3 claiming the lead in math, science, multimodal and agentic AI benchmarks
After more than a month of rumors and feverish speculation — including Polymarket wagering on the release date — Google…
Read More » -
AI
Moonshot's Kimi K2 Thinking emerges as leading open source AI, outperforming GPT-5, Claude Sonnet 4.5 on key benchmarks
Zelfs als bezorgdheid en scepsis groeit dankzij de uitbouwstrategie en hoge uitgavenverplichtingen van de Amerikaanse AI-startup OpenAI, Chinese open source…
Read More » -
AI
Nvidia says its Blackwell chips lead benchmarks in training AI LLMs
NVIDIA is rolling out its AI chips into data centers and what the AI factories all over the world calls,…
Read More » -
AI
The US is reviewing Benchmark’s investment into Chinese AI startup Manus
Manus AI is one of the most popular AI-agent startups and recently raises $ 75 million in a rating of…
Read More » -
AI
OpenAI launches program to design new ‘domain-specific’ AI benchmarks
Like many AI Laboratories, AI -Benchmarks are broken, just like many AI Laboratories. It says that it wants to repair…
Read More » -
AI
Did xAI lie about Grok 3’s benchmarks?
Debates about AI -Benchmarks – and how they are reported by AI Labs – spill out in the public image.…
Read More »
