Interconnects.ai coverage reviews rapid open AI model launches including Gemma 4, DeepSeek V4, Kimi K2.6, MiMo 2.5, and GLM-5.1 against CAISI V4 assessments
The analysis highlights disagreements between CAISI and Epoch AI Research on the open versus closed model gap while noting that evaluations remain incomplete for frontier systems.
new artifacts!
i also comment on the open<>closed model gap, where US CAISI and @EpochAIResearch disagree, arguing that both are incomplete: for an assessment of the very frontier, we must elicit the best performance by tuning prompts and harnesses with the models

Latest open artifacts (#21): Open model bonanza! Gemma 4, DeepSeek V4, Kimi K2.6, MiMo 2.5, GLM-5.1 & others. On CAISI's V4 assessment. An eventful month with one flagship release after another https://www.interconnects.ai/p/latest-open-artifacts-21-open-model
sonnet 5, gpt 5.6, gemini 3.5 all next week.
are you starting to feel the acceleration yet chat?
this week on agi wars, anthropic, google, and openai set to release their latest and greatest models in a bid to retake the 'current best sota' award.
xai continue to bleed staff because the type of people that can make the agi aren't the typical type of people elon hires, and his hammer drives them away.
meta continue to pretend ohhhhh boyyy, this is just the starter baby, oh boy, oh you wait. you'll have asi in some glasses real real soon, boy howdy yeah.
gemini after briefly appearing to turn the ship with its code red is still frozen in the chatbot era. sigh.
sorry, i was just going to say, three big models next week and got a little carried away.
codex mobile is insane btw.
typing this out on my phone and noticed my laptop in the corner of the room start doing some work.
clearly magic.