Google's Gemini 3.5 Flash records 1656 Elo on GDPval-AA matching M4o 2.5 Flash while trailing GPT-5 and Claude-Opus 4.7

INDRAJEET@indrajeet877TECH

@scaling01 Good 👍

Sundar Pichai@sundarpichaiTECH

Just off stage at #GoogleIO, some highlights from this morning 🧵 Gemini 3.5 Flash is available today for everyone in @antigravity and across our products and APIs. Compared to 3.1 Pro, 3.5 Flash is better across almost all benchmarks with huge progress in coding. It’s also comparable to the best models but very fast (4x faster tokens/ second than other frontier models). And when looking at the intelligence versus output speed, it’s in a league of its own in the top right quadrant.

Demis Hassabis@demishassabisTECH

Gemini 3.5 Flash is amazing! - Performs better than 3.1 Pro on coding & agentic tasks - 4x faster than other frontier models - 12x faster in @antigravity - 800 tokens/sec! - Often at less than half the cost And Pro to come… Try it in @antigravity, @GeminiApp & more - enjoy! https://x.com/demishassabis/status/2056904067406860545/photo/1

Logan Kilpatrick@OfficialLoganKTECH

Welcome to Gemini 3.5 Flash, our most powerful model to date. It pushes the frontier of intelligence, speed, and cost putting 3.5 Flash in a class of its own. We spent the last 6 months making sure Flash is great for real world use cases. It's available everywhere now! https://x.com/OfficialLoganK/status/2056792266514329914/photo/1

Theo - t3.gg@theoTECH

Oh my god it scored worse than Composer 2! Not even 2.5! And it cost 4x more to run!!! This might be the worst major lab model drop of all time. Llama 4 tier. Insane. https://x.com/theo/status/2056949041850913054/photo/1 https://twitter.com/mntruell/status/2056940715201220906

GoogleTECH

Meet Gemini 3.5 Flash — our strongest agentic and coding model yet. It delivers frontier-level performance at 4x the speed of comparable frontier models — often at less than half the cost. Generally available, starting today. 🧵 #GoogleIO

GoogleTECH

Gemini 3.5 Flash is built to help you execute complex, agentic workflows. 3.5 Flash rivals flagship models to deliver frontier performance for agents and coding, at the lightning speeds you expect from the Flash series.

Google DeepMind@GoogleDeepMindTECH

Introducing Gemini 3.5: our newest family of models combining frontier intelligence with real-world action. The first release is 3.5 Flash, our strongest model yet for agents and coding 🧵

Google's Gemini 3.5 Flash records 1656 Elo on GDPval-AA matching M4o 2.5 Flash while trailing GPT-5 and Claude-Opus 4.7

Related Stories

Commentary on X

Digg Deeper

How does the newly revealed pricing compare to its predecessors and current competitors?

What can one infer from this pricing ?

Worth it or nah?