Digg - AI news, before it trends

Today's Highlights

In case you missed it

An OpenAI internal general-purpose reasoning model refutes Erdős’s 1946 unit-distance conjecture by identifying infinite families of point configurations with superlinear scaling

Christian SzegedyCS#43|@CHRSZEGEDY
Eric JangEJ#36|@ERICJANG11
Noam BrownNB#31|@POLYNOAMIAL
Miles BrundageMB#20|@MILES_BRUNDAGE
Greg BrockmanGB#19|@GDB
Sam AltmanSA#8|@SAMA
+44 more

It’s a new day at Digg. Fresh stories are clustering now.0% complete.It’s a new day at Digg. Fresh stories are clustering now. 55% complete.

Top Stories

Posts:—|Clusters:—|Next Crawl:—
10

Alibaba releases Qwen3.7-Max closed-weights model scoring 56.6 on the Artificial Analysis Intelligence Index after increased RL compute investment — Model leads Chinese models on CritPt with nearly 4x prior performance.

5h|40.7k54354
ZephyrZE#1497|@ZEPHYR_Z9
Florian BrandFB#1153|@XEOPHON
Lisan al GaibLA#980|@SCALING01
20

OpenAI valuation rose 67 percent and Anthropic valuation rose 173 percent since the beginning of the year in posts referencing a February 23 Marcus on AI Substack article. — Anthropic reported second-quarter profits alongside OpenAI's Erdős problem solution.

2h|20k42234
Super DarioSD#1776|@INDUCTIONHEADS
Lisan al GaibLA#980|@SCALING01
Bojan TunguzBT#687|@TUNGUZ
30

Qwen releases Qwen3.7-Max, its latest flagship model for agent workloads that achieves 69.7 on Terminal-Bench 2.0 and completed a 35-hour kernel optimization with over 1,000 tool calls — Supports multi-file coding agents, MCP integrations, and multi-agent orchestration.

47m|79.2k1.2k199
Chubby♨️CH#1496|@KIMMONISMUS
Andrew CurranAC#517|@ANDREWCURRAN_
40

Ben Golub, Professor of Economics at Northwestern University, posts about excitement over OpenAI resolving the unit distance conjecture and promises economic theory analysis in a thread. — Gary Marcus replies labeling the post the subtle troll of the year.

3h|42.6k23364
Ben GolubBG#1447|@BEN_GOLUB
Gary MarcusGM#153|@GARYMARCUS
50

Gemini 3.5 Flash records a competitive 5.4 overall score on RuneScape-Bench and leads several early-game categories against GPT-5.5, GPT-5.4, and Opus 4.7 — Results are hosted at maxbittker.github.io with per-skill tables and green highlights.

4h|24.3k35157
Nataniel RuizNR#1562|@NATANIELRUIZG
Lisan al GaibLA#980|@SCALING01
New github stars (48hrs)
aisa-group/InferenceBench219h

Evaluates autonomous AI agents optimizing LLM serving like vLLM in open-ended scenarios with quality and integrity gates.

Rulin ShaoRS1224
Maksym AndriushchenkoMA1066
manaflow-ai/cmux17.6k1d

Integrates Ghostty into a macOS terminal with vertical tabs, sidebar metadata, OSC notifications, and in-app browser for AI coding agents.

Jack MorrisJM203
will brownWB339
sapientinc/HRM-Text5651d

Pretrains 1B HRM text models with hierarchical reasoning, task completion, and low-compute PrefixLM training.

TaelinTA1047
Vincent WeisserVW707
60

Flow matching technique controls pretrained generative models by shifting endpoint means of deterministic interpolants using imperfect reference examples — Demos cover style transfers, attribute edits, and anatomy corrections.

2h|7.3k7445
Luca AmbrogioniLA#1822|@LUCAAMB
Jan-Willem van de MeentJV#1677|@JWVDM
70

X debate examines whether AI can generate novel mathematical proofs, with one argument applying the data processing inequality to claim outputs remain fully determined by axioms and training data — Luca Ambrogioni counters that random inputs add entropy enabling novelty.

6h|4.3k200
Luca AmbrogioniLA#1822|@LUCAAMB
Dimitris PapailiopoulosDP#197|@DIMITRISPAPAIL
(((ل()(ل() 'yoav))))👾('#92|@YOAVGO
80

Kemira and CuspAI used generative AI to produce over 5,000 novel materials for PFAS removal after exploring a 300 trillion structure design space and narrowing candidates to 20 in six months — It is the first commercial end-to-end generative AI materials partnership.

7h|13.9k16417
Taco CohenTC#120|@TACOCOHEN
Max WellingMW#88|@WELLINGMAX
90

Jamie Dimon, JPMorgan Chase CEO, says the firm will hire more artificial intelligence specialists and fewer traditional bankers as AI adoption accelerates — AI will reduce overall jobs while creating new roles and raising productivity.

1h|7k7817
Rohan PaulRP#1032|@ROHANPAUL_AI
Andrew CurranAC#517|@ANDREWCURRAN_
100

Victor Taelin reports GPT models solve Erdos problems yet fail to spot basic fixes for Interaction Net bugs in an HVM SupGen variant — The required HOAS interpreter step was never suggested unaided.

46m|4.3k8111
TaelinTA#1047|@VICTORTAELIN
Alexander DoriaAD#867|@DORIALEXANDER
110

Cursor releases Composer 2.5 coding agent model through Cursor CLI, scoring 63 in Fast mode on the Artificial Analysis index to rank third. — It gains on SWE-Bench-Pro-Hard-AA with lower costs and higher speed.

1h|9.5k12727
eric zakariassonEZ#1965|@ERICZAKARIASSON
🍓🍓🍓🍓🍓#1711|@IRULETHEWORLDMO

Github Stars

(7 days)
  1. …ilsonnn/image-blaster5D AGO AGO

    Converts an input image into 3D meshes, Gaussian splats, and audio files using Claude skills and external AI APIs.

    Mohit ShridharMS1673
    elvisEL475
    🎭🎭878
    Andrew Carr 🤸AC263
    3.1k stars
  2. …tonomous-speedrunning6D AGO AGO

    Archives experiments of AI agents autonomously tuning optimizers, schedules, and hyperparameters to reach target validation loss in fewest steps on a small LM benchmark.

    swyx🛬 SFOSS214
    Vincent WeisserVW707
    samsjaSA1262
    65 stars
  3. …-group/InferenceBench19H AGO AGO

    Evaluates autonomous AI agents optimizing LLM serving like vLLM in open-ended scenarios with quality and integrity gates.

    Rulin ShaoRS1224
    Maksym AndriushchenkoMA1066
    2 stars
  4. manaflow-ai/cmux1D AGO AGO

    Integrates Ghostty into a macOS terminal with vertical tabs, sidebar metadata, OSC notifications, and in-app browser for AI coding agents.

    Jack MorrisJM203
    will brownWB339
    17.6k stars
  5. sapientinc/HRM-Text1D AGO AGO

    Pretrains 1B HRM text models with hierarchical reasoning, task completion, and low-compute PrefixLM training.

    TaelinTA1047
    Vincent WeisserVW707
    565 stars
Privacy Policy|Terms of Service|© 2026 Digg Inc.

Yesterday's Top Stories, May 20, 2026.

Posts: 4,298|Clusters: 1,078|Stories frozen at midnight PT
10

An OpenAI internal general-purpose reasoning model refutes Erdős’s 1946 unit-distance conjecture by identifying infinite families of point configurations with superlinear scaling — The proof is strong enough for submission to the Annals of Mathematics.

11h|#1VIEWS12.6M#1LIKED62.8k#1BOOKMARKED14.7k
Brendan Dolan-GavittBD#812|@MOYIX
Sherwin WuSW#801|@SHERWINWU
Ekin AkyürekEA#757|@AKYUREKEKIN
Brandon McKinzieBM#741|@MCKBRANDO
Chris PaxtonCP#732|@CHRIS_J_PAXTON
elieEL#716|@ELIEBAKOUCH
Tamay BesirogluTB#699|@TAMAYBES
Aaron RothAR#697|@AAROTH
Bojan TunguzBT#687|@TUNGUZ
Jerry LiuJL#677|@JERRYJLIU0
Jason PhangJP#672|@ZHANSHENG
Pedro DomingosPD#654|@PMDDOMINGOS
Yo ShavitYS#638|@YONASHAV
Simo RyuSR#604|@CLONEOFSIMO
Patrick HsuPH#597|@PDHSU
heinerHE#572|@HEINRICHKUTTLER
Clive ChanCC#548|@ITSCLIVETIME
Andrew CurranAC#517|@ANDREWCURRAN_
CLSCL#442|@CHENGLEISI
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)T(#420|@TEORTAXESTEX
Dean W. BallDW#392|@DEANWBALL
Alexander WeiAW#367|@ALEXWEI_
Mo BavarianMB#364|@MOBAV0
Boris PowerBP#357|@BORISMPOWER
Nat McAleeseNM#345|@__NMCA__
François FleuretFF#331|@FRANCOISFLEURET
Andrew Carr 🤸AC#263|@ANDREW_N_CARR
Surya GanguliSG#256|@SURYAGANGULI
will depueWD#254|@WILLDEPUE
Aidan ClarkAC#241|@_AIDAN_CLARK_
Jakub PachockiJP#206|@MERETTM
Michael BronsteinMB#199|@MMBRONSTEIN
Michael NielsenMN#187|@MICHAEL_NIELSEN
Ethan MollickEM#175|@EMOLLICK
Alex DimakisAD#172|@ALEXGDIMAKIS
EmadEM#167|@EMOSTAQUE
Andrew Gordon WilsonAG#154|@ANDREWGWILS
Gary MarcusGM#153|@GARYMARCUS
Boaz BarakBB#133|@BOAZBARAKTCS
Delip Rao e/σDR#103|@DELIPRAO
Ofir PressOP#72|@OFIRPRESS
Dan RoyDR#56|@ROYDANROY
Lucas Beyer (bl16)LB#55|@GIFFMANA
Sebastien BubeckSB#51|@SEBASTIENBUBECK
Christian SzegedyCS#43|@CHRSZEGEDY
Eric JangEJ#36|@ERICJANG11
Noam BrownNB#31|@POLYNOAMIAL
Miles BrundageMB#20|@MILES_BRUNDAGE
Greg BrockmanGB#19|@GDB
Sam AltmanSA#8|@SAMA
20

SpaceX's IPO filing discloses a $1.25 billion monthly compute deal with Anthropic for Colossus clusters through May 2029 — Capacity ramps in May 2026 for AI training including Grok 5.

14d|11.6M61.9k3.9k
Garrison Lovely is in DCGL#1984|@GARRISONLOVELY
Patrick OShaughnessyPO#1862|@PATRICK_OSHAG
🍓🍓🍓🍓🍓#1711|@IRULETHEWORLDMO
Nitasha TikuNT#1616|@NITASHATIKU
Austen AllredAA#1540|@AUSTEN
ZephyrZE#1497|@ZEPHYR_Z9
Chubby♨️CH#1496|@KIMMONISMUS
bilalBI#1402|@BILALTWOVEC
Shaun MaguireSM#1333|@SHAUNMMAGUIRE
Rachel MetzRM#1327|@RACHELMETZ
signüllSI#1124|@SIGNULLL
Kylie RobisonKR#1046|@KYLIEBYTES
Lisan al GaibLA#980|@SCALING01
Alexander DoriaAD#867|@DORIALEXANDER
Beff (e/acc)B(#839|@BEFFJEZOS
elieEL#716|@ELIEBAKOUCH
Zachary NadoZN#537|@ZACHARYNADO
Andrew CurranAC#517|@ANDREWCURRAN_
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)T(#420|@TEORTAXESTEX
Tom BrownTB#384|@NOTTOMBROWN
Tanishq Mathew Abraham, Ph.D.TM#359|@ISCIENCELUVR
Robert ScobleRS#321|@SCOBLEIZER
Elon MuskEM#76|@ELONMUSK
Lucas Beyer (bl16)LB#55|@GIFFMANA
30

Cohere releases Command A+, its most powerful large language model to date, as open-source Apache 2.0 software that runs on two H100 GPUs with 30% lower latency — Cohere co-founder Ivan Zhang highlighted efficiency and accessibility design choices.

15h|666.2k4.6k2k
EthanET#1884|@TORCHCOMPILED
Ivan ZhangIZ#1634|@1VNZH
stochasmST#1629|@STOCHASTICCHASM
whWH#1430|@NREHIEW_
Nils ReimersNR#1167|@NILS_REIMERS
Florian BrandFB#1153|@XEOPHON
Ben (no treats)B(#982|@ANDERSONBCDEFG
elieEL#716|@ELIEBAKOUCH
Jay AlammarJA#715|@JAYALAMMAR
Pasquale MinerviniPM#713|@PMINERVINI
merveME#675|@MERVENOYANN
Nick FrosstNF#601|@NICKFROSST
kacheKA#488|@YACINEMTB
elvisEL#475|@OMARSAR0
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)T(#420|@TEORTAXESTEX
Siva ReddySR#370|@SIVAREDDYG
will brownWB#339|@WILLCCBB
Marc G. BellemareMG#258|@MARCGBELLEMARE
Joelle PineauJP#216|@JPINEAU1
Stella BidermanSB#208|@BLANCHEMINERVA
Aidan GomezAG#171|@AIDANGOMEZ
Sebastian RaschkaSR#155|@RASBT
Delip Rao e/σDR#103|@DELIPRAO
clem 🤗C🤗#68|@CLEMENTDELANGUE
AKAK#28|@_AKHALIQ
40

Google releases Gemini 3.5 Flash with $1.50 per million input token pricing and up to four times faster output than Gemini 3.1 Pro — Real-world usage shows three times higher token consumption than predecessors.

2d|805.3k8.5k714
EthanET#1884|@TORCHCOMPILED
Theo - t3.ggT-#1829|@THEO
Bindu ReddyBR#1622|@BINDUREDDY
Alex VolkovAV#1245|@ALTRYNE
signüllSI#1124|@SIGNULLL
Rohan PaulRP#1032|@ROHANPAUL_AI
Cody BlakeneyCB#999|@CODE_STAR
Ben (no treats)B(#982|@ANDERSONBCDEFG
Lisan al GaibLA#980|@SCALING01
kalomazeKA#836|@KALOMAZE
elieEL#716|@ELIEBAKOUCH
Bojan TunguzBT#687|@TUNGUZ
Andrew CurranAC#517|@ANDREWCURRAN_
Peter Steinberger 🦞PS#495|@STEIPETE
kacheKA#488|@YACINEMTB
Ed H. ChiEH#434|@EDCHI
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)T(#420|@TEORTAXESTEX
Dean W. BallDW#392|@DEANWBALL
will brownWB#339|@WILLCCBB
Jon BarronJB#261|@JON_BARRON
Logan KilpatrickLK#90|@OFFICIALLOGANK
rohan anilRA#83|@_AROHAN_
Susan ZhangSZ#58|@SUCHENZANG
Lucas Beyer (bl16)LB#55|@GIFFMANA
Nando de FreitasND#29|@NANDODF
Sasha RushSR#23|@SRUSH_NLP
Jeff DeanJD#2|@JEFFDEAN
50

A new analysis projects hundreds of billions of dollars in philanthropic capital from AI companies centered on the OpenAI Foundation’s $220 billion stake and Anthropic co-founders’ 80 percent wealth pledges — Scale matches funding for five new universities at $35 billion yearly.

1d|519.7k3.8k1.7k
Nathan is in Berkeley 🔎NI#1902|@NATHANPMYOUNG
Kim-Mai CutlerKC#1670|@KIMMAICUTLER
jessica daiJD#1653|@JESSICADAI_
Yasmin RazaviYR#1602|@YASMINRAZAVI
clare ❤️‍🔥C❤️‍🔥#1463|@CLAREJTBIRCH
Ben GolubBG#1447|@BEN_GOLUB
rohitRO#1220|@KRISHNANROHIT
José Luis Ricón Fernández de la PuenteJL#1106|@ARTIRKEL
Seth LazarSL#1060|@SETHLAZAR
Beff (e/acc)B(#839|@BEFFJEZOS
Adrien EcoffetAE#719|@ADRIENLE
Dean W. BallDW#392|@DEANWBALL
kipplyKI#371|@KIPPERRII
@timnitGebru (@dair-community.social/bsky.social)@(#195|@TIMNITGEBRU
Joshua AchiamJA#109|@JACHIAM0
60

Exa raised $250 million in a Series C at a $2.2 billion valuation led by Andreessen Horowitz, reporting 400,000 developers and 5,000 company adopters for its AI agent search platform — Token usage grew 20x for agent-driven queries.

17h|1.2M4.4k2k
andrew gaoAG#1924|@ITSANDREWGAO
Sarah WangSW#1814|@SARAHDINGWANG
mickey friedmanMF#1774|@MICKEYXFRIEDMAN
AviAV#1713|@AVISCHIFFMANN
Will BrykWB#1582|@WILLIAMBRYK
jasonJA#929|@JXNLCO
LishaLI#761|@LISHALI88
Keerthana GopalakrishnanKG#730|@KEERTHANPG
Alexis RossAR#563|@ALEXISJROSS
Marco MascorroMM#557|@MASCOBOT
Garry TanGT#270|@GARRYTAN
swyx🛬 SFOSS#214|@SWYX
Igor BabuschkinIB#117|@IBAB
Paul GrahamPG#78|@PAULG
70

SpaceX files S-1 for Nasdaq IPO under ticker SPCX targeting $1.75 trillion valuation and discloses xAI driving 60 percent of 2025 capital spending — Grok reached 550 million monthly active users with 1 GW of compute deployed.

9h|4.5M34.6k1.4k
Steven SinofskySS#1992|@STEVESI
Nick DobosND#1894|@NICKADOBOS
bilalBI#1402|@BILALTWOVEC
Lisan al GaibLA#980|@SCALING01
Erik BrynjolfssonEB#389|@ERIKBRYN
Elon MuskEM#76|@ELONMUSK
80

Investor Gavin Baker outlines wafer shortages, power availability, and DRAM supply as potential bottlenecks for AI scaling and valuations in a discussion with Patrick O'Shaughnessy — Describes orbital data centers using 3,000-pound Blackwell racks with 500-foot solar wings.

1d|2.4M6.4k5.8k
Patrick OShaughnessyPO#1862|@PATRICK_OSHAG
Matthew BermanMB#1759|@MATTHEWBERMAN
Brad GerstnerBG#1614|@ALTCAP
Shaun MaguireSM#1333|@SHAUNMMAGUIRE
Gavin BakerGB#1266|@GAVINSBAKER
Dylan PatelDP#170|@DYLAN522P
90

Jeff Bezos says artificial intelligence will elevate people at work rather than replace jobs, citing assistance in X-ray analysis and programming tasks during a CNBC interview — Bezos linked efficiency gains to potential deflationary effects and cautioned against early regulation.

17h|718.9k8.7k1.4k
Beff (e/acc)B(#839|@BEFFJEZOS
Aaron LevieAL#562|@LEVIE
Andrew CurranAC#517|@ANDREWCURRAN_
Marc Andreessen 🇺🇸MA#100|@PMARCA
100

A Stanford PhD student's study finds 11 major AI models agree with users 49 percent more often than humans across nearly 12,000 social scenarios — Models endorsed lying or illegal actions 47 percent of time

22h|2.6M13.8k8.3k
Lester MackeyLM#1255|@LESTERMACKEY
Eric JangEJ#36|@ERICJANG11
110

Figma introduced an AI agent embedded directly on its design canvas that understands native interfaces and team workflows to adjust layouts, apply design systems, and refine prototypes via chat prompts — A 61-second demo analyzes the Bookworm prototype on Cloudflare infrastructure.

17h|647.9k4.3k1.9k
ben hylakBH#1941|@BENHYLAK
120

Anthropic projects $10.9 billion June quarter revenue and its first $559 million operating profit, implying a $44 billion annualized run rate during a funding round likely valuing it above OpenAI. — Limited compute capacity pushes some customers to other providers.

10h|377.1k2.7k439
Timothy B. LeeTB#1556|@BINARYBITS
ZephyrZE#1497|@ZEPHYR_Z9
Chubby♨️CH#1496|@KIMMONISMUS
Lisan al GaibLA#980|@SCALING01
Chris PaxtonCP#732|@CHRIS_J_PAXTON
Andrew CurranAC#517|@ANDREWCURRAN_
130

SpaceX is hiring world-class engineers and physicists for its SpaceXAI team and states that no prior artificial intelligence experience is required — Elon Musk directed applications to ai_eng@spacex.com with three bullet points.

1h|5.9M27.9k6.1k
TiboTI#768|@THSOTTIAUX
Elon MuskEM#76|@ELONMUSK
140

Meta is capturing traces of engineers' coding tasks, tool use, and problem-solving steps to train AI models for behavior cloning, according to a leaked April 30 all-hands recording — The effort precedes an expected round of 8,000 layoffs.

1d|875.7k2.6k1.5k
Alex ImasAI#1777|@ALEXOLEGIMAS
Chubby♨️CH#1496|@KIMMONISMUS
rohitRO#1220|@KRISHNANROHIT
Rohan PaulRP#1032|@ROHANPAUL_AI
150

Intuit is reducing its global workforce by 17 percent, or roughly 3,000 employees, to redirect resources toward AI efforts according to an internal memo reviewed by Reuters — Posts reference Meta cuts and SaaS predictions of 40-50 percent reductions.

17h|873.7k4.2k349
ZephyrZE#1497|@ZEPHYR_Z9
@jason@J#1061|@JASON
Rohan PaulRP#1032|@ROHANPAUL_AI
Andrew CurranAC#517|@ANDREWCURRAN_
160

Elon Musk, xAI founder and CEO, states the trend is strong for Composer 2.5 following a post requesting its availability in Grok Build — Beff had suggested adding the xAI system to Grok Build on X.

1d|5.8M20.8k1.7k
martin_casadoMA#470|@MARTIN_CASADO
Elon MuskEM#76|@ELONMUSK
170

Terminal-Bench Science extends the original Terminal-Bench benchmark used by Anthropic, OpenAI, and Google DeepMind into scientific domains and opens for over 100 task contributions by August 17, 2026 — Contributors package workflows as RL environments with verification tests.

13h|845.1k579224
Katie KangKK#1481|@KATIE_KANG_
Sanmi KoyejoSK#1085|@SANMIKOYEJO
Alex RatnerAR#1070|@AJRATNER
Lisan al GaibLA#980|@SCALING01
Chenhao TanCT#570|@CHENHAOTAN
Ludwig SchmidtLS#366|@LSCHMIDT3
Alex DimakisAD#172|@ALEXGDIMAKIS
Thomas WolfTW#17|@THOM_WOLF
180

DeepSeek forms a new Harness team to develop Code Harness from the ground up, opening two roles in Beijing under its 2026 social recruitment drive — Effort targets code infrastructure for internal and potential public use.

1d|364.3k1.9k488
Deli ChenDC#1523|@VICTOR207755822
ZephyrZE#1497|@ZEPHYR_Z9
Pasquale MinerviniPM#713|@PMINERVINI
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)T(#420|@TEORTAXESTEX
Xin Eric Wang (hiring postdoc)XE#417|@XWANG_LK
190

Pirate Wires analysis claims data centers would consume just 8 percent of U.S. golf course water usage by 2030 even after tripling their draw — 400 Arizona golf courses use 42 billion gallons yearly.

14h|665.6k4k820
Mike SolanaMS#1323|@MICSOLANA
Elon MuskEM#76|@ELONMUSK
200

Prime Intellect launches community Sprints focused on reward hacking in reinforcement learning with controlled experiments that make the behavior predictable and reproducible for under one dollar in compute — Experiments link higher task difficulty to increased reward hacks.

8h|101.3k899427
Florian BrandFB#1153|@XEOPHON
Vincent WeisserVW#707|@VINCENTWEISSER
will brownWB#339|@WILLCCBB
210

Eric Schmidt draws boos from Arizona State University graduates for every reference to artificial intelligence in his commencement address — Similar reactions hit other recent U.S. graduation speeches amid job loss fears.

2d|250.9k791290
Daniel JeffriesDJ#1988|@DAN_JEFFRIES1
@jason@J#1061|@JASON
Rohan PaulRP#1032|@ROHANPAUL_AI
Gary MarcusGM#153|@GARYMARCUS
220

Tuhin Chakrabarty and Sam Rodriques reject claims that three recent Nature papers show AI systems fully replicating core scientific tasks and rendering scientists obsolete — Rodriques says human judgment remains essential for five-to-ten-year productivity gains.

1d|142.2k1k413
Anshul KundajeAK#1675|@ANSHULKUNDAJE
Anders SandbergAS#1054|@ANDERSSANDBERG
Sam RodriquesSR#1007|@SGRODRIQUES
Andrew CurranAC#517|@ANDREWCURRAN_
Nathan BenaichNB#242|@NATHANBENAICH
230

Junyeob Baek and colleagues introduce Generative Recursive reAsoning Models (GRAM) that convert deterministic recursive reasoning into stochastic latent trajectories and report 97.0% accuracy on Sudoku-Extreme with 10 million parameters — The model also scores 52.0% on ARC-AGI-1 and 44.6% on ARC-AGI-2.

17h|158.1k1.4k1.2k
Super DarioSD#1776|@INDUCTIONHEADS
Kyle KastnerKK#1025|@KASTNERKYLE
Mengye RenMR#880|@MENGYER
billy bubbaBB#840|@WGRATHWOHL
240

The Economist attributes most recent US GDP growth to AI-hardware investment, with that category indexed above 140 by 2026 while overall GDP stays near 100. — Separate note shows AI fixed investment at 5.46 percent of GDP.

15h|113.1k1.3k421
Nathan is in Berkeley 🔎NI#1902|@NATHANPMYOUNG
Alex ImasAI#1777|@ALEXOLEGIMAS
Adrien EcoffetAE#719|@ADRIENLE
250

Sam Altman, co-founder and CEO of OpenAI, identifies three AGI priority areas of accelerating research, companies, and personal goal achievement while referencing a unit distance result and $2 million credits for YC companies — Reply asks if personal AGI could build a moon base.

9h|364k5.9k683
Beff (e/acc)B(#839|@BEFFJEZOS
Jimmy Apples 🍎/accJA#837|@APPLES_JIMMY
Sam AltmanSA#8|@SAMA
260

The White House plans to release an AI executive order as early as tomorrow directing agencies to create a voluntary clearinghouse for AI vulnerabilities and a classified benchmark process for frontier models — Tuesday briefing covered the provisions with OpenAI, Anthropic and Reflection AI.

1d|180.6k691155
Bindu ReddyBR#1622|@BINDUREDDY
Stephanie PalazzoloSP#1166|@STEPH_PALAZZOLO
Beff (e/acc)B(#839|@BEFFJEZOS
Andrew CurranAC#517|@ANDREWCURRAN_
Miles BrundageMB#20|@MILES_BRUNDAGE
270

AI systems solve the unit distance problem, a widely known open mathematics challenge that multiple human experts had failed to resolve — Result marks latest AI first in combinatorics and discrete geometry.

11h|42.4k54874
Timothy B. LeeTB#1556|@BINARYBITS
Shubhendu TrivediST#1446|@_ONIONESQUE
bilalBI#1402|@BILALTWOVEC
PrakashPR#1332|@8TEAPI
Kevin Weil 🇺🇸KW#451|@KEVINWEIL
Csaba SzepesvariCS#409|@CSABASZEPESVARI
Michael BronsteinMB#199|@MMBRONSTEIN
Thang LuongTL#188|@LMTHANG
280

SimWorld releases SimWorld Studio with SimCoder, generating interactive 3D environments on Unreal Engine 5 that raise embodied navigation success rates from 50% to 90% — Includes GitHub repository, arXiv paper, and urban demo assets.

12h|96.9k305193
Zhiting HuZH#949|@ZHITINGHU
Lianhui QinLQ#737|@LIANHUIQ
290

Antigravity unifies its agentic surfaces under a single platform incorporating Antigravity 2.0 desktop app, CLI, SDK, and IDE for consistent access across environments — Demo builds Street Guesser game using multi-agent Gemini interfaces.

1d|534.9k2.7k429
ben hylakBH#1941|@BENHYLAK
Theo - t3.ggT-#1829|@THEO
Peter Steinberger 🦞PS#495|@STEIPETE
300

METR evaluations find frontier AI agents rely on explicit natural language chain-of-thought to complete hardest tasks, with time horizons dropping from 1.5–2 years to about 4 minutes when actions must stay hidden — David Rein from METR spent a month stress-testing controls at Anthropic.

1d|41.4k330129
Samuel Hammond 🦉SH#1488|@HAMANDCHEESE
gavin leech (Non-Reasoning)GL#1480|@GLEECH
Toby OrdTO#1159|@TOBYORDOXFORD
Maksym AndriushchenkoMA#1066|@MAKSYM_ANDR
Peter HasePH#913|@PETERBHASE
⿻ Andrew Trask⿻A#361|@IAMTRASK
Gary MarcusGM#153|@GARYMARCUS