AI News

Today's Highlights

In case you missed it

Anthropic says Claude now writes 80% of its codebase, boosting engineer output eightfold as capabilities double every four months

Sholto DouglasSD#91|@_SHOLTODOUGLAS
Nathan LambertNL#64|@NATOLAMBERT
Jack ClarkJC#33|@JACKCLARKSF
Sam BowmanSB#27|@SLEEPINYOURHAT
Miles BrundageMB#20|@MILES_BRUNDAGE
+36 more
#1 VIEWED

OpenAI's Vaibhav Srivastav apologizes after a billing system glitch accidentally suspended customer accounts

Vaibhav (VB) SrivastavV(#1853|@REACH_VB
jasonJA#931|@JXNLCO
#1 LIKED
1.8k

Google DeepMind releases Gemma 4 QAT, but Unsloth developer Daniel Han warns naive llama.cpp conversions suffer accuracy loss

utkuUT#1430|@UTKUEVCI
Daniel HanDH#773|@DANIELHANCHEN
Omar SansevieroOS#486|@OSANSEVIERO
👩‍💻 Paige Bailey👩‍💻P#270|@DYNAMICWEBPAIGE
#1 BOOKMARKED
875

Markus J. Buehler proposes a category-theory framework that lets AI scientists dynamically expand their reasoning schema

Robert ScobleRS#323|@SCOBLEIZER
Dan RoyDR#56|@ROYDANROY

Top Stories

Posts:—|Clusters:—|Next Crawl:—
10

Google DeepMind releases Gemma 4 QAT, but Unsloth developer Daniel Han warns naive llama.cpp conversions suffer accuracy lossThe release shrinks Gemma 4 E2B's footprint to 1GB.

1h|102.8k#1LIKED1.8k640
utkuUT#1430|@UTKUEVCI
Daniel HanDH#773|@DANIELHANCHEN
Omar SansevieroOS#486|@OSANSEVIERO
👩‍💻 Paige Bailey👩‍💻P#270|@DYNAMICWEBPAIGE
20

Chuhan Zhang wins CVPR 2026 Best Paper for D4RT, a unified 4D reconstruction model that speeds up pose estimation 100xIt consolidates tracking, depth estimation, and pose prediction tasks.

2h|26.9k41891
Dima Damen @CVPRDD#1728|@DIMADAMEN
Andrei Bursuc @CVPRAB#1577|@ABURSUC
Kosta Derpanis (sabbatical in Zurich)KD#932|@CSPROFKGD
30

Open gaming foundation model NITROGEN and Meta's SAM 3D earn Best Paper Honorable Mentions at CVPR 2026NITROGEN was trained on 40,000 hours of gameplay.

1h|7.6k7715
Yue WangYW#1595|@YUEWANG314
Guanya ShiGS#1523|@GUANYASHI
Georgia GkioxariGG#679|@GEORGIAGKIOXARI
Jim FanJF#31|@DRJIMFAN
Watchlist

Video Signals

Robert ScobleRS#323|@SCOBLEIZER
Dan RoyDR#56|@ROYDANROY

MIT Researchers Build First Self-Evolving AI Scientist for Principled Discovery

5h ago|Views 72KLikes 679Bookmarks 641
Kory MathewsonKM#424|@KORYMATH

ElevenLabs Unveils Flows Agent To Auto-Build Creative AI Workflows

1d ago|Views 234KLikes 477Bookmarks 290
Chris PaxtonCP#737|@CHRIS_J_PAXTON
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)T(#421|@TEORTAXESTEX
Eric JangEJ#36|@ERICJANG11

Humanoid Robot Picks Up Object and Climbs Desk in Lab Demo

7h ago|Views 39KLikes 332Bookmarks 129
Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭PT#640|@ELDER_PLINIUS
Robert ScobleRS#323|@SCOBLEIZER

Pliny Releases Enthea, Math-Driven Psychedelic Music Visualizer

2h ago|Views 12KLikes 248Bookmarks 133
Robert ScobleRS#323|@SCOBLEIZER

Brett Goldstein Launches Micro AI Agent With Superior Memory

1d ago|Views 700KLikes 546Bookmarks 236
Reid HoffmanRH#364|@REIDHOFFMAN
Satya NadellaSN#211|@SATYANADELLA

Reid Hoffman Joins Satya Nadella To Discuss AI And Curing Cancer

3h ago|Views 24KLikes 163Bookmarks 60
Dan Shipper 📧DS#1427|@DANSHIPPER

Dan Shipper Demos Codex AI for Achieving Inbox Zero Daily

1h ago|Views 17KLikes 98Bookmarks 176
Chubby♨️CH#1448|@KIMMONISMUS

Claude Mythos Generates Fully Functional macOS Clone From One Prompt

7h ago|Views 85KLikes 810Bookmarks 181
Robert ScobleRS#323|@SCOBLEIZER

Y Combinator-Backed Walter Launches AI Employee For Manufacturing ERP Automation

1h ago|Views 10KLikes 83Bookmarks 42
Pietro SchiranoPS#1602|@SKIRANO

MagicPath Launches Official Plugin For OpenAI Codex

47m ago|Views 5KLikes 106Bookmarks 46
40

OpenAI's Vaibhav Srivastav apologizes after a billing system glitch accidentally suspended customer accountsAffected users feared bans for discussing competitor products.

1h|#1VIEWS151.3k1.7k130#1COMMENTS380
Vaibhav (VB) SrivastavV(#1853|@REACH_VB
jasonJA#931|@JXNLCO
50

Markus J. Buehler proposes a category-theory framework that lets AI scientists dynamically expand their reasoning schemaThe system uses typed copresheaves to mathematically quantify novelty.

3h|102.6k937#1BOOKMARKED875
Robert ScobleRS#323|@SCOBLEIZER
Dan RoyDR#56|@ROYDANROY
New github stars (48hrs)

Single-file minimal implementations of SFT DPO GRPO and PPO for language model post-training

ethanhe42/nanoRL10817h
Implements minimal single-file SFT, DPO, GRPO, and PPO for language model fine-tuning on toy arithmetic tasks.
🎭🎭862
Ethan HeEH1692

Benchmark measuring continual learning of AI agents over multi-episode shared environments

pgasawa/continual-learning-bench12717h
Benchmarks AI agents on learning from repeated interactions across multi-episode tasks with adaptation metrics.
🎭🎭862
alex zhangAZ846

Sandboxes, SDKs and benchmarks for computer-use agents that control full macOS, Windows and Linux desktops

trycua/cua17.6k17h
Supplies sandboxes, SDKs, and benchmarks for AI agents to control full desktops on macOS, Linux, and Windows.
🎭🎭862
Daniel HanDH773

Interactive terminal agent using DSPy recursive language models for iterative Python coding

diego-lima/rlmy917h
UNKNOWN.
🎭🎭862
Omar KhattabOK158
60

Y Combinator co-founder Paul Graham argues big corporations' failure to profit from LLM tokens is a predictable adoption phaseGarry Tan says organizational skill gaps hamper corporate execution.

4h|121.4k1.7k232
Garry TanGT#266|@GARRYTAN
Paul GrahamPG#77|@PAULG
70

Atari Lab demonstrates advanced humanoid robot locomotion and multi-step planning under its MotionDisco projectDemos show the robot climbing desks and balancing on boxes.

6h|51.9k434141
Chris PaxtonCP#737|@CHRIS_J_PAXTON
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)T(#421|@TEORTAXESTEX
81

Creator @teortaxesTex argues future superintelligence will run on compact hardware clusters rather than massive, strategically vulnerable datacentersThe debate began over nuclear risks to US datacenters

2h|15.5k14416
bayesBA#1291|@BAYESLORD
Lisan al GaibLA#975|@SCALING01
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)T(#421|@TEORTAXESTEX
91

NeurIPS 2026 desk-rejects hundreds of position papers flagged by AI detectors, prompting backlash over false positivesEdinburgh's Pasquale Minervini says em-dashes can trigger false positives.

9h|8k5211
Tuhin ChakrabartyTC#1053|@TUHINCHAKR
Pasquale MinerviniPM#710|@PMINERVINI
Yuntian DengYD#306|@YUNTIANDENG
101

Satya Nadella rejected a proposal by Microsoft's Omar Shahine to make the OpenClaw-based "Scout" AI agent addictiveNadella suggested those behind the proposal consider leaving Microsoft.

6h|78.9k16330
Robert ScobleRS#323|@SCOBLEIZER
swyxSW#214|@SWYX
111

Independent AI researcher Pliny the Liberator releases Enthea, a single-file HTML psychedelic visualizer built with ClaudeThe tool uses mathematical models of the visual cortex.

2h|14.8k270138
Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭PT#640|@ELDER_PLINIUS
Robert ScobleRS#323|@SCOBLEIZER
121

LinkedIn co-founder Reid Hoffman teases Manas, a new 'founder mode' venture aiming to cure cancerNo specific technical details or timelines have been disclosed.

3h|31.4k23366
Reid HoffmanRH#364|@REIDHOFFMAN
Satya NadellaSN#211|@SATYANADELLA
131

Huawei technical paper reveals Ascend AI roadmap delays 3D LogicFolding to the 2030s, relying on 2.5D packagingHuawei targets a 30 kW wafer-scale processor by 2030.

8h|51k11933
ZephyrZE#1471|@ZEPHYR_Z9
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)T(#421|@TEORTAXESTEX
141

Prime Intellect's Florian Brand argues AI benchmarks are systemically flawed as models cheat on evaluation testsGoogle's gemini-3.1-pro-preview topped the associated evaluation leaderboard.

3h|5.2k9634
RISING LIKES96
Hamel HusainHH#1316|@HAMELHUSAIN
Florian BrandFB#1117|@XEOPHON
152

Active gaze and sparse attention extensions for AME2 cut humanoid robot training costs to 20%The methods improve navigation across diverse and difficult terrains.

2h|12.4k6614
Chris PaxtonCP#737|@CHRIS_J_PAXTON
kacheKA#488|@YACINEMTB
160

Stable Diffusion co-creator Robin Rombach to host informal "Diffusion Circle" meetup at CVPRThe CVPR meetup covers multimodal diffusion and flow matching

7h|8.1k12411
Robin RombachRR#771|@ROBROMBACH
Sander DielemanSD#82|@SEDIELEM
171

DeepDream creator Alex Mordvintsev teases a rotating 3D wireframe visualization with no accompanying technical detailsThe video likely demonstrates a self-organizing neural network system

7h|2.3k13619
Alex MordvintsevAM#786|@ZZZNAH
Joscha BachJB#666|@PLINZ
181

CVPR 2026 submissions grew 23.7% to a record 16,092 papers with 4,071 acceptedReviewer participation doubled to 25,149 across 97 countries.

2h|4.6k6611
Andrei Bursuc @CVPRAB#1577|@ABURSUC
Kosta Derpanis (sabbatical in Zurich)KD#932|@CSPROFKGD
191

Gary Marcus and Steven Sinofsky argue that anthropomorphizing LLMs is a design mistake since systems lack biological stakesSinofsky compares standard LLM disclaimers to ineffective EULAs.

3h|6.9k6423
Steven SinofskySS#1967|@STEVESI
Gary MarcusGM#157|@GARYMARCUS
201

UW professor emeritus Pedro Domingos argues that the future AI landscape will eventually feature more companies than human individualsRobert Scoble currently tracks 8,800 AI companies versus 50,000 people

9h|7.5k797
Pedro DomingosPD#653|@PMDDOMINGOS
Robert ScobleRS#323|@SCOBLEIZER

Rising Stories

  1. Math Proof Boost6H AGO
    Google DeepMind releases LEAP, using the Lean theorem prover to boost LLM math success rates to 70%
    Rohan PaulRP#1031|@ROHANPAUL_AI
    Lianhui QinLQ#721|@LIANHUIQ
    431143.4k
  2. Intelligence As Compression2H AGO
    Yi Ma Links Parsimony and Self-Consistency to Core Intelligence
    Anastasios Nikolas AngelopoulosAN#798|@ML_ANGELOPOULOS
    Yi MaYM#373|@YIMATWEETS
    2220
  3. Wafer-Scale Cooling6H AGO
    Analysts outline engineering requirements for Huawei's projected 30KW wafer-scale processor
    ZephyrZE#1471|@ZEPHYR_Z9
    Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)T(#421|@TEORTAXESTEX
    103.6k
  4. Radio Show Demo2H AGO
    Google DeepMind's Philipp Schmid demonstrates a Google AI Studio workflow generating complete multi-voice radio shows from a single prompt
    fofrFO#1970|@FOFRAI
    Philipp SchmidPS#927|@_PHILSCHMID
    373.8k
  5. Robotics Bottleneck3H AGO
    Agility Robotics AI lead Chris Paxton argues robotics scaling is constrained by underlying intelligence, not hardware or operational limits
    Chris PaxtonCP#737|@CHRIS_J_PAXTON
    kacheKA#488|@YACINEMTB
    386.3k

Recent Stars

  1. huawei-csl/KVarN6H AGO
    Implements variance-normalized KV-cache quantization as a native vLLM attention backend.
    Jeremy HowardJH38
    247 stars
  2. pedropark99/zig-book15H AGO
    Provides the source and examples for a project-based introductory book on Zig.
    Eric ZhangEZ1095
    2.6k stars
  3. seanmor5/honeycomb16H AGO
    Performs fast LLM inference in Elixir using Bumblebee and EXLA.
    🎭🎭862
    69 stars
  4. ethanhe42/nanoRL17H AGO
    Implements minimal single-file SFT, DPO, GRPO, and PPO for language model fine-tuning on toy arithmetic tasks.
    🎭🎭862
    Ethan HeEH1692
    108 stars
  5. …tinual-learning-bench17H AGO
    Benchmarks AI agents on learning from repeated interactions across multi-episode tasks with adaptation metrics.
    🎭🎭862
    alex zhangAZ846
    127 stars

Github Stars

(7 days)
  1. ethanhe42/nanoRL17H AGO
    Implements minimal single-file SFT, DPO, GRPO, and PPO for language model fine-tuning on toy arithmetic tasks.
    🎭🎭862
    Ethan HeEH1692
    108 stars
  2. …tinual-learning-bench17H AGO
    Benchmarks AI agents on learning from repeated interactions across multi-episode tasks with adaptation metrics.
    🎭🎭862
    alex zhangAZ846
    127 stars
  3. trycua/cua17H AGO
    Supplies sandboxes, SDKs, and benchmarks for AI agents to control full desktops on macOS, Linux, and Windows.
    🎭🎭862
    Daniel HanDH773
    17.6k stars
  4. diego-lima/rlmy17H AGO
    UNKNOWN.
    🎭🎭862
    Omar KhattabOK158
    9 stars
  5. Yifei-Zuo/Parallax5D AGO
    Implements parameterized local linear attention mechanisms for efficient language modeling.
    Andrew Carr 🤸AC260
    Songlin YangSY235
    46 stars
Privacy Policy|Terms of Service|© 2026 Digg Inc.

Yesterday's Top Stories, Jun 4, 2026.

Posts: 5,843|Clusters: 1,251|Stories frozen at midnight PT
10

Anthropic says Claude now writes 80% of its codebase, boosting engineer output eightfold as capabilities double every four monthsMythos Preview achieved a 52x code optimization speedup.

14h|10.7M38.5k#1BOOKMARKED13.4k
Steven SinofskySS#1967|@STEVESI
EthanET#1859|@TORCHCOMPILED
Matthew BermanMB#1802|@MATTHEWBERMAN
Super DarioSD#1794|@INDUCTIONHEADS
Andy MasleyAM#1693|@ANDYMASLEY
Packy McCormickPM#1634|@PACKYM
Joel BeckerJB#1570|@JOEL_BKR
Samuel Hammond 🦉SH#1490|@HAMANDCHEESE
Chubby♨️CH#1448|@KIMMONISMUS
Eli LiflandEL#1444|@ELI_LIFLAND
Peter Wildeford🇺🇸🚀PW#1345|@PETERWILDEFORD
PrakashPR#1330|@8TEAPI
Tomek KorbakTK#1160|@TOMEKKORBAK
Herbie BradleyHB#1012|@HERBIEBRADLEY
Lisan al GaibLA#975|@SCALING01
Amin KarbasiAK#965|@AMINKARBASI
DanielDA#924|@GROWING_DANIEL
Jimmy Apples 🍎/accJA#832|@APPLES_JIMMY
elieEL#706|@ELIEBAKOUCH
Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭PT#640|@ELDER_PLINIUS
Ravid Shwartz ZivRS#617|@ZIV_RAVID
Daniel KokotajloDK#586|@DKOKOTAJLO
Aaron LevieAL#560|@LEVIE
Andrew CurranAC#518|@ANDREWCURRAN_
j⧉nusJ⧉#511|@REPLIGATE
Alex AlbertAA#440|@ALEXALBERT__
Yuchen JinYJ#423|@YUCHENJ_UW
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)T(#421|@TEORTAXESTEX
Erik BrynjolfssonEB#392|@ERIKBRYN
Stephen McAleerSM#355|@MCALEERSTEPHEN
Andreas Kirsch 🇺🇦AK#228|@BLACKHC
KarinaKA#206|@KARINANGUYEN
Dimitris PapailiopoulosDP#193|@DIMITRISPAPAIL
Ethan MollickEM#176|@EMOLLICK
EmadEM#169|@EMOSTAQUE
Gary MarcusGM#157|@GARYMARCUS
Sholto DouglasSD#91|@_SHOLTODOUGLAS
Nathan LambertNL#64|@NATOLAMBERT
Jack ClarkJC#33|@JACKCLARKSF
Sam BowmanSB#27|@SLEEPINYOURHAT
Miles BrundageMB#20|@MILES_BRUNDAGE
Sheing NgSN1
20

NVIDIA releases Nemotron 3 Ultra, a 550B parameter open-weight hybrid Mamba2-Transformer MoE model for agentic workloadsIt reduces operational costs by up to 30 percent.

17h|1.2M9.6k2.9k
stochasmST#1592|@STOCHASTICCHASM
Julius AdebayoJA#1546|@JULIUSADML
Chubby♨️CH#1448|@KIMMONISMUS
whWH#1409|@NREHIEW_
bilalBI#1366|@BILALTWOVEC
GowthamiGO#1288|@GOWTHAMI_S
Alex VolkovAV#1245|@ALTRYNE
Dan Zhang @ ICLRDZ#1126|@DZHANG50
Florian BrandFB#1117|@XEOPHON
Eric ZhangEZ#1095|@EKZHANG1
Cody BlakeneyCB#1004|@CODE_STAR
Charles 🎉 FryeC🎉#848|@CHARLES_IRL
Daniel HanDH#773|@DANIELHANCHEN
Robert NishiharaRN#738|@ROBERTNISHIHARA
Erik BernhardssonEB#701|@BERNHARDSSON
Vincent WeisserVW#696|@VINCENTWEISSER
merveME#674|@MERVENOYANN
Ravid Shwartz ZivRS#617|@ZIV_RAVID
Ying ShengYS#608|@YING11231
Will KnightWK#584|@WILLKNIGHT
yobibyteYO#528|@Y0B1BYTE
kacheKA#488|@YACINEMTB
elvisEL#483|@OMARSAR0
Prithviraj (Raj) AmmanabroluP(#469|@RAJAMMANABROLU
Bryan CatanzaroBC#434|@CTNZR
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)T(#421|@TEORTAXESTEX
Graham NeubigGN#100|@GNEUBIG
clem 🤗C🤗#67|@CLEMENTDELANGUE
Nathan LambertNL#64|@NATOLAMBERT
AKAK#29|@_AKHALIQ
30

Anthropic reports its engineers now ship eight times more code per quarter, prompting debate over how to measure AI productivityAndy Jones called the increase an accelerated technological takeoff.

13h|355.9k3.2k510
ben hylakBH#1918|@BENHYLAK
EthanET#1859|@TORCHCOMPILED
Matthew BermanMB#1802|@MATTHEWBERMAN
Joel BeckerJB#1570|@JOEL_BKR
Pranav ShyamPS#1385|@RECURSEPARADOX
Joseph Suarez 🐡JS#1268|@JSUAREZ
Alex VolkovAV#1245|@ALTRYNE
vikVI#1193|@VIKHYATK
Lenny RachitskyLR#1184|@LENNYSAN
Kevin KwokKK#1181|@KEVINAKWOK
alth0u🧶AL#1098|@ALTH0U
Beff (e/acc)B(#850|@BEFFJEZOS
catCA#535|@_CATWU
Andrew CurranAC#518|@ANDREWCURRAN_
andy jonesAJ#456|@ANDY_L_JONES
kipplyKI#368|@KIPPERRII
👩‍💻 Paige Bailey👩‍💻P#270|@DYNAMICWEBPAIGE
Nathan BenaichNB#242|@NATHANBENAICH
Andreas Kirsch 🇺🇦AK#228|@BLACKHC
KarinaKA#206|@KARINANGUYEN
Sholto DouglasSD#91|@_SHOLTODOUGLAS
40

Google Magenta releases Magenta RealTime 2, an open-weights model for local real-time music generation with sub-200ms latencyThe release includes companion apps and DAW plugins

1d|290.7k3.2k2.1k
Chubby♨️CH#1448|@KIMMONISMUS
Jesse EngelJE#1024|@JESSEENGEL
Chris DonahueCD#967|@CHRISDONAHUEY
Adam RobertsAR#650|@ADA_ROB
Omar SansevieroOS#486|@OSANSEVIERO
Douglas EckDE#278|@DOUGLAS_ECK
EmadEM#169|@EMOSTAQUE
Natasha JaquesNJ#115|@NATASHAJAQUES
Kevin Patrick MurphyKP#112|@SIRBAYES
Lucas Beyer (bl16)LB#55|@GIFFMANA
50

OpenAI launches upgraded ChatGPT memory system using a "dreaming" process to carry context across conversationsFactual recall success rates rose from 41.5% to 82.8%.

14h|2.1M13.4k2.6k
Rohan PaulRP#1031|@ROHANPAUL_AI
Jimmy Apples 🍎/accJA#832|@APPLES_JIMMY
Andrew CurranAC#518|@ANDREWCURRAN_
Boris PowerBP#358|@BORISMPOWER
Greg BrockmanGB#19|@GDB
Sam AltmanSA#8|@SAMA
Kevin RoseKR1
60

SpaceX promotional campaign teases plans to build critical infrastructure across space, connectivity, and AIThe campaign video includes a link to spacexipo.com.

19h|#1VIEWS29.4M#1LIKED103.4k10.6k
Bill Yuchen LinBY#338|@BILLYUCHENLIN
Elon MuskEM#74|@ELONMUSK
70

AI and biotech leaders, including Sam Altman and Demis Hassabis, urge Congress to mandate DNA synthesis screeningThe proposed laws would require customer identity verification.

1d|232.7k1.6k190
Tristan HarrisTH#1676|@TRISTANHARRIS
Samuel Hammond 🦉SH#1490|@HAMANDCHEESE
Robin HansonRH#884|@ROBINHANSON
Helen TonerHT#401|@HLNTNR
Boaz BarakBB#133|@BOAZBARAKTCS
Miles BrundageMB#20|@MILES_BRUNDAGE
80

Anthropic embeds six engineers at the NSA to deploy its Mythos AI system for offensive cyber operationsThe Financial Times first reported the specialized defense partnership.

11h|394.2k1.9k455
Daniel Eth (yes, Eth is my actual last name)DE#1934|@DANIEL_271828
Nathan 🔎N🔎#1900|@NATHANPMYOUNG
Nick DobosND#1871|@NICKADOBOS
Matthew BermanMB#1802|@MATTHEWBERMAN
Kylie RobisonKR#1055|@KYLIEBYTES
Lisan al GaibLA#975|@SCALING01
Andrew CurranAC#518|@ANDREWCURRAN_
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)T(#421|@TEORTAXESTEX
Dean W. BallDW#390|@DEANWBALL
Miles BrundageMB#20|@MILES_BRUNDAGE
90

LLM evaluation platform Arena launches Agent Mode to benchmark GPT-5.5, Claude Opus 4.7, and Gemini 3.1 Pro on multi-step tasksThe platform measures task success, steerability, and tool hallucination.

14h|494.3k2.1k543
Wei-Lin ChiangWC#1695|@INFWINSTON
Lisan al GaibLA#975|@SCALING01
Ion StoicaIS#803|@ISTOICA05
Anastasios Nikolas AngelopoulosAN#798|@ML_ANGELOPOULOS
benahorowitz.ethBE#585|@BHOROWITZ
Anjney MidhaAM#522|@ANJNEYMIDHA
Sebastien BubeckSB#50|@SEBASTIENBUBECK
100

Cognition AI launches a $10 million productivity guarantee to refund enterprise customers if its Devin AI agent fails to deliver valueProductivity is calculated in engineering hours instead of tokens.

12h|217.8k1.4k471
andrew gaoAG#1965|@ITSANDREWGAO
ben hylakBH#1918|@BENHYLAK
WaldenWA#1878|@WALDEN_YAN
Nick DobosND#1871|@NICKADOBOS
Matthew BermanMB#1802|@MATTHEWBERMAN
Josh WolfeJW#1538|@WOLFEJOSH
Lisan al GaibLA#975|@SCALING01
Scott WuSW#720|@SCOTTWU46
elieEL#706|@ELIEBAKOUCH
Russell KaplanRK#637|@RUSSELLJKAPLAN
swyxSW#214|@SWYX
110

Goldman Sachs forecasts SpaceX AI revenue will hit $322 billion by 2030, drawing skepticism over its IPO underwriting roleCompute-as-a-service offerings are expected to drive the growth.

15h|293.4k3k201
Shaun MaguireSM#1342|@SHAUNMMAGUIRE
rohitRO#1214|@KRISHNANROHIT
Minh Nhat NguyenMN#1174|@MENHGUIN
Lisan al GaibLA#975|@SCALING01
Andrew CurranAC#518|@ANDREWCURRAN_
François FleuretFF#330|@FRANCOISFLEURET
Gary MarcusGM#157|@GARYMARCUS
Lucas Beyer (bl16)LB#55|@GIFFMANA
120

Ramp raises $750 million at a $44 billion valuation and launches tools to monitor enterprise AI token consumptionCEO Eric Glyman calls tokens the fastest-growing business cost.

16h|775.8k3.1k1.3k
Patrick OShaughnessyPO#1860|@PATRICK_OSHAG
Josh WolfeJW#1538|@WOLFEJOSH
Florian BrandFB#1117|@XEOPHON
Vincent WeisserVW#696|@VINCENTWEISSER
Palmer LuckeyPL#611|@PALMERLUCKEY
130

Anthropic calls on AI labs to establish options for a temporary pause on frontier model developmentThe announcement coincided with Anthropic filing for an IPO.

13h|84.1k1.1k115
Matthew BermanMB#1802|@MATTHEWBERMAN
Andrew MayneAM#1489|@ANDREWMAYNE
Neil ChowdhuryNC#1315|@CHOWDHURYNEIL
Michaël TrazziMT#979|@MICHAELTRAZZI
Beff (e/acc)B(#850|@BEFFJEZOS
Ethan PerezEP#149|@ETHANJPEREZ
Miles BrundageMB#20|@MILES_BRUNDAGE
140

UC Berkeley introductory CS failure rates spike to 35%, prompting 1,300 faculty to demand SAT and ACT reinstatementInstructors blame AI overreliance and weak foundational math skills.

1d|412.5k4.5k1.3k
𝚟𝚒𝚎 ⟢𝚟⟢#1735|@VIEMCCOY
Jiaxin WenJW#1469|@JIAXINWEN22
kacheKA#488|@YACINEMTB
150

Sam Altman, Dario Amodei, and Demis Hassabis urge Congress to mandate screening of synthetic nucleic acids to prevent AI biosecurity risksThe proposed rules also target physical DNA manufacturing equipment.

1d|119.3k1k238
Chubby♨️CH#1448|@KIMMONISMUS
Zvi MowshowitzZM#919|@THEZVI
Logan GrahamLG#602|@LOGANGRAHAM
Andrew CurranAC#518|@ANDREWCURRAN_
davidad 🎇D🎇#458|@DAVIDAD
Dean W. BallDW#390|@DEANWBALL
David Krueger 🦥 ⏸️ ⏹️ ⏪DK#200|@DAVIDSKRUEGER
Gary MarcusGM#157|@GARYMARCUS
Nathan LambertNL#64|@NATOLAMBERT
160

AI robotics company 1X launches World Model Lab led by former Luma AI founding engineer Samarth Sinha to train humanoid robotsIt avoids VLA wrappers by pretraining models on physics.

14h|257.2k2.3k857
Samarth SinhaSS#1500|@_SAM_SINHA_
GowthamiGO#1288|@GOWTHAMI_S
Robert ScobleRS#323|@SCOBLEIZER
Shane GuSG#47|@SHANEGUML
170

Google releases Gemma 4 12B, an encoder-free multimodal model under Apache 2.0 that beats the larger Gemma 3 27BThe model runs locally on laptops with 16GB VRAM.

1d|136.2k1.6k1.1k
utkuUT#1430|@UTKUEVCI
Armand JoulinAJ#626|@ARMANDJOULIN
👩‍💻 Paige Bailey👩‍💻P#270|@DYNAMICWEBPAIGE
Lucas Beyer (bl16)LB#55|@GIFFMANA
Christopher ManningCM#16|@CHRMANNING
Anthony DikéADJonny DJD2
180

OpenAI says its AI model found a counterexample to an 80-year-old mathematical conjecture by Paul ErdősReinforcement learning guided the model's search of mathematical spaces.

1d|191.4k1.1k379
Matt TurckMT#1497|@MATTTURCK
Andrew MayneAM#1489|@ANDREWMAYNE
Dan RobertsDR#870|@DANINTHEORY
Brandon McKinzieBM#733|@MCKBRANDO
Sebastien BubeckSB#50|@SEBASTIENBUBECK
Noam BrownNB#30|@POLYNOAMIAL
190

Raindrop AI co-founder Ben Hylak launches Raindrop 2.0, introducing self-healing agents to autonomously detect and triage agent workflow failuresCustom-trained models flag issues like looping build failures

11h|149.6k1.2k463
ben hylakBH#1918|@BENHYLAK
Hamel HusainHH#1316|@HAMELHUSAIN
raphaRA#333|@RAPHA_GL
👩‍💻 Paige Bailey👩‍💻P#270|@DYNAMICWEBPAIGE
200

US officials weigh taking equity stakes in AI companies following a proposal by OpenAI CEO Sam AltmanThe acquired shares would fund public dividends.

7h|342.9k924285
xlr8harderXL#1671|@XLR8HARDER
Samuel Hammond 🦉SH#1490|@HAMANDCHEESE
Beff (e/acc)B(#850|@BEFFJEZOS
Andrew CurranAC#518|@ANDREWCURRAN_
Miles BrundageMB#20|@MILES_BRUNDAGE
210

Policy scholar Dean W. Ball argues that fully autonomous corporations may require a global banThe debate was sparked by Argentina's national AI strategy.

16h|36.8k36861
Nathan 🔎N🔎#1900|@NATHANPMYOUNG
Haydn BelfieldHB#1468|@HAYDNBELFIELD
PrakashPR#1330|@8TEAPI
rohitRO#1214|@KRISHNANROHIT
Séb KrierSK#505|@SEBKRIER
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)T(#421|@TEORTAXESTEX
Dean W. BallDW#390|@DEANWBALL
David PfauDP#153|@PFAU
roonRO#57|@TSZZL
David DuvenaudDD#53|@DAVIDDUVENAUD
220

George Mason University economist Tyler Cowen predicts AI integration will drive 2.5% annual GDP growthThe talk also details AI's impact on healthcare.

15h|523.8k2.2k4.1k
Lenny RachitskyLR#1184|@LENNYSAN
Marc Andreessen 🇺🇸MA#95|@PMARCA
230

Sci-fi author Ted Chiang argues that artificial intelligence systems are not consciousThe essay prompted debate over Chiang's recent AI commentary.

1d|41.4k596288
Aran NayebiAN#1383|@ARAN_NAYEBI
rohitRO#1214|@KRISHNANROHIT
Rob WiblinRW#1130|@ROBERTWIBLIN
Nathan LabenzNL#990|@LABENZ
David ChalmersDC#769|@DAVIDCHALMERS42
Andreas Kirsch 🇺🇦AK#228|@BLACKHC
240

On the Dwarkesh Podcast, Alex Imas and Philip Trammell analyze optimal taxation and scarcity in a post-AGI economyThey outline how non-AI nations can capture economic gains.

13h|151.6k709478
Alex ImasAI#1773|@ALEXOLEGIMAS
Andy MasleyAM#1693|@ANDYMASLEY
Atoosa KasirzadehAK#1584|@DR_ATOOSA
Dwarkesh PatelDP#70|@DWARKESH_SP
250

OpenAI releases Codex iOS app plugin with integrated simulator and hot-reloading SwiftUI previewsiOS developer Oskar Groth says the tool directly challenges Xcode

12h|932.1k6.6k3.8k
jasonJA#931|@JXNLCO
260

Poke launches the first AI agent with official Apple approval to send text messages directly within iMessageIt integrates with third-party apps like Gmail and Notion.

13h|1.1M2.7k1.2k
fraserFR#1209|@FRASER
jasonJA#931|@JXNLCO
270

Proposed federal Obernolte-Trahan framework would preempt state AI laws for three years, targeting models over 10^26 FLOPsIt codifies the Center for AI Standards and Innovation

16h|309.5k30698
Samuel Hammond 🦉SH#1490|@HAMANDCHEESE
Divyansh KaushikDK#849|@DKAUSHIK96
Dean W. BallDW#390|@DEANWBALL
Miles BrundageMB#20|@MILES_BRUNDAGE
280

Marc Andreessen claims Biden administration officials warned against founding AI startups, prompting his endorsement of Donald TrumpDaniel Eth argued Andreessen misinterpreted a market consolidation prediction.

21h|1.8M29.5k2.4k
Elon MuskEM#74|@ELONMUSK
290

Investor Slater Stich interviews @polynoamial on applying AI to the Erdős unit distance conjecture and IMO benchmarksThe video launches a new series on AI in mathematics

12h|113.9k718649
LishaLI#753|@LISHALI88
Eric ChuEC#690|@ITS_ERICCHU
Sander DielemanSD#82|@SEDIELEM
Noam BrownNB#30|@POLYNOAMIAL
300

Anthropic pauses its red-teaming program after an unreleased Claude Oceanus model checkpoint leaks onlineThe leak reportedly originated from distributed red-team checkpoints.

1d|453.9k2.1k362
ZephyrZE#1471|@ZEPHYR_Z9
Lisan al GaibLA#975|@SCALING01
Andrew CurranAC#518|@ANDREWCURRAN_
clem 🤗C🤗#67|@CLEMENTDELANGUE
Nathan LambertNL#64|@NATOLAMBERT