In case you missed itOpenAI agrees to let the Trump administration approve GPT-5.6 preview customers on a case-by-case basisGM#178|@GARYMARCUSDR#113|@DELIPRAOSZ#84|@SUCHENZANGNL#80|@NATOLAMBERTMB#57|@MILES_BRUNDAGE+37 more
#1 VIEWEDOpenAI launches limited preview of GPT-5.6 models Sol, Terra, and Luna following a US government requestAL#195|@LEVIEEM#171|@EMOSTAQUEJL#111|@JASONDEANLEEGB#32|@GDBSA#13|@SAMA+45 more
FASTEST CLIMBINGCNBC says enterprise shift from token volume to cost optimization threatens projected revenue growth at OpenAI and AnthropicFC#865|@ALTIMORHC#698|@HWCHASE17BA#398|@BRIAN_ARMSTRONGGM#178|@GARYMARCUSC🤗#109|@CLEMENTDELANGUE
RISING LIKES8.7kMatthew Berman highlights contradictory claims that the Mythos model is too powerful to release despite failing to detect 20,000 attacking accountsMB#1780|@MATTHEWBERMANSS#819|@STEVESI
19:49MB#1780|@MATTHEWBERMANOpenAI Previews GPT-5.6 Sol Frontier Model With Terra And Luna Variants4h ago|Views 92KLikes 1.3KBookmarks 240
32:40EV#1387|@ERICVISHRIAYP#1171|@YPATIL125"M#866|@LM_BRASWELLMM#716|@MASCOBOTLC#219|@LULUMESERVEYSN#153|@SATYANADELLAYash Patil Urges One AI Model Per Company In Nadella Interview9h ago|Views 95KLikes 515Bookmarks 401
19:53SU#361|@SUHAILDP#60|@DWARKESH_SPPodcast Examines Next AI Training Paradigm and Lab Research Bets9h ago|Views 89KLikes 750Bookmarks 857
0:17CL#1795|@_CHENGLOUSteam Controller Autonomously Docks and Charges Using Vision Tracking3h ago|Views 1.3MLikes 0Bookmarks 0
36:19SG#130|@SARANORMOUSNB#43|@POLYNOAMIALOpenAI Researcher Discusses Test-Time Compute Scaling and AI Safety Risks6h ago|Views 24KLikes 239Bookmarks 299
0:17LK#100|@OFFICIALLOGANKGoogle AI Studio Adds Design Variations for Instant UI Layout Exploration2h ago|Views 21KLikes 291Bookmarks 57
0:17AM#1205|@ZZZNAHAlex Mordvintsev Teases Upcoming Release with Evolving Dot Animations9h ago|Views 51KLikes 513Bookmarks 151
42:46B(#530|@BEFFJEZOSAV#350|@ASHLEEVANCENew England Tech Scene Unveils AI Brain Platforms, Robot Submarines And Cancer Cures10h ago|Views 23KLikes 106Bookmarks 65
0:39DP#60|@DWARKESH_SPAI Automating $200k Jobs Boosts Total GDP Despite Low Marginal Value4h ago|Views 22KLikes 155Bookmarks 77
0:08TP|@TRUNGTPHANTrump Administration Approves Anthropic Mythos 5 Release to 100 Firms1h ago|Views 14MLikes 28Bookmarks 2
…You/AutoResearchBench7M AGOOfficial Repo: AutoResearchBench: Benchmarking AI Agents on Complex Scientific Literature DiscoverySD159045 stars
xiaowu0162/LongMemEval47M AGOReleases 500-question benchmark with timestamped histories to evaluate long-term memory abilities like multi-session reasoning and knowledge updates in chat assistants.SD1590897 stars
Robbyant/lingbot-map5H AGOImplements a geometric context transformer for feed-forward streaming 3D scene reconstruction from image sequences.M🍥19567.4k stars
…ylinux-cuda-container5H AGOBuilds manylinux-based Docker images preinstalled with CUDA toolkits, cuDNN, NCCL and related libraries.TC4544 stars
nicklashansen/mmbench27H AGOReleases code, MMBench2 dataset, and checkpoints for training large generative world models along with hallucination predictors and mitigation methods.XW55923 stars
THUDM/slime8H AGOIntegrates Megatron training with SGLang rollouts for LLM post-training and RL scaling.🎭1014AC484JH906.8k stars
xlang-ai/OSWorld-V28H AGOBenchmarks computer-use agents on long-horizon real-world desktop tasks via VM environments and gated task sets.YS397TY84316 stars
…nd-models-of-ai-tools17H AGOAggregates leaked and open-sourced system prompts plus tool definitions from AI coding agents like Cursor, Claude, and Devin.🎭1014SC827141.2k stars
anthropics/skills18H AGOHosts collections of self-contained skills, specs, and templates for dynamically extending Claude's capabilities on specialized tasks.🎭1014AG1467155.5k stars
…entX/OpenCaptchaWorld18H AGOHosts a Flask-based web platform with 20 CAPTCHA types to benchmark MLLM agents on visual reasoning and interaction tasks.🎭1014SZ93979 stars