In case you missed itAnthropic's Claude Fable 5 system card reveals undisclosed safety mitigations that silently restrict frontier LLM development tasksDP#217|@DIMITRISPAPAILSW#201|@SIMONWGT#121|@GARRYTANC🤗#109|@CLEMENTDELANGUENL#80|@NATOLAMBERT+27 more
antirez/ds42H AGONative inference engine for DeepSeek V4 Flash/PRO on Metal, CUDA and ROCm.SD159013.5k stars
WecoAI/weco-cli5H AGOLeverages LLM-guided tree search to iteratively explore, refine, and optimize code against custom metrics.TC55554 stars
…nguz/sxt-proof-of-sql8H AGOImplements a high-performance ZK prover that cryptographically verifies SQL query results against untampered data.BT5931 stars
macrodata-labs/refiner16H AGOProcesses and refines large-scale ML datasets via a data framework.EL1136NB353OS888CB167438 stars
kyutai-labs/kairos3D AGOTrains 6B LLMs on temporally ordered Common Crawl data from 2018-2025 to measure recency bias and enable continual learning studies.🎭1014JL848JM2787 stars
vllm-project/vime2D AGOIntegrates vLLM rollout with Megatron training for LLM post-training and RL scaling.🎭1014SA1768213 stars
…eley/agents-last-exam2D AGOProvisions OS sandboxes, runs agent harnesses on long-horizon tasks, and grades outputs against hidden references.🎭1014PL1489521 stars
datacurve-ai/deep-swe2D AGOBenchmarks frontier coding agents on 113 long-horizon tasks from active open-source repos with isolated environments and program verifiers.🎭1014FO1853761 stars