Converts an input image into 3D meshes, Gaussian splats, and audio files using Claude skills and external AI APIs.
Archives experiments of AI agents autonomously tuning optimizers, schedules, and hyperparameters to reach target validation loss in fewest steps on a small LM benchmark.
Provides AI agent toolkit with coding CLI, unified LLM API, TUI/web UI libs and Slack bot support.
Automates articulated 3D asset creation by prompting LLMs to generate executable Python code defining parts, geometry and joints.
Streams live traces of coding agent actions including tokens and tool calls to a local UI at localhost:5899.
Structures AI coding agent workflows by enforcing TDD, task planning, and subagent reviews.
Collects and analyzes data on GitHub Actions security across Python packages for a PyCon talk.
Pretrains 1B HRM text models with hierarchical reasoning, task completion, and low-compute PrefixLM training.
Estimates rare failure probabilities of LLMs on structured math problems via cross-entropy method importance sampling and confidence bounds.
Executes JavaScript and TypeScript with a fast runtime, bundler, test runner and package manager.