· Field NotesMarch 9, 2026

An AI that runs 100 experiments while you sleep

Andrej Karpathy just released a tiny tool that lets an AI agent run experiments on its own, all night, and hand you the results by morning.

AIautomationworkflowopen-sourcevia github · @@karpathy

What happened

Andrej Karpathy — one of the people who helped build the AI we all use today — published something quietly this week. It's called autoresearch, and the idea is simple enough to sketch on a napkin.

You give an AI agent a task. The agent tries something, checks if it worked, keeps the good changes, throws out the bad ones, and tries again. Repeat. All night. You wake up and it's done about 100 rounds.

The part that stuck with us isn't the AI research angle — it's the pattern. The whole thing runs off a plain text file called program.md. That file is the agent's instruction manual. You write in plain language what you want it to do and how to judge success. The agent reads it, acts, checks the score, and loops.

Why it matters for your business

Most of us don't train AI models. But we do have repetitive processes where the goal is clear and the rules can be written down. Testing email subject lines. Trying different ways to write a product description. Reviewing supplier quotes against a checklist.

The mental model here — give an agent a written playbook and a way to measure success, then let it run — is one of the most reusable ideas we've seen in a while.

Words worth knowing

AI agent — An AI that doesn't just answer one question but takes a series of actions on its own to reach a goal, like a very focused assistant that never gets tired.

Validation metric — A score that tells you whether something worked. Like tracking which pizza on your menu gets reordered most — the number helps you decide what to keep.

Agentic loop — When an AI does something, checks the result, adjusts, and does it again. The loop keeps going until the job is done or you stop it.


Something worth sitting with: if you could write down the rules for a repetitive decision in your business — clearly enough that a smart new hire could follow them — an agent could probably run that loop for you.

Try it

Karpathy's tool uses `uv` (a fast Python package manager). About five minutes to your first run.

  1. Install uv: curl -LsSf https://astral.sh/uv/install.sh | sh
  2. Clone the repo: git clone https://github.com/karpathy/autoresearch && cd autoresearch
  3. Sync dependencies: uv sync
  4. Prepare data: uv run prepare.py
  5. Kick off an overnight experiment run and check the report in the morning
Read the install guide
Check it out →

Written by David at AC0.AI. Follow on @ac0hero

Field Notes in your inbox

The AI tools and moves I actually use to win more business. A couple a week, nothing I haven't run myself.

session0
lifetime0
Pressed0