korrel  /  open-source Python SDK

Write your agent test once.
CI today, RL fine-tune tomorrow.

Define a multi-turn agent scenario one time: a user-simulator persona, programmable mock tools, and a scoring rubric. Gate it in pytest now. Export the same definition as a verifiers or OpenEnv reinforcement-learning environment when you train.

$ pip install korrel
Scenario
├─ korrel run pytest CI gate
├─ korrel export --to verifiers RL environment
└─ korrel export --to openenv RL environment

An evaluation and a reinforcement-learning environment are the same object: a dataset, a harness, and a rubric. The CI buyer and the RL buyer want the same definition with a different runtime. Korrel is the authoring layer that carries it across both.

A single Korrel scenario reproduces tau2-bench's deterministic reward identically across the pytest gate, the verifiers environment, and the OpenEnv server. 80 frozen transcripts, exact float equality, zero drift.

Read the benchmark, rerun it yourself →