publications

Preprints and manuscripts on AI agents, agent harnesses, coding agents, and evaluation.

* for equal contribution, # for corresponding author.

  1. arXiv
    2026
  2. 2026
  3. arXiv
    2026