ZIm/crates/eval/src
Antonio Scandurra 9f6809a28d
Reuse conversation cache when streaming edits (#30245)
Release Notes:

- Improved latency when the agent applies edits.
2025-05-08 14:36:34 +02:00
..
examples Fix agent reading and editing files over SSH (#30144) 2025-05-07 17:07:01 +00:00
assertions.rs eval: Fine-grained assertions (#29246) 2025-04-22 23:58:58 -03:00
eval.rs evals: Enable Python LSP (#29987) 2025-05-06 10:28:59 +00:00
example.rs agent: Handle attempts to use hallucinated tools (#29946) 2025-05-05 19:31:11 +00:00
explorer.html eval: Add HTML overview for evaluation runs (#29413) 2025-04-25 17:49:05 +03:00
explorer.rs eval: Add HTML overview for evaluation runs (#29413) 2025-04-25 17:49:05 +03:00
ids.rs Add new action to run agent eval (#29158) 2025-04-21 21:30:21 -07:00
instance.rs Reuse conversation cache when streaming edits (#30245) 2025-05-08 14:36:34 +02:00
judge_diff_prompt.hbs eval: Fine-grained assertions (#29246) 2025-04-22 23:58:58 -03:00
judge_thread_prompt.hbs eval: Fine-grained assertions (#29246) 2025-04-22 23:58:58 -03:00
tool_metrics.rs eval: Fine-grained assertions (#29246) 2025-04-22 23:58:58 -03:00