ZIm/crates/eval/src
Max Brunsfeld f125353b6f
Add tree-sitter example to the eval (#29321)
Interesting things about this example:
* It's a useful, non-trivial change I made with the agent in Tree-sitter
* It runs fast
* It frequently showcases edit file errors
* It occasionally completely errors out due to errors parsing tool call
input JSON

Release Notes:

- N/A
2025-04-23 18:46:38 -07:00
..
examples Add tree-sitter example to the eval (#29321) 2025-04-23 18:46:38 -07:00
assertions.rs eval: Fine-grained assertions (#29246) 2025-04-22 23:58:58 -03:00
eval.rs eval: Fine-grained assertions (#29246) 2025-04-22 23:58:58 -03:00
example.rs eval: New add_arg_to_trait_method example (#29297) 2025-04-23 18:46:39 +00:00
ids.rs Add new action to run agent eval (#29158) 2025-04-21 21:30:21 -07:00
instance.rs Do not log unfinished tools use that are in the middle of streaming (#29275) 2025-04-23 13:19:01 +00:00
judge_diff_prompt.hbs eval: Fine-grained assertions (#29246) 2025-04-22 23:58:58 -03:00
judge_thread_prompt.hbs eval: Fine-grained assertions (#29246) 2025-04-22 23:58:58 -03:00
tool_metrics.rs eval: Fine-grained assertions (#29246) 2025-04-22 23:58:58 -03:00