ZIm/crates/eval/src/examples
Oleksiy Syvokon 3884de937b
assistant: Partial fix for HTML entities in tools params (#32148)
This problem seems to be specific to Opus 4. Eval shows improvement from
89% to 97%.

Closes: https://github.com/zed-industries/zed/issues/32060

Release Notes:

- N/A

Co-authored-by: Ben Brandt <benjamin.j.brandt@gmail.com>
2025-06-05 10:36:55 +00:00
..
threads agent: Overwrite files more cautiously (#30649) 2025-05-14 10:40:44 +03:00
add_arg_to_trait_method.rs evals: Configurable number of max dialog turns (#31680) 2025-05-29 10:35:29 +00:00
code_block_citations.rs evals: Configurable number of max dialog turns (#31680) 2025-05-29 10:35:29 +00:00
comment_translation.rs evals: Configurable number of max dialog turns (#31680) 2025-05-29 10:35:29 +00:00
file_search.rs evals: Configurable number of max dialog turns (#31680) 2025-05-29 10:35:29 +00:00
find_and_replace_diff_card.toml eval: Fine-grained assertions (#29246) 2025-04-22 23:58:58 -03:00
grep_params_escapement.rs assistant: Partial fix for HTML entities in tools params (#32148) 2025-06-05 10:36:55 +00:00
hallucinated_tool_calls.toml agent: Handle attempts to use hallucinated tools (#29946) 2025-05-05 19:31:11 +00:00
mod.rs assistant: Partial fix for HTML entities in tools params (#32148) 2025-06-05 10:36:55 +00:00
no_tools_enabled.toml Add no_tools_enabled eval (#30537) 2025-05-12 08:52:03 +00:00
overwrite_file.rs evals: Configurable number of max dialog turns (#31680) 2025-05-29 10:35:29 +00:00
planets.rs evals: Configurable number of max dialog turns (#31680) 2025-05-29 10:35:29 +00:00
tree_sitter_drop_emscripten_dep.toml Add tree-sitter example to the eval (#29321) 2025-04-23 18:46:38 -07:00