ZIm/crates/assistant_tools/src
Agus Zubiaga ce1a674eba
eval: Fine-grained assertions (#29246)
- Support programmatic examples
([example](17feb260a0/crates/eval/src/examples/file_search.rs))
- Combine data-driven example declarations into a single `.toml` file
([example](17feb260a0/crates/eval/src/examples/find_and_replace_diff_card.toml))
- Run judge on individual assertions (previously called "criteria")
- Report judge and programmatic assertions in one combined table

Note: We still need to work on concept naming 

<img width=400
src="https://github.com/user-attachments/assets/fc719c93-467f-412b-8d47-68821bd8a5f5">

Release Notes:

- N/A

---------

Co-authored-by: Richard Feldman <oss@rtfeldman.com>
Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com>
Co-authored-by: Thomas Mickley-Doyle <tmickleydoyle@gmail.com>
2025-04-22 23:58:58 -03:00
..
batch_tool Add Batch tool call for calling multiple tools (#27621) 2025-03-27 18:21:26 -04:00
code_action_tool Add code action tool and rename tool (#28453) 2025-04-09 22:38:01 -04:00
code_symbols_tool Add Code Symbols tool (#27733) 2025-03-31 05:13:13 +00:00
contents_tool Add contents_tool (#28738) 2025-04-15 00:54:25 -04:00
copy_path_tool Add copy-path tool (#27371) 2025-03-24 21:21:55 -04:00
create_directory_tool Add Create Directory Tool (#27505) 2025-03-26 11:59:03 -04:00
create_file_tool Add create-file-tool (#27381) 2025-03-25 10:56:41 -04:00
delete_path_tool Delete tool uses paths instead of globs (#26715) 2025-03-16 11:58:25 +01:00
diagnostics_tool Systematically optimize agentic editing performance (#28961) 2025-04-19 02:47:59 +00:00
edit_file_tool Systematically optimize agentic editing performance (#28961) 2025-04-19 02:47:59 +00:00
fetch_tool assistant_tools: Add fetch tool (#26999) 2025-03-18 16:25:51 +00:00
find_replace_tool Add code action tool and rename tool (#28453) 2025-04-09 22:38:01 -04:00
grep_tool Rename regex search tool to grep and accept an include glob pattern (#29100) 2025-04-20 00:53:30 +00:00
list_directory_tool Rename regex search tool to grep and accept an include glob pattern (#29100) 2025-04-20 00:53:30 +00:00
move_path_tool Add move_path tool (#27366) 2025-03-24 14:45:19 +00:00
open_tool Add Open Tool (#27499) 2025-03-27 18:20:59 -04:00
path_search_tool Rename regex search tool to grep and accept an include glob pattern (#29100) 2025-04-20 00:53:30 +00:00
read_file_tool Systematically optimize agentic editing performance (#28961) 2025-04-19 02:47:59 +00:00
rename_tool Add code action tool and rename tool (#28453) 2025-04-09 22:38:01 -04:00
symbol_info_tool Add symbol info tool (#27742) 2025-03-31 00:23:03 -04:00
terminal_tool agent: Use current shell (#28470) 2025-04-09 23:38:36 -06:00
thinking_tool Add thinking tool (#26675) 2025-03-14 16:26:22 -04:00
ui agent: Refine the web search tool call UI (#29190) 2025-04-22 09:51:57 -03:00
assistant_tools.rs eval: Fine-grained assertions (#29246) 2025-04-22 23:58:58 -03:00
batch_tool.rs Rename regex search tool to grep and accept an include glob pattern (#29100) 2025-04-20 00:53:30 +00:00
code_action_tool.rs agent: Return ToolResult from run inside Tool (#28763) 2025-04-15 14:28:09 +00:00
code_symbols_tool.rs Systematically optimize agentic editing performance (#28961) 2025-04-19 02:47:59 +00:00
contents_tool.rs Systematically optimize agentic editing performance (#28961) 2025-04-19 02:47:59 +00:00
copy_path_tool.rs agent: Return ToolResult from run inside Tool (#28763) 2025-04-15 14:28:09 +00:00
create_directory_tool.rs agent: Return ToolResult from run inside Tool (#28763) 2025-04-15 14:28:09 +00:00
create_file_tool.rs Streaming tool calls (#29179) 2025-04-21 22:28:32 +00:00
delete_path_tool.rs agent: Return ToolResult from run inside Tool (#28763) 2025-04-15 14:28:09 +00:00
diagnostics_tool.rs agent: Return ToolResult from run inside Tool (#28763) 2025-04-15 14:28:09 +00:00
edit_file_tool.rs agent: Add small design tweaks (#29218) 2025-04-22 12:17:34 -03:00
fetch_tool.rs agent: Return ToolResult from run inside Tool (#28763) 2025-04-15 14:28:09 +00:00
grep_tool.rs Rename regex search tool to grep and accept an include glob pattern (#29100) 2025-04-20 00:53:30 +00:00
list_directory_tool.rs Systematically optimize agentic editing performance (#28961) 2025-04-19 02:47:59 +00:00
move_path_tool.rs agent: Return ToolResult from run inside Tool (#28763) 2025-04-15 14:28:09 +00:00
now_tool.rs agent: Return ToolResult from run inside Tool (#28763) 2025-04-15 14:28:09 +00:00
open_tool.rs agent: Return ToolResult from run inside Tool (#28763) 2025-04-15 14:28:09 +00:00
path_search_tool.rs Systematically optimize agentic editing performance (#28961) 2025-04-19 02:47:59 +00:00
read_file_tool.rs Systematically optimize agentic editing performance (#28961) 2025-04-19 02:47:59 +00:00
rename_tool.rs agent: Return ToolResult from run inside Tool (#28763) 2025-04-15 14:28:09 +00:00
replace.rs edit tool: Handle over-indentation in replace_with_flexible_indent (#29153) 2025-04-21 11:02:08 -03:00
schema.rs agent: Improve compatibility when using MCP servers with Gemini models (#28700) 2025-04-14 21:55:25 +02:00
symbol_info_tool.rs agent: Return ToolResult from run inside Tool (#28763) 2025-04-15 14:28:09 +00:00
terminal_tool.rs agent: Return ToolResult from run inside Tool (#28763) 2025-04-15 14:28:09 +00:00
thinking_tool.rs agent: Return ToolResult from run inside Tool (#28763) 2025-04-15 14:28:09 +00:00
ui.rs agent: Refine the web search tool call UI (#29190) 2025-04-22 09:51:57 -03:00
web_search_tool.rs agent: Refine the web search tool call UI (#29190) 2025-04-22 09:51:57 -03:00