agent: Handle attempts to use hallucinated tools (#29946)

This change: 1. Catches attempts to use missing tools. If this happens, we now send Agent a message listing available tools, after which Agent can gracefully recover. Prior behavior: thread would stop in a broken state. Example of a hallucinated call and a message we send back: ![image](https://github.com/user-attachments/assets/92a8f700-b192-4038-8c7e-0a74ca2e0146) 2. Adds evals for hallucinated tool use and imagined edits 3. Adds ability to configure a profile name in evals. Release Notes: - N/A
2025-05-05 22:31:11 +03:00 · 2025-05-05 22:31:11 +03:00 · 8199664a5a
commit 8199664a5a
parent 7dfbe0b908
14 changed files with 111 additions and 0 deletions
--- a/crates/eval/src/examples/add_arg_to_trait_method.rs
+++ b/crates/eval/src/examples/add_arg_to_trait_method.rs
@ -1,6 +1,7 @@
 use std::path::Path;

 use anyhow::Result;
+use assistant_settings::AgentProfileId;
 use async_trait::async_trait;

 use crate::example::{Example, ExampleContext, ExampleMetadata, JudgeAssertion, LanguageServer};
@ -19,6 +20,7 @@ impl Example for AddArgToTraitMethod {
                allow_preexisting_diagnostics: false,
            }),
            max_assertions: None,
+            profile_id: AgentProfileId::default(),
        }
    }