Yehowshua/ZIm - Forgejo: Beyond coding. We Forge.

Author	SHA1	Message	Date
Richard Feldman	f6774ae60d	More graceful invalid JSON handling (#29295 ) Now we're more tolerant of invalid JSON coming back from the model (possibly because it was incomplete and we're streaming), plus if we do end up with invalid JSON once it has all streamed back, we report what the malformed JSON actually was: <img width="444" alt="Screenshot 2025-04-23 at 1 49 14 PM" src="https://github.com/user-attachments/assets/480f5da7-869b-49f3-9ffd-8f08ccddb33d" /> Release Notes: - N/A	2025-04-23 14:08:26 -04:00
Marshall Bowers	92e810bfec	language_models: Pass up `mode` for completion requests through Zed (#29294 ) This PR makes it so we pass up the `mode` for completion requests through the Zed provider. Release Notes: - N/A	2025-04-23 18:02:03 +00:00
Bennet Bo Fenner	b0b620af56	gemini: Add support for passing images as part of the prompt (#29203 ) Release Notes: - agent: Add support for adding images as context when using Google Gemini	2025-04-22 09:05:46 +00:00
Richard Feldman	4f2f9ff762	Streaming tool calls (#29179 ) https://github.com/user-attachments/assets/7854a737-ef83-414c-b397-45122e4f32e8 Release Notes: - Create file and edit file tools now stream their tool descriptions, so you can see what they're doing sooner. --------- Co-authored-by: Marshall Bowers <git@maxdeviant.com>	2025-04-21 22:28:32 +00:00
Michael Sloan	fbf7caf93e	Default to fast model for thread summaries and titles + don't include system prompt / context / thinking segments (#29102 ) * Adds a fast / cheaper model to providers and defaults thread summarization to this model. Initial motivation for this was that https://github.com/zed-industries/zed/pull/29099 would cause these requests to fail when used with a thinking model. It doesn't seem correct to use a thinking model for summarization. * Skips system prompt, context, and thinking segments. * If tool use is happening, allows 2 tool uses + one more agent response before summarizing. Downside of this is that there was potential for some prefix cache reuse before, especially for title summarization (thread summarization omitted tool results and so would not share a prefix for those). This seems fine as these requests should typically be fairly small. Even for full thread summarization, skipping all tool use / context should greatly reduce the token use. Release Notes: - N/A	2025-04-19 23:26:29 +00:00
Bennet Bo Fenner	bafc086d27	agent: Preserve thinking blocks between requests (#29055 ) Looks like the required backend component of this was deployed. https://github.com/zed-industries/monorepo/actions/runs/14541199197 Release Notes: - N/A --------- Co-authored-by: Antonio Scandurra <me@as-cii.com> Co-authored-by: Agus Zubiaga <hi@aguz.me> Co-authored-by: Richard Feldman <oss@rtfeldman.com> Co-authored-by: Nathan Sobo <nathan@zed.dev>	2025-04-19 20:12:03 +00:00
Marshall Bowers	9875521d4e	language_models: Fix passing of `thread_id` and `prompt_id` (#29071 ) This PR is a follow-up to https://github.com/zed-industries/zed/pull/29069 that fixes an issue where the thread ID and prompt ID were not being sent up correctly. Release Notes: - N/A	2025-04-18 21:12:23 +00:00
Marshall Bowers	7abe2c9c31	agent: Attach thread ID and prompt ID to telemetry events (#29069 ) This PR attaches the thread ID and the new prompt ID to telemetry events for completions in the Agent panel. Release Notes: - N/A --------- Co-authored-by: Mikayla Maki <mikayla.c.maki@gmail.com>	2025-04-18 20:41:02 +00:00
Danilo Leal	e27f6a984f	agent: Simplify design of the settings view (#29041 ) Containing everything in boxes wasn't super necessary here. Want to still improve the switch color contrast here, but will probably do that in a separate PR. <img src="https://github.com/user-attachments/assets/f826a7a8-beaf-45d0-9dc2-36dc210c418e" width="700"/> Release Notes: - N/A	2025-04-18 14:24:53 -03:00
Marshall Bowers	d93141bded	agent: Extract usage information from response headers (#29002 ) This PR updates the Agent to extract the usage information from the response headers, if they are present. For now we just log the information, but we'll be using this soon to populate some UI. Release Notes: - N/A	2025-04-17 20:11:07 +00:00
Umesh Yadav	8117940aca	Add support for OpenAI o3 and o4-mini models (#28881 ) Release Notes: - Add support for OpenAI o3 and o4-mini models via OpenAI API and Copilot Chat providers. --------- Co-authored-by: Peter Tripp <peter@zed.dev>	2025-04-17 10:58:41 -04:00
Bennet Bo Fenner	ae47829fa8	agent: Fix system instructions typo (#28949 ) See #28793, the name of the field is actually `systemInstruction` not `systemInstructions`. Release Notes: - Fixed an issue where Gemini requests would fail	2025-04-17 08:51:05 +00:00
Marshall Bowers	3fef3cc392	Use more types/constants from `zed_llm_client` (#28909 ) This PR makes it so we use more types and constants from the `zed_llm_client` crate to avoid duplicating information. Also updates the current usage endpoint to use limits derived from the `Plan`. Release Notes: - N/A	2025-04-16 20:58:00 +00:00
Marshall Bowers	fcb1efdf21	rpc: Remove `llm` module in favor of `zed_llm_client` (#28900 ) This PR removes the `llm` module of the `rpc` crate in favor of using the types from the `zed_llm_client`. Release Notes: - N/A	2025-04-16 20:22:44 +00:00
Agus Zubiaga	0286b8ab3e	agent: Fix conversation token usage and estimate unsent message (#28878 ) The UI was mistakenly using the cumulative token usage for the token counter. It will now display the last request token count, plus an estimation of the tokens in the message editor and context entries that haven't been sent yet. https://github.com/user-attachments/assets/0438c501-b850-4397-9135-57214ca3c07a Additionally, when the user edits a message, we'll display the actual token count up to it and estimate the tokens in the new message. Note: We don't currently estimate the delta when switching profiles. In the future, we want to use the count tokens API to measure every part of the request and display a breakdown. Release Notes: - agent: Made the token count more accurate and added back estimation of used tokens as you type and add context. --------- Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de> Co-authored-by: Danilo Leal <daniloleal09@gmail.com>	2025-04-16 16:27:36 -03:00
Marshall Bowers	cb79420773	agent: Show an error when the model requests limit has been reached (#28868 ) This PR adds an error message when the model requests limit has been hit. Release Notes: - N/A Co-authored-by: Oleksiy Syvokon <oleksiy.syvokon@gmail.com>	2025-04-16 15:11:35 +00:00
Bennet Bo Fenner	c7e80c80c6	gemini: Pass system prompt as system instructions (#28793 ) https://ai.google.dev/gemini-api/docs/text-generation#system-instructions Release Notes: - agent: Improve performance of Gemini models	2025-04-15 19:45:47 +02:00
Bennet Bo Fenner	e1c42315dc	gemini: Fix "invalid argument" error when request contains no tools (#28747 ) When we do not have any tools, we want to set the `tools` field to `None` Release Notes: - Fixed an issue where Gemini requests would sometimes return a Bad Request ("Invalid argument...")	2025-04-15 07:57:54 +00:00
Richard Hao	5b6efa4c02	copilot_chat: Add Gemini 2.5 Pro support to Copilot Chat (#28660 )	2025-04-14 15:33:22 -04:00
Umesh Yadav	84aa480344	Add support for OpenAI GPT-4.1 models (#28708 ) Release Notes: - Add support for OpenAI GPT-4.1 via Copilot Chat and OpenAI API --------- Co-authored-by: Danilo Leal <daniloleal09@gmail.com> Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de>	2025-04-14 16:15:59 -03:00
Agus Zubiaga	b45230784d	agent: Handle context window exceeded errors from Anthropic (#28688 ) ![CleanShot 2025-04-14 at 11 15 38@2x](https://github.com/user-attachments/assets/9e803ffb-74fd-486b-bebc-2155a407a9fa) Release Notes: - agent: Handle context window exceeded errors from Anthropic	2025-04-14 14:39:33 +00:00
Bennet Bo Fenner	b22faf96e0	agent: Refine language model selector (#28597 ) Release Notes: - agent: Show recommended models in the agent model selector and display the provider in the model selector's trigger. --------- Co-authored-by: Danilo Leal <daniloleal09@gmail.com> Co-authored-by: Danilo Leal <67129314+danilo-leal@users.noreply.github.com>	2025-04-11 23:02:50 +00:00
Marshall Bowers	d88694f8da	language_models: Fix non-streaming Copilot Chat models (#28537 ) This PR fixes usage of non-streaming Copilot Chat models. Closes https://github.com/zed-industries/zed/issues/28528. Release Notes: - Fixed an issue with using non-streaming Copilot Chat models (e.g., o1, o3-mini).	2025-04-10 20:48:08 +00:00
Marshall Bowers	61b7a05792	language_models: Allow overriding Zed completions URL via environment variable (#28323 ) This PR adds support for overriding the Zed completions URL via the `ZED_COMPLETIONS_URL` environment variable. Release Notes: - N/A	2025-04-08 14:46:15 +00:00
Shardul Vaidya	525755c28e	bedrock: Add support for tool use, cross-region inference, and Claude 3.7 Thinking (#28137 ) Closes #27223 Merges: #27996, #26734, #27949 Release Notes: - AWS Bedrock: Added advanced authentication strategies with: - Short lived credentials with Session Tokens - AWS Named Profile - EC2 Identity, Pod Identity, Web Identity - AWS Bedrock: Added Claude 3.7 Thinking support. - AWS Bedrock: Adding Cross Region Inference for all combinations of regions and model availability. - Agent Beta: Added support for AWS Bedrock. --------- Co-authored-by: Marshall Bowers <git@maxdeviant.com>	2025-04-05 11:16:26 -04:00
Bennet Bo Fenner	02e4267bc6	Add tool calling support for GitHub Copilot Chat (#28035 ) This PR adds tool calling support for GitHub Copilot Chat models. Currently only supports the Claude family of models. Release Notes: - agent: Added tool calling support for Claude models in GitHub Copilot Chat. --------- Co-authored-by: Marshall Bowers <git@maxdeviant.com>	2025-04-04 21:41:07 +00:00
Marshall Bowers	819bb8fffb	open_ai: Disable `parallel_tool_calls` (#28056 ) This PR disables `parallel_tool_calls` for the models that support it, as the Agent currently expects at most one tool use per turn. It was a bit of trial and error to figure this out. OpenAI's API annoyingly will return an error if passing `parallel_tool_calls` to a model that doesn't support it. Release Notes: - N/A	2025-04-03 22:07:37 +00:00
Marshall Bowers	7492ec3f67	Add tool use support for OpenAI models (#28051 ) This PR adds support for using tools to the OpenAI models. Release Notes: - agent: Added support for tool use with OpenAI models (Preview only).	2025-04-03 20:55:11 +00:00
Marshall Bowers	e5b347b03a	Remove unused `extract_tool_args_from_events` functions (#28038 ) This PR removes the unused `extract_tool_args_from_events` functions that were defined in some of the LLM provider crates. Release Notes: - N/A	2025-04-03 18:38:35 +00:00
Julia Ryan	01ec6e0f77	Add workspace-hack (#27277 ) This adds a "workspace-hack" crate, see [mozilla's](https://hg.mozilla.org/mozilla-central/file/3a265fdc9f33e5946f0ca0a04af73acd7e6d1a39/build/workspace-hack/Cargo.toml#l7) for a concise explanation of why this is useful. For us in practice this means that if I were to run all the tests (`cargo nextest r --workspace`) and then `cargo r`, all the deps from the previous cargo command will be reused. Before this PR it would rebuild many deps due to resolving different sets of features for them. For me this frequently caused long rebuilds when things "should" already be cached. To avoid manually maintaining our workspace-hack crate, we will use [cargo hakari](https://docs.rs/cargo-hakari) to update the build files when there's a necessary change. I've added a step to CI that checks whether the workspace-hack crate is up to date, and instructs you to re-run `script/update-workspace-hack` when it fails. Finally, to make sure that people can still depend on crates in our workspace without pulling in all the workspace deps, we use a `[patch]` section following [hakari's instructions](https://docs.rs/cargo-hakari/0.9.36/cargo_hakari/patch_directive/index.html) One possible followup task would be making guppy use our `rust-toolchain.toml` instead of having to duplicate that list in its config, I opened an issue for that upstream: guppy-rs/guppy#481. TODO: - [x] Fix the extension test failure - [x] Ensure the dev dependencies aren't being unified by Hakari into the main dependencies - [x] Ensure that the remote-server binary continues to not depend on LibSSL Release Notes: - N/A --------- Co-authored-by: Mikayla <mikayla@zed.dev> Co-authored-by: Mikayla Maki <mikayla.c.maki@gmail.com>	2025-04-02 13:26:34 -07:00
Marshall Bowers	889bc13b7d	language_model: Remove `use_any_tool` method from `LanguageModel` (#27930 ) This PR removes the `use_any_tool` method from the `LanguageModel` trait. It was not being used anywhere, and doesn't really fit in our new tool use story. Release Notes: - N/A	2025-04-02 15:49:21 +00:00
Marshall Bowers	5dcd0d37a6	language_models: Denote Copilot Chat as not supporting tools (#27909 ) This PR updates the Copilot Chat language model to indicate it does not yet support tool use in Zed. Release Notes: - N/A	2025-04-02 12:12:09 +00:00
Bennet Bo Fenner	b158ee60ca	Fix Gemini function calling (#27905 ) This seems to improve the performance of `gemini-2.5-pro-exp-03-25` significantly. We know define a single `Tool` that has multiple `FunctionDeclaration`s, instead of defining multiple `Tool`s with a single `FunctionDeclaration`. Oddly enough the `flash` models seemed to work perfectly fine with the multiple `Tool { ... }` definitions Release Notes: - N/A	2025-04-02 10:35:17 +00:00
Danilo Leal	192097f58f	assistant2: Ensure errors are also displayed in populated new thread view (#27869 ) Follow-up to https://github.com/zed-industries/zed/pull/27812 This PR makes sure these errors cases also show up in the panel's empty state even when there is past data. \| No ToS \| Missing Provider \| \|--------\|--------\| \| ![CleanShot 2025-04-01 at 4  49 36@2x](https://github.com/user-attachments/assets/6da6bdc9-daa6-4a7b-a224-989eb845e205) \| ![CleanShot 2025-04-01 at 4  50 04@2x](https://github.com/user-attachments/assets/bddf62cb-3727-44b5-b115-9a88313c6d85) \| Release Notes: - N/A	2025-04-01 17:06:34 -03:00
Marshall Bowers	5880271b11	language_model: Add `supports_tools` method to `LanguageModel` (#27867 ) This PR adds a new `supports_tools` method to the `LanguageModel` trait to indicate whether a given model supports tool use. Release Notes: - N/A	2025-04-01 19:56:05 +00:00
Bennet Bo Fenner	5509e0141a	Return language model events when using Google model via zed.dev (#27831 ) Release Notes: - N/A	2025-04-01 08:58:17 +00:00
Danilo Leal	dce824f095	assistant2: Refine empty states design (#27812 ) \| No LLM provider \| Fresh Start \| No ToS \| \|--------\|--------\|--------\| \| ![CleanShot 2025-03-31 at 7  04 17@2x](https://github.com/user-attachments/assets/aab5987c-1530-401d-acc6-65e4f2fc13b8) \| ![CleanShot 2025-03-31 at 7  04 39@2x](https://github.com/user-attachments/assets/b2c7a2e0-5178-4bcb-a917-da7bf8e6246c) \| ![CleanShot 2025-03-31 at 7  05 10@2x](https://github.com/user-attachments/assets/4a656e82-0e1d-4d11-8d34-8eeeadd4814c) \| Release Notes: - N/A	2025-03-31 19:31:56 -03:00
Piotr Osiewicz	dc64ec9cc8	chore: Bump Rust edition to 2024 (#27800 ) Follow-up to https://github.com/zed-industries/zed/pull/27791 Release Notes: - N/A	2025-03-31 20:55:27 +02:00
Piotr Osiewicz	0729d24d77	chore: Prepare for Rust edition bump to 2024 (without autofix) (#27791 ) Successor to #27779 - in this PR I've applied changes manually, without futzing with if let lifetimes at all. Release Notes: - N/A	2025-03-31 20:10:36 +02:00
Bennet Bo Fenner	01a2c8eb01	Set tool schema format for zed.dev language model (#27788 ) Release Notes: - N/A	2025-03-31 16:49:59 +00:00
Bennet Bo Fenner	c8a9a74e6a	Add tool calling support for Gemini models (#27772 ) Release Notes: - N/A	2025-03-31 17:46:42 +02:00
Richard Feldman	85740ddaa4	Make serialization backwards-compatible for collab server (#27626 ) Sets up the collab server to accept the format of system message that we'll introduce later for [prompt caching](https://docs.anthropic.com/en/docs/build-with-claude/prompt-caching). Release Notes: - N/A	2025-03-27 18:20:10 -04:00
Richard Feldman	76d3a9a0f0	Retry on 5xx errors from cloud language model providers (#27584 ) Release Notes: - N/A	2025-03-27 09:35:16 -04:00
isomo	baf03e355b	copilot: Add Claude 3.7-Sonnet-Thought model to Copilot Chat (#27409 ) - Follow-up to: #25529 Release Notes: - Added Claude Sonnet 3.7 Thought to GitHub Copilot Chat	2025-03-25 10:56:27 +01:00
Bennet Bo Fenner	a52e2f9553	Show claude-3-7-sonnet-thinking model for all users (#27256 ) Release Notes: - N/A	2025-03-21 17:23:36 +01:00
Bennet Bo Fenner	a709d4c7c6	assistant: Add support for `claude-3-7-sonnet-thinking` (#27085 ) Closes #25671 Release Notes: - Added support for `claude-3-7-sonnet-thinking` in the assistant panel --------- Co-authored-by: Danilo Leal <daniloleal09@gmail.com> Co-authored-by: Antonio Scandurra <me@as-cii.com> Co-authored-by: Agus Zubiaga <hi@aguz.me>	2025-03-21 12:29:07 +00:00
Mikayla Maki	1aefa5178b	Move "async move" a few characters to the left in cx.spawn() (#26758 ) This is the core change: https://github.com/zed-industries/zed/pull/26758/files#diff-044302c0d57147af17e68a0009fee3e8dcdfb4f32c27a915e70cfa80e987f765R1052 TODO: - [x] Use AsyncFn instead of Fn() -> Future in GPUI spawn methods - [x] Implement it in the whole app - [x] Implement it in the debugger - [x] Glance at the RPC crate, and see if those box future methods can be switched over. Answer: It can't directly, as you can't make an AsyncFn* into a trait object. There's ways around that, but they're all more complex than just keeping the code as is. - [ ] Fix platform specific code Release Notes: - N/A	2025-03-19 02:09:02 +00:00
Smit Barmase	6a95ec6a64	copilot: Decouple copilot sign in from edit prediction settings (#26689 ) Closes #25883 This PR allows you to use copilot chat for assistant without setting copilot as the edit prediction provider. [copilot.webm](https://github.com/user-attachments/assets/fecfbde1-d72c-4c0c-b080-a07671fb846e) Todos: - [x] Remove redudant "copilot" key from settings - [x] Do not disable copilot LSP when `edit_prediction_provider` is not set to `copilot` - [x] Start copilot LSP when: - [x] `edit_prediction_provider` is set to `copilot` - [x] Copilot sign in clicked from assistant settings - [x] Handle flicker for frame after starting LSP, but before signing in caused due to signed out status - [x] Fixed this by adding intermediate state for awaiting signing in in sign out enum - [x] Handle cancel button should sign out from `copilot` (existing bug) - [x] Handle modal dismissal should sign out if not in signed in state (existing bug) Release Notes: - You can now sign into Copilot from assistant settings without making it your edit prediction provider. This is useful if you want to use Copilot chat while keeping a different provider, like Zed, for predictions. - Removed the `copilot` key from `features` in settings. Use `edit_prediction_provider` instead.	2025-03-14 15:10:56 +05:30
Michael Sloan	8e0e291bd5	Track cumulative token usage in assistant2 when using anthropic API (#26738 ) Release Notes: - N/A	2025-03-13 22:56:16 +00:00
Smit Barmase	5aae3bdc69	copilot: Fix missing sign-out button when Zed is the edit prediction provider (#26340 ) Closes #25884 Added a sign-out button for Copilot in Assistant settings, allowing sign-out even when copilot is disabled. <img width="500" alt="image" src="https://github.com/user-attachments/assets/43fc97ad-f73c-49e1-a7b6-a3910434d661" /> Release Notes: - Added a sign-out button for Copilot in Assistant settings.	2025-03-09 21:39:14 +05:30

1 2

93 commits