Yehowshua/ZIm - Forgejo: Beyond coding. We Forge.

Author	SHA1	Message	Date
Fernando Freire	3077abf9cf	google_ai: Parse thought parts in Gemini responses (#31925 ) Fixes thinking Gemini models. Closes #31902 Release Notes: - Updated Google Gemini client to match the latest API	2025-06-03 10:37:06 +00:00
Umesh Yadav	59686f1f44	language_models: Add images support for Ollama vision models (#31883 ) Ollama supports vision to process input images. This PR adds support for same. I have tested this with gemma3:4b and have attached the screenshot of it working. <img width="435" alt="image" src="https://github.com/user-attachments/assets/5f17d742-0a37-4e6c-b4d8-05b750a0a158" /> Release Notes: - Add image support for [Ollama vision models](https://ollama.com/search?c=vision)	2025-06-03 11:12:59 +02:00
THELOSTSOUL	b820aa1fcd	Add tool support for DeepSeek (#30223 ) [deepseek function call api](https://api-docs.deepseek.com/guides/function_calling) has been released and it is same as openai. Release Notes: - Added tool calling support for Deepseek Models --------- Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de>	2025-06-03 10:59:36 +02:00
Umesh Yadav	65e3e84cbc	language_models: Add thinking support for ollama (#31665 ) This PR updates how we handle Ollama responses, leveraging the new [v0.9.0](https://github.com/ollama/ollama/releases/tag/v0.9.0) release. Previously, thinking text was embedded within the model's main content, leading to it appearing directly in the agent's response. Now, thinking content is provided as a separate parameter, allowing us to display it correctly within the agent panel, similar to other providers. I have tested this with qwen3:8b and works nicely. ~~We can release this once the ollama is release is stable.~~ It's released now as stable. <img width="433" alt="image" src="https://github.com/user-attachments/assets/2983ef06-6679-4033-82c2-231ea9cd6434" /> Release Notes: - Add thinking support for ollama --------- Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de>	2025-06-02 15:12:41 +00:00
Oleksiy Syvokon	ae219e9e99	agent: Fix bug with double-counting tokens in Gemini (#31885 ) We report the total number of input tokens by summing the numbers of 1. Prompt tokens 2. Cached tokens But Google API returns prompt tokens (1) that already include cached tokens (2), so we were double counting tokens in some cases. Release Notes: - Fixed bug with double-counting tokens in Gemini	2025-06-02 10:18:44 +00:00
Marshall Bowers	a23ee61a4b	Pass up intent with completion requests (#31710 ) This PR adds a new `intent` field to completion requests to assist in categorizing them correctly. Release Notes: - N/A --------- Co-authored-by: Ben Brandt <benjamin.j.brandt@gmail.com>	2025-05-29 20:43:12 +00:00
Umesh Yadav	4e7dc37f01	language_models: Remove handling of WrappedTextContent in tool result content (#31605 ) Fixes ci pipeline Release Notes: - N/A	2025-05-28 16:43:08 +00:00
Richard Feldman	00fd045844	Make language model deserialization more resilient (#31311 ) This expands our deserialization of JSON from models to be more tolerant of different variations that the model may send, including capitalization, wrapping things in objects vs. being plain strings, etc. Also when deserialization fails, it reports the entire error in the JSON so we can see what failed to deserialize. (Previously these errors were very unhelpful at diagnosing the problem.) Finally, also removes the `WrappedText` variant since the custom deserializer just turns that style of JSON into a normal `Text` variant. Release Notes: - N/A	2025-05-28 12:06:07 -04:00
Fedor Nezhivoi	998542b048	language_models: Add support for tool use to LM Studio provider (#30589 ) Closes #30004 Quick demo: https://github.com/user-attachments/assets/0ac93851-81d7-4128-a34b-1f3ae4bcff6d Additional notes: I've tried to stick to existing code in OpenAI provider as much as possible without changing much to keep the diff small. This PR is done in collaboration with @yagil from LM Studio. We agreed upon the format in which LM Studio will return information about tool use support for the model in the upcoming version. As of current stable version nothing is going to change for the users, but once they update to a newer LM Studio tool use gets automatically enabled for them. I think this is much better UX then defaulting to true right now. Release Notes: - Added support for tool calls to LM Studio provider --------- Co-authored-by: Ben Brandt <benjamin.j.brandt@gmail.com>	2025-05-26 13:54:17 +02:00
Ben Brandt	ef0e1cb2ba	open_ai: Make Assistant message content optional (#31418 ) Fixes regression caused by: https://github.com/zed-industries/zed/pull/30639 Assistant messages can come back with no content, and we no longer allowed that in the deserialization. Release Notes: - open_ai: fixed deserialization issue if assistant content was empty	2025-05-26 09:59:39 +00:00
Marshall Bowers	685933b5c8	language_models: Fetch Zed models from the server (#31316 ) This PR updates the Zed LLM provider to fetch the available models from the server instead of hard-coding them in the binary. Release Notes: - Updated the Zed provider to fetch the list of available language models from the server.	2025-05-23 23:00:35 +00:00
Marshall Bowers	5c0b161563	Handle new `refusal` stop reason from Claude 4 models (#31217 ) This PR adds support for handling the new [`refusal` stop reason](https://docs.anthropic.com/en/docs/test-and-evaluate/strengthen-guardrails/handle-streaming-refusals) from Claude 4 models. <img width="409" alt="Screenshot 2025-05-22 at 4 31 56 PM" src="https://github.com/user-attachments/assets/707b04f5-5a52-4a19-95d9-cbd2be2dd86f" /> Release Notes: - Added handling for `"stop_reason": "refusal"` from Claude 4 models.	2025-05-22 16:56:59 -04:00
Marshall Bowers	37047a6fde	language_models: Update default/recommended Anthropic models to Claude Sonnet 4 (#31209 ) This PR updates the default/recommended models for the Anthropic and Zed providers to be Claude Sonnet 4. Release Notes: - Updated default/recommended Anthropic models to Claude Sonnet 4.	2025-05-22 19:10:08 +00:00
Marshall Bowers	1475ace6f1	anthropic: Add support for Claude 4 (#31203 ) This PR adds support for [Claude 4](https://www.anthropic.com/news/claude-4). Release Notes: - Added support for Claude Opus 4 and Claude Sonnet 4. --------- Co-authored-by: Antonio Scandurra <me@as-cii.com> Co-authored-by: Richard Feldman <oss@rtfeldman.com>	2025-05-22 18:09:35 +00:00
Kirill Bulatov	16366cf9f2	Use `anyhow` more idiomatically (#31052 ) https://github.com/zed-industries/zed/issues/30972 brought up another case where our context is not enough to track the actual source of the issue: we get a general top-level error without inner error. The reason for this was `.ok_or_else(\|\| anyhow!("failed to read HEAD SHA"))?; ` on the top level. The PR finally reworks the way we use anyhow to reduce such issues (or at least make it simpler to bubble them up later in a fix). On top of that, uses a few more anyhow methods for better readability. * `.ok_or_else(\|\| anyhow!("..."))`, `map_err` and other similar error conversion/option reporting cases are replaced with `context` and `with_context` calls * in addition to that, various `anyhow!("failed to do ...")` are stripped with `.context("Doing ...")` messages instead to remove the parasitic `failed to` text * `anyhow::ensure!` is used instead of `if ... { return Err(...); }` calls * `anyhow::bail!` is used instead of `return Err(anyhow!(...));` Release Notes: - N/A	2025-05-20 23:06:07 +00:00
Richard Feldman	4bb04cef9d	Accept wrapped text content from LLM providers (#31048 ) Some providers sometimes send `{ "type": "text", "text": ... }` instead of just the text as a string. Now we accept those instead of erroring. Release Notes: - N/A	2025-05-20 20:50:02 +00:00
Umesh Yadav	926f377c6c	language_models: Add tool use support for Mistral models (#29994 ) Closes https://github.com/zed-industries/zed/issues/29855 Implement tool use handling in Mistral provider, including mapping tool call events and updating request construction. Add support for tool_choice and parallel_tool_calls in Mistral API requests. This works fine with all the existing models. Didn't touched anything else but for future. Fetching models using their models api, deducting tool call support, parallel tool calls etc should be done from model data from api response. <img width="547" alt="Screenshot 2025-05-06 at 4 52 37 PM" src="https://github.com/user-attachments/assets/4c08b544-1174-40cc-a40d-522989953448" /> Tasks: - [x] Add tool call support - [x] Auto Fetch models using mistral api - [x] Add tests for mistral crates. - [x] Fix mistral configurations for llm providers. Release Notes: - agent: Add tool call support for existing mistral models --------- Co-authored-by: Peter Tripp <peter@zed.dev> Co-authored-by: Bennet Bo Fenner <bennet@zed.dev>	2025-05-19 18:36:59 +02:00
Ben Brandt	57424e4743	language_models: Update tiktoken-rs to support newer models (#30951 ) I was able to get this fix in upstream, so now we can have simpler code paths for our model selection. I also added a test to catch if this would cause a bug again in the future. Release Notes: - N/A	2025-05-19 11:40:36 +00:00
Oleksiy Syvokon	2b6dab9197	agent: Fix OpenAI models not getting first message (#30941 ) Closes #30733 Release Notes: - N/A	2025-05-19 09:09:03 +00:00
Marshall Bowers	c80bd698f8	language_models: Don't mark local subscription binding as unused (#30867 ) This PR removes an instance of marking a local `Subscription` binding as unused. While we `_` the field to prevent unused warnings, the locals shouldn't be marked as unused as we do use them (and want them to participate in usage tracking). Release Notes: - N/A	2025-05-17 10:23:08 +00:00
Oleksiy Syvokon	d42cb111f4	agent: Fix tool use in Gemini (#30689 ) Thread doesn't run pending tools when `stop_reason` is not `ToolUse`. Perhaps we should change that so that it always runs pending tools if there are some, but for now this change just fixes setting `stop_reason` for Google models. Release Notes: - N/A	2025-05-14 15:43:17 +03:00
Agus Zubiaga	a4766e296f	Add tool result image support to Gemini models (#30647 ) Release Notes: - Add tool result image support to Gemini models	2025-05-14 00:51:31 +00:00
Agus Zubiaga	dd6594621f	Add image input support for OpenAI models (#30639 ) Release Notes: - Added input image support for OpenAI models	2025-05-13 17:32:42 +02:00
Marshall Bowers	7cad943fde	agent: Remove unused max monthly spend reached error (#30615 ) This PR removes the code for showing the max monthly spend limit reached error, as it is no longer used. Release Notes: - N/A	2025-05-13 09:43:13 +00:00
Richard Feldman	8fdf309a4a	Have read_file support images (#30435 ) This is very basic support for them. There are a number of other TODOs before this is really a first-class supported feature, so not adding any release notes for it; for now, this PR just makes it so that if read_file tries to read a PNG (which has come up in practice), it at least correctly sends it to Anthropic instead of messing up. This also lays the groundwork for future PRs for more first-class support for images in tool calls across more image file formats and LLM providers. Release Notes: - N/A --------- Co-authored-by: Agus Zubiaga <hi@aguz.me> Co-authored-by: Agus Zubiaga <agus@zed.dev>	2025-05-13 10:58:00 +02:00
Bennet Bo Fenner	3ea86da16f	Copilot fix o1 model (#30581 ) Release Notes: - Fixed an issue where the `o1` model would not work when using Copilot Chat	2025-05-12 15:27:24 +00:00
Umesh Yadav	a6c3d49bb9	language_models: Add vision support for Copilot Chat models (#30155 ) Problem Statement: Support for image analysis (vision) is currently restricted to Anthropic and Gemini models. This limits users who wish to leverage vision capabilities available in other models, such as Copilot, for tasks like attaching image context within the agent message editor. Proposed Change: This PR extends vision support to include Copilot models that are already equipped with vision capabilities. This integration will allow users within VS Code to attach and analyze images using supported Copilot models via the agent message editor. Scope Limitation: This PR does not implement controls within the message editor to ensure that image context (e.g., through copy-paste or attachment) is exclusively enabled or prompted only when a vision-supported model is active. Long term the message editor should have access to each models vision capability and stop the users from attaching images by either greying out the context saying it's not support or not work through both copy paste and file/directory search. Closes #30076 Release Notes: - Add vision support for Copilot Chat models --------- Co-authored-by: Bennet Bo Fenner <bennet@zed.dev>	2025-05-12 13:11:38 +00:00
Liam	f14e48d202	language_models: Dynamically detect Copilot Chat models (#29027 ) I noticed the discussion in #28881, and had thought of exactly the same a few days prior. This implementation should preserve existing functionality fairly well. I've added a dependency (serde_with) to allow the deserializer to skip models which cannot be deserialized, which could occur if a future provider, for instance, is added. Without this modification, such a change could break all models. If extra dependencies aren't desired, a manual implementation could be used instead. - Closes #29369 Release Notes: - Dynamically detect available Copilot Chat models, including all models with tool support --------- Co-authored-by: AidanV <aidanvanduyne@gmail.com> Co-authored-by: imumesh18 <umesh4257@gmail.com> Co-authored-by: Bennet Bo Fenner <bennet@zed.dev> Co-authored-by: Agus Zubiaga <hi@aguz.me>	2025-05-12 11:28:41 +00:00
Danilo Leal	83319c8a6d	agent: Fix instruction list item with multiple buttons not working (#30541 ) This was a particular problem in the Amazon Bedrock section (at least for now) where there were multiple buttons and none of them actually worked because they all had the same id. Release Notes: - agent: Fixed Amazon Bedrock settings link buttons not working.	2025-05-12 06:19:20 -03:00
Kirill Bulatov	471e02d48f	Separate timeout and connection dropped errors out (#30457 )	2025-05-10 15:12:58 +03:00
Marshall Bowers	f29c6e5661	Update `zed_llm_client` to v0.8.1 (#30433 ) This PR updates the `zed_llm_client` crate to v0.8.1. The name of `Plan::Free` changed to `Plan::ZedFree` in this version. Release Notes: - N/A	2025-05-09 21:08:03 +00:00
Antonio Scandurra	25ced2e3c2	Fix error when Copilot calls tools without arguments (#30371 ) Fixes https://github.com/zed-industries/zed/issues/30346 The model can output an empty string to indicate the absence of arguments, which can't be parsed as a `serde_json::Value`. When that happens, we now create an empty object instead on behalf of the model. Release Notes: - Fixed a bug that prevented Copilot models from calling the `diagnostic` tool.	2025-05-09 12:14:36 +00:00
Shardul Vaidya	648d0054de	bedrock: Fix UX bug (#28350 ) Closes #29072, #28390, Release Notes: - AWS Bedrock: Fixed case where user couldn't delete manually added AWS credentials. --------- Co-authored-by: Marshall Bowers <git@maxdeviant.com> Co-authored-by: Peter Tripp <peter@zed.dev>	2025-05-08 22:09:18 +00:00
Marshall Bowers	f21780cef3	Remove individual URL overrides for LLM service (#30290 ) This PR removes the individual URL overrides for the LLM service. We initially had `ZED_PREDICT_EDITS_URL` to allow for directing traffic to the LLM Worker back when there was still the split of the Collab-based LLM Service and the Cloudflare-based LLM Worker. But now that all of the LLM functionality has been moved into the Worker, we can just direct all traffic there. Release Notes: - N/A	2025-05-08 17:54:46 +00:00
Marshall Bowers	b343a8aa22	language_models: Improve subscription states in the Agent configuration view (#30252 ) This PR improves the subscription states in the Agent configuration view to the new billing system. Zed Free (legacy): <img width="638" alt="Screenshot 2025-05-08 at 8 42 59 AM" src="https://github.com/user-attachments/assets/7b62d4c1-2a9c-4c6a-aa8f-060730b6d7b3" /> Zed Free (new): <img width="640" alt="Screenshot 2025-05-08 at 8 43 56 AM" src="https://github.com/user-attachments/assets/8a48448e-813e-4633-955d-623d3e6d603c" /> Zed Pro trial: <img width="641" alt="Screenshot 2025-05-08 at 8 45 52 AM" src="https://github.com/user-attachments/assets/1ec7ee62-e954-48e7-8447-4584527307c9" /> Zed Pro: <img width="636" alt="Screenshot 2025-05-08 at 8 47 21 AM" src="https://github.com/user-attachments/assets/f934b2e3-0943-4b78-b8dc-0a31e731d8fb" /> Release Notes: - agent: Improved the subscription-related information in the configuration view.	2025-05-08 09:10:50 -04:00
Ben Brandt	3a3d3c05e8	Improve token counting for OpenAI models (#30242 ) tiktoken_rs is a bit behind (and even upstream tiktoken doesn't have all of these models) We were incorrectly using the cl100k tokenizer for some models that actually use the o200k tokenizers. So that is updated. I also made the match arms specific so that we do a better job of catching whether or not tiktoken-rs accurately supports new models we add in. I will also do a PR upstream to see if we can move some of this logic back out if tiktoken better supports the newer models. Release Notes: - Improved tokenizer support for openai models.	2025-05-08 13:09:29 +00:00
Antonio Scandurra	9f6809a28d	Reuse conversation cache when streaming edits (#30245 ) Release Notes: - Improved latency when the agent applies edits.	2025-05-08 14:36:34 +02:00
Max Brunsfeld	3c128ef83f	Avoid empty schema in copilot dummy tool (#30178 ) Copilot chat still returns a 400 if the dummy tool uses the `{}` schema. This is a follow-up to https://github.com/zed-industries/zed/pull/30007. Release Notes: - Fixed a bug where agent edits would fail when using GitHub Copilot Chat. Co-authored-by: Agus Zubiaga <hi@aguz.me>	2025-05-07 20:08:38 +00:00
Marshall Bowers	4469b7339f	language_models: Update copy for Zed Pro subscription (#30152 ) This PR updates the copy around the Zed Pro description to be more accurate. Release Notes: - agent: Updated some copy about Zed Pro in the configuration view.	2025-05-07 17:15:02 +00:00
Marshall Bowers	a34fb6f6b1	Send up Zed version with edit prediction and completion requests (#30136 ) This PR makes it so we send up an `x-zed-version` header with the client's version when making a request to llm.zed.dev for edit predictions and completions. Release Notes: - N/A	2025-05-07 15:44:30 +00:00
Richard Feldman	fcb9706022	Improve Ollama tool use (#30120 ) <img width="458" alt="Screenshot 2025-05-07 at 9 37 39 AM" src="https://github.com/user-attachments/assets/80f8a9b8-6a13-4e84-b91d-140e11475638" /> <img width="603" alt="Screenshot 2025-05-07 at 9 37 33 AM" src="https://github.com/user-attachments/assets/7fe67a68-3885-4a0e-a282-aad37e92068b" /> Release Notes: - Ollama models no longer require the supports_tools field in settings (defaults to false) --------- Co-authored-by: Antonio Scandurra <me@as-cii.com>	2025-05-07 15:37:06 +00:00
Agus Zubiaga	3cdf5ce947	agent: Allow customizing temperature by provider/model (#30033 ) Adds a new `agent.model_parameters` setting that allows the user to specify a custom temperature for a provider AND/OR model: ```json5 "model_parameters": [ // To set parameters for all requests to OpenAI models: { "provider": "openai", "temperature": 0.5 }, // To set parameters for all requests in general: { "temperature": 0 }, // To set parameters for a specific provider and model: { "provider": "zed.dev", "model": "claude-3-7-sonnet-latest", "temperature": 1.0 } ], ``` Release Notes: - agent: Allow customizing temperature by provider/model --------- Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com> Co-authored-by: Marshall Bowers <git@maxdeviant.com>	2025-05-06 20:36:25 +00:00
Umesh Yadav	a743035286	lmstudio: Fix streaming not working in v0.3.15 (#30013 ) Closes #29781 Tested this with llama3, gemma3 and qwen3. This is a breaking change, which means after adding this code changes in future version zed we will require atleast lmstudio >= 0.3.15. For context why it's breaking changes check out the issue: #29781. What this doesn't try to solve is: * Tool calling, thinking text rendering. Will raise a seperate PR for these as those are not required in this PR to make it work. https://github.com/user-attachments/assets/945f9c73-6323-4a88-92e2-2219b760a249 Release Notes: - lmstudio: Fixed Zed support for LMStudio >= v0.3.15 (breaking change -- older versions are no longer supported). --------- Co-authored-by: Peter Tripp <peter@zed.dev>	2025-05-06 12:59:36 -04:00
Antonio Scandurra	0f50e6b1d1	Fix error when requesting completion to Copilot Chat without tools (#30007 ) The API will return a Bad Request (with no error message) when tools were used previously in the conversation but no tools are provided as part of a new request. Inserting a dummy tool seems to circumvent this error. Release Notes: - Fixed an error that could sometimes occur when editing using Copilot Chat. Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de>	2025-05-06 14:19:59 +00:00
Bennet Bo Fenner	e44367c6d0	agent: Disable claude-3-7-sonnet-thinking tool support for Copilot Chat (#29999 ) We started getting Bad Requests from the Copilot Chat API. Seems like Microsoft stopped supporting this: <img width="331" alt="image" src="https://github.com/user-attachments/assets/46050063-f031-4836-82ff-219bdd45639a" /> Release Notes: - agent: Disable `claude-3-7-sonnet-thinking` for Copilot Chat Provider because it is not supported by Copilot Chat Co-authored-by: Ben Brandt <benjamin.j.brandt@gmail.com>	2025-05-06 12:47:26 +00:00
Max Brunsfeld	2eb10ab9fb	openai: Don't append tool calls to prior assistant messages (#29969 ) Closes https://github.com/zed-industries/zed/issues/29821 Release Notes: - Fixed an issue in the agent panel where OpenAI requests would fail if the assistant begins its response with a tool call.	2025-05-05 22:04:56 -07:00
tidely	769ec59162	ollama: Add tool call support (#29563 ) The goal of this PR is to support tool calls using ollama. A lot of the serialization work was done in https://github.com/zed-industries/zed/pull/15803 however the abstraction over language models always disables tools. ## Changelog: - Use `serde_json::Value` inside `OllamaFunctionCall` just as it's used in `OllamaFunctionCall`. This fixes deserialization of ollama tool calls. - Added deserialization tests using json from official ollama api docs. - Fetch model capabilities during model enumeration from ollama provider - Added `supports_tools` setting to manually configure if a model supports tools ## TODO: - [x] Fix tool call serialization/deserialization - [x] Fetch model capabilities from ollama api - [x] Add tests for parsing model capabilities - [ ] Documentation for `supports_tools` field for ollama language model config - [ ] Convert between generic language model types - [x] Pass tools to ollama Release Notes: - N/A --------- Co-authored-by: Antonio Scandurra <me@as-cii.com> Co-authored-by: Nathan Sobo <nathan@zed.dev>	2025-05-05 17:52:23 +00:00
Umesh Yadav	251f26d48a	copilot: Add support for tool_calls for gpt-4.1, gpt-4o, o4-mini (#29369 ) Github Copilot currently supports following models for agent mode with tool calls. Currently we are only supporting anthropic models and not openai and gemini. This PR add support for the openai models. I have tested it and it works for all of them. For gemini models it seems there is a issues from copilot side so not adding that in this PR as enabling gemini model breaks it in the ask mode as well. <img width="392" alt="image" src="https://github.com/user-attachments/assets/fb7a4148-e48c-45c5-9ff9-c02f71217dfb" /> - [x] GPT-4.1 - [x] GPT-4.0 - [x] o4-mini Release Notes: - agent: Add tool calling support for gpt-4.1, gpt-4o, o4-mini when using Copilot Chat as a provider Signed-off-by: Umesh Yadav <umesh4257@gmail.com>	2025-05-05 13:59:12 +02:00
Michael Sloan	76ad1a29a5	Add support for getting the token count for all parts of Gemini generation requests (#29630 ) * `CountTokensRequest` now takes a full `GenerateContentRequest` instead of just content. * Fixes use of `models/` prefix in `model` field of `GenerateContentRequest`, since that's required for use in `CountTokensRequest`. This didn't cause issues before because it was always cleared and used in the path. Release Notes: - N/A	2025-05-04 21:32:45 +00:00
Michael Sloan	f4e9ea3cd8	In error text of cloud LLM API: `completion failed` -> `request failed` (#29888 ) This error is used for more requests than completion requests Release Notes: - N/A	2025-05-04 21:04:34 +00:00

1 2 3 4 5 ...

262 commits