Yehowshua/ZIm - Forgejo: Beyond coding. We Forge.

Author	SHA1	Message	Date
Kirill Bulatov	471e02d48f	Separate timeout and connection dropped errors out (#30457 )	2025-05-10 15:12:58 +03:00
Marshall Bowers	f29c6e5661	Update `zed_llm_client` to v0.8.1 (#30433 ) This PR updates the `zed_llm_client` crate to v0.8.1. The name of `Plan::Free` changed to `Plan::ZedFree` in this version. Release Notes: - N/A	2025-05-09 21:08:03 +00:00
Antonio Scandurra	25ced2e3c2	Fix error when Copilot calls tools without arguments (#30371 ) Fixes https://github.com/zed-industries/zed/issues/30346 The model can output an empty string to indicate the absence of arguments, which can't be parsed as a `serde_json::Value`. When that happens, we now create an empty object instead on behalf of the model. Release Notes: - Fixed a bug that prevented Copilot models from calling the `diagnostic` tool.	2025-05-09 12:14:36 +00:00
Shardul Vaidya	648d0054de	bedrock: Fix UX bug (#28350 ) Closes #29072, #28390, Release Notes: - AWS Bedrock: Fixed case where user couldn't delete manually added AWS credentials. --------- Co-authored-by: Marshall Bowers <git@maxdeviant.com> Co-authored-by: Peter Tripp <peter@zed.dev>	2025-05-08 22:09:18 +00:00
Marshall Bowers	f21780cef3	Remove individual URL overrides for LLM service (#30290 ) This PR removes the individual URL overrides for the LLM service. We initially had `ZED_PREDICT_EDITS_URL` to allow for directing traffic to the LLM Worker back when there was still the split of the Collab-based LLM Service and the Cloudflare-based LLM Worker. But now that all of the LLM functionality has been moved into the Worker, we can just direct all traffic there. Release Notes: - N/A	2025-05-08 17:54:46 +00:00
Marshall Bowers	b343a8aa22	language_models: Improve subscription states in the Agent configuration view (#30252 ) This PR improves the subscription states in the Agent configuration view to the new billing system. Zed Free (legacy): <img width="638" alt="Screenshot 2025-05-08 at 8 42 59 AM" src="https://github.com/user-attachments/assets/7b62d4c1-2a9c-4c6a-aa8f-060730b6d7b3" /> Zed Free (new): <img width="640" alt="Screenshot 2025-05-08 at 8 43 56 AM" src="https://github.com/user-attachments/assets/8a48448e-813e-4633-955d-623d3e6d603c" /> Zed Pro trial: <img width="641" alt="Screenshot 2025-05-08 at 8 45 52 AM" src="https://github.com/user-attachments/assets/1ec7ee62-e954-48e7-8447-4584527307c9" /> Zed Pro: <img width="636" alt="Screenshot 2025-05-08 at 8 47 21 AM" src="https://github.com/user-attachments/assets/f934b2e3-0943-4b78-b8dc-0a31e731d8fb" /> Release Notes: - agent: Improved the subscription-related information in the configuration view.	2025-05-08 09:10:50 -04:00
Ben Brandt	3a3d3c05e8	Improve token counting for OpenAI models (#30242 ) tiktoken_rs is a bit behind (and even upstream tiktoken doesn't have all of these models) We were incorrectly using the cl100k tokenizer for some models that actually use the o200k tokenizers. So that is updated. I also made the match arms specific so that we do a better job of catching whether or not tiktoken-rs accurately supports new models we add in. I will also do a PR upstream to see if we can move some of this logic back out if tiktoken better supports the newer models. Release Notes: - Improved tokenizer support for openai models.	2025-05-08 13:09:29 +00:00
Antonio Scandurra	9f6809a28d	Reuse conversation cache when streaming edits (#30245 ) Release Notes: - Improved latency when the agent applies edits.	2025-05-08 14:36:34 +02:00
Max Brunsfeld	3c128ef83f	Avoid empty schema in copilot dummy tool (#30178 ) Copilot chat still returns a 400 if the dummy tool uses the `{}` schema. This is a follow-up to https://github.com/zed-industries/zed/pull/30007. Release Notes: - Fixed a bug where agent edits would fail when using GitHub Copilot Chat. Co-authored-by: Agus Zubiaga <hi@aguz.me>	2025-05-07 20:08:38 +00:00
Marshall Bowers	4469b7339f	language_models: Update copy for Zed Pro subscription (#30152 ) This PR updates the copy around the Zed Pro description to be more accurate. Release Notes: - agent: Updated some copy about Zed Pro in the configuration view.	2025-05-07 17:15:02 +00:00
Marshall Bowers	a34fb6f6b1	Send up Zed version with edit prediction and completion requests (#30136 ) This PR makes it so we send up an `x-zed-version` header with the client's version when making a request to llm.zed.dev for edit predictions and completions. Release Notes: - N/A	2025-05-07 15:44:30 +00:00
Richard Feldman	fcb9706022	Improve Ollama tool use (#30120 ) <img width="458" alt="Screenshot 2025-05-07 at 9 37 39 AM" src="https://github.com/user-attachments/assets/80f8a9b8-6a13-4e84-b91d-140e11475638" /> <img width="603" alt="Screenshot 2025-05-07 at 9 37 33 AM" src="https://github.com/user-attachments/assets/7fe67a68-3885-4a0e-a282-aad37e92068b" /> Release Notes: - Ollama models no longer require the supports_tools field in settings (defaults to false) --------- Co-authored-by: Antonio Scandurra <me@as-cii.com>	2025-05-07 15:37:06 +00:00
Agus Zubiaga	3cdf5ce947	agent: Allow customizing temperature by provider/model (#30033 ) Adds a new `agent.model_parameters` setting that allows the user to specify a custom temperature for a provider AND/OR model: ```json5 "model_parameters": [ // To set parameters for all requests to OpenAI models: { "provider": "openai", "temperature": 0.5 }, // To set parameters for all requests in general: { "temperature": 0 }, // To set parameters for a specific provider and model: { "provider": "zed.dev", "model": "claude-3-7-sonnet-latest", "temperature": 1.0 } ], ``` Release Notes: - agent: Allow customizing temperature by provider/model --------- Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com> Co-authored-by: Marshall Bowers <git@maxdeviant.com>	2025-05-06 20:36:25 +00:00
Umesh Yadav	a743035286	lmstudio: Fix streaming not working in v0.3.15 (#30013 ) Closes #29781 Tested this with llama3, gemma3 and qwen3. This is a breaking change, which means after adding this code changes in future version zed we will require atleast lmstudio >= 0.3.15. For context why it's breaking changes check out the issue: #29781. What this doesn't try to solve is: * Tool calling, thinking text rendering. Will raise a seperate PR for these as those are not required in this PR to make it work. https://github.com/user-attachments/assets/945f9c73-6323-4a88-92e2-2219b760a249 Release Notes: - lmstudio: Fixed Zed support for LMStudio >= v0.3.15 (breaking change -- older versions are no longer supported). --------- Co-authored-by: Peter Tripp <peter@zed.dev>	2025-05-06 12:59:36 -04:00
Antonio Scandurra	0f50e6b1d1	Fix error when requesting completion to Copilot Chat without tools (#30007 ) The API will return a Bad Request (with no error message) when tools were used previously in the conversation but no tools are provided as part of a new request. Inserting a dummy tool seems to circumvent this error. Release Notes: - Fixed an error that could sometimes occur when editing using Copilot Chat. Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de>	2025-05-06 14:19:59 +00:00
Bennet Bo Fenner	e44367c6d0	agent: Disable claude-3-7-sonnet-thinking tool support for Copilot Chat (#29999 ) We started getting Bad Requests from the Copilot Chat API. Seems like Microsoft stopped supporting this: <img width="331" alt="image" src="https://github.com/user-attachments/assets/46050063-f031-4836-82ff-219bdd45639a" /> Release Notes: - agent: Disable `claude-3-7-sonnet-thinking` for Copilot Chat Provider because it is not supported by Copilot Chat Co-authored-by: Ben Brandt <benjamin.j.brandt@gmail.com>	2025-05-06 12:47:26 +00:00
Max Brunsfeld	2eb10ab9fb	openai: Don't append tool calls to prior assistant messages (#29969 ) Closes https://github.com/zed-industries/zed/issues/29821 Release Notes: - Fixed an issue in the agent panel where OpenAI requests would fail if the assistant begins its response with a tool call.	2025-05-05 22:04:56 -07:00
tidely	769ec59162	ollama: Add tool call support (#29563 ) The goal of this PR is to support tool calls using ollama. A lot of the serialization work was done in https://github.com/zed-industries/zed/pull/15803 however the abstraction over language models always disables tools. ## Changelog: - Use `serde_json::Value` inside `OllamaFunctionCall` just as it's used in `OllamaFunctionCall`. This fixes deserialization of ollama tool calls. - Added deserialization tests using json from official ollama api docs. - Fetch model capabilities during model enumeration from ollama provider - Added `supports_tools` setting to manually configure if a model supports tools ## TODO: - [x] Fix tool call serialization/deserialization - [x] Fetch model capabilities from ollama api - [x] Add tests for parsing model capabilities - [ ] Documentation for `supports_tools` field for ollama language model config - [ ] Convert between generic language model types - [x] Pass tools to ollama Release Notes: - N/A --------- Co-authored-by: Antonio Scandurra <me@as-cii.com> Co-authored-by: Nathan Sobo <nathan@zed.dev>	2025-05-05 17:52:23 +00:00
Umesh Yadav	251f26d48a	copilot: Add support for tool_calls for gpt-4.1, gpt-4o, o4-mini (#29369 ) Github Copilot currently supports following models for agent mode with tool calls. Currently we are only supporting anthropic models and not openai and gemini. This PR add support for the openai models. I have tested it and it works for all of them. For gemini models it seems there is a issues from copilot side so not adding that in this PR as enabling gemini model breaks it in the ask mode as well. <img width="392" alt="image" src="https://github.com/user-attachments/assets/fb7a4148-e48c-45c5-9ff9-c02f71217dfb" /> - [x] GPT-4.1 - [x] GPT-4.0 - [x] o4-mini Release Notes: - agent: Add tool calling support for gpt-4.1, gpt-4o, o4-mini when using Copilot Chat as a provider Signed-off-by: Umesh Yadav <umesh4257@gmail.com>	2025-05-05 13:59:12 +02:00
Michael Sloan	76ad1a29a5	Add support for getting the token count for all parts of Gemini generation requests (#29630 ) * `CountTokensRequest` now takes a full `GenerateContentRequest` instead of just content. * Fixes use of `models/` prefix in `model` field of `GenerateContentRequest`, since that's required for use in `CountTokensRequest`. This didn't cause issues before because it was always cleared and used in the path. Release Notes: - N/A	2025-05-04 21:32:45 +00:00
Michael Sloan	f4e9ea3cd8	In error text of cloud LLM API: `completion failed` -> `request failed` (#29888 ) This error is used for more requests than completion requests Release Notes: - N/A	2025-05-04 21:04:34 +00:00
Michael Sloan	a0895a6ed8	Only send `Stop` event at end of google completion request (#29885 ) I don't think this makes much of a difference in current use, but this more closely matches other providers and cleans up the "Response" section of eval markdown output Release Notes: - N/A	2025-05-04 20:23:13 +00:00
Max Brunsfeld	c3d9cdecab	Change cloud language model provider JSON protocol to surface errors and usage information (#29830 ) Release Notes: - N/A --------- Co-authored-by: Nathan Sobo <nathan@zed.dev> Co-authored-by: Marshall Bowers <git@maxdeviant.com>	2025-05-04 17:37:42 +00:00
Marshall Bowers	f0515d1c34	agent: Show a notice when reaching consecutive tool use limits (#29833 ) This PR adds a notice when reaching consecutive tool use limits when using normal mode. Here's an example with the limit artificially lowered to 2 consecutive tool uses: https://github.com/user-attachments/assets/32da8d38-67de-4d6b-8f24-754d2518e5d4 Release Notes: - agent: Added a notice when reaching consecutive tool use limits when using a model in normal mode.	2025-05-03 02:09:54 +00:00
Max Brunsfeld	04772bf17d	Add support for queuing status updates in cloud language model provider (#29818 ) This sets us up to display queue position information to the user, once our language model backend is updated to support request queuing. The JSON returned by the LLM backend will need to look like this: ```json {"queue": {"status": "queued", "position": 1}} {"queue": {"status": "started"}} {"event": {"THE_UPSTREAM_MODEL_PROVIDER_EVENT": "..."}} ``` Release Notes: - N/A --------- Co-authored-by: Marshall Bowers <git@maxdeviant.com>	2025-05-02 20:36:39 +00:00
Michael Sloan	edf78e770d	Fix token counting requests in Gemini (#29643 ) Release Notes: - N/A	2025-04-30 04:55:07 +00:00
Shardul Vaidya	fa40353fc5	bedrock: Preserve thinking blocks for Bedrock (#29602 ) Fixes a regression from #29055, resolves #29290 Release Notes: - agent: Fixed a regression that rendered Claude 3.7 Thinking unusable on Bedrock.	2025-04-29 12:18:32 -04:00
Marshall Bowers	b2df395918	language_models: Change default fast model for Zed provider (#29600 ) This PR changes the default fast model for the Zed provider from Claude 3.5 Haiku to Claude 3.5 Sonnet. We don't offer Claude 3.5 Haiku to users. Closes https://github.com/zed-industries/zed/issues/29505. Release Notes: - agent: Changed the default fast model for the Zed provider to Claude 3.5 Sonnet.	2025-04-29 14:46:27 +00:00
Marshall Bowers	cd86905ebe	language_models: Pass up `mode` from the `LanguageModelRequest` (#29552 ) This PR makes it so we pass up the `mode` from the `LanguageModelRequest` when interacting with the Zed provider instead of passing a hard-coded value. Release Notes: - N/A	2025-04-28 17:38:55 +00:00
Marshall Bowers	9be7bf72a4	language_models: Remove `language-models` feature flag (#29416 ) This PR removes the `language-models` feature flag. This feature is already generally available, so we no longer need the feature flag. Release Notes: - N/A	2025-04-25 14:18:48 +00:00
Bennet Bo Fenner	93862838bd	assistant: Fix issue when using inline assistant with Gemini models (#29407 ) Closes #29020 Release Notes: - assistant: Fix issue when using inline assistant with Gemini models	2025-04-25 12:24:21 +00:00
Marshall Bowers	187f851613	feature_flags: Add `FeatureFlag` suffix to feature flag types (#29392 ) This PR adds the `FeatureFlag` suffix to the feature flag types that were missing them. This makes the names easier to search in the codebase. Release Notes: - N/A	2025-04-25 04:07:49 +00:00
Marshall Bowers	6bb6be826d	language_models: Use `POST /completions` endpoint for Zed provider (#29389 ) This PR updates the Zed provider to use the `POST /completions` endpoint. There is no functional difference from `POST /completion`, but the pluralized version reads better. Release Notes: - N/A	2025-04-25 02:58:02 +00:00
Richard Feldman	720dfee803	Treat invalid JSON in tool calls as failed tool calls (#29375 ) Release Notes: - N/A --------- Co-authored-by: Max <max@zed.dev> Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com>	2025-04-24 16:54:27 -04:00
Bennet Bo Fenner	cd365b0cf5	gemini: Fix issue when deserializing tool call (#29363 ) Fixes a regression introduced in #29322 Release Notes: - N/A Co-authored-by: Agus Zubiaga <hi@aguz.me>	2025-04-24 18:19:05 +00:00
Agus Zubiaga	58604fba86	agent: Do not reuse assistant message across generations (#29360 ) #29354 introduced a bug where we would append tool uses to the last assistant message even if it was from a previous request. Release Notes: - N/A Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de>	2025-04-24 17:56:47 +00:00
Nathan Sobo	8836c6fb42	Introduce LanguageModelToolUse::raw_input (#29322 ) This is to enable alternative streaming solutions at the application layer. I'm not sure we really should have performed parsing of the input at this layer. Either way I want to experiment with streaming approaches in a separate crate on a branch, and this will help. /cc @maxdeviant @bennetbo @rtfeldman Closes #ISSUE Release Notes: - N/A	2025-04-24 02:30:48 +00:00
Marshall Bowers	fef2681cfa	language_models: Count Google AI tokens through LLM service (#29319 ) This PR wires the counting of Google AI tokens back up. It now goes through the LLM service instead of collab's RPC. Still only available for Zed staff. Release Notes: - N/A	2025-04-24 01:21:53 +00:00
Marshall Bowers	74442b68ea	collab: Remove `CountLanguageModelTokens` RPC message (#29314 ) This PR removes the `CountLanguageModelTokens` RPC message from collab. We were only using this for Google AI models through the Zed provider (which is only available to Zed staff). For now we're returning `0`, but will bring back soon. Release Notes: - N/A	2025-04-23 23:10:47 +00:00
Julia Ryan	f11c749353	VSCode Settings import (#29018 ) Things this doesn't currently handle: - [x] ~testing~ - ~we really need an snapshot test that takes a vscode settings file with all options that we support, and verifies the zed settings file you get from importing it, both from an empty starting file or one with lots of conflicts. that way we can open said vscode settings file in vscode to ensure that those options all still exist in the future.~ - Discussed this, we don't think this will meaningfully protect us from future failures, and we will just do this as a manual validation step before merging this PR. Any imports that have meaningfully complex translation steps should still be tested. - [x] confirmation (right now it just clobbers your settings file silently) - it'd be really cool if we could show a diff multibuffer of your current settings with the result of the vscode import and let you pick "hunks" to keep, but that's probably too much effort for this feature, especially given that we expect most of the people using it to have an empty/barebones zed config when they run the import. - [x] ~UI in the "welcome" page~ - we're planning on redoing our welcome/walkthrough experience anyways, but in the meantime it'd be nice to conditionally show a button there if we see a user level vscode config - we'll add it to the UI when we land the new walkthrough experience, for now it'll be accessible through the action - [ ] project-specific settings - handling translation of `.vscode/settings.json` or `.code-workspace` settings to `.zed/settings.json` will come in a future PR, along with UI to prompt the user for those actions when opening a project with local vscode settings for the first time - [ ] extension settings - we probably want to do a best-effort pass of popular extensions like vim and git lens - it's also possible to look for installed/enabled extensions with `code --list-extensions`, but we'd have to maintain some sort of mapping of those to our settings and/or extensions - [ ] LSP settings - these are tricky without access to the json schemas for various language server extensions. we could probably manage to do translations for a couple popular languages and avoid solving it in the general case. - [ ] platform specific settings (`[macos].blah`) - this is blocked on #16392 which I'm hoping to address soon - [ ] language specific settings (`[rust].foo`) - totally doable, just haven't gotten to it yet ~We may want to put this behind some kind of flag and/or not land it until some of the above issues are addressed, given that we expect people to only run this importer once there's an incentive to get it right the first time. Maybe we land it alongside a keymap importer so you don't have to go through separate imports for those?~ We are gonna land this as-is, all these unchecked items at the bottom will be addressed in followup PRs, so maybe don't run the importer for now if you have a large and complex VsCode settings file you'd like to import. Release Notes: - Added a VSCode settings importer, available via a `zed::ImportVsCodeSettings` action --------- Co-authored-by: Mikayla Maki <mikayla@zed.dev> Co-authored-by: Kirill Bulatov <kirill@zed.dev> Co-authored-by: Mikayla Maki <mikayla.c.maki@gmail.com> Co-authored-by: Marshall Bowers <git@maxdeviant.com>	2025-04-23 20:54:09 +00:00
Richard Feldman	f6774ae60d	More graceful invalid JSON handling (#29295 ) Now we're more tolerant of invalid JSON coming back from the model (possibly because it was incomplete and we're streaming), plus if we do end up with invalid JSON once it has all streamed back, we report what the malformed JSON actually was: <img width="444" alt="Screenshot 2025-04-23 at 1 49 14 PM" src="https://github.com/user-attachments/assets/480f5da7-869b-49f3-9ffd-8f08ccddb33d" /> Release Notes: - N/A	2025-04-23 14:08:26 -04:00
Marshall Bowers	92e810bfec	language_models: Pass up `mode` for completion requests through Zed (#29294 ) This PR makes it so we pass up the `mode` for completion requests through the Zed provider. Release Notes: - N/A	2025-04-23 18:02:03 +00:00
Bennet Bo Fenner	b0b620af56	gemini: Add support for passing images as part of the prompt (#29203 ) Release Notes: - agent: Add support for adding images as context when using Google Gemini	2025-04-22 09:05:46 +00:00
Richard Feldman	4f2f9ff762	Streaming tool calls (#29179 ) https://github.com/user-attachments/assets/7854a737-ef83-414c-b397-45122e4f32e8 Release Notes: - Create file and edit file tools now stream their tool descriptions, so you can see what they're doing sooner. --------- Co-authored-by: Marshall Bowers <git@maxdeviant.com>	2025-04-21 22:28:32 +00:00
Michael Sloan	fbf7caf93e	Default to fast model for thread summaries and titles + don't include system prompt / context / thinking segments (#29102 ) * Adds a fast / cheaper model to providers and defaults thread summarization to this model. Initial motivation for this was that https://github.com/zed-industries/zed/pull/29099 would cause these requests to fail when used with a thinking model. It doesn't seem correct to use a thinking model for summarization. * Skips system prompt, context, and thinking segments. * If tool use is happening, allows 2 tool uses + one more agent response before summarizing. Downside of this is that there was potential for some prefix cache reuse before, especially for title summarization (thread summarization omitted tool results and so would not share a prefix for those). This seems fine as these requests should typically be fairly small. Even for full thread summarization, skipping all tool use / context should greatly reduce the token use. Release Notes: - N/A	2025-04-19 23:26:29 +00:00
Bennet Bo Fenner	bafc086d27	agent: Preserve thinking blocks between requests (#29055 ) Looks like the required backend component of this was deployed. https://github.com/zed-industries/monorepo/actions/runs/14541199197 Release Notes: - N/A --------- Co-authored-by: Antonio Scandurra <me@as-cii.com> Co-authored-by: Agus Zubiaga <hi@aguz.me> Co-authored-by: Richard Feldman <oss@rtfeldman.com> Co-authored-by: Nathan Sobo <nathan@zed.dev>	2025-04-19 20:12:03 +00:00
Marshall Bowers	9875521d4e	language_models: Fix passing of `thread_id` and `prompt_id` (#29071 ) This PR is a follow-up to https://github.com/zed-industries/zed/pull/29069 that fixes an issue where the thread ID and prompt ID were not being sent up correctly. Release Notes: - N/A	2025-04-18 21:12:23 +00:00
Marshall Bowers	7abe2c9c31	agent: Attach thread ID and prompt ID to telemetry events (#29069 ) This PR attaches the thread ID and the new prompt ID to telemetry events for completions in the Agent panel. Release Notes: - N/A --------- Co-authored-by: Mikayla Maki <mikayla.c.maki@gmail.com>	2025-04-18 20:41:02 +00:00
Danilo Leal	e27f6a984f	agent: Simplify design of the settings view (#29041 ) Containing everything in boxes wasn't super necessary here. Want to still improve the switch color contrast here, but will probably do that in a separate PR. <img src="https://github.com/user-attachments/assets/f826a7a8-beaf-45d0-9dc2-36dc210c418e" width="700"/> Release Notes: - N/A	2025-04-18 14:24:53 -03:00
Marshall Bowers	d93141bded	agent: Extract usage information from response headers (#29002 ) This PR updates the Agent to extract the usage information from the response headers, if they are present. For now we just log the information, but we'll be using this soon to populate some UI. Release Notes: - N/A	2025-04-17 20:11:07 +00:00

1 2 3

133 commits