Yehowshua/ZIm - Forgejo: Beyond coding. We Forge.

Author	SHA1	Message	Date
Richard Feldman	7d4d8b8398	Add GPT-5 support through OpenAI API (#35822 ) (This PR does not add GPT-5 to Zed Pro, but rather adds access if you're using your own OpenAI API key.) <img width="772" height="333" alt="Screenshot 2025-08-07 at 2 23 18 PM" src="https://github.com/user-attachments/assets/42e75082-118a-4737-89b6-a740ae33b169" /> --- NOTE: If your API key is not through a verified organization, you may see this error: <img width="549" height="253" alt="Screenshot 2025-08-07 at 2 04 54 PM" src="https://github.com/user-attachments/assets/d0b6d739-9c39-4af3-88d7-0c9609b0e6ba" /> Even if your org is verified, you still may not have access to GPT-5, in which case you could see this error: <img width="543" height="98" alt="Screenshot 2025-08-07 at 2 09 18 PM" src="https://github.com/user-attachments/assets/e3ed31e3-2a11-4f07-8f3c-5b410fbe4540" /> One way to test if you're in this situation is to visit https://platform.openai.com/chat/edit?models=gpt-5 and see if you get the same "you don't have access to GPT-5" error on OpenAI's official playground. It looks like this: <img width="581" height="196" alt="Screenshot 2025-08-07 at 2 15 25 PM" src="https://github.com/user-attachments/assets/ea1454ca-3c10-4703-8126-c02cb92a34f2" /> Release Notes: - Added GPT-5, as well as its mini and nano variants. To use this, you need to have an OpenAI API key configured via the `OPENAI_API_KEY` environment variable.	2025-08-07 23:35:41 +00:00
Antonio Scandurra	6f5867fc88	Fetch models right after signing in (#35711 ) This uses the `current_user` watch in the `UserStore` instead of looping every 100ms in order to detect if the user had signed in. We are changing this because we noticed it was causing the deterministic executor in tests to never detect a "parking with nothing left to run" situation. This seems better in production as well, especially for users who never sign in. /cc @maxdeviant Release Notes: - N/A Co-authored-by: Ben Brandt <benjamin.j.brandt@gmail.com>	2025-08-06 10:04:07 +00:00
Danilo Leal	cc93175256	Recategorize a few items in the component preview (#35681 ) Release Notes: - N/A	2025-08-05 23:11:43 +00:00
Danilo Leal	497252480c	agent: Update link to OpenAI compatible docs (#35620 ) Release Notes: - N/A	2025-08-05 13:05:05 +00:00
Danilo Leal	be2f54b233	agent: Update pieces of copy in the settings view (#35621 ) Some tiny updates to make the agent panel's copywriting sharper. Release Notes: - N/A	2025-08-05 00:36:43 +00:00
Danilo Leal	0609c8b953	Revise and clean up some icons (#35582 ) This is really just a small beginning, as there are many other icons to be revised and cleaned up. Our current set is a bit of a mess in terms of dimension, spacing, stroke width, and terminology. I'm sure there are more non-used icons I'm not covering here, too. We'll hopefully tackle it all soon leading up to 1.0. Closes https://github.com/zed-industries/zed/issues/35576 Release Notes: - N/A	2025-08-04 11:58:31 -03:00
Antonio Scandurra	f888f3fc0b	Start separating authentication from connection to collab (#35471 ) This pull request should be idempotent, but lays the groundwork for avoiding to connect to collab in order to interact with AI features provided by Zed. Release Notes: - N/A --------- Co-authored-by: Marshall Bowers <git@maxdeviant.com> Co-authored-by: Richard Feldman <oss@rtfeldman.com>	2025-08-01 17:37:38 +00:00
Marshall Bowers	72d354de6c	Update Agent panel to work with `CloudUserStore` (#35436 ) This PR updates the Agent panel to work with the `CloudUserStore` instead of the `UserStore`, reducing its reliance on being connected to Collab to function. Release Notes: - N/A --------- Co-authored-by: Richard Feldman <oss@rtfeldman.com>	2025-08-01 01:44:43 +00:00
Marshall Bowers	7be1f2418d	Replace `zed_llm_client` with `cloud_llm_client` (#35309 ) This PR replaces the usage of the `zed_llm_client` with the `cloud_llm_client`. It was ported into this repo in #35307. Release Notes: - N/A	2025-07-30 00:09:14 +00:00
Michael Sloan	65250fe08d	cloud provider: Use `CompletionEvent` type from `zed_llm_client` (#35285 ) Release Notes: - N/A	2025-07-29 17:28:18 +00:00
etimvr	5de544eb4b	Fix unnecessary Ollama model loading (#35032 ) Closes https://github.com/zed-industries/zed/issues/35031 Similar solution as in https://github.com/zed-industries/zed/pull/30589 Release Notes: - Fix unnecessary ollama model loading	2025-07-25 16:58:05 +03:00
Danilo Leal	29332c1962	ai onboarding: Add overall fixes to the whole flow (#34996 ) Closes https://github.com/zed-industries/zed/issues/34979 Release Notes: - N/A --------- Co-authored-by: Agus Zubiaga <hi@aguz.me> Co-authored-by: Ben Kunkle <Ben.kunkle@gmail.com>	2025-07-24 11:26:15 -03:00
Marshall Bowers	7f70325a93	language_models: Rename `handler` to `handle` in Bedrock provider (#34923 ) This PR renames the `handler` field to `handle` on the `BedrockLanguageModelProvider` and `BedrockModel` structs. Release Notes: - N/A	2025-07-22 20:04:08 +00:00
tiagoq	56b99f49fd	bedrock: Fix remaining streaming delays (#33931 ) Closes #26030 Note: This is my first contribution to Zed This addresses a second streaming bottleneck in Bedrock that remained after the initial fix in #28281 (released in preview 194). The issue is in the mechanism used to convert Zed's internal `AsyncBody` into the `SdkBody` expected by the Bedrock language provider. We are using a non-streaming converter that buffers responses. How the fix works: The AWS SDK provides streaming-compatible converters to create `SdkBody` instances, but these require the input body to implement the `Body` trait from the `http-body` crate. This PR enables streaming by implementing the required trait and switching to the streaming-compatible converter. Changes (2 commits): * 1st Commit - Implement http-body Body trait for AsyncBody: - Add `http-body = 1.0` dependency (already an indirect dependency) - Implement the `Body` trait for our existing `AsyncBody` type - Uses `poll_frame` to read data chunks asynchronously, preserving streaming behavior * 2nd Commit - Use streaming-compatible AWS SDK converter: - Create `SdkBody` using `SdkBody::from_body_1_x()` with the new `Body` trait implementation Details/FAQ: Q: Why add another dependency? A: We tried to avoid adding a dependency, but the AWS SDK requires the `Body` trait and `http-body` is where it's defined. The crate is already an indirect dependency, making this a reasonable solution. Q: Why modify the shared `http_client` crate instead of just `aws_bedrock_client`? A: We considered implementing the `Body` trait on a wrapper in `aws_bedrock_client`, but since `AsyncBody` already uses `http` crate types, extending support to the companion `http-body` crate seems reasonable and may benefit other integrations. Q: How was this bottleneck discovered? A: After @5herlocked's initial streaming fix in #28281, I tested preview 194 and noticed streaming still had issues. I found a way to reproduce the problem and chatted with @5herlocked about it. He immediately pinpointed the exact location where the issue was occurring, his diagnosis made this fix possible. Q: How does this relate to the previous fix? A: #28281 fixed buffering issues higher in the stack, but unfortunately there was another bottleneck lower-down in the aws-http-client. This PR addresses that separate buffering issue. Q: Does this use zero-copy or one-copy? A: The `Body` implementation includes one copy. Someone more knowledgeable might be able to achieve a zero-copy approach, but we opted for a conservative approach. The performance impact should not be perceptible in typical usage. Testing: Confirmed that Bedrock streaming now works without buffering delays in a local build. Release Notes: - Improved Bedrock streaming by eliminating response buffering delays --------- Co-authored-by: Marshall Bowers <git@maxdeviant.com>	2025-07-22 11:55:24 -04:00
Bennet Bo Fenner	230061a6cb	Support multiple OpenAI compatible providers (#34212 ) TODO - [x] OpenAI Compatible API Icon - [x] Docs - [x] Link to docs in OpenAI provider section about configuring OpenAI API compatible providers Closes #33992 Related to #30010 Release Notes: - agent: Add support for adding multiple OpenAI API compatible providers --------- Co-authored-by: MrSubidubi <dev@bahn.sh> Co-authored-by: Danilo Leal <daniloleal09@gmail.com>	2025-07-22 12:20:07 -03:00
Danilo Leal	eaccd542fd	Add fast-follows to the AI onboarding flow (#34737 ) Follow-up to https://github.com/zed-industries/zed/pull/33738. Release Notes: - N/A --------- Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de>	2025-07-22 02:09:05 -03:00
Oleksandr Mykhailenko	29111304dd	agent: Fix Mistral tool use error message (#34692 ) Closes #32675 Exactly the same changes as in #33640 by @sviande The PR has been in WIP state for 3 weeks with no activity, and the issue basically makes Mistral models unusable. I have tested the changes locally, and it does indeed work. Full credit goes to @sviande, I just want this feature to be finished. Release Notes: - agent: Fixed an issue with tool calling with the Mistral provider (thanks [@sviande](https://github.com/sviande) and [@armyhaylenko](https://github.com/armyhaylenko)) Co-authored-by: sviande <sviande@gmail.com>	2025-07-19 11:59:57 -04:00
Danilo Leal	4476860664	Add refinements to the AI onboarding flow (#33738 ) This includes making sure that both the agent panel and Zed's edit prediction have a consistent narrative when it comes to onboarding users into the AI features, considering the possible different plans and conditions (such as being signed in/out, account age, etc.) Release Notes: - N/A --------- Co-authored-by: Bennet Bo Fenner <53836821+bennetbo@users.noreply.github.com> Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de>	2025-07-18 18:25:36 +02:00
Richard Feldman	d470411725	Improve upstream error reporting (#34668 ) Now we handle more upstream error cases using the same auto-retry logic. Release Notes: - N/A	2025-07-17 18:12:48 -04:00
Peter Tripp	f82ef1f76f	agent: Support GEMINI_API_KEY environment variable (#34574 ) Google Gemini Docs now recommend usage of `GEMINI_API_KEY` and the legacy `GOOGLE_AI_API_KEY` variable is no longer supported in the modern SDKs. Zed will now accept either. Release Notes: - N/A	2025-07-16 20:55:54 +00:00
Umesh Yadav	ec52e9281a	Add xAI language model provider (#33593 ) Closes #30010 Release Notes: - Add support for xAI language model provider	2025-07-15 15:35:50 -04:00
Marshall Bowers	eca36c502e	Route all LLM traffic through `cloud.zed.dev` (#34404 ) This PR makes it so all LLM traffic is routed through `cloud.zed.dev`. We're already routing `llm.zed.dev` to `cloud.zed.dev` on the server, but we want to standardize on `cloud.zed.dev` moving forward. Release Notes: - N/A	2025-07-14 16:03:19 +00:00
Marshall Bowers	cfc9cfa4ab	language_models: Refresh the list of models when the LLM token is refreshed (#34222 ) This PR makes it so we refresh the list of models whenever the LLM token is refreshed. This allows us to add or remove models based on the plan in the new token. Release Notes: - Fixed model list not refreshing when subscribing to Zed Pro. --------- Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de>	2025-07-10 17:05:41 +00:00
Bennet Bo Fenner	41fe2a2ab4	agent: Disable thinking when using inline assistant/edit file tool (#34141 ) This introduces a new field `thinking_allowed` on `LanguageModelRequest` which lets us control whether thinking should be enabled if the model supports it. We permit thinking in the Inline Assistant, Edit File tool and the Git Commit message generator, this should make generation faster when using a thinking model, e.g. `claude-sonnet-4-thinking` Release Notes: - N/A	2025-07-09 18:05:39 +00:00
Marshall Bowers	1220049089	Add feature flag to use `cloud.zed.dev` instead of `llm.zed.dev` (#34076 ) This PR adds a new `zed-cloud` feature flag that can be used to send traffic to `cloud.zed.dev` instead of `llm.zed.dev`. This is just so Zed staff can test the new infrastructure. When we're ready for prime-time we'll reroute traffic on the server. Release Notes: - N/A	2025-07-08 18:44:51 +00:00
Bennet Bo Fenner	66a1c356bf	agent: Fix max token count mismatch when not using burn mode (#34025 ) Closes #31854 Release Notes: - agent: Fixed an issue where the maximum token count would be displayed incorrectly when burn mode was not being used.	2025-07-07 23:13:24 +02:00
Liam	83562fca77	copilot: Indicate whether a request is initiated by an agent to Copilot API (#33895 ) Per [GitHub's documentation for VSCode's agent mode](https://docs.github.com/en/copilot/how-tos/chat/asking-github-copilot-questions-in-your-ide#agent-mode), a premium request is charged per user-submitted prompt. rather than per individual request the agent makes to an LLM. This PR matches Zed's functionality to VSCode's, accurately indicating to GitHub's API whether a given request is initiated by the user or by an agent, allowing a user to be metered only for prompts they send. See also: #31068 Release Notes: - Improve Copilot premium request tracking	2025-07-07 10:24:17 +02:00
Marshall Bowers	52c42125a7	language_models: Fix casing of `ZedAiConfiguration` (#33712 ) This PR fixes the casing of the `ZedAiConfiguration` identifier. Release Notes: - N/A	2025-07-01 13:29:43 +00:00
Bennet Bo Fenner	0629804390	agent: Clarify upgrade path when starting trial (#33706 ) Release Notes: - N/A	2025-07-01 11:32:14 +00:00
Bennet Bo Fenner	782fbfad90	agent: Add component preview for Zed AI configuration (#33704 ) As we are in the process of improving our Onboarding UX for Zed AI, I added component previews for the Zed AI Configuration section. This should make it easier to inspect the different states we can run into. <img width="1198" alt="image" src="https://github.com/user-attachments/assets/eb774f27-9091-450d-bfae-c688d533c25e" /> Release Notes: - N/A	2025-07-01 11:12:51 +00:00
Shardul Vaidya	0d809c21ba	bedrock: Fix bedrock not streaming (#28281 ) Closes #26030 Release Notes: - Fixed Bedrock bug causing streaming responses to return as one big chunk --------- Co-authored-by: Peter Tripp <peter@zed.dev>	2025-07-01 12:51:09 +03:00
Michael Sloan	2ee5bedfa9	agent: Only consider zed provider authenticated if TOS is accepted (#33693 ) Also now auto-expands the zed provider section when TOS is not accepted Release Notes: - N/A	2025-07-01 04:51:32 +00:00
Michael Sloan	d497f52e17	agent: Improve error handling and retry for zed-provided models (#33565 ) * Updates to `zed_llm_client-0.8.5` which adds support for `retry_after` when anthropic provides it. * Distinguishes upstream provider errors and rate limits from errors that originate from zed's servers * Moves `LanguageModelCompletionError::BadInputJson` to `LanguageModelCompletionEvent::ToolUseJsonParseError`. While arguably this is an error case, the logic in thread is cleaner with this move. There is also precedent for inclusion of errors in the event type - `CompletionRequestStatus::Failed` is how cloud errors arrive. * Updates `PROVIDER_ID` / `PROVIDER_NAME` constants to use proper types instead of `&str`, since they can be constructed in a const fashion. * Removes use of `CLIENT_SUPPORTS_EXA_WEB_SEARCH_PROVIDER_HEADER_NAME` as the server no longer reads this header and just defaults to that behavior. Release notes for this is covered by #33275 Release Notes: - N/A --------- Co-authored-by: Richard Feldman <oss@rtfeldman.com> Co-authored-by: Richard <richard@zed.dev>	2025-06-30 21:01:32 -06:00
Bennet Bo Fenner	ca0bd53bed	agent: Fix an issue with messages containing trailing whitespace (#33643 ) Seeing this come up in our server logs when sending requests to Anthropic: `final assistant content cannot end with trailing whitespace`. Release Notes: - agent: Fixed an issue where Anthropic requests would sometimes fail because of malformed assistant messages	2025-06-30 09:31:40 +00:00
Umesh Yadav	3f4098e87b	open_ai: Make OpenAI error message generic (#33383 ) Context: In this PR: https://github.com/zed-industries/zed/pull/33362, we started to use underlying open_ai crate for making api calls for vercel as well. Now whenever we get the error we get something like the below. Where on part of the error mentions OpenAI but the rest of the error returns the actual error from provider. This PR tries to make the error generic for now so that people don't get confused seeing OpenAI in their v0 integration. ``` Error interacting with language model Failed to connect to OpenAI API: 403 Forbidden {"success":false,"error":"Premium or Team plan required to access the v0 API: https://v0.dev/chat/settings/billing"} ``` Release Notes: - N/A	2025-06-28 14:38:27 +02:00
Umesh Yadav	3ab4ad6de8	language_models: Use `JsonSchemaSubset` for Gemini models in OpenRouter (#33477 ) Closes #33466 Release Notes: - N/A	2025-06-27 16:36:16 +02:00
Ben Brandt	6c46e1129d	Cleanup remaining references to max mode (#33509 ) Release Notes: - N/A	2025-06-27 08:32:13 +00:00
Bennet Bo Fenner	224de2ec6c	settings: Remove version fields (#33372 ) This cleans up our settings to not include any `version` fields, as we have an actual settings migrator now. This PR removes `language_models > anthropic > version`, `language_models > openai > version` and `agent > version`. We had migration paths in the code for a long time, so in practice almost everyone should be using the latest version of these settings. Release Notes: - Remove `version` fields in settings for `agent`, `language_models > anthropic`, `language_models > openai`. Your settings will automatically be migrated. If you're running into issues with this open an issue [here](https://github.com/zed-industries/zed/issues)	2025-06-25 19:05:29 +02:00
Vladimir Kuznichenkov	0905255fd1	bedrock: Add prompt caching support (#33194 ) Closes https://github.com/zed-industries/zed/issues/33221 Bedrock has similar to anthropic caching api, if we want to cache messages up to a certain point, we should add a special block into that message. Additionally, we can cache tools definition by adding cache point block after tools spec. See: [Bedrock User Guide: Prompt Caching](https://docs.aws.amazon.com/bedrock/latest/userguide/prompt-caching.html#prompt-caching-models) Release Notes: - bedrock: Added prompt caching support --------- Co-authored-by: Oleksiy Syvokon <oleksiy@zed.dev>	2025-06-25 17:15:13 +03:00
Bennet Bo Fenner	59aeede50d	vercel: Use proper model identifiers and add image support (#33377 ) Follow up to previous PRs: - Return `true` in `supports_images` - v0 supports images already - Rename model id to match the exact version of the model `v0-1.5-md` (For now we do not expose `sm`/`lg` variants since they seem not to be available via the API) - Provide autocompletion in settings for using `vercel` as a `provider` Release Notes: - N/A	2025-06-25 13:26:41 +00:00
Bennet Bo Fenner	18f1221a44	vercel: Reuse existing OpenAI code (#33362 ) Follow up to #33292 Since Vercel's API is OpenAI compatible, we can reuse a bunch of code. Release Notes: - N/A	2025-06-25 15:04:43 +02:00
Shardul Vaidya	4396ac9dd6	bedrock: DeepSeek does not support receiving Reasoning Blocks (#33326 ) Closes #32341 Release Notes: - Fixed DeepSeek R1 errors for reasoning blocks being sent back to the model.	2025-06-25 14:51:25 +03:00
Vladimir Kuznichenkov	c6ff58675f	bedrock: Fix empty tool input on project diagnostic in bedrock (#33369 ) Bedrock [do not accept][1] `null` as a JSON value input for the tool call when called back. Instead of passing null, we will pass back an empty object, which is accepted by API Closes #33204 Release Notes: - Fixed project diagnostic tool call for bedrock [1]: https://docs.aws.amazon.com/bedrock/latest/APIReference/API_runtime_ToolUseBlock.html	2025-06-25 14:28:36 +03:00
Umesh Yadav	108162423d	language_models: Emit UsageUpdate events for token usage in DeepSeek and OpenAI (#33242 ) Closes #ISSUE Release Notes: - N/A	2025-06-25 09:42:30 +02:00
Vladimir Kuznichenkov	098896146e	bedrock: Fix subsequent bedrock tool calls fail (#33174 ) Closes #30714 Bedrock converse api expect to see tool options if at least one tool was used in conversation in the past messages. Right now if `LanguageModelToolChoice::None` isn't supported edit agent [remove][1] tools from request. That point breaks Converse API of Bedrock. As was proposed in [the issue][2] we won't drop tool choose but instead will deny any of them if model will respond with a tool choose. [1]: `fceba6c795/crates/assistant_tools/src/edit_agent.rs (L703)` [2]: https://github.com/zed-industries/zed/issues/30714#issuecomment-2886422716 Release Notes: - Fixed bedrock tool calls in edit mode	2025-06-25 10:37:07 +03:00
Bennet Bo Fenner	7be57baef0	agent: Fix issue with Anthropic thinking models (#33317 ) cc @osyvokon We were seeing a bunch of errors in our backend when people were using Claude models with thinking enabled. In the logs we would see > an error occurred while interacting with the Anthropic API: invalid_request_error: messages.x.content.0.type: Expected `thinking` or `redacted_thinking`, but found `text`. When `thinking` is enabled, a final `assistant` message must start with a thinking block (preceeding the lastmost set of `tool_use` and `tool_result` blocks). We recommend you include thinking blocks from previous turns. To avoid this requirement, disable `thinking`. Please consult our documentation at https://docs.anthropic.com/en/docs/build-with-claude/extended-thinking However, this issue did not occur frequently and was not easily reproducible. Turns out it was triggered by us not correctly handling [Redacted Thinking Blocks](https://docs.anthropic.com/en/docs/build-with-claude/extended-thinking#thinking-redaction). I could constantly reproduce this issue by including this magic string: `ANTHROPIC_MAGIC_STRING_TRIGGER_REDACTED_THINKING_46C9A13E193C177646C7398A98432ECCCE4C1253D5E2D82641AC0E52CC2876CB ` in the request, which forces `claude-3-7-sonnet` to emit redacted thinking blocks (confusingly the magic string does not seem to be working for `claude-sonnet-4`). As soon as we hit a tool call Anthropic would return an error. Thanks to @osyvokon for pointing me in the right direction 😄! Release Notes: - agent: Fixed an issue where Anthropic models would sometimes return an error when thinking was enabled	2025-06-24 16:23:59 +00:00
Danilo Leal	94735aef69	Add support for Vercel as a language model provider (#33292 ) Vercel v0 is an OpenAI-compatible model, so this is mostly a dupe of the OpenAI provider files with some adaptations for v0, including going ahead and using the custom endpoint for the API URL field. Release Notes: - Added support for Vercel as a language model provider.	2025-06-24 11:02:06 -03:00
Richard Feldman	c610ebfb03	Thread Anthropic errors into LanguageModelKnownError (#33261 ) This PR is in preparation for doing automatic retries for certain errors, e.g. Overloaded. It doesn't change behavior yet (aside from some granularity of error messages shown to the user), but rather mostly changes some error handling to be exhaustive enum matches instead of `anyhow` downcasts, and leaves some comments for where the behavior change will be in a future PR. Release Notes: - N/A	2025-06-23 18:48:26 +00:00
Peter Tripp	595f61f0d6	bedrock: Use Claude 3.0 Haiku where Haiku 3.5 is not available (#33214 ) Closes: https://github.com/zed-industries/zed/issues/33183 @kuzaxak Can you confirm this works for you? Release Notes: - bedrock: Use Anthropic Haiku 3.0 in AWS regions where Haiku 3.5 is unavailable	2025-06-22 15:15:20 -04:00
Umesh Yadav	dfdd2b9558	language_models: Add thinking support to OpenRouter provider (#32541 ) Did some bit cleanup of code for loading models for settings as that is not required as we are fetching all the models from openrouter so it's better to maintain one source of truth Release Notes: - Add thinking support to OpenRouter provider	2025-06-21 08:03:50 +02:00

1 2 3 4 5

243 commits