Yehowshua/ZIm - Forgejo: Beyond coding. We Forge.

Author	SHA1	Message	Date
Michael Sloan	76ad1a29a5	Add support for getting the token count for all parts of Gemini generation requests (#29630 ) * `CountTokensRequest` now takes a full `GenerateContentRequest` instead of just content. * Fixes use of `models/` prefix in `model` field of `GenerateContentRequest`, since that's required for use in `CountTokensRequest`. This didn't cause issues before because it was always cleared and used in the path. Release Notes: - N/A	2025-05-04 21:32:45 +00:00
Michael Sloan	f4e9ea3cd8	In error text of cloud LLM API: `completion failed` -> `request failed` (#29888 ) This error is used for more requests than completion requests Release Notes: - N/A	2025-05-04 21:04:34 +00:00
Max Brunsfeld	c3d9cdecab	Change cloud language model provider JSON protocol to surface errors and usage information (#29830 ) Release Notes: - N/A --------- Co-authored-by: Nathan Sobo <nathan@zed.dev> Co-authored-by: Marshall Bowers <git@maxdeviant.com>	2025-05-04 17:37:42 +00:00
Marshall Bowers	f0515d1c34	agent: Show a notice when reaching consecutive tool use limits (#29833 ) This PR adds a notice when reaching consecutive tool use limits when using normal mode. Here's an example with the limit artificially lowered to 2 consecutive tool uses: https://github.com/user-attachments/assets/32da8d38-67de-4d6b-8f24-754d2518e5d4 Release Notes: - agent: Added a notice when reaching consecutive tool use limits when using a model in normal mode.	2025-05-03 02:09:54 +00:00
Max Brunsfeld	04772bf17d	Add support for queuing status updates in cloud language model provider (#29818 ) This sets us up to display queue position information to the user, once our language model backend is updated to support request queuing. The JSON returned by the LLM backend will need to look like this: ```json {"queue": {"status": "queued", "position": 1}} {"queue": {"status": "started"}} {"event": {"THE_UPSTREAM_MODEL_PROVIDER_EVENT": "..."}} ``` Release Notes: - N/A --------- Co-authored-by: Marshall Bowers <git@maxdeviant.com>	2025-05-02 20:36:39 +00:00
Marshall Bowers	b2df395918	language_models: Change default fast model for Zed provider (#29600 ) This PR changes the default fast model for the Zed provider from Claude 3.5 Haiku to Claude 3.5 Sonnet. We don't offer Claude 3.5 Haiku to users. Closes https://github.com/zed-industries/zed/issues/29505. Release Notes: - agent: Changed the default fast model for the Zed provider to Claude 3.5 Sonnet.	2025-04-29 14:46:27 +00:00
Marshall Bowers	cd86905ebe	language_models: Pass up `mode` from the `LanguageModelRequest` (#29552 ) This PR makes it so we pass up the `mode` from the `LanguageModelRequest` when interacting with the Zed provider instead of passing a hard-coded value. Release Notes: - N/A	2025-04-28 17:38:55 +00:00
Marshall Bowers	187f851613	feature_flags: Add `FeatureFlag` suffix to feature flag types (#29392 ) This PR adds the `FeatureFlag` suffix to the feature flag types that were missing them. This makes the names easier to search in the codebase. Release Notes: - N/A	2025-04-25 04:07:49 +00:00
Marshall Bowers	6bb6be826d	language_models: Use `POST /completions` endpoint for Zed provider (#29389 ) This PR updates the Zed provider to use the `POST /completions` endpoint. There is no functional difference from `POST /completion`, but the pluralized version reads better. Release Notes: - N/A	2025-04-25 02:58:02 +00:00
Richard Feldman	720dfee803	Treat invalid JSON in tool calls as failed tool calls (#29375 ) Release Notes: - N/A --------- Co-authored-by: Max <max@zed.dev> Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com>	2025-04-24 16:54:27 -04:00
Marshall Bowers	fef2681cfa	language_models: Count Google AI tokens through LLM service (#29319 ) This PR wires the counting of Google AI tokens back up. It now goes through the LLM service instead of collab's RPC. Still only available for Zed staff. Release Notes: - N/A	2025-04-24 01:21:53 +00:00
Marshall Bowers	74442b68ea	collab: Remove `CountLanguageModelTokens` RPC message (#29314 ) This PR removes the `CountLanguageModelTokens` RPC message from collab. We were only using this for Google AI models through the Zed provider (which is only available to Zed staff). For now we're returning `0`, but will bring back soon. Release Notes: - N/A	2025-04-23 23:10:47 +00:00
Marshall Bowers	92e810bfec	language_models: Pass up `mode` for completion requests through Zed (#29294 ) This PR makes it so we pass up the `mode` for completion requests through the Zed provider. Release Notes: - N/A	2025-04-23 18:02:03 +00:00
Michael Sloan	fbf7caf93e	Default to fast model for thread summaries and titles + don't include system prompt / context / thinking segments (#29102 ) * Adds a fast / cheaper model to providers and defaults thread summarization to this model. Initial motivation for this was that https://github.com/zed-industries/zed/pull/29099 would cause these requests to fail when used with a thinking model. It doesn't seem correct to use a thinking model for summarization. * Skips system prompt, context, and thinking segments. * If tool use is happening, allows 2 tool uses + one more agent response before summarizing. Downside of this is that there was potential for some prefix cache reuse before, especially for title summarization (thread summarization omitted tool results and so would not share a prefix for those). This seems fine as these requests should typically be fairly small. Even for full thread summarization, skipping all tool use / context should greatly reduce the token use. Release Notes: - N/A	2025-04-19 23:26:29 +00:00
Marshall Bowers	9875521d4e	language_models: Fix passing of `thread_id` and `prompt_id` (#29071 ) This PR is a follow-up to https://github.com/zed-industries/zed/pull/29069 that fixes an issue where the thread ID and prompt ID were not being sent up correctly. Release Notes: - N/A	2025-04-18 21:12:23 +00:00
Marshall Bowers	7abe2c9c31	agent: Attach thread ID and prompt ID to telemetry events (#29069 ) This PR attaches the thread ID and the new prompt ID to telemetry events for completions in the Agent panel. Release Notes: - N/A --------- Co-authored-by: Mikayla Maki <mikayla.c.maki@gmail.com>	2025-04-18 20:41:02 +00:00
Marshall Bowers	d93141bded	agent: Extract usage information from response headers (#29002 ) This PR updates the Agent to extract the usage information from the response headers, if they are present. For now we just log the information, but we'll be using this soon to populate some UI. Release Notes: - N/A	2025-04-17 20:11:07 +00:00
Marshall Bowers	3fef3cc392	Use more types/constants from `zed_llm_client` (#28909 ) This PR makes it so we use more types and constants from the `zed_llm_client` crate to avoid duplicating information. Also updates the current usage endpoint to use limits derived from the `Plan`. Release Notes: - N/A	2025-04-16 20:58:00 +00:00
Marshall Bowers	fcb1efdf21	rpc: Remove `llm` module in favor of `zed_llm_client` (#28900 ) This PR removes the `llm` module of the `rpc` crate in favor of using the types from the `zed_llm_client`. Release Notes: - N/A	2025-04-16 20:22:44 +00:00
Marshall Bowers	cb79420773	agent: Show an error when the model requests limit has been reached (#28868 ) This PR adds an error message when the model requests limit has been hit. Release Notes: - N/A Co-authored-by: Oleksiy Syvokon <oleksiy.syvokon@gmail.com>	2025-04-16 15:11:35 +00:00
Agus Zubiaga	b45230784d	agent: Handle context window exceeded errors from Anthropic (#28688 ) ![CleanShot 2025-04-14 at 11 15 38@2x](https://github.com/user-attachments/assets/9e803ffb-74fd-486b-bebc-2155a407a9fa) Release Notes: - agent: Handle context window exceeded errors from Anthropic	2025-04-14 14:39:33 +00:00
Bennet Bo Fenner	b22faf96e0	agent: Refine language model selector (#28597 ) Release Notes: - agent: Show recommended models in the agent model selector and display the provider in the model selector's trigger. --------- Co-authored-by: Danilo Leal <daniloleal09@gmail.com> Co-authored-by: Danilo Leal <67129314+danilo-leal@users.noreply.github.com>	2025-04-11 23:02:50 +00:00
Marshall Bowers	61b7a05792	language_models: Allow overriding Zed completions URL via environment variable (#28323 ) This PR adds support for overriding the Zed completions URL via the `ZED_COMPLETIONS_URL` environment variable. Release Notes: - N/A	2025-04-08 14:46:15 +00:00
Marshall Bowers	819bb8fffb	open_ai: Disable `parallel_tool_calls` (#28056 ) This PR disables `parallel_tool_calls` for the models that support it, as the Agent currently expects at most one tool use per turn. It was a bit of trial and error to figure this out. OpenAI's API annoyingly will return an error if passing `parallel_tool_calls` to a model that doesn't support it. Release Notes: - N/A	2025-04-03 22:07:37 +00:00
Marshall Bowers	7492ec3f67	Add tool use support for OpenAI models (#28051 ) This PR adds support for using tools to the OpenAI models. Release Notes: - agent: Added support for tool use with OpenAI models (Preview only).	2025-04-03 20:55:11 +00:00
Marshall Bowers	889bc13b7d	language_model: Remove `use_any_tool` method from `LanguageModel` (#27930 ) This PR removes the `use_any_tool` method from the `LanguageModel` trait. It was not being used anywhere, and doesn't really fit in our new tool use story. Release Notes: - N/A	2025-04-02 15:49:21 +00:00
Danilo Leal	192097f58f	assistant2: Ensure errors are also displayed in populated new thread view (#27869 ) Follow-up to https://github.com/zed-industries/zed/pull/27812 This PR makes sure these errors cases also show up in the panel's empty state even when there is past data. \| No ToS \| Missing Provider \| \|--------\|--------\| \| ![CleanShot 2025-04-01 at 4  49 36@2x](https://github.com/user-attachments/assets/6da6bdc9-daa6-4a7b-a224-989eb845e205) \| ![CleanShot 2025-04-01 at 4  50 04@2x](https://github.com/user-attachments/assets/bddf62cb-3727-44b5-b115-9a88313c6d85) \| Release Notes: - N/A	2025-04-01 17:06:34 -03:00
Marshall Bowers	5880271b11	language_model: Add `supports_tools` method to `LanguageModel` (#27867 ) This PR adds a new `supports_tools` method to the `LanguageModel` trait to indicate whether a given model supports tool use. Release Notes: - N/A	2025-04-01 19:56:05 +00:00
Bennet Bo Fenner	5509e0141a	Return language model events when using Google model via zed.dev (#27831 ) Release Notes: - N/A	2025-04-01 08:58:17 +00:00
Danilo Leal	dce824f095	assistant2: Refine empty states design (#27812 ) \| No LLM provider \| Fresh Start \| No ToS \| \|--------\|--------\|--------\| \| ![CleanShot 2025-03-31 at 7  04 17@2x](https://github.com/user-attachments/assets/aab5987c-1530-401d-acc6-65e4f2fc13b8) \| ![CleanShot 2025-03-31 at 7  04 39@2x](https://github.com/user-attachments/assets/b2c7a2e0-5178-4bcb-a917-da7bf8e6246c) \| ![CleanShot 2025-03-31 at 7  05 10@2x](https://github.com/user-attachments/assets/4a656e82-0e1d-4d11-8d34-8eeeadd4814c) \| Release Notes: - N/A	2025-03-31 19:31:56 -03:00
Piotr Osiewicz	dc64ec9cc8	chore: Bump Rust edition to 2024 (#27800 ) Follow-up to https://github.com/zed-industries/zed/pull/27791 Release Notes: - N/A	2025-03-31 20:55:27 +02:00
Bennet Bo Fenner	01a2c8eb01	Set tool schema format for zed.dev language model (#27788 ) Release Notes: - N/A	2025-03-31 16:49:59 +00:00
Richard Feldman	76d3a9a0f0	Retry on 5xx errors from cloud language model providers (#27584 ) Release Notes: - N/A	2025-03-27 09:35:16 -04:00
Bennet Bo Fenner	a52e2f9553	Show claude-3-7-sonnet-thinking model for all users (#27256 ) Release Notes: - N/A	2025-03-21 17:23:36 +01:00
Bennet Bo Fenner	a709d4c7c6	assistant: Add support for `claude-3-7-sonnet-thinking` (#27085 ) Closes #25671 Release Notes: - Added support for `claude-3-7-sonnet-thinking` in the assistant panel --------- Co-authored-by: Danilo Leal <daniloleal09@gmail.com> Co-authored-by: Antonio Scandurra <me@as-cii.com> Co-authored-by: Agus Zubiaga <hi@aguz.me>	2025-03-21 12:29:07 +00:00
Mikayla Maki	1aefa5178b	Move "async move" a few characters to the left in cx.spawn() (#26758 ) This is the core change: https://github.com/zed-industries/zed/pull/26758/files#diff-044302c0d57147af17e68a0009fee3e8dcdfb4f32c27a915e70cfa80e987f765R1052 TODO: - [x] Use AsyncFn instead of Fn() -> Future in GPUI spawn methods - [x] Implement it in the whole app - [x] Implement it in the debugger - [x] Glance at the RPC crate, and see if those box future methods can be switched over. Answer: It can't directly, as you can't make an AsyncFn* into a trait object. There's ways around that, but they're all more complex than just keeping the code as is. - [ ] Fix platform specific code Release Notes: - N/A	2025-03-19 02:09:02 +00:00
Danilo Leal	5c400dac8d	assistant2: Adjust empty state layout (#25745 ) Going for a different, arguably simpler design for the Assistant 2 empty state here. Also took the opportunity to adjust other elements like the toolbar, message editor, and some items in the configuration page. <img src="https://github.com/user-attachments/assets/03fd1d48-a675-4eac-b694-bbe4eeaf06e9" width="700px"/> Release Notes: - N/A	2025-02-27 11:33:53 -03:00
Marshall Bowers	75dbe189bd	Give Zed AI users access to Claude 3.7 Sonnet (#25577 ) This PR updates the client-side checks to give Zed AI users access to Claude 3.7 Sonnet. Requires https://github.com/zed-industries/zed/pull/25576 to be deployed. Release Notes: - Added support for Claude 3.7 Sonnet to Zed AI.	2025-02-25 12:15:15 -05:00
Marshall Bowers	def342e35c	Remove dependents of `language_models` (#25511 ) This PR removes the dependents of the `language_models` crate. The following types have been moved from `language_models` to `language_model` to facilitate this: - `LlmApiToken` - `RefreshLlmTokenListener` - `MaxMonthlySpendReachedError` - `PaymentRequiredError` With this change only `zed` now depends on `language_models`. Release Notes: - N/A	2025-02-24 22:46:45 +00:00
Marshall Bowers	0acd556106	language_model: Remove dependencies on individual model provider crates (#25503 ) This PR removes the dependencies on the individual model provider crates from the `language_model` crate. The various conversion methods for converting a `LanguageModelRequest` into its provider-specific request type have been inlined into the various provider modules in the `language_models` crate. The model providers we provide via Zed's cloud offering get to stay, for now. Release Notes: - N/A	2025-02-24 16:41:35 -05:00
Antonio Scandurra	f517050548	Partially fix assistant onboarding (#25313 ) While investigating #24896, I noticed two issues: 1. The default configuration for the `zed.dev` provider was using the wrong string for Claude 3.5 Sonnet. This meant the provider would always result as not configured until the user selected it from the model picker, because we couldn't deserialize that string to a valid `anthropic::Model` enum variant. 2. When clicking on `Open New Chat`/`Start New Thread` in the provider configuration, we would select `Claude 3.5 Haiku` by default instead of Claude 3.5 Sonnet. Release Notes: - Fixed some issues that caused AI providers to sometimes be misconfigured.	2025-02-24 07:29:55 +00:00
Marshall Bowers	7a6b652ebc	language_model: Return `AuthenticateError`s from `LanguageModelProvider::authenticate` (#25126 ) This PR updates the `LanguageModelProvider::authenticate` method to return an `AuthenticateError` instead of an `anyhow::Error`. This allows us to model the "credentials not found" state explicitly as `AuthenticateError::CredentialsNotFound`, which enables the caller to check for this state and act accordingly. Planning to use this in #25123 to silence errors about missing credentials when authenticating providers in the background. Release Notes: - N/A	2025-02-19 00:01:48 +00:00
Mikayla Maki	9cae96f82f	Remove more references to 'model' in GPUI APIs (#23693 ) Release Notes: - N/A	2025-01-27 04:00:27 +00:00
Mikayla Maki	a6b1514246	Fix missed renames in #22632 (#23688 ) Fix a bug where a GPUI macro still used `ModelContext` Rename `AsyncAppContext` -> `AsyncApp` Rename update_model, read_model, insert_model, and reserve_model to update_entity, read_entity, insert_entity, and reserve_entity Release Notes: - N/A	2025-01-26 23:37:34 +00:00
Nathan Sobo	6fca1d2b0b	Eliminate GPUI View, ViewContext, and WindowContext types (#22632 ) There's still a bit more work to do on this, but this PR is compiling (with warnings) after eliminating the key types. When the tasks below are complete, this will be the new narrative for GPUI: - `Entity<T>` - This replaces `View<T>`/`Model<T>`. It represents a unit of state, and if `T` implements `Render`, then `Entity<T>` implements `Element`. - `&mut App` This replaces `AppContext` and represents the app. - `&mut Context<T>` This replaces `ModelContext` and derefs to `App`. It is provided by the framework when updating an entity. - `&mut Window` Broken out of `&mut WindowContext` which no longer exists. Every method that once took `&mut WindowContext` now takes `&mut Window, &mut App` and every method that took `&mut ViewContext<T>` now takes `&mut Window, &mut Context<T>` Not pictured here are the two other failed attempts. It's been quite a month! Tasks: - [x] Remove `View`, `ViewContext`, `WindowContext` and thread through `Window` - [x] [@cole-miller @mikayla-maki] Redraw window when entities change - [x] [@cole-miller @mikayla-maki] Get examples and Zed running - [x] [@cole-miller @mikayla-maki] Fix Zed rendering - [x] [@mikayla-maki] Fix todo! macros and comments - [x] Fix a bug where the editor would not be redrawn because of view caching - [x] remove publicness window.notify() and replace with `AppContext::notify` - [x] remove `observe_new_window_models`, replace with `observe_new_models` with an optional window - [x] Fix a bug where the project panel would not be redrawn because of the wrong refresh() call being used - [x] Fix the tests - [x] Fix warnings by eliminating `Window` params or using `_` - [x] Fix conflicts - [x] Simplify generic code where possible - [x] Rename types - [ ] Update docs ### issues post merge - [x] Issues switching between normal and insert mode - [x] Assistant re-rendering failure - [x] Vim test failures - [x] Mac build issue Release Notes: - N/A --------- Co-authored-by: Antonio Scandurra <me@as-cii.com> Co-authored-by: Cole Miller <cole@zed.dev> Co-authored-by: Mikayla <mikayla@zed.dev> Co-authored-by: Joseph <joseph@zed.dev> Co-authored-by: max <max@zed.dev> Co-authored-by: Michael Sloan <michael@zed.dev> Co-authored-by: Mikayla Maki <mikaylamaki@Mikaylas-MacBook-Pro.local> Co-authored-by: Mikayla <mikayla.c.maki@gmail.com> Co-authored-by: joão <joao@zed.dev>	2025-01-26 03:02:45 +00:00
Agus Zubiaga	ba16b4eb65	assistant2: Show accept terms UI in thread empty state (#23630 ) <img src="https://github.com/user-attachments/assets/cea93cfb-8a40-48c4-9d90-f1751c79603b" width=400> Release Notes: - N/A --------- Co-authored-by: Danilo <danilo@zed.dev>	2025-01-24 19:34:46 -03:00
Danilo Leal	802d7421bc	assistant: Adjust the ToS acceptance card design (#23599 ) Just fine-tuning the copywriting and design here. \| Before \| After \| \|--------\|--------\| \| <img width="1233" alt="Screenshot 2025-01-24 at 9 28 30 AM" src="https://github.com/user-attachments/assets/ca91a985-8a20-4ece-b0e4-3a6779db2fda" /> \| <img width="1233" alt="Screenshot 2025-01-24 at 9 27 49 AM" src="https://github.com/user-attachments/assets/edc9c2ef-4ae0-4caf-a496-9887748673c9" /> \| Release Notes: - N/A	2025-01-24 09:44:09 -03:00
Roy Williams	b1a6e2427f	anthropic: Allow specifying additional beta headers for custom models (#20551 ) Release Notes: - Added the ability to specify additional beta headers for custom Anthropic models. --------- Co-authored-by: David Soria Parra <167242713+dsp-ant@users.noreply.github.com> Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2025-01-03 23:46:32 +00:00
Antonio Scandurra	77b8296fbb	Introduce staff-only inline completion provider (#21739 ) Release Notes: - N/A --------- Co-authored-by: Thorsten Ball <mrnugget@gmail.com> Co-authored-by: Bennet <bennet@zed.dev> Co-authored-by: Thorsten <thorsten@zed.dev>	2024-12-09 14:26:36 +01:00
Marshall Bowers	cbba44900d	Add `language_models` crate to house language model providers (#20945 ) This PR adds a new `language_models` crate to house the various language model providers. By extracting the provider definitions out of `language_model`, we're able to remove `language_model`'s dependency on `editor`, which improves incremental compilation when changing `editor`. Release Notes: - N/A	2024-11-20 18:49:34 -05:00

1 2

100 commits