Yehowshua/ZIm - Forgejo: Beyond coding. We Forge.

Author	SHA1	Message	Date
Marshall Bowers	cbba44900d	Add `language_models` crate to house language model providers (#20945 ) This PR adds a new `language_models` crate to house the various language model providers. By extracting the provider definitions out of `language_model`, we're able to remove `language_model`'s dependency on `editor`, which improves incremental compilation when changing `editor`. Release Notes: - N/A	2024-11-20 18:49:34 -05:00
Thorsten Ball	aee01f2c50	assistant: Remove `low_speed_timeout` (#20681 ) This removes the `low_speed_timeout` setting from all providers as a response to issue #19509. Reason being that the original `low_speed_timeout` was only as part of #9913 because users wanted to _get rid of timeouts_. They wanted to bump the default timeout from 5sec to a lot more. Then, in the meantime, the meaning of `low_speed_timeout` changed in #19055 and was changed to a normal `timeout`, which is a different thing and breaks slower LLMs that don't reply with a complete response in the configured timeout. So we figured: let's remove the whole thing and replace it with a default _connect_ timeout to make sure that we can connect to a server in 10s, but then give the server as long as it wants to complete its response. Closes #19509 Release Notes: - Removed the `low_speed_timeout` setting from LLM provider settings, since it was only used to _increase_ the timeout to give LLMs more time, but since we don't have any other use for it, we simply remove the setting to give LLMs as long as they need. --------- Co-authored-by: Antonio <antonio@zed.dev> Co-authored-by: Peter Tripp <peter@zed.dev>	2024-11-15 07:37:31 +01:00
Danilo Leal	187356ab9b	assistant: Show only configured models in the model picker (#20392 ) Closes https://github.com/zed-industries/zed/issues/16568 This PR introduces some changes to how we display models in the model selector within the assistant panel. Basically, it comes down to this: - If you don't have any provider configured, you should see _all_ available models in the picker - But, once you've configured some, you should _only_ see models from them in the picker Visually, nothing's changed much aside from the added "Configured Models" label at the top to ensure the understanding that that's a list of, well, configured models only. 😬 <img width="700" alt="Screenshot 2024-11-07 at 23 42 41" src="https://github.com/user-attachments/assets/219ed386-2318-43a6-abea-1de0cda8dc53"> Release Notes: - Change model selector in the assistant panel to only show configured models	2024-11-08 10:08:59 -03:00
Marshall Bowers	2bcf9fc490	Add `client::zed_urls` module for constructing zed.dev URLs (#19391 ) This PR adds a new `zed_urls` module to the `client` crate. This module contains functions for constructing URLs to Zed properties, such as zed.dev. The URLs produced by this module will respect the server URL set via settings or the `ZED_SERVER_URL` environment variable. This allows them to correctly reflect the current environment (such as when testing Zed against a local collab/zed.dev). Release Notes: - N/A	2024-10-17 16:18:35 -04:00
Marshall Bowers	84b61c8b1a	assistant: Add support for displaying billing-related errors (#19082 ) This PR adds support to the assistant for display billing-related errors. Pulling this out of #19081 to make it easier to cherry-pick. Release Notes: - N/A Co-authored-by: Antonio <antonio@zed.dev> Co-authored-by: Richard <richard@zed.dev>	2024-10-11 13:22:45 -04:00
Richard Feldman	caaa9a00a9	Remove Qwen2 model (#18444 ) Removed deprecated Qwen2 7B Instruct model from zed.dev provider (staff only). Release Notes: - N/A	2024-09-27 13:30:25 -04:00
Conrad Irwin	e28496d4e2	Stop leaking isahc assumption (#18408 ) Users of our http_client crate knew they were interacting with isahc as they set its extensions on the request. This change adds our own equivalents for their APIs in preparation for changing the default http client. Release Notes: - N/A	2024-09-26 14:01:05 -06:00
Roy Williams	5905fbb9ac	Allow Anthropic custom models to override temperature (#18160 ) Release Notes: - Allow Anthropic custom models to override "temperature" This also centralized the defaulting of "temperature" to be inside of each model's `into_x` call instead of being sprinkled around the code.	2024-09-20 14:59:12 -06:00
jvmncs	9f6ff29a54	Reuse OpenAI low_speed_timeout setting for zed.dev provider (#18144 ) Release Notes: - N/A	2024-09-20 12:57:35 -04:00
jvmncs	c71f052276	Add ability to use o1-preview and o1-mini as custom models (#17804 ) This is a barebones modification of the OpenAI provider code to accommodate non-streaming completions. This is specifically for the o1 models, which do not support streaming. Tested that this is working by running a `/workflow` with the following (arbitrarily chosen) settings: ```json { "language_models": { "openai": { "version": "1", "available_models": [ { "name": "o1-preview", "display_name": "o1-preview", "max_tokens": 128000, "max_completion_tokens": 30000 }, { "name": "o1-mini", "display_name": "o1-mini", "max_tokens": 128000, "max_completion_tokens": 20000 } ] } }, } ``` Release Notes: - Changed `low_speed_timeout_in_seconds` option to `600` for OpenAI provider to accommodate recent o1 model release. --------- Co-authored-by: Peter <peter@zed.dev> Co-authored-by: Bennet <bennet@zed.dev> Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2024-09-13 15:42:15 -04:00
Peter Tripp	fb9d01b0d5	assistant: Add display_name for OpenAI and Gemini (#17508 )	2024-09-10 13:41:06 -04:00
Piotr Osiewicz	e6c1c51b37	chore: Fix several style lints (#17488 ) It's not comprehensive enough to start linting on `style` group, but hey, it's a start. Release Notes: - N/A	2024-09-06 11:58:39 +02:00
Marshall Bowers	f38956943b	assistant: Propagate LLM stop reason upwards (#17358 ) This PR makes it so we propagate the `stop_reason` from Anthropic up to the Assistant so that we can take action based on it. The `extract_content_from_events` function was moved from `anthropic` to the `anthropic` module in `language_model` since it is more useful if it is able to name the `LanguageModelCompletionEvent` type, as otherwise we'd need an additional layer of plumbing. Release Notes: - N/A	2024-09-04 12:31:10 -04:00
Marshall Bowers	452272e5df	assistant: Stream tool uses as structured data (#17322 ) This PR adjusts the approach we use to encoding tool uses in the completion response to use a structured format rather than simply injecting it into the response stream as text. In #17170 we would encode the tool uses as XML and insert them as text. This would require then re-parsing the tool uses out of the buffer in order to use them. The approach taken in this PR is to make `stream_completion` return a stream of `LanguageModelCompletionEvent`s. Each of these events can be either text, or a tool use. A new `stream_completion_text` method has been added to `LanguageModel` for scenarios where we only care about textual content (currently, everywhere that isn't the Assistant context editor). Release Notes: - N/A	2024-09-03 15:04:51 -04:00
Marshall Bowers	68ea661711	assistant: Add foundation for receiving tool uses from Anthropic models (#17170 ) This PR updates the Assistant with support for receiving tool uses from Anthropic models and capturing them as text in the context editor. This is just laying the foundation for tool use. We don't yet fulfill the tool uses yet, or define any tools for the model to use. Here's an example of what it looks like using the example `get_weather` tool from the Anthropic docs: <img width="644" alt="Screenshot 2024-08-30 at 1 51 13 PM" src="https://github.com/user-attachments/assets/3614f953-0689-423c-8955-b146729ea638"> Release Notes: - N/A	2024-08-30 14:05:55 -04:00
Thorsten Ball	7647644602	zed ai: Show ToS form in Configuration View (#16736 ) Related #16618 Release Notes: - N/A	2024-08-23 11:17:21 +02:00
Marshall Bowers	93642c9c51	Pass through Anthropic cache configuration when using Zed provider (#16685 ) This PR makes it so the model's cache configuration gets passed through from the base model when using the Zed provider. Release Notes: - Fixed caching for Anthropic models when using the Zed provider.	2024-08-22 12:48:47 -04:00
邻二氮杂菲	f1778dd9de	Add max_output_tokens to OpenAI models and integrate into requests (#16381 ) ### Pull Request Title Introduce `max_output_tokens` Field for OpenAI Models https://platform.deepseek.com/api-docs/news/news0725/#4-8k-max_tokens-betarelease-longer-possibilities ### Description This commit introduces a new field `max_output_tokens` to the OpenAI models, which allows specifying the maximum number of tokens that can be generated in the output. This field is now integrated into the request handling across multiple crates, ensuring that the output token limit is respected during language model completions. Changes include: - Adding `max_output_tokens` to the `Custom` variant of the `open_ai::Model` enum. - Updating the `into_open_ai` method in `LanguageModelRequest` to accept and use `max_output_tokens`. - Modifying the `OpenAiLanguageModel` and `CloudLanguageModel` implementations to pass `max_output_tokens` when converting requests. - Ensuring that the `max_output_tokens` field is correctly serialized and deserialized in relevant structures. This enhancement provides more control over the output length of OpenAI model responses, improving the flexibility and accuracy of language model interactions. ### Changes - Added `max_output_tokens` to the `Custom` variant of the `open_ai::Model` enum. - Updated the `into_open_ai` method in `LanguageModelRequest` to accept and use `max_output_tokens`. - Modified the `OpenAiLanguageModel` and `CloudLanguageModel` implementations to pass `max_output_tokens` when converting requests. - Ensured that the `max_output_tokens` field is correctly serialized and deserialized in relevant structures. ### Related Issue https://github.com/zed-industries/zed/pull/16358 ### Screenshots / Media N/A ### Checklist - [x] Code compiles correctly. - [x] All tests pass. - [ ] Documentation has been updated accordingly. - [ ] Additional tests have been added to cover new functionality. - [ ] Relevant documentation has been updated or added. ### Release Notes - Added `max_output_tokens` field to OpenAI models for controlling output token length.	2024-08-21 00:39:10 -04:00
Max Brunsfeld	b5bd8a5c5d	Add logic for closed beta LLM models (#16482 ) Release Notes: - N/A --------- Co-authored-by: Marshall <marshall@zed.dev>	2024-08-19 11:09:52 -07:00
Nathan Sobo	b9176fe4bb	Add custom icon for Anthropic hosted models (#16436 ) This commit adds a custom icon for Anthropic hosted models. ![CleanShot 2024-08-18 at 15 40 38@2x](https://github.com/user-attachments/assets/d467ccab-9628-4258-89fc-782e0d4a48d4) ![CleanShot 2024-08-18 at 15 40 34@2x](https://github.com/user-attachments/assets/7efaff9c-6a58-47ba-87ea-e0fe0586fedc) - Adding a new SVG icon for Anthropic hosted models. - The new icon is located at: `assets/icons/ai_anthropic_hosted.svg` - Updating the LanguageModel trait to include an optional icon method - Implementing the icon method for CloudModel to return the custom icon for Anthropic hosted models - Updating the UI components to use the model-specific icon when available - Adding a new IconName variant for the Anthropic hosted icon We should change the non-hosted icon in some small way to distinguish it from the hosted version. I duplicated the path for now so we can hopefully add it for the next release. Release Notes: - N/A	2024-08-18 16:07:15 -06:00
Nathan Sobo	907d76208d	Allow display name of custom Anthropic models to be customized (#16376 ) Also added some docs for our settings. Release Notes: - N/A	2024-08-16 14:02:37 -06:00
Roy Williams	b4f5f5024e	Support 8192 output tokens for Claude Sonnet 3.5 (#16358 ) Release Notes: - Added support for 8192 output tokens from Claude Sonnet 3.5 (https://x.com/alexalbert__/status/1812921642143900036)	2024-08-16 11:47:39 -04:00
Roy Williams	46fb917e02	Implement Anthropic prompt caching (#16274 ) Release Notes: - Adds support for Prompt Caching in Anthropic. For models that support it this can dramatically lower cost while improving performance.	2024-08-15 22:21:06 -05:00
Max Brunsfeld	4c390b82fb	Make LanguageModel::use_any_tool return a stream of chunks (#16262 ) This PR is a refactor to pave the way for allowing the user to view and edit workflow step resolutions. I've made tool calls work more like normal streaming completions for all providers. The `use_any_tool` method returns a stream of strings (which contain chunks of JSON). I've also done some minor cleanup of language model providers in general, removing the duplication around handling streaming responses. Release Notes: - N/A	2024-08-14 18:02:46 -07:00
Bennet Bo Fenner	ccd8f75cff	assistant: Adjust terms of service notice (#16235 ) Co-Authored-by: Max <max@zed.dev> Co-Authored-by: Marshall <marshall@zed.dev> Co-Authored-by: Peter <peter@zed.dev> <img width="396" alt="image" src="https://github.com/user-attachments/assets/62282506-c74a-455e-ae4d-0438d47fed96"> Release Notes: - N/A Co-authored-by: Max <max@zed.dev> Co-authored-by: Marshall <marshall@zed.dev> Co-authored-by: Peter <peter@zed.dev>	2024-08-14 19:21:07 +02:00
Danilo Leal	c6a1d9aa33	assistant: Polish terms of service toast design (#16183 ) Pushing in tiny design tweaks and wording change on the button so it's a bit more explicit. Release Notes: - N/A	2024-08-13 17:31:46 -03:00
Marshall Bowers	8a148f3a13	Add feature-flagged access to LLM service (#16136 ) This PR adds feature-flagged access to the LLM service. We've repurposed the `language-models` feature flag to be used for providing access to Claude 3.5 Sonnet through the Zed provider. The remaining RPC endpoints that were previously behind the `language-models` feature flag are now behind a staff check. We also put some Zed Pro related messaging behind a feature flag. Release Notes: - N/A --------- Co-authored-by: Max <max@zed.dev>	2024-08-12 18:13:40 -04:00
Max Brunsfeld	1674e12ccb	Expose anthropic API errors to the client (#16129 ) Now, when an anthropic request is invalid or anthropic's API is down, we'll expose that to the user instead of just returning a generic 500. Release Notes: - N/A Co-authored-by: Marshall <marshall@zed.dev>	2024-08-12 13:11:48 -07:00
Marshall Bowers	ebdb755fef	Surface upstream rate limits from Anthropic (#16118 ) This PR makes it so hitting upstream rate limits from Anthropic result in an HTTP 429 response instead of an HTTP 500. To do this we need to surface structured errors out of the `anthropic` crate. Release Notes: - N/A	2024-08-12 11:59:24 -04:00
Thorsten Ball	fbb533b3e0	assistant: Require user to accept TOS for cloud provider (#16111 ) This adds the requirement for users to accept the terms of service the first time they send a message with the Cloud provider. Once this is out and in a nightly, we need to add the check to the server side too, to authenticate access to the models. Demo: https://github.com/user-attachments/assets/0edebf74-8120-4fa2-b801-bb76f04e8a17 Release Notes: - N/A	2024-08-12 17:43:35 +02:00
Marshall Bowers	6389c613a2	Always stream completions through the LLM service (#16113 ) This PR removes the `llm-service` feature flag and makes it so all completions are done via the LLM service when using the Zed provider. Release Notes: - N/A	2024-08-12 09:33:24 -04:00
Max Brunsfeld	fbebb73d7b	Use LLM service for tool call requests (#16046 ) Release Notes: - N/A --------- Co-authored-by: Marshall <marshall@zed.dev>	2024-08-09 16:22:58 -04:00
Piotr Osiewicz	874f0c0712	assistant: Use tools in other providers (#15803 ) - [x] OpenAI - [ ] ~Google~ Moved into a separate branch at: https://github.com/zed-industries/zed/tree/tool-calls-in-google-ai I've ran into issues with having the API digest our schema without tripping over itself - the function call parameters are malformed and whatnot. We can resume from that branch if needed. - [x] Ollama - [x] Cloud - [ ] ~Copilot Chat (?)~ Release Notes: - Added tool calling capabilities to OpenAI and Ollama models.	2024-08-06 15:45:47 +02:00
Bennet Bo Fenner	d6e5265e84	assistant: Limit model access (#15820 ) Release Notes: - N/A --------- Co-authored-by: Thorsten <thorsten@zed.dev>	2024-08-06 12:19:19 +02:00
Marshall Bowers	ca9511393b	collab: Add support for more providers to the LLM service (#15832 ) This PR adds support for additional providers to the LLM service: - OpenAI - Google - Custom Zed models (through Hugging Face) Release Notes: - N/A	2024-08-05 21:16:18 -04:00
Max Brunsfeld	8e9c2b1125	Introduce a separate backend service for LLM calls (#15831 ) This PR introduces a separate backend service for making LLM calls. It exposes an HTTP interface that can be called by Zed clients. To call these endpoints, the client must provide a `Bearer` token. These tokens are issued/refreshed by the collab service over RPC. We're adding this in a backwards-compatible way. Right now the access tokens can only be minted for Zed staff, and calling this separate LLM service is behind the `llm-service` feature flag (which is not automatically enabled for Zed staff). Release Notes: - N/A --------- Co-authored-by: Marshall <marshall@zed.dev> Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2024-08-05 20:26:21 -04:00
Thorsten Ball	49d0672cdd	assistant panel: Fix wrong state for Zed.dev provider being shown (#15800 ) Release Notes: - N/A Co-authored-by: Bennet <bennet@zed.dev>	2024-08-05 15:35:58 +02:00
Thorsten Ball	390815dd76	assistant panel: Tab-less configuration view (#15682 ) TODOs for follow-up: - [ ] When opening panel: nudge user to sign in if they're not signed-in and have no provider configured (or if they're not signed-in and have Zed AI configured) - [ ] Configuration page is not scrollable - [ ] Design tweaks Current status: https://github.com/user-attachments/assets/d26d65ea-43e8-481b-81a3-b3cba01704a8 Release Notes: - N/A	2024-08-02 17:16:18 +02:00
Nate Butler	b4dcd6d394	Update model selector (#15665 ) Release Notes: - N/A --------- Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2024-08-01 21:57:51 -04:00
Marshall Bowers	5e011ab029	language_model: Denote the availability of language models (#15660 ) This PR updates the `LanguageModel` trait with a new method for denoting the availability of a model. Right now we have two variants: - `Public` for models that have no additional restrictions (other than their respective setup/authentication requirements) - `RequiresPlan` for models that require a specific Zed plan Release Notes: - N/A	2024-08-01 18:26:27 -04:00
Antonio Scandurra	21816d1ff5	Add Qwen2-7B to the list of zed.dev models (#15649 ) Release Notes: - N/A --------- Co-authored-by: Nathan <nathan@zed.dev>	2024-08-01 22:26:07 +02:00
Marshall Bowers	4bfb8fda8d	Rename `zed.dev/settings` to `zed.dev/account` (#15636 ) This PR renames the links to the `zed.dev/settings` page to the `zed.dev/account`. Some of these spots will likely link out to a marketing page later. Release Notes: - N/A	2024-08-01 13:59:21 -04:00
Nate Butler	70b2da78f8	Update assistant config UI (#15630 ) ![CleanShot 2024-08-01 at 12 55 01@2x](https://github.com/user-attachments/assets/f9ed44ba-6bff-4805-ad71-2e3538315e57) - Remove assisstant_description for now. - Updates assistant config UI - Updates Ollama and zed.dev provider UIs - Updates download icon Release Notes: - N/A --------- Co-authored-by: Marshall Bowers <1486634+maxdeviant@users.noreply.github.com> Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2024-08-01 13:30:35 -04:00
Bennet Bo Fenner	be3a8584ff	assistant: Add a Configuration page (#15490 ) - [x] bug: setting a key doesn't update anything - [x] show high-level text on configuration page to explain what it is - [x] show "everything okay!" status when credentials are set - [x] maybe: add "verify" button to check credentials - [x] open configuration page when opening panel for first time and nothing is configured - [x] BUG: need to fix empty assistant panel if provider is `zed.dev` but not logged in Co-Authored-By: Thorsten <thorsten@zed.dev> Release Notes: - N/A --------- Co-authored-by: Thorsten <thorsten@zed.dev> Co-authored-by: Nate Butler <iamnbutler@gmail.com> Co-authored-by: Thorsten Ball <mrnugget@gmail.com>	2024-08-01 15:54:47 +02:00
Thorsten Ball	874fedd717	assistant panel: Fix panic when opening panel with zed.dev provider (#15538 ) There was/is some race condition that gets triggered only with the zed.dev provider when opening the provider that would cause a double-borrow on workspace. This PR fixes the issue by cloning the workspace weakly. Turns out we can go very far with just the weak reference. We're still a bit unsure why exactly the race condition happened, since it's hard to reproduce, but we're working on configuration view/management in #15490 anyway. Release Notes: - N/A Co-authored-by: Bennet <bennet@zed.dev>	2024-07-31 16:57:24 +02:00
Bennet Bo Fenner	821ce2fc7c	assistant panel: Fix panel not reloading after entering credentials (#15531 ) This is the revised version of #15527. We also added new events to notify subscribers when new providers are added or removed. Co-Authored-by: Thorsten <thorsten@zed.dev> Release Notes: - N/A --------- Co-authored-by: Thorsten <thorsten@zed.dev> Co-authored-by: Thorsten Ball <mrnugget@gmail.com>	2024-07-31 14:12:17 +02:00
Bennet Bo Fenner	380a19fcf2	Revert "assistant panel: Fix entering credentials not updating view" (#15528 ) Reverts zed-industries/zed#15527 We broke the assistant panel in the process... Release Notes: - N/A	2024-07-31 13:26:27 +02:00
Thorsten Ball	b571bc800d	assistant panel: Fix entering credentials not updating view (#15527 ) Co-authored-by: Bennet <bennet@zed.dev> Release Notes: - N/A Co-authored-by: Bennet <bennet@zed.dev>	2024-07-31 12:51:41 +02:00
Antonio Scandurra	99bc90a372	Allow customization of the model used for tool calling (#15479 ) We also eliminate the `completion` crate and moved its logic into `LanguageModelRegistry`. Release Notes: - N/A --------- Co-authored-by: Nathan <nathan@zed.dev>	2024-07-30 16:18:53 +02:00
Antonio Scandurra	6e1f7c6e1d	Use tool calling instead of XML parsing to generate edit operations (#15385 ) Release Notes: - N/A --------- Co-authored-by: Nathan <nathan@zed.dev>	2024-07-29 16:42:08 +02:00

1 2

54 commits