Yehowshua/ZIm - Forgejo: Beyond coding. We Forge.

Author	SHA1	Message	Date
jvmncs	c71f052276	Add ability to use o1-preview and o1-mini as custom models (#17804 ) This is a barebones modification of the OpenAI provider code to accommodate non-streaming completions. This is specifically for the o1 models, which do not support streaming. Tested that this is working by running a `/workflow` with the following (arbitrarily chosen) settings: ```json { "language_models": { "openai": { "version": "1", "available_models": [ { "name": "o1-preview", "display_name": "o1-preview", "max_tokens": 128000, "max_completion_tokens": 30000 }, { "name": "o1-mini", "display_name": "o1-mini", "max_tokens": 128000, "max_completion_tokens": 20000 } ] } }, } ``` Release Notes: - Changed `low_speed_timeout_in_seconds` option to `600` for OpenAI provider to accommodate recent o1 model release. --------- Co-authored-by: Peter <peter@zed.dev> Co-authored-by: Bennet <bennet@zed.dev> Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2024-09-13 15:42:15 -04:00
Peter Tripp	fb9d01b0d5	assistant: Add display_name for OpenAI and Gemini (#17508 )	2024-09-10 13:41:06 -04:00
Piotr Osiewicz	e6c1c51b37	chore: Fix several style lints (#17488 ) It's not comprehensive enough to start linting on `style` group, but hey, it's a start. Release Notes: - N/A	2024-09-06 11:58:39 +02:00
Marshall Bowers	f38956943b	assistant: Propagate LLM stop reason upwards (#17358 ) This PR makes it so we propagate the `stop_reason` from Anthropic up to the Assistant so that we can take action based on it. The `extract_content_from_events` function was moved from `anthropic` to the `anthropic` module in `language_model` since it is more useful if it is able to name the `LanguageModelCompletionEvent` type, as otherwise we'd need an additional layer of plumbing. Release Notes: - N/A	2024-09-04 12:31:10 -04:00
Marshall Bowers	452272e5df	assistant: Stream tool uses as structured data (#17322 ) This PR adjusts the approach we use to encoding tool uses in the completion response to use a structured format rather than simply injecting it into the response stream as text. In #17170 we would encode the tool uses as XML and insert them as text. This would require then re-parsing the tool uses out of the buffer in order to use them. The approach taken in this PR is to make `stream_completion` return a stream of `LanguageModelCompletionEvent`s. Each of these events can be either text, or a tool use. A new `stream_completion_text` method has been added to `LanguageModel` for scenarios where we only care about textual content (currently, everywhere that isn't the Assistant context editor). Release Notes: - N/A	2024-09-03 15:04:51 -04:00
Marshall Bowers	68ea661711	assistant: Add foundation for receiving tool uses from Anthropic models (#17170 ) This PR updates the Assistant with support for receiving tool uses from Anthropic models and capturing them as text in the context editor. This is just laying the foundation for tool use. We don't yet fulfill the tool uses yet, or define any tools for the model to use. Here's an example of what it looks like using the example `get_weather` tool from the Anthropic docs: <img width="644" alt="Screenshot 2024-08-30 at 1 51 13 PM" src="https://github.com/user-attachments/assets/3614f953-0689-423c-8955-b146729ea638"> Release Notes: - N/A	2024-08-30 14:05:55 -04:00
Thorsten Ball	7647644602	zed ai: Show ToS form in Configuration View (#16736 ) Related #16618 Release Notes: - N/A	2024-08-23 11:17:21 +02:00
Marshall Bowers	93642c9c51	Pass through Anthropic cache configuration when using Zed provider (#16685 ) This PR makes it so the model's cache configuration gets passed through from the base model when using the Zed provider. Release Notes: - Fixed caching for Anthropic models when using the Zed provider.	2024-08-22 12:48:47 -04:00
邻二氮杂菲	f1778dd9de	Add max_output_tokens to OpenAI models and integrate into requests (#16381 ) ### Pull Request Title Introduce `max_output_tokens` Field for OpenAI Models https://platform.deepseek.com/api-docs/news/news0725/#4-8k-max_tokens-betarelease-longer-possibilities ### Description This commit introduces a new field `max_output_tokens` to the OpenAI models, which allows specifying the maximum number of tokens that can be generated in the output. This field is now integrated into the request handling across multiple crates, ensuring that the output token limit is respected during language model completions. Changes include: - Adding `max_output_tokens` to the `Custom` variant of the `open_ai::Model` enum. - Updating the `into_open_ai` method in `LanguageModelRequest` to accept and use `max_output_tokens`. - Modifying the `OpenAiLanguageModel` and `CloudLanguageModel` implementations to pass `max_output_tokens` when converting requests. - Ensuring that the `max_output_tokens` field is correctly serialized and deserialized in relevant structures. This enhancement provides more control over the output length of OpenAI model responses, improving the flexibility and accuracy of language model interactions. ### Changes - Added `max_output_tokens` to the `Custom` variant of the `open_ai::Model` enum. - Updated the `into_open_ai` method in `LanguageModelRequest` to accept and use `max_output_tokens`. - Modified the `OpenAiLanguageModel` and `CloudLanguageModel` implementations to pass `max_output_tokens` when converting requests. - Ensured that the `max_output_tokens` field is correctly serialized and deserialized in relevant structures. ### Related Issue https://github.com/zed-industries/zed/pull/16358 ### Screenshots / Media N/A ### Checklist - [x] Code compiles correctly. - [x] All tests pass. - [ ] Documentation has been updated accordingly. - [ ] Additional tests have been added to cover new functionality. - [ ] Relevant documentation has been updated or added. ### Release Notes - Added `max_output_tokens` field to OpenAI models for controlling output token length.	2024-08-21 00:39:10 -04:00
Max Brunsfeld	b5bd8a5c5d	Add logic for closed beta LLM models (#16482 ) Release Notes: - N/A --------- Co-authored-by: Marshall <marshall@zed.dev>	2024-08-19 11:09:52 -07:00
Nathan Sobo	b9176fe4bb	Add custom icon for Anthropic hosted models (#16436 ) This commit adds a custom icon for Anthropic hosted models. ![CleanShot 2024-08-18 at 15 40 38@2x](https://github.com/user-attachments/assets/d467ccab-9628-4258-89fc-782e0d4a48d4) ![CleanShot 2024-08-18 at 15 40 34@2x](https://github.com/user-attachments/assets/7efaff9c-6a58-47ba-87ea-e0fe0586fedc) - Adding a new SVG icon for Anthropic hosted models. - The new icon is located at: `assets/icons/ai_anthropic_hosted.svg` - Updating the LanguageModel trait to include an optional icon method - Implementing the icon method for CloudModel to return the custom icon for Anthropic hosted models - Updating the UI components to use the model-specific icon when available - Adding a new IconName variant for the Anthropic hosted icon We should change the non-hosted icon in some small way to distinguish it from the hosted version. I duplicated the path for now so we can hopefully add it for the next release. Release Notes: - N/A	2024-08-18 16:07:15 -06:00
Nathan Sobo	907d76208d	Allow display name of custom Anthropic models to be customized (#16376 ) Also added some docs for our settings. Release Notes: - N/A	2024-08-16 14:02:37 -06:00
Roy Williams	b4f5f5024e	Support 8192 output tokens for Claude Sonnet 3.5 (#16358 ) Release Notes: - Added support for 8192 output tokens from Claude Sonnet 3.5 (https://x.com/alexalbert__/status/1812921642143900036)	2024-08-16 11:47:39 -04:00
Roy Williams	46fb917e02	Implement Anthropic prompt caching (#16274 ) Release Notes: - Adds support for Prompt Caching in Anthropic. For models that support it this can dramatically lower cost while improving performance.	2024-08-15 22:21:06 -05:00
Max Brunsfeld	4c390b82fb	Make LanguageModel::use_any_tool return a stream of chunks (#16262 ) This PR is a refactor to pave the way for allowing the user to view and edit workflow step resolutions. I've made tool calls work more like normal streaming completions for all providers. The `use_any_tool` method returns a stream of strings (which contain chunks of JSON). I've also done some minor cleanup of language model providers in general, removing the duplication around handling streaming responses. Release Notes: - N/A	2024-08-14 18:02:46 -07:00
Bennet Bo Fenner	ccd8f75cff	assistant: Adjust terms of service notice (#16235 ) Co-Authored-by: Max <max@zed.dev> Co-Authored-by: Marshall <marshall@zed.dev> Co-Authored-by: Peter <peter@zed.dev> <img width="396" alt="image" src="https://github.com/user-attachments/assets/62282506-c74a-455e-ae4d-0438d47fed96"> Release Notes: - N/A Co-authored-by: Max <max@zed.dev> Co-authored-by: Marshall <marshall@zed.dev> Co-authored-by: Peter <peter@zed.dev>	2024-08-14 19:21:07 +02:00
Danilo Leal	c6a1d9aa33	assistant: Polish terms of service toast design (#16183 ) Pushing in tiny design tweaks and wording change on the button so it's a bit more explicit. Release Notes: - N/A	2024-08-13 17:31:46 -03:00
Marshall Bowers	8a148f3a13	Add feature-flagged access to LLM service (#16136 ) This PR adds feature-flagged access to the LLM service. We've repurposed the `language-models` feature flag to be used for providing access to Claude 3.5 Sonnet through the Zed provider. The remaining RPC endpoints that were previously behind the `language-models` feature flag are now behind a staff check. We also put some Zed Pro related messaging behind a feature flag. Release Notes: - N/A --------- Co-authored-by: Max <max@zed.dev>	2024-08-12 18:13:40 -04:00
Max Brunsfeld	1674e12ccb	Expose anthropic API errors to the client (#16129 ) Now, when an anthropic request is invalid or anthropic's API is down, we'll expose that to the user instead of just returning a generic 500. Release Notes: - N/A Co-authored-by: Marshall <marshall@zed.dev>	2024-08-12 13:11:48 -07:00
Marshall Bowers	ebdb755fef	Surface upstream rate limits from Anthropic (#16118 ) This PR makes it so hitting upstream rate limits from Anthropic result in an HTTP 429 response instead of an HTTP 500. To do this we need to surface structured errors out of the `anthropic` crate. Release Notes: - N/A	2024-08-12 11:59:24 -04:00
Thorsten Ball	fbb533b3e0	assistant: Require user to accept TOS for cloud provider (#16111 ) This adds the requirement for users to accept the terms of service the first time they send a message with the Cloud provider. Once this is out and in a nightly, we need to add the check to the server side too, to authenticate access to the models. Demo: https://github.com/user-attachments/assets/0edebf74-8120-4fa2-b801-bb76f04e8a17 Release Notes: - N/A	2024-08-12 17:43:35 +02:00
Marshall Bowers	6389c613a2	Always stream completions through the LLM service (#16113 ) This PR removes the `llm-service` feature flag and makes it so all completions are done via the LLM service when using the Zed provider. Release Notes: - N/A	2024-08-12 09:33:24 -04:00
Max Brunsfeld	fbebb73d7b	Use LLM service for tool call requests (#16046 ) Release Notes: - N/A --------- Co-authored-by: Marshall <marshall@zed.dev>	2024-08-09 16:22:58 -04:00
Piotr Osiewicz	874f0c0712	assistant: Use tools in other providers (#15803 ) - [x] OpenAI - [ ] ~Google~ Moved into a separate branch at: https://github.com/zed-industries/zed/tree/tool-calls-in-google-ai I've ran into issues with having the API digest our schema without tripping over itself - the function call parameters are malformed and whatnot. We can resume from that branch if needed. - [x] Ollama - [x] Cloud - [ ] ~Copilot Chat (?)~ Release Notes: - Added tool calling capabilities to OpenAI and Ollama models.	2024-08-06 15:45:47 +02:00
Bennet Bo Fenner	d6e5265e84	assistant: Limit model access (#15820 ) Release Notes: - N/A --------- Co-authored-by: Thorsten <thorsten@zed.dev>	2024-08-06 12:19:19 +02:00
Marshall Bowers	ca9511393b	collab: Add support for more providers to the LLM service (#15832 ) This PR adds support for additional providers to the LLM service: - OpenAI - Google - Custom Zed models (through Hugging Face) Release Notes: - N/A	2024-08-05 21:16:18 -04:00
Max Brunsfeld	8e9c2b1125	Introduce a separate backend service for LLM calls (#15831 ) This PR introduces a separate backend service for making LLM calls. It exposes an HTTP interface that can be called by Zed clients. To call these endpoints, the client must provide a `Bearer` token. These tokens are issued/refreshed by the collab service over RPC. We're adding this in a backwards-compatible way. Right now the access tokens can only be minted for Zed staff, and calling this separate LLM service is behind the `llm-service` feature flag (which is not automatically enabled for Zed staff). Release Notes: - N/A --------- Co-authored-by: Marshall <marshall@zed.dev> Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2024-08-05 20:26:21 -04:00
Thorsten Ball	49d0672cdd	assistant panel: Fix wrong state for Zed.dev provider being shown (#15800 ) Release Notes: - N/A Co-authored-by: Bennet <bennet@zed.dev>	2024-08-05 15:35:58 +02:00
Thorsten Ball	390815dd76	assistant panel: Tab-less configuration view (#15682 ) TODOs for follow-up: - [ ] When opening panel: nudge user to sign in if they're not signed-in and have no provider configured (or if they're not signed-in and have Zed AI configured) - [ ] Configuration page is not scrollable - [ ] Design tweaks Current status: https://github.com/user-attachments/assets/d26d65ea-43e8-481b-81a3-b3cba01704a8 Release Notes: - N/A	2024-08-02 17:16:18 +02:00
Nate Butler	b4dcd6d394	Update model selector (#15665 ) Release Notes: - N/A --------- Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2024-08-01 21:57:51 -04:00
Marshall Bowers	5e011ab029	language_model: Denote the availability of language models (#15660 ) This PR updates the `LanguageModel` trait with a new method for denoting the availability of a model. Right now we have two variants: - `Public` for models that have no additional restrictions (other than their respective setup/authentication requirements) - `RequiresPlan` for models that require a specific Zed plan Release Notes: - N/A	2024-08-01 18:26:27 -04:00
Antonio Scandurra	21816d1ff5	Add Qwen2-7B to the list of zed.dev models (#15649 ) Release Notes: - N/A --------- Co-authored-by: Nathan <nathan@zed.dev>	2024-08-01 22:26:07 +02:00
Marshall Bowers	4bfb8fda8d	Rename `zed.dev/settings` to `zed.dev/account` (#15636 ) This PR renames the links to the `zed.dev/settings` page to the `zed.dev/account`. Some of these spots will likely link out to a marketing page later. Release Notes: - N/A	2024-08-01 13:59:21 -04:00
Nate Butler	70b2da78f8	Update assistant config UI (#15630 ) ![CleanShot 2024-08-01 at 12 55 01@2x](https://github.com/user-attachments/assets/f9ed44ba-6bff-4805-ad71-2e3538315e57) - Remove assisstant_description for now. - Updates assistant config UI - Updates Ollama and zed.dev provider UIs - Updates download icon Release Notes: - N/A --------- Co-authored-by: Marshall Bowers <1486634+maxdeviant@users.noreply.github.com> Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2024-08-01 13:30:35 -04:00
Bennet Bo Fenner	be3a8584ff	assistant: Add a Configuration page (#15490 ) - [x] bug: setting a key doesn't update anything - [x] show high-level text on configuration page to explain what it is - [x] show "everything okay!" status when credentials are set - [x] maybe: add "verify" button to check credentials - [x] open configuration page when opening panel for first time and nothing is configured - [x] BUG: need to fix empty assistant panel if provider is `zed.dev` but not logged in Co-Authored-By: Thorsten <thorsten@zed.dev> Release Notes: - N/A --------- Co-authored-by: Thorsten <thorsten@zed.dev> Co-authored-by: Nate Butler <iamnbutler@gmail.com> Co-authored-by: Thorsten Ball <mrnugget@gmail.com>	2024-08-01 15:54:47 +02:00
Thorsten Ball	874fedd717	assistant panel: Fix panic when opening panel with zed.dev provider (#15538 ) There was/is some race condition that gets triggered only with the zed.dev provider when opening the provider that would cause a double-borrow on workspace. This PR fixes the issue by cloning the workspace weakly. Turns out we can go very far with just the weak reference. We're still a bit unsure why exactly the race condition happened, since it's hard to reproduce, but we're working on configuration view/management in #15490 anyway. Release Notes: - N/A Co-authored-by: Bennet <bennet@zed.dev>	2024-07-31 16:57:24 +02:00
Bennet Bo Fenner	821ce2fc7c	assistant panel: Fix panel not reloading after entering credentials (#15531 ) This is the revised version of #15527. We also added new events to notify subscribers when new providers are added or removed. Co-Authored-by: Thorsten <thorsten@zed.dev> Release Notes: - N/A --------- Co-authored-by: Thorsten <thorsten@zed.dev> Co-authored-by: Thorsten Ball <mrnugget@gmail.com>	2024-07-31 14:12:17 +02:00
Bennet Bo Fenner	380a19fcf2	Revert "assistant panel: Fix entering credentials not updating view" (#15528 ) Reverts zed-industries/zed#15527 We broke the assistant panel in the process... Release Notes: - N/A	2024-07-31 13:26:27 +02:00
Thorsten Ball	b571bc800d	assistant panel: Fix entering credentials not updating view (#15527 ) Co-authored-by: Bennet <bennet@zed.dev> Release Notes: - N/A Co-authored-by: Bennet <bennet@zed.dev>	2024-07-31 12:51:41 +02:00
Antonio Scandurra	99bc90a372	Allow customization of the model used for tool calling (#15479 ) We also eliminate the `completion` crate and moved its logic into `LanguageModelRegistry`. Release Notes: - N/A --------- Co-authored-by: Nathan <nathan@zed.dev>	2024-07-30 16:18:53 +02:00
Antonio Scandurra	6e1f7c6e1d	Use tool calling instead of XML parsing to generate edit operations (#15385 ) Release Notes: - N/A --------- Co-authored-by: Nathan <nathan@zed.dev>	2024-07-29 16:42:08 +02:00
Antonio Scandurra	d6bdaa8a91	Simplify LLM protocol (#15366 ) In this pull request, we change the zed.dev protocol so that we pass the raw JSON for the specified provider directly to our server. This avoids the need to define a protobuf message that's a superset of all these formats. @bennetbo: We also changed the settings for available_models under zed.dev to be a flat format, because the nesting seemed too confusing. Can you help us upgrade the local provider configuration to be consistent with this? We do whatever we need to do when parsing the settings to make this simple for users, even if it's a bit more complex on our end. We want to use versioning to avoid breaking existing users, but need to keep making progress. ```json "zed.dev": { "available_models": [ { "provider": "anthropic", "name": "some-newly-released-model-we-havent-added", "max_tokens": 200000 } ] } ``` Release Notes: - N/A --------- Co-authored-by: Nathan <nathan@zed.dev>	2024-07-28 11:07:10 +02:00
Danilo Leal	912b396e58	Adjust model selector popover design (#15056 ) This PR mostly refines the model selector popover design by formatting the models names' and adjusting spacing/alignment in the list-related items. The list component changes could've been made in a separate PR but it was also very practical to do it here as I was already in-context. Either way, I'm happy to separate if that's better! One thing I couldn't necessarily figure out, though, is why the order changed (e.g., Anthropic at last ). I wonder if that was because of the separator logic somehow? I'd love guidance here—new to Rust! \| Before \| After \| \|--------\|--------\| \| <img width="228" alt="Screenshot 2024-07-23 at 21 02 33" src="https://github.com/user-attachments/assets/3372c6c9-08dc-4d71-9265-26f015e2dbc2"> \| <img width="228" alt="Screenshot 2024-07-23 at 21 01 45" src="https://github.com/user-attachments/assets/624cc7db-a3d9-48e3-99d7-c29829501130"> \| --- Release Notes: - N/A --------- Co-authored-by: Marshall Bowers <elliott.codes@gmail.com> Co-authored-by: Bennet Bo Fenner <bennet@zed.dev> Co-authored-by: Antonio <antonio@zed.dev> Co-authored-by: Antonio Scandurra <me@as-cii.com>	2024-07-24 12:24:54 +02:00
Bennet Bo Fenner	af4b9805c9	assistant: Fix issues when configuring different providers (#15072 ) Release Notes: - N/A --------- Co-authored-by: Antonio Scandurra <me@as-cii.com>	2024-07-24 11:21:31 +02:00
Bennet Bo Fenner	d0f52e90e6	assistant: Overhaul provider infrastructure (#14929 ) <img width="624" alt="image" src="https://github.com/user-attachments/assets/f492b0bd-14c3-49e2-b2ff-dc78e52b0815"> - [x] Correctly set custom model token count - [x] How to count tokens for Gemini models? - [x] Feature flag zed.dev provider - [x] Figure out how to configure custom models - [ ] Update docs Release Notes: - Added support for quickly switching between multiple language model providers in the assistant panel --------- Co-authored-by: Antonio <antonio@zed.dev>	2024-07-23 19:48:41 +02:00

45 commits