Yehowshua/ZIm - Forgejo: Beyond coding. We Forge.

Author	SHA1	Message	Date
Marshall Bowers	c0731bfa28	assistant: Fix formatting in settings (#17172 ) This PR fixes some formatting issues in `assistant_settings.rs` that were being caused by long lines. Release Notes: - N/A	2024-08-30 14:26:34 -04:00
Peter Tripp	b62e63349b	Ollama max_tokens settings (#17025 ) - Support `available_models` for Ollama - Clamp default max tokens (context length) to 16384. - Add documentation for ollama context configuration.	2024-08-30 08:52:00 -04:00
邻二氮杂菲	f1778dd9de	Add max_output_tokens to OpenAI models and integrate into requests (#16381 ) ### Pull Request Title Introduce `max_output_tokens` Field for OpenAI Models https://platform.deepseek.com/api-docs/news/news0725/#4-8k-max_tokens-betarelease-longer-possibilities ### Description This commit introduces a new field `max_output_tokens` to the OpenAI models, which allows specifying the maximum number of tokens that can be generated in the output. This field is now integrated into the request handling across multiple crates, ensuring that the output token limit is respected during language model completions. Changes include: - Adding `max_output_tokens` to the `Custom` variant of the `open_ai::Model` enum. - Updating the `into_open_ai` method in `LanguageModelRequest` to accept and use `max_output_tokens`. - Modifying the `OpenAiLanguageModel` and `CloudLanguageModel` implementations to pass `max_output_tokens` when converting requests. - Ensuring that the `max_output_tokens` field is correctly serialized and deserialized in relevant structures. This enhancement provides more control over the output length of OpenAI model responses, improving the flexibility and accuracy of language model interactions. ### Changes - Added `max_output_tokens` to the `Custom` variant of the `open_ai::Model` enum. - Updated the `into_open_ai` method in `LanguageModelRequest` to accept and use `max_output_tokens`. - Modified the `OpenAiLanguageModel` and `CloudLanguageModel` implementations to pass `max_output_tokens` when converting requests. - Ensured that the `max_output_tokens` field is correctly serialized and deserialized in relevant structures. ### Related Issue https://github.com/zed-industries/zed/pull/16358 ### Screenshots / Media N/A ### Checklist - [x] Code compiles correctly. - [x] All tests pass. - [ ] Documentation has been updated accordingly. - [ ] Additional tests have been added to cover new functionality. - [ ] Relevant documentation has been updated or added. ### Release Notes - Added `max_output_tokens` field to OpenAI models for controlling output token length.	2024-08-21 00:39:10 -04:00
Bennet Bo Fenner	6f5674691c	assistant: Set default provider to zed.dev (#16454 ) Do NOT merge until tomorrow Release Notes: - N/A --------- Co-authored-by: Thorsten <thorsten@zed.dev>	2024-08-19 19:00:38 -04:00
Antonio Scandurra	99bc90a372	Allow customization of the model used for tool calling (#15479 ) We also eliminate the `completion` crate and moved its logic into `LanguageModelRegistry`. Release Notes: - N/A --------- Co-authored-by: Nathan <nathan@zed.dev>	2024-07-30 16:18:53 +02:00
Bennet Bo Fenner	2ada2964c5	assistant: Make it easier to define custom models (#15442 ) This PR makes it easier to specify custom models for the Google, OpenAI, and Anthropic provider: Before (google): ```json { "language_models": { "google": { "available_models": [ { "custom": { "name": "my-custom-google-model", "max_tokens": 12345 } } ] } } } ``` After (google): ```json { "language_models": { "google": { "available_models": [ { "name": "my-custom-google-model", "max_tokens": 12345 } ] } } } ``` Before (anthropic): ```json { "language_models": { "anthropic": { "available_models": [ { "custom": { "name": "my-custom-anthropic-model", "max_tokens": 12345 } } ] } } } ``` After (anthropic): ```json { "language_models": { "anthropic": { "version": "1", "available_models": [ { "name": "my-custom-anthropic-model", "max_tokens": 12345 } ] } } } ``` The settings will be auto-upgraded so the old versions will continue to work (except for Google since that one has not been released). /cc @as-cii Release Notes: - N/A --------- Co-authored-by: Thorsten <thorsten@zed.dev>	2024-07-30 15:46:39 +02:00
Bennet Bo Fenner	0540291204	settings: Introduce PRESERVED_KEYS to write default values (#15474 ) This adds the optional `PRESERVED_KEYS` constant to the `Settings` trait, which allows users of the trait to specify which keys should be written to the settings file, even if their current value matches the default value. That's useful for tagged settings that have, for example, a `"version"` field that should always be present in the user settings file, so we can then reparse the user settings based on the version. Co-Authored-By: Thorsten <thorsten@zed.dev> Release Notes: - N/A --------- Co-authored-by: Thorsten <thorsten@zed.dev>	2024-07-30 13:09:50 +02:00
Ryan Hawkins	6f0655810e	Add GitHub Copilot Chat Support (#14842 ) # Summary This commit implements Github Copilot Chat support within the existing Assistant panel/framework. It required a little bit of trickery and internal API modification, as Copilot doesn't use the same authentication-style as all of the existing providers, opting to use OAuth and a short lived API key instead of a straight API key. All existing Assistant features should work. Release Notes: - Added Github Copilot Chat support ([#4673](https://github.com/zed-industries/zed/issues/4673)). ## Screenshots <img width="1552" alt="A screenshot showing a conversation between a user and Github Copilot Chat within the Zed editor." src="https://github.com/user-attachments/assets/73eaf6a2-792b-4c40-a7fe-f763bd6417d7"> --------- Co-authored-by: Bennet Bo Fenner <bennet@zed.dev>	2024-07-30 09:32:58 +02:00
Antonio Scandurra	d6bdaa8a91	Simplify LLM protocol (#15366 ) In this pull request, we change the zed.dev protocol so that we pass the raw JSON for the specified provider directly to our server. This avoids the need to define a protobuf message that's a superset of all these formats. @bennetbo: We also changed the settings for available_models under zed.dev to be a flat format, because the nesting seemed too confusing. Can you help us upgrade the local provider configuration to be consistent with this? We do whatever we need to do when parsing the settings to make this simple for users, even if it's a bit more complex on our end. We want to use versioning to avoid breaking existing users, but need to keep making progress. ```json "zed.dev": { "available_models": [ { "provider": "anthropic", "name": "some-newly-released-model-we-havent-added", "max_tokens": 200000 } ] } ``` Release Notes: - N/A --------- Co-authored-by: Nathan <nathan@zed.dev>	2024-07-28 11:07:10 +02:00
Bennet Bo Fenner	af4b9805c9	assistant: Fix issues when configuring different providers (#15072 ) Release Notes: - N/A --------- Co-authored-by: Antonio Scandurra <me@as-cii.com>	2024-07-24 11:21:31 +02:00
Bennet Bo Fenner	d0f52e90e6	assistant: Overhaul provider infrastructure (#14929 ) <img width="624" alt="image" src="https://github.com/user-attachments/assets/f492b0bd-14c3-49e2-b2ff-dc78e52b0815"> - [x] Correctly set custom model token count - [x] How to count tokens for Gemini models? - [x] Feature flag zed.dev provider - [x] Figure out how to configure custom models - [ ] Update docs Release Notes: - Added support for quickly switching between multiple language model providers in the assistant panel --------- Co-authored-by: Antonio <antonio@zed.dev>	2024-07-23 19:48:41 +02:00
Antonio Scandurra	0155435142	Allow using a custom model when using zed.dev (#14933 ) Release Notes: - N/A	2024-07-22 12:25:53 +02:00
Richard Feldman	ec487d8f64	Extract completion provider crate (#14823 ) We will soon need `semantic_index` to be able to use `CompletionProvider`. This is currently impossible due to a cyclic crate dependency, because `CompletionProvider` lives in the `assistant` crate, which depends on `semantic_index`. This PR breaks the dependency cycle by extracting two crates out of `assistant`: `language_model` and `completion`. Only one piece of logic changed: [this code](`922fcaf5a6 (diff-3857b3707687a4d585f1200eec4c34a7a079eae8d303b4ce5b4fce46234ace9fR61-R69)`). * As of https://github.com/zed-industries/zed/pull/13276, whenever we ask a given completion provider for its available models, OpenAI providers would go and ask the global assistant settings whether the user had configured an `available_models` setting, and if so, return that. * This PR changes it so that instead of eagerly asking the assistant settings for this info (the new crate must not depend on `assistant`, or else the dependency cycle would be back), OpenAI completion providers now store the user-configured settings as part of their struct, and whenever the settings change, we update the provider. In theory, this change should not change user-visible behavior...but since it's the only change in this large PR that's more than just moving code around, I'm mentioning it here in case there's an unexpected regression in practice! (cc @amtoaer in case you'd like to try out this branch and verify that the feature is still working the way you expect.) Release Notes: - N/A --------- Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2024-07-19 13:35:34 -04:00
versecafe	18b5a87298	Add gpt-4o-mini as an available model (#14770 ) Release Notes: - Fixes #14769	2024-07-18 22:32:56 -06:00
Nathan Sobo	f0279e672a	Add Gemini models to cloud enum (#14482 ) Release Notes: - N/A	2024-07-15 12:21:58 +02:00
Bennet Bo Fenner	c4dbe32f20	assistant: Limit amount of concurrent completion requests (#13856 ) This PR refactors the completion providers to only process a maximum amount of completion requests at a time. Also started refactoring language model providers to use traits, so it's easier to allow specifying multiple providers in the future. Release Notes: - N/A	2024-07-05 14:52:45 +02:00
ᴀᴍᴛᴏᴀᴇʀ	922fcaf5a6	Add the ability to customize available models for OpenAI-compatible services (#13276 ) Closes #11984, closes #11075. Release Notes: - Added the ability to customize available models for OpenAI-compatible services ([#11984](https://github.com/zed-industries/zed/issues/11984)) ([#11075](https://github.com/zed-industries/zed/issues/11075)). ![image](https://github.com/zed-industries/zed/assets/32017007/01057e7b-1f21-49ad-a3ad-abc5282ffaf0)	2024-06-25 16:37:02 -04:00
ᴀᴍᴛᴏᴀᴇʀ	7e694d1bcf	Fix an issue where provider settings were lost when switching between Ollama models (#13402 ) Closes #13399. Release Notes: - Fixed an issue where provider settings were lost when switching between Ollama models ([#13399](https://github.com/zed-industries/zed/issues/13399)).	2024-06-25 11:58:13 -04:00
Antonio Scandurra	4072ad2858	Add support for Claude 3.5 Sonnet (#13371 ) Release Notes: - Added support for Claude 3.5 Sonnet.	2024-06-21 18:32:26 +02:00
Kyle Kelley	4cb8d6f40e	Ollama Provider for Assistant (#12902 ) Closes #4424. A few design decisions that may need some rethinking or later PRs: * Other providers have a check for authentication. I use this opportunity to fetch the models which doubles as a way of finding out if the Ollama server is running. * Ollama has _no_ API for getting the max tokens per model * Ollama has _no_ API for getting the current token count https://github.com/ollama/ollama/issues/1716 * Ollama does allow setting the `num_ctx` so I've defaulted this to 4096. It can be overridden in settings. * Ollama models will be "slow" to start inference because they're loading the model into memory. It's faster after that. There's no UI affordance to show that the model is being loaded. Release Notes: - Added an Ollama Provider for the assistant. If you have [Ollama](https://ollama.com/) running locally on your machine, you can enable it in your settings under: ```jsonc "assistant": { "version": "1", "provider": { "name": "ollama", // Recommended setting to allow for model startup "low_speed_timeout_in_seconds": 30, } } ``` Chat like usual <img width="1840" alt="image" src="https://github.com/zed-industries/zed/assets/836375/4e0af266-4c4f-4d9e-9d74-1a91f76a12fe"> Interact with any model from the [Ollama Library](https://ollama.com/library) <img width="587" alt="image" src="https://github.com/zed-industries/zed/assets/836375/87433ac6-bf87-4a99-89e1-96a93bf8de8a"> Open up the terminal to download new models via `ollama pull`: ![image](https://github.com/zed-industries/zed/assets/836375/af7ec411-76bf-41c7-ba81-64bbaeea98a8)	2024-06-11 17:35:27 -07:00
Mikayla Maki	3cd6719b30	Fix issues with Claude in Assistant2 (#12619 ) Release Notes: - N/A --------- Co-authored-by: Nathan <nathan@zed.dev>	2024-06-03 16:30:09 -07:00
Antonio Scandurra	6ff01b17ca	Improve model selection in the assistant (#12472 ) https://github.com/zed-industries/zed/assets/482957/3b017850-b7b6-457a-9b2f-324d5533442e Release Notes: - Improved the UX for selecting a model in the assistant panel. You can now switch model using just the keyboard by pressing `alt-m`. Also, when switching models via the UI, settings will now be updated automatically.	2024-05-30 12:36:07 +02:00
Nathan Sobo	ea166f0b27	Add a send button to the assistant (#12171 ) ![CleanShot 2024-05-22 at 18 20 45@2x](https://github.com/zed-industries/zed/assets/1789/dac9fcde-9fcb-4c40-b5da-ebdc847b3962) Include the keybinding to help people discover how to submit from the keyboard. I'm also shifting from the word "Conversation" to "Context". Release Notes: - Added a send button to the assistant panel.	2024-05-22 19:17:28 -06:00
Marshall Bowers	c1e291bc96	gpui: Improve `Global` ergonomics (#11923 ) This PR adds some ergonomic improvements when working with GPUI `Global`s. Two new traits have been added—`ReadGlobal` and `UpdateGlobal`—that provide associated functions on any type that implements `Global` for accessing and updating the global without needing to call the methods on the `cx` directly (which generally involves qualifying the type). I looked into adding `ObserveGlobal` as well, but this seems a bit trickier to implement as the signatures of `cx.observe_global` vary slightly between the different contexts. Release Notes: - N/A	2024-05-16 12:47:43 -04:00
Antonio Scandurra	5944caaa90	Add support for interacting with Claude in the assistant panel (#11798 ) Release Notes: - Added support for interacting with Claude in the assistant panel. You can enable it by adding the following to your `settings.json`: ```json "assistant": { "version": "1", "provider": { "name": "anthropic" } } ```	2024-05-14 15:57:52 +02:00
Antonio Scandurra	019d98898e	Add support for gpt-4o when using zed.dev as the model provider (#11794 ) Release Notes: - N/A	2024-05-14 13:55:47 +02:00
Toon Willems	9b74acc4f5	Add GPT-4o as possible model (#11764 ) Resolves: #11766 Release Notes: - Add GPT-4o support (see: https://openai.com/index/hello-gpt-4o/). GPT-4o is better and faster than 4-turbo, at half the price.	2024-05-14 10:43:24 +02:00
Marshall Bowers	0d26beb91b	Add configurable low-speed timeout for OpenAI provider (#11668 ) This PR adds a setting to allow configuring the low-speed timeout for the Assistant when using the OpenAI provider. The `low_speed_timeout_in_seconds` accepts a number of seconds that the HTTP client can go below a minimum speed limit (currently set to 100 bytes/second) before it times out. ```json { "assistant": { "version": "1", "provider": { "name": "openai", "low_speed_timeout_in_seconds": 60 } }, } ``` This should help the case where the `openai` provider is being used with a local model that requires higher timeouts. Issue: https://github.com/zed-industries/zed/issues/9913 Release Notes: - Added a `low_speed_timeout_in_seconds` setting to the Assistant's OpenAI provider ([#9913](https://github.com/zed-industries/zed/issues/9913)).	2024-05-10 13:19:21 -04:00
Marshall Bowers	7c5bc3c26f	Add the ability for extensions to provide language settings (#10296 ) This PR adds the ability for extensions to provide certain language settings via the language `config.toml`. These settings are then merged in with the rest of the settings when the language is loaded from the extension. The language settings that are available are: - `tab_size` - `hard_tabs` - `soft_wrap` Additionally, for bundled languages we moved these settings out of the `settings/default.json` and into their respective `config.toml`s . For languages currently provided by extensions, we are leaving the values in the `settings/default.json` temporarily until all released versions of Zed are able to load these settings from the extension. --- Along the way we ended up refactoring the `Settings::load` method slightly, introducing a new `SettingsSources` struct to better convey where the settings are being loaded from. This makes it easier to load settings from specific locations/sets of locations in an explicit way. Release Notes: - N/A --------- Co-authored-by: Max <max@zed.dev> Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com>	2024-04-08 19:17:12 -04:00
Nathan Sobo	9b673089db	Enable Claude 3 models to be used via the Zed server if "language-models" feature flag is enabled for user (#10015 ) Release Notes: - N/A	2024-03-31 15:57:57 -06:00
Antonio Scandurra	fb6cff89d7	Introduce `InlineCompletionProvider` (#9777 ) This pull request introduces a new `InlineCompletionProvider` trait, which enables making `Editor` copilot-agnostic and lets us push all the copilot functionality into the `copilot_ui` module. Long-term, I would like to merge `copilot` and `copilot_ui`, but right now `project` depends on `copilot`, which makes this impossible. The reason for adding this new trait is so that we can experiment with other inline completion providers and swap them at runtime using config settings. Please, note also that we renamed some of the existing copilot actions to be more agnostic (see release notes below). We still kept the old actions bound for backwards-compatibility, but we should probably remove them at some later version. Also, as a drive-by, we added new methods to the `Global` trait that let you read or mutate a global directly, e.g.: ```rs MyGlobal::update(cx, \|global, cx\| { }); ``` Release Notes: - Renamed the `copilot::Suggest` action to `editor::ShowInlineCompletion` - Renamed the `copilot::NextSuggestion` action to `editor::NextInlineCompletion` - Renamed the `copilot::PreviousSuggestion` action to `editor::PreviousInlineCompletion` - Renamed the `editor::AcceptPartialCopilotSuggestion` action to `editor::AcceptPartialInlineCompletion` --------- Co-authored-by: Nathan <nathan@zed.dev> Co-authored-by: Kyle <kylek@zed.dev> Co-authored-by: Kyle Kelley <rgbkrk@gmail.com>	2024-03-26 13:28:06 +01:00
Marshall Bowers	c6d479715d	Add setting to allow disabling the Assistant (#9706 ) This PR adds a new `assistant.enabled` setting that controls whether the Zed Assistant is enabled. Some users have requested the ability to disable the AI-related features in Zed if they don't use them. Changing `assistant.enabled` to `false` will hide the Assistant icon in the status bar (taking priority over the `assistant.button` setting) as well as filter out the `assistant:` actions. The Assistant is enabled by default. Release Notes: - Added an `assistant.enabled` setting to control whether the Assistant is enabled.	2024-03-22 11:55:29 -04:00
Nathan Sobo	6d5787cfdc	Hard code max token counts for supported models (#9675 )	2024-03-21 20:30:33 -06:00
Nathan Sobo	8ae5a3b61a	Allow AI interactions to be proxied through Zed's server so you don't need an API key (#7367 ) Co-authored-by: Antonio <antonio@zed.dev> Resurrected this from some assistant work I did in Spring of 2023. - [x] Resurrect streaming responses - [x] Use streaming responses to enable AI via Zed's servers by default (but preserve API key option for now) - [x] Simplify protobuf - [x] Proxy to OpenAI on zed.dev - [x] Proxy to Gemini on zed.dev - [x] Improve UX for switching between openAI and google models - We current disallow cycling when setting a custom model, but we need a better solution to keep OpenAI models available while testing the google ones - [x] Show remaining tokens correctly for Google models - [x] Remove semantic index - [x] Delete `ai` crate - [x] Cloud front so we can ban abuse - [x] Rate-limiting - [x] Fix panic when using inline assistant - [x] Double check the upgraded `AssistantSettings` are backwards-compatible - [x] Add hosted LLM interaction behind a `language-models` feature flag. Release Notes: - We are temporarily removing the semantic index in order to redesign it from scratch. --------- Co-authored-by: Antonio <antonio@zed.dev> Co-authored-by: Antonio Scandurra <me@as-cii.com> Co-authored-by: Thorsten <thorsten@zed.dev> Co-authored-by: Max <max@zed.dev>	2024-03-19 19:22:26 +01:00
Marshall Bowers	a860530a2e	Assign OpenAI model based on Azure OpenAI deployment ID (#8835 ) Following up on #8646, this PR makes it so we select an OpenAI model based on the deployment ID when using Azure OpenAI. Release Notes: - N/A	2024-03-04 11:19:32 -05:00
Marshall Bowers	eb1ab69606	Wire up Azure OpenAI completion provider (#8646 ) This PR wires up support for [Azure OpenAI](https://learn.microsoft.com/en-us/azure/ai-services/openai/overview) as an alternative AI provider in the assistant panel. This can be configured using the following in the settings file: ```json { "assistant": { "provider": { "type": "azure_openai", "api_url": "https://{your-resource-name}.openai.azure.com", "deployment_id": "gpt-4", "api_version": "2023-05-15" } }, } ``` You will need to deploy a model within Azure and update the settings accordingly. Release Notes: - N/A	2024-02-29 22:01:33 -05:00
Yesterday17	9e17018416	Allow OpenAI API URL to be configured via `assistant.openai_api_url` (#7552 ) Partially fixes #4321, since Azure OpenAI API can be converted to OpenAI API. Release Notes: - Added `assistant.openai_api_url` setting to allow OpenAI API URL to be configured. --------- Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2024-02-12 11:37:27 -05:00
Marshall Bowers	027f055841	Update casing of "OpenAI" in identifiers to match Rust conventions (#6940 ) This PR updates the casing of "OpenAI" when used in Rust identifiers to match the [Rust naming guidelines](https://rust-lang.github.io/api-guidelines/naming.html): > In `UpperCamelCase`, acronyms and contractions of compound words count as one word: use `Uuid` rather than `UUID`, `Usize` rather than `USize` or `Stdin` rather than `StdIn`. Release Notes: - N/A	2024-01-28 12:01:10 -05:00
Piotr Osiewicz	e4b1c76895	Display setting documentation in settings.json (#3936 ) Let this screenshot of settings.json speak for itself: ![image](https://github.com/zed-industries/zed/assets/24362066/fca60383-1788-43f9-803b-00f083394c8a) Release Notes: - Added code completion & on-hover documentation to Zed's settings.json file. --------- Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2024-01-08 19:30:18 +01:00
Max Brunsfeld	ecbd115542	Remove 2 suffix for assistant, journal Co-authored-by: Mikayla <mikayla@zed.dev>	2024-01-03 11:14:01 -08:00
Joseph T. Lyons	516a8790b9	Add gpt-4-1106-preview model	2023-11-14 08:28:57 -05:00
KCaverly	5f6334696a	rename ai crate to assistant crate	2023-09-21 21:54:59 -04:00

42 commits