Yehowshua/ZIm - Forgejo: Beyond coding. We Forge.

Author	SHA1	Message	Date
Marshall Bowers	968ffaa3fd	assistant2: Restructure storage of tool uses and results (#21194 ) This PR restructures the storage of the tool uses and results in `assistant2` so that they don't live on the individual messages. It also introduces a `LanguageModelToolUseId` newtype for better type safety. Release Notes: - N/A	2024-11-25 21:53:27 -05:00
Marshall Bowers	f059b6a24b	assistant2: Add support for using tools (#21190 ) This PR adds rudimentary support for using tools to `assistant2`. There are currently no visual affordances for tool use. This is gated behind the `assistant-tool-use` feature flag. <img width="1079" alt="Screenshot 2024-11-25 at 7 21 31 PM" src="https://github.com/user-attachments/assets/64d6ca29-c592-4474-8e9d-c344f855bc63"> Release Notes: - N/A	2024-11-25 19:44:34 -05:00
Marshall Bowers	3901d46101	Factor tool definitions out of `assistant` (#21189 ) This PR factors the tool definitions out of the `assistant` crate so that they can be shared between `assistant` and `assistant2`. `ToolWorkingSet` now lives in `assistant_tool`. The tool definitions themselves live in `assistant_tools`, with the exception of the `ContextServerTool`, which has been moved to the `context_server` crate. As part of this refactoring I needed to extract the `ContextServerSettings` to a separate `context_server_settings` crate so that the `extension_host`—which is referenced by the `remote_server`—can name the `ContextServerSettings` type without pulling in some undesired dependencies. Release Notes: - N/A	2024-11-25 18:26:34 -05:00
Marshall Bowers	cbba44900d	Add `language_models` crate to house language model providers (#20945 ) This PR adds a new `language_models` crate to house the various language model providers. By extracting the provider definitions out of `language_model`, we're able to remove `language_model`'s dependency on `editor`, which improves incremental compilation when changing `editor`. Release Notes: - N/A	2024-11-20 18:49:34 -05:00
Marshall Bowers	f5cbfa718e	assistant: Fix evaluating slash commands in slash command output (like `/default`) (#20864 ) This PR fixes an issue where slash commands in the output of other slash commands were not being evaluated when configured to do so. Closes https://github.com/zed-industries/zed/issues/20820. Release Notes: - Fixed slash commands from other slash commands (like `/default`) not being evaluated (Preview only).	2024-11-19 11:20:30 -05:00
Danilo Leal	3f905d57e5	assistant: Adjust title summarization prompt (#20822 ) Meant to avoid the excessive use of "Here's a concise 3-7 word title..." and "Title:" instances we've been seeing lately. Follow up to: https://github.com/zed-industries/zed/pull/19530 Release Notes: - Improve prompt for generating title summaries, avoiding preambles	2024-11-18 12:44:06 -03:00
Marshall Bowers	da09cbd055	assistant: Show more details for assist errors (#20740 ) This PR updates the Assistant to show more detailed error messages when the user encounters an assist error. Here are some examples: <img width="415" alt="Screenshot 2024-11-15 at 1 47 03 PM" src="https://github.com/user-attachments/assets/5e7c5d5f-bd78-4af3-86ed-af4c6712770f"> <img width="417" alt="Screenshot 2024-11-15 at 2 11 14 PM" src="https://github.com/user-attachments/assets/02cb659b-1239-4e24-865f-3a512703a94f"> The notification will scroll if the error lines overflow the set maximum height. Release Notes: - Updated the Assistant to show more details in error cases.	2024-11-15 14:23:46 -05:00
Antonio Scandurra	2fe9cd8faa	Fix regression in producing sections when converting `SlashCommandOutput` to event stream (#20404 ) Closes #20243 Release Notes: - N/A	2024-11-08 09:29:14 +01:00
Marshall Bowers	7e7f25df6c	Scope slash commands, context servers, and tools to individual Assistant Panel instances (#20372 ) This PR reworks how the Assistant Panel references slash commands, context servers, and tools. Previously we were always reading them from the global registries, but now we store individual collections on each Assistant Panel instance so that there can be different ones registered for each project. Release Notes: - N/A --------- Co-authored-by: Max <max@zed.dev> Co-authored-by: Antonio <antonio@zed.dev> Co-authored-by: Joseph <joseph@zed.dev> Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com>	2024-11-07 18:23:25 -05:00
Antonio Scandurra	16cbff9118	Polish streaming slash commands (#20345 ) This improves the experience in a few ways: - It avoids merging slash command output sections that are adjacent. - When hitting cmd-z, all the output from a command is undone at once. - When deleting a pending command, it stops the command and prevents new output from flowing in. Release Notes: - N/A	2024-11-07 13:25:26 +01:00
Marshall Bowers	b129e18396	Make slash command output streamable (#19632 ) This PR adds support for streaming output from slash commands In this PR we are focused primarily on the interface of the `SlashCommand` trait to support streaming the output. We will follow up later with support for extensions and context servers to take advantage of the streaming nature. Release Notes: - N/A --------- Co-authored-by: David Soria Parra <davidsp@anthropic.com> Co-authored-by: Antonio Scandurra <me@as-cii.com> Co-authored-by: David <david@anthropic.com> Co-authored-by: Antonio <antonio@zed.dev> Co-authored-by: Max <max@zed.dev> Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com> Co-authored-by: Will <will@zed.dev>	2024-11-06 16:24:43 -08:00
Richard Feldman	bc4bd2e168	Don't conservatively include Suggest Edits token in token count (#20180 ) Before: (note the 1.3k in the upper right corner instead of 3 in the second screenshot) <img width="459" alt="Screenshot 2024-11-04 at 11 37 58 AM" src="https://github.com/user-attachments/assets/64c06aff-f7d2-42a4-a767-0d7a4ba0f486"> Now: <img width="631" alt="Screenshot 2024-11-04 at 11 38 11 AM" src="https://github.com/user-attachments/assets/22af974d-915a-41e1-9ee0-f0622901e242"> This was intended to be a conservative estimate in case you pressed Suggest Edits (and therefore might have an unpleasant surprise if you were under the context limit but Suggest Edits failed with a "too much context" error message anyway, after the Suggest Edits context got added for you behind the scenes). However, in retrospect this design created more [confusion in the common case](https://github.com/zed-industries/zed/pull/19900#issuecomment-2453456569) because it made it look like more context had been actually consumed than what was actually consumed. This does raise a potential design question for the future: the Suggest Edits button adds more context at the last minute without ever communicating that it's going to do that. In the meantime it seems best to go back to the less-confusing way of reporting the token counts, especially since only users of the experimental flag could possibly press Suggest Edits anyway. Release Notes: - Fixed issue where initial token count was over-reported as 1.3k instead of 3 (for the context string "You").	2024-11-04 15:40:10 -05:00
Boris Cherny	b87c4a1e13	assistant: Add health telemetry (#19928 ) This PR adds a bit of telemetry for Anthropic models, in order to understand model health. With this logging, we can monitor and diagnose dips in performance, for example due to model rollouts. Release Notes: - N/A --------- Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com>	2024-10-31 16:21:26 -07:00
Richard Feldman	c564a4a26c	Require /file or /tab when using Suggest Edits (#19960 ) Now if you try to do Suggest Edits without a file context, you see this (and it doesn't run the query). <img width="635" alt="Screenshot 2024-10-30 at 10 51 24 AM" src="https://github.com/user-attachments/assets/a3997ba6-98a9-4bfa-81b6-1d8579c26fd7"> Release Notes: - N/A --------- Co-authored-by: Antonio <antonio@zed.dev>	2024-10-30 11:38:43 -04:00
Nathan Sobo	cfa20ff221	Sketch in assistant edit button (#19705 ) Add an edit button to the assistant. This is totally hacked in for now, just to see how this would feel rendered simply in the UI. ![CleanShot 2024-10-24 at 16 26 14@2x](https://github.com/user-attachments/assets/e630d078-78b7-42d7-93f1-cf61c00bd20e) cc @as-cii @danilo-leal Release Notes: - N/A --------- Co-authored-by: Danilo Leal <67129314+danilo-leal@users.noreply.github.com> Co-authored-by: Richard Feldman <oss@rtfeldman.com>	2024-10-29 13:21:10 -04:00
Marshall Bowers	d30361537e	assistant: Update `SlashCommand` trait with streaming return type (#19652 ) This PR updates the `SlashCommand` trait to use a streaming return type. This change is just at the trait layer. The goal here is to decouple changing the trait's API while preserving behavior on either side. The `SlashCommandOutput` type now has two methods for converting two and from a stream to use in cases where we're not yet doing streaming. On the `SlashCommand` implementer side, the implements can call `to_event_stream` to produce a stream of events based off the `SlashCommandOutput`. On the slash command consumer side we use `SlashCommandOutput::from_event_stream` to convert a stream of events back into a `SlashCommandOutput`. The `/file` slash command has been updated to emit `SlashCommandEvent`s directly in order for it to work properly. Release Notes: - N/A --------- Co-authored-by: Max <max@zed.dev>	2024-10-23 21:26:50 -04:00
Marshall Bowers	9c0dba4ce1	Add a `SlashCommandResult` type alias (#19633 ) This PR adds a new `SlashCommandResult` type alias. We're going to be changing what slash commands can return in order to support streaming, so having this type alias in place will make that switch a bit more neat. Release Notes: - N/A	2024-10-23 14:32:43 -04:00
Adam Wolff	680b3dd80b	Refine AI context summary prompt (#19530 ) Release Notes: - Improved prompt for generating context editor summaries.	2024-10-22 11:02:04 -04:00
Max Brunsfeld	411f64b374	Restructure assistant edits to show all changes in a proposed-change editor (#18240 ) This changes the `/workflow` command so that instead of emitting edits in separate steps, the user is presented with a single tab, with an editable diff that they can apply to the buffer. Todo * Assistant panel * [x] Show a patch title and a list of changed files in a block decoration * [x] Don't store resolved patches as state on Context. Resolve on demand. * [ ] Better presentation of patches in the panel * [ ] Show a spinner while patch is streaming in * Patches * [x] Preserve leading whitespace in new text, auto-indent insertions * [x] Ensure patch title is very short, to fit better in tab * [x] Improve patch location resolution, prefer skipping whitespace over skipping `}` * [x] Ensure patch edits are auto-indented properly * [ ] Apply `Update` edits via a diff between the old and new text, to get fine-grained edits. * Proposed changes editor * [x] Show patch title in the tab * [x] Add a toolbar with an "Apply all" button * [x] Make `open excerpts` open the corresponding location in the base buffer (https://github.com/zed-industries/zed/pull/18591) * [x] Add an apply button above every hunk (https://github.com/zed-industries/zed/pull/18592) * [x] Expand all diff hunks by default (https://github.com/zed-industries/zed/pull/18598) * [x] Fix https://github.com/zed-industries/zed/issues/18589 * [x] Syntax highlighting doesn't work until the buffer is edited (https://github.com/zed-industries/zed/pull/18648) * [x] Disable LSP interaction in Proposed Changes editor (https://github.com/zed-industries/zed/pull/18945) * [x] No auto-indent? (https://github.com/zed-industries/zed/pull/18984) * Prompt * [ ] make sure old_text is unique Release Notes: - N/A --------- Co-authored-by: Marshall Bowers <elliott.codes@gmail.com> Co-authored-by: Antonio <antonio@zed.dev> Co-authored-by: Richard <richard@zed.dev> Co-authored-by: Marshall <marshall@zed.dev> Co-authored-by: Nate Butler <iamnbutler@gmail.com> Co-authored-by: Antonio Scandurra <me@as-cii.com> Co-authored-by: Richard Feldman <oss@rtfeldman.com>	2024-10-17 13:18:13 -04:00
Shish	f1c45d988e	collab: Remove dependency on X11 (#19079 ) collab: Remove dependency on X11 I'm not sure if this is the best solution (perhaps pulling `LanguageName` into a separate `language_types` crate would be better...?) - but it massively reduces build time / dependencies / size and means that the collab server no longer requires X11 libraries to be installed. tl;dr: `telemetry_events` requires the `language` crate, and the language crate requires a whole ton of extra stuff. Since telemetry_events only uses `language` for a single type definition (`LanguageName`, aka `String`), we can cut all of these out by using the base `String` type (This doesn't seem too terrible, given that all other telemetry fields are using basic datatypes like String as opposed to more strongly-typed variants). FYI the dependency tree for "why does collab need X11 libraries??" looks like this: ``` collab \- telemetry_events \- language \|- gpui \|- fuzzy \| \- gpui \|- git \| \- gpui \|- lsp \| \|- gpui \| \- release_channel \| \- gpui \|- settings \| \|- fs \| \| \- gpui \| \- gpui \|- task \| \- gpui \- theme \- gpui ``` Release Notes: - N/A	2024-10-11 13:28:34 -04:00
Marshall Bowers	84b61c8b1a	assistant: Add support for displaying billing-related errors (#19082 ) This PR adds support to the assistant for display billing-related errors. Pulling this out of #19081 to make it easier to cherry-pick. Release Notes: - N/A Co-authored-by: Antonio <antonio@zed.dev> Co-authored-by: Richard <richard@zed.dev>	2024-10-11 13:22:45 -04:00
Piotr Osiewicz	03c84466c2	chore: Fix some violations of 'needless_pass_by_ref_mut' lint (#18795 ) While this lint is allow-by-default, it seems pretty useful to get rid of mutable borrows when they're not needed. Closes #ISSUE Release Notes: - N/A	2024-10-07 01:29:58 +02:00
Boris Cherny	01ad22683d	telemetry: Add `language_name` and `model_provider` (#18640 ) This PR adds a bit more metadata for assistant logging. Release Notes: - Assistant: Added `language_name` and `model_provider` fields to telemetry events. --------- Co-authored-by: Marshall Bowers <elliott.codes@gmail.com> Co-authored-by: Max <max@zed.dev>	2024-10-04 14:37:27 -04:00
Marshall Bowers	e3a6f89e2d	Make `report_assistant_event` take an `AssistantEvent` struct (#18741 ) This PR makes the `report_assistant_event` method take an `AssistantEvent` struct instead of all of the struct fields as individual parameters. Release Notes: - N/A	2024-10-04 13:19:18 -04:00
Max Brunsfeld	743feb98bc	Add the ability to propose changes to a set of buffers (#18170 ) This PR introduces functionality for creating branches of buffers that can be used to preview and edit change sets that haven't yet been applied to the buffers themselves. Release Notes: - N/A --------- Co-authored-by: Marshall Bowers <elliott.codes@gmail.com> Co-authored-by: Marshall <marshall@zed.dev>	2024-09-20 18:28:50 -04:00
Max Brunsfeld	e309fbda2a	Add a slash command for automatically retrieving relevant context (#17972 ) * [x] put this slash command behind a feature flag until we release embedding access to the general population * [x] choose a name for this slash command and name the rust module to match Release Notes: - N/A --------- Co-authored-by: Jason <jason@zed.dev> Co-authored-by: Richard <richard@zed.dev> Co-authored-by: Jason Mancuso <7891333+jvmncs@users.noreply.github.com> Co-authored-by: Richard Feldman <oss@rtfeldman.com>	2024-09-20 18:09:18 -04:00
Roy Williams	5905fbb9ac	Allow Anthropic custom models to override temperature (#18160 ) Release Notes: - Allow Anthropic custom models to override "temperature" This also centralized the defaulting of "temperature" to be inside of each model's `into_x` call instead of being sprinkled around the code.	2024-09-20 14:59:12 -06:00
Marshall Bowers	1fc391f696	Make `Buffer::apply_ops` infallible (#18089 ) This PR makes the `Buffer::apply_ops` method infallible for `text::Buffer` and `language::Buffer`. We discovered that `text::Buffer::apply_ops` was only fallible due to `apply_undo`, which didn't actually need to be fallible. Release Notes: - N/A	2024-09-19 13:14:15 -04:00
Conrad Irwin	b43b800a54	More assistant events (#18032 ) Release Notes: - N/A	2024-09-18 18:07:39 -06:00
Antonio Scandurra	54b8232be2	Introduce a new `/delta` command (#17903 ) Release Notes: - Added a new `/delta` command to re-insert changed files that were previously included in a context. --------- Co-authored-by: Roy <roy@anthropic.com>	2024-09-17 08:47:08 -06:00
Conrad Irwin	e66ea9e5d4	Fix renames over language server for SSH remoting (#17897 ) Release Notes: - ssh remoting: Fix rename over language server --------- Co-authored-by: Mikayla <mikayla@zed.dev> Co-authored-by: Max <max@zed.dev>	2024-09-16 16:20:17 -06:00
Marshall Bowers	3ff81c2e86	assistant: Simplify image insertion (#17668 ) This PR simplifies how images are inserted into the context editor. We don't need to hold the `images` in a `HashMap` on the `Context`, as we were only inserting them to pull them out again. Release Notes: - N/A	2024-09-10 17:37:26 -04:00
Marshall Bowers	a23e381096	assistant: Pass up tool results in LLM request messages (#17656 ) This PR makes it so we pass up the tool results in the `tool_results` field in the request message to the LLM. This required reworking how we track non-text content in the context editor. We also removed serialization of images in context history, as we were never deserializing it, and thus it was unneeded. Release Notes: - N/A --------- Co-authored-by: Antonio <antonio@zed.dev>	2024-09-10 15:25:57 -04:00
Piotr Osiewicz	095a08d9c8	chore: Another round of style lints fixes (#17519 ) Closes #ISSUE Release Notes: - N/A	2024-09-07 02:36:55 +02:00
Marshall Bowers	b8d3af35fd	assistant: Insert creases for tool output (#17464 ) This PR makes it so we insert creases for tool output. Release Notes: - N/A	2024-09-05 18:23:27 -04:00
Marshall Bowers	01525f17fa	assistant: Add basic tool invocation (#17368 ) This PR adds the initial groundwork for invoking tools in response to tool uses from the model. Tool uses are run when the model responds with a `stop_reason` of `tool_use`. Currently the tool results are just inserted as text into the user message. We'll want to include these as `tool_result` content on the message, but Claude seems to understand it regardless. Release Notes: - N/A	2024-09-04 14:32:20 -04:00
Marshall Bowers	f38956943b	assistant: Propagate LLM stop reason upwards (#17358 ) This PR makes it so we propagate the `stop_reason` from Anthropic up to the Assistant so that we can take action based on it. The `extract_content_from_events` function was moved from `anthropic` to the `anthropic` module in `language_model` since it is more useful if it is able to name the `LanguageModelCompletionEvent` type, as otherwise we'd need an additional layer of plumbing. Release Notes: - N/A	2024-09-04 12:31:10 -04:00
Marshall Bowers	e81b484bf2	assistant: Add tool registry (#17331 ) This PR adds a tool registry to hold tools that can be called by the Assistant. Currently we just have a `now` tool for retrieving the current datetime. This is all behind the `assistant-tool-use` feature flag which currently needs to be explicitly opted-in to in order for the LLM to see the tools. Release Notes: - N/A	2024-09-03 19:14:36 -04:00
Marshall Bowers	c2448e1673	assistant: Insert creases for tool uses (#17330 ) This PR makes it so we create creases for each of the tool uses in the context editor. <img width="1290" alt="Screenshot 2024-09-03 at 5 37 33 PM" src="https://github.com/user-attachments/assets/94e943fd-3f05-4bc4-9672-94bff42ec500"> Release Notes: - N/A	2024-09-03 17:52:52 -04:00
Marshall Bowers	452272e5df	assistant: Stream tool uses as structured data (#17322 ) This PR adjusts the approach we use to encoding tool uses in the completion response to use a structured format rather than simply injecting it into the response stream as text. In #17170 we would encode the tool uses as XML and insert them as text. This would require then re-parsing the tool uses out of the buffer in order to use them. The approach taken in this PR is to make `stream_completion` return a stream of `LanguageModelCompletionEvent`s. Each of these events can be either text, or a tool use. A new `stream_completion_text` method has been added to `LanguageModel` for scenarios where we only care about textual content (currently, everywhere that isn't the Assistant context editor). Release Notes: - N/A	2024-09-03 15:04:51 -04:00
Max Brunsfeld	b41ddbd018	Have models indicate code locations in workflows using textual search, not symbol names (#17282 ) Release Notes: - N/A --------- Co-authored-by: Antonio Scandurra <me@as-cii.com>	2024-09-02 18:20:05 -07:00
Marshall Bowers	68ea661711	assistant: Add foundation for receiving tool uses from Anthropic models (#17170 ) This PR updates the Assistant with support for receiving tool uses from Anthropic models and capturing them as text in the context editor. This is just laying the foundation for tool use. We don't yet fulfill the tool uses yet, or define any tools for the model to use. Here's an example of what it looks like using the example `get_weather` tool from the Anthropic docs: <img width="644" alt="Screenshot 2024-08-30 at 1 51 13 PM" src="https://github.com/user-attachments/assets/3614f953-0689-423c-8955-b146729ea638"> Release Notes: - N/A	2024-08-30 14:05:55 -04:00
Marshall Bowers	89487772b0	assistant: Remove outdated comment (#17105 ) This used to appear on a call to `prune_invalid_workflow_steps`, but that method doesn't exist anymore. Release Notes: - N/A	2024-08-29 14:24:08 -04:00
Max Brunsfeld	f84ef5e48a	Immediate edit step resolution (#16447 ) ## Todo * [x] Parse and present new XML output * [x] Resolve new edits to buffers and anchor ranges * [x] Surface resolution errors * [x] Steps fail to resolve because language hasn't loaded yet * [x] Treat empty `<symbol>` tag as None * [x] duplicate assists when editing steps * [x] step footer blocks can appear below the following message header block ## Release Notes: - N/A --------- Co-authored-by: Mikayla <mikayla@zed.dev> Co-authored-by: Peter <peter@zed.dev> Co-authored-by: Marshall <marshall@zed.dev> Co-authored-by: Antonio <antonio@zed.dev> Co-authored-by: Antonio Scandurra <me@as-cii.com>	2024-08-29 10:18:52 -07:00
Piotr Osiewicz	aaddb73b28	assistant: Refesh message headers only for dirty messages (#16881 ) We've noticed performance issues in long conversations with assistants; the profiles pointed to slowiness in WrapMap (and indeed there were some low hanging fruits that we picked up in https://github.com/zed-industries/zed/pull/16761). That however did not fully resolve the issue, as WrapMap still cracked through in profiles; basically, the speedup I've landed has just moved the post elsewhere. The higher level issue is that we were trying to refresh message headers for all messages, irrespective of whether they've actually needed to be updated. This PR fixes that by using `replace_blocks` API where possible. Release Notes: - Improved performance of Assistant Panel with long conversations.	2024-08-26 18:16:23 +02:00
Roy Williams	0042c24d3c	Simplify logic & add UI affordances to show model cache status (#16395 ) Release Notes: - Adds UI affordances to the assistant panel to show which messages have been cached - Migrate cache invalidation to be based on `has_edits_since_in_range` to be smarter and more selective about when to invalidate the cache and when to fetch. <img width="310" alt="Screenshot 2024-08-16 at 11 19 23 PM" src="https://github.com/user-attachments/assets/4ee2d111-2f55-4b0e-b944-50c4f78afc42"> <img width="580" alt="Screenshot 2024-08-18 at 10 05 16 PM" src="https://github.com/user-attachments/assets/17630a60-7b78-421c-ae39-425246638a12"> I had originally added the lightening bolt on every message and only added the tooltip warning about editing prior messages on the first anchor, but thought it looked too busy, so I settled on just annotating the last anchor.	2024-08-19 12:06:14 -07:00
Kirill Bulatov	69aae2037d	Display default prompts more elaborately (#16471 ) Show them under `User` role instead of a `System` one, and insert them expanded. Release Notes: - N/A	2024-08-19 18:44:52 +03:00
Max Brunsfeld	8841d6faad	Avoid redundant newline insertion after file command (#16419 ) Release Notes: - Fixed an issue where an extra newline was inserted after running a `/file` command in the assistant.	2024-08-17 15:10:10 -07:00
Kyle Kelley	bac39d7743	assistant: Only push text content if not empty with image content (#16270 ) If you submit an image with empty space above it and text below, it will fail with this error: ![image](https://github.com/user-attachments/assets/a4a2265e-815f-48b5-b09e-e178fce82ef7) Now instead it fails with an error about needing a message. <img width="640" alt="image" src="https://github.com/user-attachments/assets/72b267eb-b288-40a5-a829-750121ff16cc"> It will however work with text above and empty text below the image now. Release Notes: - Improved conformance with Anthropic Images in Chat Completions API	2024-08-15 22:38:52 -05:00
Roy Williams	46fb917e02	Implement Anthropic prompt caching (#16274 ) Release Notes: - Adds support for Prompt Caching in Anthropic. For models that support it this can dramatically lower cost while improving performance.	2024-08-15 22:21:06 -05:00

1 2

90 commits