Yehowshua/ZIm - Forgejo: Beyond coding. We Forge.

Author	SHA1	Message	Date
Antonio Scandurra	a8ef0f2426	Include outline when predicting edits with Zeta (#22895 ) Release Notes: - N/A Co-authored-by: Thorsten <thorsten@zed.dev>	2025-01-09 14:26:33 +00:00
Conrad Irwin	03efd0d1d9	Stop sending data to Clickhouse (#21763 ) Release Notes: - N/A	2024-12-10 08:47:29 -07:00
Marshall Bowers	158cdc33ba	collab: Attach additional properties to `Language Model Used` event (#21770 ) This PR attaches two new properties to the `Language Model Used` event: - `has_llm_subscription` - This will tell us if a user is a paid subscriber. - `max_monthly_spend_in_cents` - This will indicate what their maximum monthly spend is set to. Release Notes: - N/A	2024-12-09 17:13:41 -05:00
Antonio Scandurra	77b8296fbb	Introduce staff-only inline completion provider (#21739 ) Release Notes: - N/A --------- Co-authored-by: Thorsten Ball <mrnugget@gmail.com> Co-authored-by: Bennet <bennet@zed.dev> Co-authored-by: Thorsten <thorsten@zed.dev>	2024-12-09 14:26:36 +01:00
Conrad Irwin	984bb192ba	Send llm events to snowflake too (#21091 ) Closes #ISSUE Release Notes: - N/A	2024-11-22 20:40:39 -07:00
Thorsten Ball	aee01f2c50	assistant: Remove `low_speed_timeout` (#20681 ) This removes the `low_speed_timeout` setting from all providers as a response to issue #19509. Reason being that the original `low_speed_timeout` was only as part of #9913 because users wanted to _get rid of timeouts_. They wanted to bump the default timeout from 5sec to a lot more. Then, in the meantime, the meaning of `low_speed_timeout` changed in #19055 and was changed to a normal `timeout`, which is a different thing and breaks slower LLMs that don't reply with a complete response in the configured timeout. So we figured: let's remove the whole thing and replace it with a default _connect_ timeout to make sure that we can connect to a server in 10s, but then give the server as long as it wants to complete its response. Closes #19509 Release Notes: - Removed the `low_speed_timeout` setting from LLM provider settings, since it was only used to _increase_ the timeout to give LLMs more time, but since we don't have any other use for it, we simply remove the setting to give LLMs as long as they need. --------- Co-authored-by: Antonio <antonio@zed.dev> Co-authored-by: Peter Tripp <peter@zed.dev>	2024-11-15 07:37:31 +01:00
Marshall Bowers	a451bcc3c4	collab: Exempt staff from LLM usage limits (#19836 ) This PR updates the usage limit check to exempt Zed staff members from usage limits. We previously had some affordances for the rate limits, but hadn't yet updated it for the usage-based billing. Release Notes: - N/A	2024-10-28 11:45:18 -04:00
Marshall Bowers	1a4b253ee5	collab: Add support for a custom monthly allowance for LLM usage (#19525 ) This PR adds support for setting a monthly LLM usage allowance for certain users. Release Notes: - N/A	2024-10-21 17:12:33 -04:00
Marshall Bowers	b44bed0115	collab: Unconditionally execute billing checks (#19432 ) This PR removes the conditional checks around the billing-related enforcement for LLM completions. These were just in place to prevent executing any billing code before we had rolled it out. Now that it is rolled out, we don't need this conditional execution anymore. Release Notes: - N/A	2024-10-18 15:55:28 -04:00
Antonio Scandurra	8c910540ed	Subtract FREE_TIER_MONTHLY_SPENDING_LIMIT from reported monthly spend (#19358 ) Release Notes: - N/A	2024-10-17 13:09:50 +02:00
Marshall Bowers	f6fad3b09e	collab: Remove lifetime spending limit in favor of LLM usage billing (#19321 ) This PR removes the lifetime spending limit that was added in #16780. We had previously added this as a way to prevent runaway usage, but now that we have a cap on free usage per month with paid access after that, we don't need this check anymore. Release Notes: - N/A	2024-10-16 18:14:07 -04:00
Antonio Scandurra	474e670bbd	Increase monthly free tier spend from 5 dollars to 10 dollars (#19291 ) Release Notes: - N/A Co-authored-by: Marshall <marshall@zed.dev> Co-authored-by: Richard <richard@zed.dev>	2024-10-16 12:22:24 -04:00
Mikayla Maki	22ac178f9d	Restore HTTP client transition, but use reqwest everywhere (#19055 ) Release Notes: - N/A	2024-10-11 14:58:58 -07:00
Marshall Bowers	c709b66f35	collab: Don't record billing events if billing is not enabled (#19102 ) This PR adjusts the billing logic to not write any records to `billing_events` if: - The user is staff, as we don't want to bill staff members - Billing is disabled (we currently enable billing based on the presence of the Stripe API key) Release Notes: - N/A	2024-10-11 17:54:10 -04:00
Marshall Bowers	22ea7cef7a	collab: Add usage-based billing for LLM interactions (#19081 ) This PR adds usage-based billing for LLM interactions in the Assistant. Release Notes: - N/A --------- Co-authored-by: Antonio Scandurra <me@as-cii.com> Co-authored-by: Antonio <antonio@zed.dev> Co-authored-by: Richard <richard@zed.dev> Co-authored-by: Richard Feldman <oss@rtfeldman.com>	2024-10-11 13:36:54 -04:00
Marshall Bowers	69711660ab	collab: Make LLM billing fields required in `LlmTokenClaims` (#18959 ) This PR makes the `has_llm_subscription` and `max_monthly_spend_in_cents` fields in the `LlmTokenClaims` required. This change will be safe to deploy in ~45 minutes. Release Notes: - N/A	2024-10-09 18:42:22 -04:00
Marshall Bowers	d316577fd5	collab: Add billing preferences for maximum LLM monthly spend (#18948 ) This PR adds a new `billing_preferences` table. Right now there is a single preference: the maximum monthly spend for LLM usage. Release Notes: - N/A --------- Co-authored-by: Richard <richard@zed.dev>	2024-10-09 16:29:07 -04:00
Marshall Bowers	f1053ff525	collab: Clarify naming around free tier spending limits (#18936 ) This PR renames the `MONTHLY_SPENDING_LIMIT` constant to `FREE_TIER_MONTHLY_SPENDING_LIMIT` to clarify it. This will help distinguish it from the user's specified limit on their paid monthly spending. Release Notes: - N/A	2024-10-09 15:05:53 -04:00
Marshall Bowers	817a41c4dc	collab: Add a `Cents` type (#18935 ) This PR adds a new `Cents` type that can be used to represent a monetary value in cents. This cuts down on the primitive obsession we were using when dealing with money in the billing code. Release Notes: - N/A	2024-10-09 14:22:32 -04:00
Mikayla Maki	5d5c4b6677	Revert http client changes (#18892 ) These proved to be too unstable. Will restore these changes once the issues have been fixed. Release Notes: - N/A	2024-10-09 01:07:18 -07:00
Marshall Bowers	f861479890	collab: Update billing code for LLM usage billing (#18879 ) This PR reworks our existing billing code in preparation for charging based on LLM usage. We aren't yet exercising the new billing-related code outside of development. There are some noteworthy changes for our existing LLM usage tracking: - A new `monthly_usages` table has been added for tracking usage per-user, per-model, per-month - The per-month usage measures have been removed, in favor of the `monthly_usages` table - All of the per-month metrics in the Clickhouse rows have been changed from a rolling 30-day window to a calendar month Release Notes: - N/A --------- Co-authored-by: Antonio Scandurra <me@as-cii.com> Co-authored-by: Richard <richard@zed.dev> Co-authored-by: Max <max@zed.dev>	2024-10-08 18:29:38 -04:00
Marshall Bowers	d55f025906	collab: Track cache writes/reads in LLM usage (#18834 ) This PR extends the LLM usage tracking to support tracking usage for cache writes and reads for Anthropic models. Release Notes: - N/A --------- Co-authored-by: Antonio Scandurra <me@as-cii.com> Co-authored-by: Antonio <antonio@zed.dev>	2024-10-07 17:32:49 -04:00
Conrad Irwin	3a5deb5c6f	Replace isahc with async ureq (#18414 ) REplace isahc with ureq everywhere gpui is used. This should allow us to make http requests without libssl; and avoid a long-tail of panics caused by ishac. Release Notes: - (potentially breaking change) updated our http client --------- Co-authored-by: Mikayla <mikayla@zed.dev>	2024-10-02 12:30:48 -07:00
Richard Feldman	caaa9a00a9	Remove Qwen2 model (#18444 ) Removed deprecated Qwen2 7B Instruct model from zed.dev provider (staff only). Release Notes: - N/A	2024-09-27 13:30:25 -04:00
Piotr Osiewicz	2c8a6ee7cc	remote_server: Remove dependency on libssl and libcrypto (#15446 ) Fixes: #15599 Release Notes: - N/A --------- Co-authored-by: Mikayla <mikayla@zed.dev> Co-authored-by: Conrad <conrad@zed.dev>	2024-09-18 23:29:34 +02:00
Richard Feldman	91ffa02e2c	/auto (#16696 ) Add `/auto` behind a feature flag that's disabled for now, even for staff. We've decided on a different design for context inference, but there are parts of /auto that will be useful for that, so we want them in the code base even if they're unused for now. Release Notes: - N/A --------- Co-authored-by: Antonio Scandurra <me@as-cii.com> Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2024-09-13 13:17:49 -04:00
Piotr Osiewicz	e6c1c51b37	chore: Fix several style lints (#17488 ) It's not comprehensive enough to start linting on `style` group, but hey, it's a start. Release Notes: - N/A	2024-09-06 11:58:39 +02:00
Bennet Bo Fenner	f413ea90bf	assistant: Fix Google AI provider not respecting `low_speed_timeout_in_seconds` (#17423 ) Release Notes: - Fixed an issue when using Google Gemini models, where the setting `low_speed_timeout_in_seconds` was not respected	2024-09-05 18:16:30 +02:00
Marshall Bowers	30056254f3	collab: Add `GET /models` endpoint to LLM service (#17307 ) This PR adds a `GET /models` endpoint to the LLM service. This endpoint returns the models that the authenticated user has access to. This is the first step towards populating the models for the hosted service from the server. Release Notes: - N/A	2024-09-03 11:41:32 -04:00
Marshall Bowers	d666cc5fba	collab: Report when upstream rate limit is exceeded (#17083 ) This PR makes it so we report a trace when the upstream rate limit is exceeded. Release Notes: - N/A	2024-08-29 08:54:45 -04:00
Marshall Bowers	340662e2f7	collab: Add lifetime spending limit for LLM usage (#16780 ) This PR adds a lifetime spending limit on LLM usage. Exceeding this limit will prevent further use of the Zed LLM provider. Currently the cap is $1,000. Release Notes: - N/A	2024-08-23 16:41:16 -04:00
Marshall Bowers	1d986b0c77	collab: Report active user counts separately, as well (#16629 ) This PR adds additional reporting of the active user counts as separate logs. We were already reporting these on individual rate limit events/logs, but it seems like something that would be good to report on independent of user activity. Release Notes: - N/A	2024-08-21 18:15:15 -04:00
Marshall Bowers	0229d3ccac	collab: Track active user counts independently for each model (#16624 ) This PR fixes an issue where the active user count spanned individual models. We now track the active user counts on a per-model basis. Release Notes: - N/A	2024-08-21 17:19:47 -04:00
Marshall Bowers	96bcceed40	collab: Add traces for user LLM rate limits (#16610 ) This PR adds traces for when users hit LLM rate limits. We were already emitting telemetry events for these to Clickhouse, but it will be handy to have them available in Axiom as well. Release Notes: - N/A	2024-08-21 15:13:55 -04:00
Marshall Bowers	de41c151c8	collab: Add `is_staff` to upstream rate limit spans (#16463 ) This PR adds the `is_staff` field to the `upstream rate limit` spans. Since we use different API keys for staff vs non-staff, it will be useful to break down the rate limits accordingly. Release Notes: - N/A	2024-08-19 10:15:25 -04:00
Marshall Bowers	3d997e5fd6	collab: Add `is_staff` to spans (#16389 ) This PR adds the `is_staff` field to our LLM spans so that we can distinguish between staff and non-staff traffic. Release Notes: - N/A	2024-08-16 18:42:44 -04:00
Max Brunsfeld	1b1070e0f7	Add tracing needed for LLM rate limit dashboards (#16388 ) Release Notes: - N/A --------- Co-authored-by: Marshall <marshall@zed.dev>	2024-08-16 17:52:31 -04:00
Marshall Bowers	7a5acc0b0c	collab: Rework model name checks (#16365 ) This PR reworks how we do checks for model names in the LLM service. We now normalize the model names using the models defined in the database. Release Notes: - N/A	2024-08-16 13:54:28 -04:00
Marshall Bowers	9233418cb8	collab: Attach GitHub login to LLM spans (#16316 ) This PR updates the LLM service to include the GitHub login on its spans. We need to pass this information through on the LLM token, so it will temporarily be `None` until this change is deployed and new tokens have been issued. Release Notes: - N/A	2024-08-15 17:06:20 -04:00
Marshall Bowers	5e05821d18	collab: Attach `user_id` to LLM spans (#16311 ) This PR updates the LLM service to attach the user ID to the spans. Release Notes: - N/A	2024-08-15 15:49:12 -04:00
Marshall Bowers	b4c22cc861	collab: Add ability to revoke LLM service access tokens (#16143 ) This PR adds the ability to revoke access tokens for the LLM service. There is a new `revoked_access_tokens` table that contains the identifiers (`jti`) of revoked access tokens. To revoke an access token, insert a record into this table: ```sql insert into revoked_access_tokens (jti) values ('1e887b9e-37f5-49e8-8feb-3274e5a86b67'); ``` We now attach the `jti` as `authn.jti` to the tracing spans so that we can associate an access token with a given request to the LLM service. Release Notes: - N/A	2024-08-12 21:47:05 -04:00
Max Brunsfeld	dbcd06642c	Track lifetime spending for each user and model (#16137 ) Release Notes: - N/A Co-authored-by: Marshall <marshall@zed.dev>	2024-08-12 20:15:26 -04:00
Max Brunsfeld	a3c79218c4	Report telemetry events for rate limit errors (#16130 ) clickhouse telemetry schema: ``` CREATE TABLE default.llm_rate_limit_events ( `time` DateTime64(3), `user_id` Int32, `is_staff` Bool, `plan` LowCardinality(String), `model` String, `provider` LowCardinality(String), `usage_measure` LowCardinality(String), `requests_this_minute` UInt64, `tokens_this_minute` UInt64, `tokens_this_day` UInt64, `max_requests_per_minute` UInt64, `max_tokens_per_minute` UInt64, `max_tokens_per_day` UInt64, `users_in_recent_minutes` UInt64, `users_in_recent_days` UInt64 ) ORDER BY tuple() ``` Release Notes: - N/A Co-authored-by: Marshall <marshall@zed.dev>	2024-08-12 16:31:11 -04:00
Max Brunsfeld	1674e12ccb	Expose anthropic API errors to the client (#16129 ) Now, when an anthropic request is invalid or anthropic's API is down, we'll expose that to the user instead of just returning a generic 500. Release Notes: - N/A Co-authored-by: Marshall <marshall@zed.dev>	2024-08-12 13:11:48 -07:00
Marshall Bowers	f3ec8d425f	collab: Use a separate Anthropic API key for Zed staff (#16128 ) This PR makes it so Zed staff can use a separate Anthropic API key for the LLM service. We also added an `is_staff` column to the `usages` table so that we can exclude staff usage from the "active users" metrics that influence the rate limits. Release Notes: - N/A --------- Co-authored-by: Max <max@zed.dev>	2024-08-12 15:20:34 -04:00
Marshall Bowers	ebdb755fef	Surface upstream rate limits from Anthropic (#16118 ) This PR makes it so hitting upstream rate limits from Anthropic result in an HTTP 429 response instead of an HTTP 500. To do this we need to surface structured errors out of the `anthropic` crate. Release Notes: - N/A	2024-08-12 11:59:24 -04:00
Marshall Bowers	3140d6ce8c	collab: Temporarily bypass LLM rate limiting for staff (#16089 ) This PR makes it so staff members will be exempt from rate limiting by the LLM service. This is just a temporary measure until we can tweak the rate-limiting heuristics. Staff members are still subject to upstream LLM provider rate limits. Release Notes: - N/A	2024-08-11 14:41:49 -04:00
Max Brunsfeld	33e120d964	Capture telemetry data on per-user monthly LLM spending (#16050 ) Release Notes: - N/A --------- Co-authored-by: Marshall <marshall@zed.dev>	2024-08-09 16:38:37 -07:00
Max Brunsfeld	8688b2ad19	Add telemetry for LLM usage (#16049 ) Release Notes: - N/A Co-authored-by: Marshall <marshall@zed.dev>	2024-08-09 18:15:57 -04:00
Max Brunsfeld	fbebb73d7b	Use LLM service for tool call requests (#16046 ) Release Notes: - N/A --------- Co-authored-by: Marshall <marshall@zed.dev>	2024-08-09 16:22:58 -04:00

1 2

59 commits