Yehowshua/ZIm - Forgejo: Beyond coding. We Forge.

Author	SHA1	Message	Date
Marshall Bowers	96bcceed40	collab: Add traces for user LLM rate limits (#16610 ) This PR adds traces for when users hit LLM rate limits. We were already emitting telemetry events for these to Clickhouse, but it will be handy to have them available in Axiom as well. Release Notes: - N/A	2024-08-21 15:13:55 -04:00
Marshall Bowers	d274be67d6	Mark the `user-backfiller` secret as optional	2024-08-21 13:25:05 -04:00
Marshall Bowers	19f0c4af6d	collab: Update user backfiller to be mindful of GitHub rate limits (#16602 ) This PR updates the user backfiller to be mindful of GitHub rate limits and back off when rate-limited. Release Notes: - N/A	2024-08-21 13:23:24 -04:00
Marshall Bowers	8a5fcc2c22	collab: Backfill `github_user_created_at` on users (#16600 ) This PR adds a backfiller to backfill the `github_user_created_at` column on users. Release Notes: - N/A	2024-08-21 12:38:51 -04:00
Marshall Bowers	395a68133d	Add Postgrest to Docker Compose (#16498 ) This PR adds two Postgrest containers—one for the app database and one for the LLM database—to the Docker Compose cluster. Also fixed an issue where `postgres_app.conf` and `postgres_llm.conf` had been switched. Release Notes: - N/A	2024-08-19 20:50:45 -04:00
Max Brunsfeld	b5bd8a5c5d	Add logic for closed beta LLM models (#16482 ) Release Notes: - N/A --------- Co-authored-by: Marshall <marshall@zed.dev>	2024-08-19 11:09:52 -07:00
Marshall Bowers	de41c151c8	collab: Add `is_staff` to upstream rate limit spans (#16463 ) This PR adds the `is_staff` field to the `upstream rate limit` spans. Since we use different API keys for staff vs non-staff, it will be useful to break down the rate limits accordingly. Release Notes: - N/A	2024-08-19 10:15:25 -04:00
Joseph T. Lyons	ebecd7e65f	Fix issue with fetching users in seed script (#16393 ) Release Notes: - N/A	2024-08-16 21:51:51 -04:00
Marshall Bowers	3d997e5fd6	collab: Add `is_staff` to spans (#16389 ) This PR adds the `is_staff` field to our LLM spans so that we can distinguish between staff and non-staff traffic. Release Notes: - N/A	2024-08-16 18:42:44 -04:00
Max Brunsfeld	1b1070e0f7	Add tracing needed for LLM rate limit dashboards (#16388 ) Release Notes: - N/A --------- Co-authored-by: Marshall <marshall@zed.dev>	2024-08-16 17:52:31 -04:00
Joseph T. Lyons	9ef3306f55	Add feature flags to seed script (#16385 ) Release Notes: - N/A	2024-08-16 17:08:44 -04:00
Marshall Bowers	35cd397a40	collab: Allow enabling feature flags for all users (#16372 ) This PR adds a new `enabled_for_all` column to the `feature_flags` table to allow enabling a feature flag for all users. Release Notes: - N/A	2024-08-16 15:17:03 -04:00
Marshall Bowers	a9441879c3	collab: Fix writing LLM rate limit events to Clickhouse (#16367 ) This PR fixes the writing of LLM rate limit events to Clickhouse. We had a table in the table name: `llm_rate_limits` instead of `llm_rate_limit_events`. I also extracted a helper function to write to Clickhouse so we can use it anywhere we need to. Release Notes: - N/A	2024-08-16 14:03:34 -04:00
Marshall Bowers	7a5acc0b0c	collab: Rework model name checks (#16365 ) This PR reworks how we do checks for model names in the LLM service. We now normalize the model names using the models defined in the database. Release Notes: - N/A	2024-08-16 13:54:28 -04:00
Marshall Bowers	583959f82a	collab: Add support for reading boolean values from `.env.toml` (#16317 ) This PR adds support for reading boolean values from `.env.toml`, since it wasn't supported previously. Release Notes: - N/A	2024-08-15 17:07:17 -04:00
Marshall Bowers	9233418cb8	collab: Attach GitHub login to LLM spans (#16316 ) This PR updates the LLM service to include the GitHub login on its spans. We need to pass this information through on the LLM token, so it will temporarily be `None` until this change is deployed and new tokens have been issued. Release Notes: - N/A	2024-08-15 17:06:20 -04:00
Marshall Bowers	5e05821d18	collab: Attach `user_id` to LLM spans (#16311 ) This PR updates the LLM service to attach the user ID to the spans. Release Notes: - N/A	2024-08-15 15:49:12 -04:00
Max Brunsfeld	6b7664ef4a	Fix bugs preventing non-staff users from using LLM service (#16307 ) - db deadlock in GetLlmToken for non-staff users - typo in allowed model name for non-staff users Release Notes: - N/A --------- Co-authored-by: Marshall <marshall@zed.dev> Co-authored-by: Joseph <joseph@zed.dev>	2024-08-15 11:21:19 -07:00
Marshall Bowers	b4c22cc861	collab: Add ability to revoke LLM service access tokens (#16143 ) This PR adds the ability to revoke access tokens for the LLM service. There is a new `revoked_access_tokens` table that contains the identifiers (`jti`) of revoked access tokens. To revoke an access token, insert a record into this table: ```sql insert into revoked_access_tokens (jti) values ('1e887b9e-37f5-49e8-8feb-3274e5a86b67'); ``` We now attach the `jti` as `authn.jti` to the tracing spans so that we can associate an access token with a given request to the LLM service. Release Notes: - N/A	2024-08-12 21:47:05 -04:00
Max Brunsfeld	dbcd06642c	Track lifetime spending for each user and model (#16137 ) Release Notes: - N/A Co-authored-by: Marshall <marshall@zed.dev>	2024-08-12 20:15:26 -04:00
Marshall Bowers	8a148f3a13	Add feature-flagged access to LLM service (#16136 ) This PR adds feature-flagged access to the LLM service. We've repurposed the `language-models` feature flag to be used for providing access to Claude 3.5 Sonnet through the Zed provider. The remaining RPC endpoints that were previously behind the `language-models` feature flag are now behind a staff check. We also put some Zed Pro related messaging behind a feature flag. Release Notes: - N/A --------- Co-authored-by: Max <max@zed.dev>	2024-08-12 18:13:40 -04:00
Marshall Bowers	98516b5527	collab: Restrict usage of the LLM service to accounts older than 30 days (#16133 ) This PR restricts usage of the LLM service to accounts older than 30 days. We now store the GitHub user's `created_at` timestamp to check the GitHub account age. If this is not set—which it won't be for existing users—then we use the `created_at` timestamp in the Zed database. Release Notes: - N/A --------- Co-authored-by: Max <max@zed.dev>	2024-08-12 17:27:21 -04:00
Max Brunsfeld	a3c79218c4	Report telemetry events for rate limit errors (#16130 ) clickhouse telemetry schema: ``` CREATE TABLE default.llm_rate_limit_events ( `time` DateTime64(3), `user_id` Int32, `is_staff` Bool, `plan` LowCardinality(String), `model` String, `provider` LowCardinality(String), `usage_measure` LowCardinality(String), `requests_this_minute` UInt64, `tokens_this_minute` UInt64, `tokens_this_day` UInt64, `max_requests_per_minute` UInt64, `max_tokens_per_minute` UInt64, `max_tokens_per_day` UInt64, `users_in_recent_minutes` UInt64, `users_in_recent_days` UInt64 ) ORDER BY tuple() ``` Release Notes: - N/A Co-authored-by: Marshall <marshall@zed.dev>	2024-08-12 16:31:11 -04:00
Max Brunsfeld	1674e12ccb	Expose anthropic API errors to the client (#16129 ) Now, when an anthropic request is invalid or anthropic's API is down, we'll expose that to the user instead of just returning a generic 500. Release Notes: - N/A Co-authored-by: Marshall <marshall@zed.dev>	2024-08-12 13:11:48 -07:00
Marshall Bowers	f3ec8d425f	collab: Use a separate Anthropic API key for Zed staff (#16128 ) This PR makes it so Zed staff can use a separate Anthropic API key for the LLM service. We also added an `is_staff` column to the `usages` table so that we can exclude staff usage from the "active users" metrics that influence the rate limits. Release Notes: - N/A --------- Co-authored-by: Max <max@zed.dev>	2024-08-12 15:20:34 -04:00
Marshall Bowers	ebdde5994d	collab: Don't issue LLM API tokens if the user has not accepted the ToS (#16123 ) This PR adds a check to the LLM API token issuance to ensure that we only issue tokens to users that have accepted the terms of service. Release Notes: - N/A	2024-08-12 14:10:08 -04:00
Marshall Bowers	ebdb755fef	Surface upstream rate limits from Anthropic (#16118 ) This PR makes it so hitting upstream rate limits from Anthropic result in an HTTP 429 response instead of an HTTP 500. To do this we need to surface structured errors out of the `anthropic` crate. Release Notes: - N/A	2024-08-12 11:59:24 -04:00
Thorsten Ball	fbb533b3e0	assistant: Require user to accept TOS for cloud provider (#16111 ) This adds the requirement for users to accept the terms of service the first time they send a message with the Cloud provider. Once this is out and in a nightly, we need to add the check to the server side too, to authenticate access to the models. Demo: https://github.com/user-attachments/assets/0edebf74-8120-4fa2-b801-bb76f04e8a17 Release Notes: - N/A	2024-08-12 17:43:35 +02:00
Marshall Bowers	f952126319	collab: Remove LLM completions over RPC (#16114 ) This PR removes the LLM completion messages from the RPC protocol, as these now go through the LLM service as of #16113. Release Notes: - N/A	2024-08-12 10:08:56 -04:00
Marshall Bowers	3140d6ce8c	collab: Temporarily bypass LLM rate limiting for staff (#16089 ) This PR makes it so staff members will be exempt from rate limiting by the LLM service. This is just a temporary measure until we can tweak the rate-limiting heuristics. Staff members are still subject to upstream LLM provider rate limits. Release Notes: - N/A	2024-08-11 14:41:49 -04:00
Max Brunsfeld	33e120d964	Capture telemetry data on per-user monthly LLM spending (#16050 ) Release Notes: - N/A --------- Co-authored-by: Marshall <marshall@zed.dev>	2024-08-09 16:38:37 -07:00
Max Brunsfeld	8688b2ad19	Add telemetry for LLM usage (#16049 ) Release Notes: - N/A Co-authored-by: Marshall <marshall@zed.dev>	2024-08-09 18:15:57 -04:00
Max Brunsfeld	423c7b999a	Larger rate limit integers (#16047 ) Tokens per day may exceed the range of Postgres's 32-bit `integer` data type. Release Notes: - N/A Co-authored-by: Marshall <marshall@zed.dev>	2024-08-09 14:07:49 -07:00
Max Brunsfeld	fbebb73d7b	Use LLM service for tool call requests (#16046 ) Release Notes: - N/A --------- Co-authored-by: Marshall <marshall@zed.dev>	2024-08-09 16:22:58 -04:00
Max Brunsfeld	d96afde5bf	Avoid `insert ... on conflict` on startup (#16045 ) These queries advance the id sequence even when there's nothing to insert Release Notes: - N/A Co-authored-by: Marshall <marshall@zed.dev>	2024-08-09 15:32:11 -04:00
Max Brunsfeld	b1c69c2178	Fix usage recording in llm service (#16044 ) Release Notes: - N/A Co-authored-by: Marshall <marshall@zed.dev>	2024-08-09 11:48:18 -07:00
Peter Tripp	eb3c4b0e46	Docs Party 2024 (#15876 ) Co-authored-by: Raunak Raj <nkray21111983@gmail.com> Co-authored-by: Thorsten Ball <mrnugget@gmail.com> Co-authored-by: Bennet <bennet@zed.dev> Co-authored-by: Marshall Bowers <elliott.codes@gmail.com> Co-authored-by: Joseph T Lyons <JosephTLyons@gmail.com> Co-authored-by: Mikayla <mikayla@zed.dev> Co-authored-by: Jason <jason@zed.dev> Co-authored-by: Antonio Scandurra <me@as-cii.com> Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com> Co-authored-by: Marshall <marshall@zed.dev> Co-authored-by: Nathan Sobo <nathan@zed.dev> Co-authored-by: Jason Mancuso <7891333+jvmncs@users.noreply.github.com> Co-authored-by: Piotr Osiewicz <24362066+osiewicz@users.noreply.github.com>	2024-08-09 13:37:54 -04:00
Marshall Bowers	49f760eeda	collab: Set `LLM_DATABASE_MAX_CONNECTIONS` (#16035 ) This PR updates the collab template to set the `LLM_DATABASE_MAX_CONNECTIONS` environment variable for the LLM service. Release Notes: - N/A	2024-08-09 11:16:02 -04:00
Max Brunsfeld	225726ba4a	Remove code paths that skip LLM db in prod (#16008 ) Release Notes: - N/A	2024-08-09 10:41:50 -04:00
Max Brunsfeld	240b7c641c	Fix llm queries (#16006 ) Release Notes: - N/A --------- Co-authored-by: Marshall <marshall@zed.dev>	2024-08-08 17:21:38 -07:00
Max Brunsfeld	06625bfe94	Apply rate limits in LLM service (#15997 ) Release Notes: - N/A --------- Co-authored-by: Marshall <marshall@zed.dev> Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2024-08-08 15:46:33 -07:00
Bennet Bo Fenner	514b79e461	collab: Always use newest anthropic model version (#15978 ) When Anthropic releases a new version of their models, Zed AI users should always get access to the new version even when using an old version of zed. Co-Authored-By: Thorsten <thorsten@zed.dev> Release Notes: - N/A Co-authored-by: Thorsten <thorsten@zed.dev>	2024-08-08 15:24:08 +02:00
Marshall Bowers	6f6eeb6595	collab: Update how mode is displayed in root endpoint (#15911 ) This PR adjusts how we display the "mode" collab is running in on the root endpoint. It's minor, but it does make things a bit cleaner. Release Notes: - N/A	2024-08-07 12:09:43 -04:00
Bennet Bo Fenner	3a52d6cc52	assistant: Limit model access for Zed AI users to Claude-3.5-sonnet (#15904 ) This prevents users from accessing other models, such as OpenAI's GPT-4 or Google's Gemini-Pro. Staff members can still access all models. Co-authored-by: Thorsten <thorsten@zed.dev> Release Notes: - N/A --------- Co-authored-by: Thorsten <thorsten@zed.dev>	2024-08-07 16:26:56 +02:00
Nathan Sobo	990774247e	Allow /workflow and step resolution prompts to be overridden (#15892 ) This will help us as we hit issues with the /workflow and step resolution. We can override the baked-in prompts and make tweaks, then import our refinements back into the source tree when we're ready. Release Notes: - N/A	2024-08-06 21:47:42 -06:00
Marshall Bowers	a54e16b7ea	collab: Add `usages` table to LLM database (#15884 ) This PR adds a `usages` table to the LLM database. We'll use this to track usage for rate-limiting purposes. Release Notes: - N/A	2024-08-06 18:40:10 -04:00
Marshall Bowers	b19f85f9b5	collab: Remove unused parameter to `run_database_migrations` (#15883 ) This PR removes the unused `ignore_checksum_mismatch` parameter to `run_database_migrations`. We were always passing `false`, which meant the behavior didn't need to be parameterized. Release Notes: - N/A	2024-08-06 17:31:52 -04:00
Marshall Bowers	7f6d0919c9	collab: Setup database for LLM service (#15882 ) This PR puts the initial infrastructure for the LLM service's database in place. The LLM service will be using a separate Postgres database, with its own set of migrations. Currently we only connect to the database in development, as we don't yet have the database setup for the staging/production environments. Release Notes: - N/A	2024-08-06 17:18:08 -04:00
Max Brunsfeld	33afbe9a94	Add LLM service to kubernetes deployment action (#15863 ) Release Notes: - N/A Co-authored-by: Marshall <marshall@zed.dev>	2024-08-06 12:35:00 -04:00
Marshall Bowers	cf5f4dddf5	Authorize access to language model providers based on country (#15859 ) This PR updates the LLM service to authorize access to language model providers based on the requester's country. We detect the country using Cloudflare's [`CF-IPCountry`](https://developers.cloudflare.com/fundamentals/reference/http-request-headers/#cf-ipcountry) header. The country code is then checked against the list of supported countries for the given LLM provider. Countries that are not supported will receive an `HTTP 451: Unavailable For Legal Reasons` response. Release Notes: - N/A	2024-08-06 11:49:04 -04:00

... 2 3 4 5 6 ...

1867 commits