zed/crates/collab/migrations
Kyle Kelley 49371b44cb
Semantic Index (#10329)
This introduces semantic indexing in Zed based on chunking text from
files in the developer's workspace and creating vector embeddings using
an embedding model. As part of this, we've created an embeddings
provider trait that allows us to work with OpenAI, a local Ollama model,
or a Zed hosted embedding.

The semantic index is built by breaking down text for known
(programming) languages into manageable chunks that are smaller than the
max token size. Each chunk is then fed to a language model to create a
high dimensional vector which is then normalized to a unit vector to
allow fast comparison with other vectors with a simple dot product.
Alongside the vector, we store the path of the file and the range within
the document where the vector was sourced from.

Zed will soon grok contextual similarity across different text snippets,
allowing for natural language search beyond keyword matching. This is
being put together both for human-based search as well as providing
results to Large Language Models to allow them to refine how they help
developers.

Remaining todo:

* [x] Change `provider` to `model` within the zed hosted embeddings
database (as its currently a combo of the provider and the model in one
name)


Release Notes:

- N/A

---------

Co-authored-by: Nathan Sobo <nathan@zed.dev>
Co-authored-by: Antonio Scandurra <me@as-cii.com>
Co-authored-by: Conrad Irwin <conrad@zed.dev>
Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>
Co-authored-by: Antonio <antonio@zed.dev>
2024-04-12 11:40:59 -06:00
..
20210527024318_initial_schema.sql Add bootstrap script, avoid hard-coding zed team members 2022-08-15 13:25:31 -07:00
20210607190313_create_access_tokens.sql Rename zed-server to collab 2022-04-09 08:30:42 -06:00
20210805175147_create_chat_tables.sql Rename zed-server to collab 2022-04-09 08:30:42 -06:00
20210916123647_add_nonce_to_channel_messages.sql Rename zed-server to collab 2022-04-09 08:30:42 -06:00
20210920192001_add_interests_to_signups.sql Rename zed-server to collab 2022-04-09 08:30:42 -06:00
20220421165757_drop_signups.sql Drop signups table 2022-04-21 10:59:13 -06:00
20220505144506_add_trigram_index_to_users.sql Add fuzzy_search_users to Db trait, PostgresDb 2022-05-05 09:58:18 -06:00
20220506130724_create_contacts.sql Start work on RPC endpoints for dealing with contact requests 2022-05-06 15:44:47 -07:00
20220518151305_add_invites_to_users.sql Accept an optional email address when creating new users 2022-05-20 20:25:21 -06:00
20220523232954_allow_user_deletes.sql Update foreign key constraints to allow users to be deleted 2022-05-23 17:46:06 -06:00
20220620211403_create_projects.sql Register projects in the database and record worktree extensions 2022-06-21 10:29:26 +02:00
20220913211150_create_signups.sql Identify users in amplitude via a separate 'metrics_id' UUID 2022-09-29 13:15:21 -07:00
20220929182110_add_metrics_id.sql Identify users in amplitude via a separate 'metrics_id' UUID 2022-09-29 13:15:21 -07:00
20221111092550_reconnection_support.sql Don't drop unregistered column in reconnection support migration 2022-12-09 08:11:18 +01:00
20221125192125_add_added_to_mailing_list_to_signups.sql Add "added_to_mailing_list" column on signups table 2022-11-29 02:13:13 -05:00
20221207165001_add_connection_lost_to_room_participants.sql Move creation of room_id index into its own migration 2022-12-13 13:57:41 +01:00
20221213125710_index_room_participants_on_room_id.sql Move creation of room_id index into its own migration 2022-12-13 13:57:41 +01:00
20221214144346_change_epoch_from_uuid_to_integer.sql Allow nulls in projects.host_connection_{id,server_id} 2022-12-15 11:30:51 +01:00
20221219181850_project_reconnection_support.sql WIP 2022-12-19 20:05:00 +01:00
20230103200902_replace_is_completed_with_completed_scan_id.sql Avoid dropping is_complete column for backward compatibility 2023-01-11 13:25:02 -08:00
20230202155735_followers.sql Begin tracking follow states on collab server 2023-02-22 11:21:23 -05:00
20230508211523_add-repository-entries.sql Add postgres migration 2023-05-08 14:33:59 -07:00
20230511004019_add_repository_statuses.sql Add postgres migration 2023-05-10 17:50:35 -07:00
20230529164700_add_worktree_settings_files.sql Replicate project-specific settings when collaborating 2023-05-30 18:08:03 -07:00
20230605191135_remove_repository_statuses.sql Avoid dropping the worktree_repository_statuses table for now 2023-06-09 11:37:39 -07:00
20230616134535_add_is_external_to_worktree_entries.sql Add a bit to each entry indicating if it's outside of the worktree root 2023-06-22 10:34:28 -07:00
20230727150500_add_channels.sql Represent channel relationships using paths table 2023-08-09 10:36:27 -07:00
20230819154600_add_channel_buffers.sql Simplify buffer_operations schema 2023-08-23 18:37:01 -07:00
20230825190322_add_server_feature_flags.sql Update database and RPC to provide configured feature flags 2023-08-25 14:34:32 -07:00
20230907114200_add_channel_messages.sql Start work on restoring server-side code for chat messages 2023-09-07 16:32:49 -07:00
20230925210437_add_channel_changes.sql Avoid N+1 query for channels with notes changes 2023-10-02 15:58:34 -07:00
20230926102500_add_participant_index_to_room_participants.sql Rename color_index to participant_index 2023-09-28 11:37:22 -07:00
20231004130100_create_notifications.sql Index notifications to allow faster lookup by kind, entity id 2023-10-19 13:03:27 -07:00
20231009181554_add_release_channel_to_rooms.sql Rename release channel to enviroment 2023-10-10 13:23:03 -07:00
20231010114600_add_unique_index_on_rooms_channel_id.sql Update channel rooms to be ephemeral 2023-10-10 12:39:16 -07:00
20231011214412_add_guest_role.sql fix migration 2023-10-13 15:08:09 -06:00
20231017185833_projects_room_id_fkey_on_delete_cascade.sql update sidebar for public channels 2023-10-17 13:30:09 -06:00
20231018102700_create_mentions.sql Fix possibility of extra mention insertion on nonce collision 2023-10-18 18:04:56 -07:00
20231024085546_move_channel_paths_to_channels_table.sql Remove logic for multiple channel parents 2023-10-24 17:29:23 +02:00
20240103025509_add_role_to_room_participants.sql Plumbing to pass role for room participants 2024-01-03 19:30:32 -07:00
20240111085546_fix_column_name.sql Switch to non-destructive migration 2024-01-11 10:33:33 -08:00
20240117150300_add_impersonator_to_access_tokens.sql For impersonating access tokens, store impersonatee in the new column 2024-01-17 18:06:16 -08:00
20240122174606_add_contributors.sql Add REST APIs for getting and adding contributors 2024-01-22 10:48:33 -08:00
20240122224506_add_requires_zed_cla_column_to_channels.sql Add requires_zed_cla column to channels table 2024-01-22 16:41:24 -08:00
20240129193601_fix_parent_path_index.sql Fix slow query for fetching descendants of channels (#7008) 2024-01-29 13:24:59 -08:00
20240203113741_add_reply_to_message.sql Add the ability to reply to a message (#7170) 2024-02-06 13:22:54 -07:00
20240207041417_add_in_call_column_to_room_participants.sql single click channel (#7596) 2024-02-09 14:18:27 -07:00
20240213200201_remove_unused_room_columns.sql drop columns (#7742) 2024-02-14 14:30:48 -07:00
20240214102900_add_extensions.sql Add an extensions API to the collaboration server (#7807) 2024-02-15 12:53:57 -08:00
20240220234826_add_rate_buckets.sql Allow AI interactions to be proxied through Zed's server so you don't need an API key (#7367) 2024-03-19 19:22:26 +01:00
20240221151017_add_edited_at_field_to_channel_message.sql Channel chat: Add edit message (#9035) 2024-03-19 19:49:04 -06:00
20240226163408_hosted_projects.sql channel projects (#8456) 2024-02-26 22:15:11 -07:00
20240226164505_unique_channel_names.sql fix migration (#8451) 2024-02-26 13:50:26 -07:00
20240227215556_hosted_projects_in_projects.sql hosted projects (#8627) 2024-03-04 19:17:40 -07:00
20240307163119_denormalize_buffer_ops.sql Denormalize buffer operations (#9026) 2024-03-07 11:35:47 -07:00
20240315182903_non_null_channel_role.sql Finish migration to role instead of is_admin (#9414) 2024-03-15 13:04:48 -06:00
20240315183903_channel_parent_path_not_null.sql Finish migration to role instead of is_admin (#9414) 2024-03-15 13:04:48 -06:00
20240320124800_add_extension_schema_version.sql Add a schema to extensions, to prevent installing extensions on too old of a Zed version (#9599) 2024-03-20 17:33:26 -04:00
20240321162658_add_devservers.sql remoting (#9680) 2024-03-22 08:44:56 -06:00
20240335123500_add_extension_wasm_api_version.sql Add telemetry events for loading extensions (#9793) 2024-03-25 17:30:48 -04:00
20240402155003_add_dev_server_projects.sql WIP: remoting (#10085) 2024-04-11 15:36:35 -06:00
20240409082755_create_embeddings.sql Semantic Index (#10329) 2024-04-12 11:40:59 -06:00