sapling

mirror of https://github.com/facebook/sapling.git synced 2024-10-09 08:18:15 +03:00

Author	SHA1	Message	Date
Mark Juggurnauth-Thomas	91c4748c5b	blame: switch to manager-based derivation Summary: Switch derivation of `blame` to the `DerivedDataManager`. This is mostly the same as the existing derivation implementation. The main difference is that `blame` derivation using the backfilling config will use the backfilling config for the unodes that it depends on, too. Reviewed By: mitrandir77 Differential Revision: D30974102 fbshipit-source-id: 5f69f8c218806bb7606b2af4b831e2104b8440d6	2021-09-20 06:26:21 -07:00
Mark Juggurnauth-Thomas	aef6b3f453	unodes: switch to manager-based derivation Summary: Switch derivation of `unodes` to the `DerivedDataManager`. Reviewed By: yancouto Differential Revision: D30944564 fbshipit-source-id: 41f63cd47cffdd1e24d57b470afef57aa6fe0264	2021-09-20 06:26:21 -07:00
Yan Soares Couto	0c80a0aac7	Fix build warnings Summary: Why not, right? Fixes a few build warnings that showed up to me while building. Reviewed By: kulshrax Differential Revision: D30933487 fbshipit-source-id: 318fbd2c5697914fd0bfa723e678dc710524dc02	2021-09-20 06:26:21 -07:00
Yan Soares Couto	84118416a2	Simplify code for FsnodeId Summary: There were already helpers to make this code less copy-pasty, this diff just uses them. Reviewed By: markbt Differential Revision: D30933408 fbshipit-source-id: acc27a0904425eccfc71fee884a8f2035ed0c37f	2021-09-20 06:26:21 -07:00
Yan Soares Couto	7725ca63b9	Simplify code for WireHgId Summary: We already have a macro to make it easier to create wire representation of hash types, let's use it on `HgId` to reduce copy-pasting. Changes: - Added `Ord` implementations to wire hash types, as `WireHgId` used it. - Added from/into implementations on `HgId` to byte arrays, which were used by the macro. - Changed Debug implementation so it prints hex instead of an actual array of bytes Reviewed By: krallin Differential Revision: D30933067 fbshipit-source-id: c88911bfc91e44e07f2f658098036b766495d05f	2021-09-20 06:26:21 -07:00
Yan Soares Couto	f1cb23d4e1	Add --clean option to snapshot restore Summary: I imagine a pretty common case (specially for automation that's trying to keep two clones in sync), will be that you need to restore a snapshot and then restore another snapshot after that. Currently, this doesn't work very well, as it fails on (some but not all) cases where there is uncommitted changes. It's kind of boring bc to handle that you need to run `hg purge && hg revert -a -C`. This diff adds a `--clean` option to `hg snapshot restore` that will clean the working copy before updating to given snapshot. Now the command will also fail if you try to update to a snapshot while you have untracked files. Reviewed By: markbt Differential Revision: D30903851 fbshipit-source-id: 387eeeee882093389649dc337c861291c35f4b94	2021-09-20 06:26:21 -07:00
Mark Juggurnauth-Thomas	044ae9f1f0	backfill_derived_data: fix tests to ensure batch invariants Summary: The `backfill_batch_dangerous` method requires that the caller ensures that all dependencies of the batch have been derived, otherwise errors, such as mappings being written out before the things they map to, can occur. When the derived data manager takes over batch derivation, it will enforce this requirement, so that it is no longer dangerous. However, The backfiller tests were not ensuring the invariant, so the tests will fail with the new derivation implementation. Fix the tests by ensuring the parent commits are always derived before a batch is started. The test is also extended to expose the failure mode of accidentally deriving batch parents. This will be fixed in the next commit. Reviewed By: yancouto Differential Revision: D30959132 fbshipit-source-id: 8489a5d0b375692a903854294e3810846c9e13de	2021-09-20 03:23:17 -07:00
Mark Juggurnauth-Thomas	45963588b3	derived_data_utils: implement manager-based DerivedUtils Summary: Implement `DerivedUtils` using the `DerivedDataManager`. This is just for migration. In the future `DerivedUtils` will be replaced by the manager. Reviewed By: yancouto Differential Revision: D30944568 fbshipit-source-id: 32376e3b4aeb959e63f66e989a663c21dee30ba5	2021-09-20 03:23:17 -07:00
Mark Juggurnauth-Thomas	fd388aac1a	derived_data_manager: implement data derivation Summary: Implement a new version of data derivation in the derived data manager. This is different from the old version in a few ways: * `derived_data::BonsaiDerivable` is replaced by `derived_data_manager::BonsaiDerivable`. This trait defines both how to perform derivation and how to store and retrieve mapping values. Derivation is performed with reference to the derived data manager, rather than `BlobRepo`. * The old `Mapping` structs and traits are replaced with a direct implementation in the derived data manager, using the `BonsaiDerivable` trait to handle the derived-data-type-specific parts. * The new implementation assumes we will stick with parallel derivation, and doesn't implement serial derivation. Code is copied from the `derived_data` crate, as it is intended to be a replacement once all the derived data types are migrated, and re-using code would create a circular dependency during migration. This only covers the basic derivation implementation used during production. The derived data manager will also take over backfilling, but that will happen in a later diff. Reviewed By: yancouto Differential Revision: D30805046 fbshipit-source-id: b9660dd957fdf762f621b2cb37fc2eea7bf03074	2021-09-20 03:23:17 -07:00
Mark Juggurnauth-Thomas	0741a4815d	derived_data_utils: extract find_oldest_underived Summary: The `find_oldest_underived` method of `DerivedUtils` is used outside tests by exactly one client (the backfiller in tailing mode). Simplify the `DerivedUtils` trait by extracting this method from the trait, and replacing with a more general one that will be easier to implement in terms of the derived data manager. Reviewed By: yancouto Differential Revision: D30944567 fbshipit-source-id: a1d408e091d145297241a5eebc02a87155bc3765	2021-09-20 03:23:17 -07:00
Mark Juggurnauth-Thomas	8fc9f7159a	derived_data: split BonsaiDerived trait Summary: Split the `BonsaiDerived` type in two: * `BonsaiDerived` is now just the interface which is used by callers who want to derive some derived data type. It will be implemented by both old and new derivation. * `BonsaiDerivedOld` is the interface that old derivation uses to determine the default mapping for derivation. This will not be implemented by new derivation, and will be removed once migration is complete. Reviewed By: yancouto Differential Revision: D30944566 fbshipit-source-id: 5d30a44da22bcf290ed3123844eb712c7b37dea4	2021-09-20 03:23:17 -07:00
Mark Juggurnauth-Thomas	4776c94ae7	derived_data_manager: remove builder Summary: The builder pattern turned out to be unnecessary, as mappings don't need to be stored in the manager after all. Reviewed By: StanislavGlebik Differential Revision: D30944565 fbshipit-source-id: 4300cdcc871c89f98e42d5b47600ac640b4b94eb	2021-09-20 03:23:17 -07:00
Mark Juggurnauth-Thomas	b1ecec2b14	filenodes: make derivation not depend on BlobRepo Summary: Make the derivation process for mercurial filenodes not depend on `BlobRepo`. Instead, use the repo attributes (`RepoBlobstore` and `Filenodes`) directly. This will allow us to migrate to using `DerivedDataManager` in preparation for removing `BlobRepo` from derivation entirely. The existing use of `changesets` for determining the commit's parents is changed to use the parents from the bonsai changeset. For normal derivation, the bonsai changeset is already loaded, so this saves a database round-trip. For batch derivation we currently need to load the changeset, but it should be in cache anyway, as other derived data types will also have loaded it. We still need to keep a `BlobRepo` reference at the moment. This is because filenodes depend on the mercurial derived data. The recursive derivation is hidden in the call to `repo.get_hg_from_bonsai_changeset`. When derivation is migrated to the derived data manager, we can replace this will a direct derivation. Reviewed By: StanislavGlebik Differential Revision: D30765254 fbshipit-source-id: 20cc17c2eb611544869e5f1c15d858663cd60fd1	2021-09-20 03:23:17 -07:00
Stanislau Hlebik	06705f701c	mononoke: add names to the fixtures repos Summary: Let's give them a more descriptive names so that it's easier to understand what's going on. Reviewed By: markbt Differential Revision: D31022612 fbshipit-source-id: 8e4f516f3d0b1cd661b1a8fceba80a8f85a2ed4f	2021-09-18 00:25:06 -07:00
Stanislau Hlebik	99dd200e34	mononoke: add FileChangeAggregation to split_batch_in_linear_stacks Summary: This is a new option in split_batch_in_linear_stacks - it either aggregates file changes from all ancestors in the stack or not. Currently all of our callsites wants Aggregate, but in the next diff we'll add a new callsite that doesn't Reviewed By: markbt Differential Revision: D31022444 fbshipit-source-id: ce0613863855163f26ab18c7f35142ae569eb31a	2021-09-18 00:25:06 -07:00
Xavier Deguillard	c3c63b697c	privhelper: add some logging when mounting via NFS on macOS Summary: EdenFS would never log anything when mounting via NFS, let's make it more visible and easier to grep. Reviewed By: chadaustin Differential Revision: D31022158 fbshipit-source-id: 99fd3a04c90526eedf9951ac7c2bcd9e18ef8953	2021-09-17 16:23:43 -07:00
Gus Wynn	c797db63a1	vendor tokio-console and console-subscriber Summary: this relies on local changes to make it so cargo metadata ACTUALLY finds this binary: https://github.com/tokio-rs/console/pull/146 is where I try to upstream it Reviewed By: jsgf Differential Revision: D30944630 fbshipit-source-id: 5d34a32a042f83eff7e7ae7445e23badf10fffe3	2021-09-17 16:23:43 -07:00
Bob Ren	2ce9908315	derive checksum from naming table Summary: For the time being we don't have checksums in saved states. As a temporary workaround add the ability to derive the checksum from the naming table. Differential Revision: D30967637 fbshipit-source-id: 4ac34d988d08c9af6f08f7ce46206f756cf1cf0c	2021-09-17 15:26:45 -07:00
Chad Austin	7c3ef4fccf	replace folly::Optional with std::optional Summary: Watchman is a C++17 project now, so we can use std::optional. Reviewed By: xavierd Differential Revision: D30917549 fbshipit-source-id: 95d8ac15d4939a70347336ddfb120ab5025db993	2021-09-17 11:24:33 -07:00
Xavier Deguillard	4104f3d2ef	service: move all glob booleans in a GlobOption structure Summary: Having tons of booleans in a function can be very error prone from a caller perspective, using a structure to pass in the same information can mitigate some of this issue. Reviewed By: kmancini Differential Revision: D30883743 fbshipit-source-id: dcf38d29bfe2cb5155879f7ae4eab5cea31f798a	2021-09-17 11:24:32 -07:00
Ilia Medianikov	1108dfa113	mononoke: add "source_repo" column to synced_commit_mapping Summary: Without this bit of information we can't tell where the sync came from (i.e. from which of two repos) so we can't reliably find a commit "source" for a landed commit. Reviewed By: StanislavGlebik Differential Revision: D30902774 fbshipit-source-id: d85d0d028fbd6bfb2d64bce89bc7934bad2e242b	2021-09-17 09:23:42 -07:00
Xavier Deguillard	55d86a5557	cli: allow skipping chowning redirected directories Summary: During an `eden chown`, EdenFS will try to chown both the repository, and the redirections. In some cases, chowning the redirection can both take a long time and be unecessary. Consider the case where some automation temporarily chown a repository to a service user that needs to access the repository, and then chown it back to the owner of the repository. In that case, changing the ownership of the redirection is superfluous and unecessary. Reviewed By: mrkmndz Differential Revision: D31010912 fbshipit-source-id: a882948005ac4fe29ff465088f196e0fc2bc10be	2021-09-17 09:23:42 -07:00
Mateusz Kwapich	403f9b0627	add show command to the async-requests Summary: This is a very basic commands that uses debug-printing to display all the request details. In the future we might want to make it more ellaborate but as-it-is it works. Reviewed By: StanislavGlebik Differential Revision: D30965076 fbshipit-source-id: 561c64597b94359843e575550be0ae6f39fad7bf	2021-09-17 09:23:42 -07:00
Mateusz Kwapich	63f37ff9c9	add async-requests subcommand Summary: This debug command will allow the user to see and interact with currently running async requests. Reviewed By: StanislavGlebik Differential Revision: D30965077 fbshipit-source-id: 259f1af0eb6ade4a34f6004c7b1ad63cd5f0bc9f	2021-09-17 09:23:42 -07:00
Stanislau Hlebik	ee516928a1	mononoke: make some derived data mapping types comparable Summary: It makes it a bit hard to do experiments and compare derivation results. It's easy to compare these types, so let's do it. Reviewed By: mitrandir77 Differential Revision: D31017823 fbshipit-source-id: 6173bba53c7ee254198e023dde57564fe9c3efed	2021-09-17 08:25:29 -07:00
Stanislau Hlebik	66279b94a6	mononoke: make sender optional in create_leaf_unode Summary: This will be used in the next diffs to add batch derivations for unode. Also it makes it symmetrical to create_manifest_unode Reviewed By: mitrandir77 Differential Revision: D31015719 fbshipit-source-id: 65e12901c6a004375c7c0e3b07f1632ac9c6eaa8	2021-09-17 08:25:29 -07:00
Stanislau Hlebik	f951d8d84f	mononoke: include additional bonsai changesets during segmented changelog seeder Summary: In some cases (e.g. when master bookmark moves backwards) there might be commits in segmented changelog that are not ancestors of master. When reseeding we still want to build segments for these chagnesets, and this is what this diff does (see D30898955 for more details about why we want to build segments for these changesets). Reviewed By: quark-zju Differential Revision: D30996484 fbshipit-source-id: 864aaaacfc04d6169afd3d04ebcb6096ae2514e5	2021-09-17 01:23:05 -07:00
David Tolnay	6ce5dce1be	Update to Rust 1.55.0 Summary: Release notes: https://blog.rust-lang.org/2021/09/09/Rust-1.55.0.html Reviewed By: jsgf Differential Revision: D30859665 fbshipit-source-id: 89db9befd805babe3405522842864500550d5c8c	2021-09-16 22:24:04 -07:00
Katie Mancini	2671e5a4bb	fix a python type error in redirect Summary: In D29940980 (`2e2b9755cf`) we used shlex for a redirect subprocess command line. The list does not always contain strings tho, which is a requirement to use shlex.quote my guess is that they are paths. We should still str things before we shlex.quote them. Differential Revision: D31001622 fbshipit-source-id: 2a270781d7f2d84ad7a9a2f9975500b29306cfa8	2021-09-16 20:23:45 -07:00
Xavier Deguillard	7f4cde5bf9	revisionstore: reduce default auto_sync_threshold Summary: One of the largest contributor to EdenFS memory usage are the internal IndexedLog buffers to hold data in memory until a threshold is reached. Since the main benefit of these buffers is to utilize the disk bandwidth, very large buffers aren't necessary and much smaller ones will be able to achieve similar results. A default 50MB buffer is used which will cap the memory usage to 50MB * 3: - File IndexedLogDataStore - Tree IndexedLogDataStore - File LFS The aux and history stores are also reduced down to 10MB. Reviewed By: DurhamG Differential Revision: D30948343 fbshipit-source-id: 74e789856ac995a5672b6aefe8a68c9580f69613	2021-09-16 15:52:57 -07:00
Jan Mazur	479e662c65	set x-x2pagentd-ws-over-h1 when using x2pagentd with http proxy Summary: As in the title Reviewed By: ngoyal Differential Revision: D30994761 fbshipit-source-id: d93af0d522dcfc8eb2be8f3984e90b75f309e1a9	2021-09-16 14:48:18 -07:00
Katie Mancini	41029bf3d7	unknown inodes are not a bug on NFS mounts. Summary: We periodically need to dereference inodes on NFS because we get no other info from the kernel on when should dereference them. This means the NFS kernel might have references to inodes after we delete them. An unknown inode number is not a bug on NFS. It's just stale, so the error should reflect that. Reviewed By: xavierd Differential Revision: D30144898 fbshipit-source-id: 3d448e94aea5acb02908ea443bcf3adae80eb975	2021-09-16 14:35:04 -07:00
Katie Mancini	7a1f3b6eb2	add a config knob to control how long to delay inode unloading Summary: We periodically need to dereference inodes on NFS because we get no other info from the kernel on when should dereference them. It can be disruptive to a users workflow because an open files that were rm'ed or removed on checkout will no longer have their old content. (on a native filesystem or fuse applications that had the file open propr to the removal would still be able to access files.) For most editors this is not a problem because they read the file on open (seems fine for vim and vscode from testing). However folks could theoretically have a workflow this does not jive with. Let's make it configurable how often this runs, so users can control how much we distupt their workflow. Reviewed By: xavierd Differential Revision: D30144899 fbshipit-source-id: 59cf5faea70b3aea216ca2bcb45b96e34f5e72b5	2021-09-16 14:35:04 -07:00
Katie Mancini	ffa558bf84	implement inode unloading after checkout Summary: NFSv3 has no inode invalidation flow built into the procall. The kernel does not send us forget messages like we get in FUSE. The kernel also does not send us notifications when a file is closed. Thus EdenFS can not easily tell when all handles to a file have been closed. As is now we never clean up inodes. This is bad for memory & disk usage. We will never unload an inode so we always keep it in memory once it's created. Additonally, we never remove a materialized inode from the overlay. This means we have unbounded memory and disk usage :/ We need to clean up these inodes at somepoint. There are a couple high level options: 1. Support nfsv4. NFSv4 sends us close message when a file handle is closed. This would allow us to actually keep track of reference coundts on an inode. However, This is a lot of work. There is a lot of other things we would have to support before we can move to nfsv4. 2. Run background inode cleanups. nfsv4 is probably the right long term solution. But for now we should be able to get by with periodic unloads. I considered a couple of options for unloads: 1. Unload inodes immediatly when files are removed. 2. Delay cleaning up inodes until a while after they are removed. (i.e. clean up inodes n seconds after an `unlink`, `rename`, `rmdir`, or checkout) 3. Run periodic inode unloading. (i.e. once a day unload inodes). Option 1. feels a bit too hostile to applications that hold files open. Option 3. means we will build up a lot of cruft over the course of the day. But is probably the most application friendly. I decided to try out option 2 first and see if it works well with the common developer tools. Its seems to work (see below) so I am going with it. This diff only does inode cleanup after checkout. we might want to run inode clean up after unlink/remove dir as well, but this would be more expensive. Batch unloading feels better on checkout seems better to me and should happen frequently enough to clean up space for people. There is one known "broken" behavior in this diff. We unload all unlinked inodes which means we will erase more inodes than we should. Sometimes EdenFS crashes or bugs and unlinks legit inodes. Normally we let those live in the overlay so we could go in an recover them. My plan to fix this is to mark inodes for unloading instead of just unloading all unlinked inodes. Reviewed By: xavierd Differential Revision: D30144901 fbshipit-source-id: 345d0c04aa386e9fb2bd40906d6f8c41569c1d05	2021-09-16 14:35:04 -07:00
Stanislau Hlebik	19d23c70cb	mononoke: add a test that shows a weird behaviour of derive_manifest Summary: Delete a non-existing file is fine, and also deleting a file when a directory with the same name just ignores the delete. This diff adds tests to cover these cases. Overall it seems like a bug, but I'm not sure it worth fixing - who knows if we have bonsai changesets that rely on that! Reviewed By: yancouto Differential Revision: D30990826 fbshipit-source-id: b04992817469abe2fa82056c4fddac3689559855	2021-09-16 13:58:03 -07:00
Stanislau Hlebik	e4d2d8a0e4	mononoke: add new PathTree insert_and_merge method Summary: This method allows to append a value instead of just replacing it. It will be used in the next diff when we derive manifest for a stack of commits in one go. Reviewed By: yancouto Differential Revision: D30989889 fbshipit-source-id: dd9a574609b4d289c01d6eebcc6f5c76a973a96b	2021-09-16 13:58:03 -07:00
Xavier Deguillard	1ebf805e18	inodes: return an eof flag from FileInode::read Summary: The NFS protocol needs to know if a read reached the end-of-file to avoid a subsequent read and thus reduce the chattyness of the protocol. On top of avoiding RPC calls, this should also halve the amount of data read from Mercurial due to the BlobCache freeing the in-memory cached blob when the FS has read the file in its entirety. This meant that the second READ would always force the blob to be reloaded from the Mercurial store, which would also force that blob to be kept in memory until being evicted (due to it not being fully read). Reviewed By: kmancini Differential Revision: D30871422 fbshipit-source-id: 8acf4e21ea22b2dfd7f81d2fdd1b137a6e90cc8f	2021-09-16 10:34:36 -07:00
Robin Håkanson	b562bb0344	GitPool updates to handle heavy imports better. Summary: Changes: - Limit simultainous open git-repo objects by amount of CPUs. - Put a semaphore limit so we wait inside tokio::task domain instead of tokio::blocking domain (later is more expensive and has a hard upper limit). Reviewed By: mitrandir77 Differential Revision: D30976034 fbshipit-source-id: 3432983b5650bac6aa5178d98d8fd241398aa682	2021-09-16 09:02:22 -07:00
Mateusz Kwapich	cfb0ec6085	make skiplists optional in mononoke Summary: This allows the mononoke_api user to choose whether the skiplists should be used to spped up the ancestry checks or not. The skiplists crate is already prepared for the situation where skiplist entries are missing and traverses the graph then. Reviewed By: yancouto Differential Revision: D30958909 fbshipit-source-id: 7773487b78ac6641fa2a427c55f679b49f99ac8d	2021-09-16 07:20:01 -07:00
Mateusz Kwapich	515d178024	make WBC optional Summary: Allow the mononoke_api user to choose whether they want oprerations to be sped up using WBC or not. Reviewed By: yancouto Differential Revision: D30958908 fbshipit-source-id: 038cf77735e7c655f6801d714762e316b6817df5	2021-09-16 07:20:01 -07:00
Mateusz Kwapich	b6f48e94a4	turn wbc into a trait and add noop implementation Summary: Some crates like mononoke_api depend on warm bookmark cache to speed up the bookmark operations. This prevents them from being used in cases requiring quick and low overhead startup like CLIs. This diff makes it possible to swap out the warm bookmark cache to a implementation that doesn't cache anything. (See next diffs to see how it's used in mononoke_api crate). Reviewed By: yancouto Differential Revision: D30958910 fbshipit-source-id: 4d09367217a66f59539b566e48c8d271b8cc8c8e	2021-09-16 07:20:01 -07:00
Mateusz Kwapich	bc77707fdd	remove get_all from warm bookmark cache Summary: This method was added before the more generic list method was added. Let's get rid of it for simplicity and to discourage listing all the bookmarks. Reviewed By: yancouto Differential Revision: D30958911 fbshipit-source-id: f4518da3f34591c313657161f69af96d15482e6c	2021-09-16 07:20:01 -07:00
David Tolnay	7f0c6953b8	third-party/rust: Update pin-project 0.4.24 to 0.4.28 Summary: 0.4.24 is incompatible with crates that use `deny(warnings)` on a compiler 1.55.0 or newer. Example error: ``` error: unused borrow that must be used --> common/rust/shed/futures_ext/src/stream/return_remainder.rs:22:1 \| 22 \| #[pin_project] \| ^^^^^^^^^^^^^^ \| = note: this error originates in the derive macro `::pin_project::__private::__PinProjectInternalDerive` (in Nightly builds, run with -Z macro-backtrace for more info) ``` The release notes for 0.4.28 call out this issue. https://github.com/taiki-e/pin-project/releases/tag/v0.4.28 Reviewed By: krallin Differential Revision: D30858380 fbshipit-source-id: 98e98bcb5a6b795b93ed1efd706a1711f15c57db	2021-09-15 23:01:30 -07:00
Robert Grosse	acc7a511d3	Refactor matching optional lines in run-tests.py Summary: Move optional line handling logic into a separate function and simplify. This diff is intended to be a pure refactoring with no observable changes in behavior. In particular, all the code dealing with the "optional" list appears to be dead code because if the line is optional, linematch will return "retry", so that branch is never reachable. Reviewed By: DurhamG Differential Revision: D30849757 fbshipit-source-id: 17283f9217466b3f85d913da66222b9a6779abe4	2021-09-15 17:47:22 -07:00
Jeremy Fitzhardinge	113060beb1	third-party/rust: bump all the tracing packages Summary: This includes recent changes: - `skip_all` in #[instrument] https://github.com/tokio-rs/tracing/pull/1548 - NoSubscriber no-op Subscriber https://github.com/tokio-rs/tracing/pull/1549 - Box/Arc Layer (https://github.com/tokio-rs/tracing/pull/1536 + https://github.com/tokio-rs/tracing/pull/1547) - log filtering fixes https://github.com/tokio-rs/tracing/pull/1543 Local: force `log` and `log-always` features off. This removes the `enable-trace-logging` flag from smartplatform/reconciler. Reviewed By: guswynn Differential Revision: D30906032 fbshipit-source-id: fdc3e4e29a39e9e181d9f4ff017e6b3e16acdcd8	2021-09-15 16:52:25 -07:00
Durham Goode	133aca0fb4	status: remove serial manifest read Summary: This line was iterating over a list of files and looking in the manifest for each one. This results in serial manifest reads which can result in serial network requests. Let's instead use manifest.matches() to test them all at once via the underlying BFS, which does bulk fetching. Differential Revision: D30938359 fbshipit-source-id: 1af7d417288b82efdd537a4afeaf93c1b55eaf49	2021-09-15 12:36:44 -07:00
Jun Wu	e8c2ac87ce	dag: demonstrate bugs in vertex to path resolution Summary: Demonstrate issues with the vertex to path resolution. Basically, the vertex to path resolution logic did not check if the "parent of merge" being used is actually valid (is an ancestor of provided heads) or not. Reviewed By: DurhamG Differential Revision: D30911150 fbshipit-source-id: 83d215910d5ba67ac0d5749927018a7aefcc6730	2021-09-15 11:16:25 -07:00
Arun Kulshreshtha	462e0d0386	edenapi: fix dead code warnings Summary: Delete dead code. Reviewed By: yancouto Differential Revision: D30942023 fbshipit-source-id: 9831c5eec6869c692fc13354dd6368fc6fb5ce8a	2021-09-15 06:43:14 -07:00
CodemodService Bot	3562c65696	Daily `common/rust/cargo_from_buck/bin/autocargo` Reviewed By: krallin Differential Revision: D30955790 fbshipit-source-id: 45a00d6e8c683b0a1b34ef7041197e9ae30cf63a	2021-09-15 01:29:47 -07:00
Andrey Chursin	008167cb0e	deprecate scs proxy hash Summary: The tree metadata fetching evolution goes as follow (1) (commit, path) scs query (2) tree manifest scs query [we are here] (3) eden api manifest query [in development] Option (1) is no longer used and is the only placed that required scs proxy hash. Removing it will simplify transition from (2) to (3) and also cleans up bunch of unused code. It also comes with minor performance improvement, saving about 5% on file access time. To be precise, this is measured by running fsprobe [this is probably too little to measure in high noise benchmark like running arc focus]: ``` fsprobe.sh run cat.targets --parallel 24 ``` Results: ``` W/ scshash: P24: 0.1044 0.1007 0.1005 (hot) 0.1019 avg W/o scshash: P24: 0.0954 0.0964 0.1008 (hot) 0.0975 avg ``` This performance improvement comes from the fact, that even though scs hash was never created or used, we still attempted to load it from scs table, and even though this load always failed it contributed to execution time. Reviewed By: xavierd Differential Revision: D30942663 fbshipit-source-id: af84f1e5658e7d8d9fb6853cbb88f02b49cd050b	2021-09-14 19:52:15 -07:00
Andrey Chursin	a34c530606	fsprobe: change latency precision .2 -> .4 Summary: File access latency can actually be less then 1 ms, so it's good to show more digits Reviewed By: DurhamG Differential Revision: D30942905 fbshipit-source-id: 2fc8d48dbc08c55b89d829d1474ae11c2c3df1c3	2021-09-14 19:52:15 -07:00
Andrey Chursin	f3cba474a3	fsprobe: add simple shell script to generate and run fsprobe tests Summary: Since fsprobe itself requires a 'plan' to run, we need separate script to standartize list of plans we think are relevant This scripts allows to generate fsprobe plans and run them Reviewed By: DurhamG Differential Revision: D30908892 fbshipit-source-id: eb722fe1f6d982e42b66614f08bc73345e04f9e6	2021-09-14 19:52:15 -07:00
Jun Wu	aa028423af	indexedlog: include wrong meta file content in error message Summary: We got errors like: error.IndexedLogError: "repo/.hg/store/lfs/pointers/meta": when reading LogMetadata in log::OpenOptions::open(Filesystem("repo/.hg/store/lfs/pointers")) Caused by 1 errors: - failed to fill whole buffer from Sandcastle. There seems no easy way to get a sample of the broken `meta` file content. Let's include the file content to make progress on debugging. Reviewed By: DurhamG Differential Revision: D30939737 fbshipit-source-id: ccd77f6b67e4aaf75af2248118845fd5b3434ff1	2021-09-14 18:11:17 -07:00
Meyer Jacobs	38b0ccdf4f	scmstore: remove allow(dead_code) from treescmstore Summary: This `allow` is no longer needed. Reviewed By: yancouto Differential Revision: D30859520 fbshipit-source-id: 36b810a72a28af25513404739bccf471e380cdf1	2021-09-14 18:08:41 -07:00
Meyer Jacobs	a0b6b4ba9b	scmstore: update TreeStore to use CommonFetchState Summary: Update TreeStore to use CommonFetchState and update TreeStore and BackingStore to use the other utility types already in use for files (`StoreTree`, `FetchResults`, etc). Reviewed By: andll Differential Revision: D30739008 fbshipit-source-id: e210b8d76614c762ba127d5f2e26391681da004f	2021-09-14 18:08:41 -07:00
Meyer Jacobs	789fd8c82c	scmstore: add utility method for converting StoreTree to manifest-tree::Entry Summary: Adds a utility method for converting a `StoreTree` to a `manifest-tree::Entry`, which wraps an hg manifest blob and provides methods for parsing it as a tree manifest (and a `TryFrom` impl used to convert it to a pre-parsed `manifest::List`, which is used by BackingStore in the next change in this stack). Reviewed By: andll Differential Revision: D30859470 fbshipit-source-id: 411e80a14861e0739b0c398290055002b35e59d3	2021-09-14 18:08:40 -07:00
Meyer Jacobs	30cb821f95	scmstore: add StoreTree, LazyTree, TreeAttributes, matching FileStore's types Summary: This change does not add aux data support, so for now the types are a bit useless. Reviewed By: DurhamG Differential Revision: D30313314 fbshipit-source-id: 11968199b12c4f870c58c7e939b5c8ed5cd9afea	2021-09-14 18:08:40 -07:00
Meyer Jacobs	4ac73c7942	scmstore: move tree to submodule Summary: More refactoring of scmstore `TreeStore`. Introducing a new `tree` submodule as we'll be adding tree-specific metrics, types, etc. soon (as currently exist for files). Reviewed By: andll Differential Revision: D30313460 fbshipit-source-id: f20d3ee62520b1d9ea34ad04eb1880ad9b5a00c3	2021-09-14 18:08:40 -07:00
Meyer Jacobs	9a60141ccf	scmstore: introduce CommonFetchState Summary: Extract out `CommonFetchState` from `FileStore`'s `FetchState`. Currently, direct field access is still used for computing derivations and a few other places, but this will be changed in a later diff. Reviewed By: DurhamG Differential Revision: D30308289 fbshipit-source-id: 16d34904412572facc9f51cbd791e30413bfe634	2021-09-14 18:08:40 -07:00
Meyer Jacobs	7c35519888	lib: fix unused import warnings Summary: Andrey noticed some unused import warnings on master, this should address them. Reviewed By: quark-zju Differential Revision: D30944906 fbshipit-source-id: 6ecd04a1bb85e137ab68069a76e593477fbb23ab	2021-09-14 17:37:11 -07:00
Arun Kulshreshtha	0c9f750154	hg-http: defer progress bar registration Summary: Don't show progress bars for pending HTTP requests until they actually start running, so that the user always sees progress bars from active transfers. Reviewed By: quark-zju Differential Revision: D30914241 fbshipit-source-id: ca2f85af055dc9324123d0f9cc765f42d3b36ad4	2021-09-14 15:03:12 -07:00
Arun Kulshreshtha	bdc48e2b96	http-client: add on_first_activity event Summary: Add a new `first_activity` event to the `Response` event listeners that fires the first time we detect nonzero progress for either uploading or downloading. This is useful for situations where requests are queued and we want to be notified when the request becomes active (e.g., to register progress bars). Reviewed By: DurhamG Differential Revision: D30914242 fbshipit-source-id: 83445724ed81e77ac25954b644e6bbafcbe5cadb	2021-09-14 15:03:11 -07:00
Zhengchao Liu	06b507912f	add inode number to NFS trace event Summary: This adds inode number to NFS trace event so that we can use it in ActivityRecorder to show the filename of the FS request. Reviewed By: xavierd Differential Revision: D30849770 fbshipit-source-id: 580faf5fccb1a225399d9aec843e23eae1874e87	2021-09-14 10:44:46 -07:00
Katie Mancini	1e10a6efd5	raise indulde dot files to a command arg Summary: We have an option on GlobFiles for listing hidden files, but we don't have a cli option. We default to false in the cli. Let's pipe this option all the way through. so that we can control this flag from the cli. Reviewed By: xavierd Differential Revision: D30915118 fbshipit-source-id: 28b91d4fd2dd4bdf9e342929f570f64db14e8ad0	2021-09-14 10:02:33 -07:00
Katie Mancini	11da6c1027	prefetch option to only list files Summary: `eden prefetch` and `eden glob` return lists that despite being called "maching files" actually contains both files and directories. In some cases, we only want the list of files and it introduces unnessecary overhead on our clients for them to have to stat all the files in the list to filter out the dirs. Let's add an option to just list files. Reviewed By: chadaustin Differential Revision: D30816193 fbshipit-source-id: 6e264142162ce03e560c969a0c0dbbc2f418d7b9	2021-09-14 10:02:33 -07:00
Genevieve Helsel	0fba1ca036	fix error message in performBindMounts Summary: The error message that currently exists here does not correspond to the command ran, its just missing the "redirect" part Reviewed By: xavierd Differential Revision: D30914616 fbshipit-source-id: 866ab7d494b728af13fbb3656edb8740a399755f	2021-09-14 09:58:00 -07:00
Yan Soares Couto	b5a2b2004f	Block derivation of mercurial and git changesets for snapshots Summary: There's no real equivalent of hg changeset of snapshot, so let's not derive it. Closes task T97939172 Reviewed By: liubov-dmitrieva Differential Revision: D30902073 fbshipit-source-id: 8128597c25e12e40e719cdd7800d4b9b792391c9	2021-09-14 05:10:13 -07:00
Yan Soares Couto	2c60888f7c	Basic info command Summary: `hg snapshot info` command will be used to get information about the snapshot (similar to `hg show` for commits) It's still not easy to do this, as we want to have derived data for snapshots, which is still unimplemented. For now, this makes the command only check if the snapshot exists or not. In the future more functionality will be added (and likely the edenapi endpoint we query will be different). Reviewed By: liubov-dmitrieva Differential Revision: D30900088 fbshipit-source-id: 4dc6915d74694a03496c756f03bc073d1a0819f2	2021-09-14 05:10:13 -07:00
Stiopa Koltsov	8efdd836e5	Doc warning of buck2 dependencies Summary: * links to private items * raw non-markdown links Reviewed By: quark-zju Differential Revision: D30893382 fbshipit-source-id: 164fc8428437e303b5c1ba47d3f83ac2051e0d46	2021-09-13 20:51:21 -07:00
Xavier Deguillard	de4be2df48	utils: mark SpawnedProcess as being waited when waitpid fails Summary: This is a similar diff to D30915090, but for EdenFS. Differential Revision: D30915126 fbshipit-source-id: 9a718e47237924ebe20176c522a1b1193224236c	2021-09-13 20:00:45 -07:00
Chad Austin	7ef95f6d82	add an ObjectId type Summary: To eliminate the need for proxy hashes, we need variable-width object IDs. Introduce an ObjectId type much like RootId. Reviewed By: genevievehelsel Differential Revision: D30819412 fbshipit-source-id: 07a185ba6b866b475c92f811e70aa00a8a9f895f	2021-09-13 17:21:01 -07:00
Arun Kulshreshtha	92ff6fd79a	edenapi: add repo name to client config Summary: As a first step to moving the repo name inside the EdenAPI client itself, add it as a (currently unused) field to the config. Later diffs will use this instead of having each method take a `repo` argument. Reviewed By: quark-zju Differential Revision: D30746379 fbshipit-source-id: 07957e53e940ce72f84b2297f506b796117ec46a	2021-09-13 16:05:53 -07:00
Alex Barouski	cb1d34f8d4	Added building Uuid5 Summary: We use it as an unique key for the detector Reviewed By: ginfung Differential Revision: D30703470 fbshipit-source-id: cb8e7dae5dc4192402530b2cfe564b86aa23c7c8	2021-09-13 14:45:12 -07:00
Liubov Dmitrieva	3fe7179fe6	check all the multiplexed blobstores when we check is_present Summary: Edenapi lookup (for file content, filenodes and trees): check all the multiplexed blobstores when we check is_present. This will help us to avoid undesired behaviour for commit cloud blobs that haven't been replicated to all blobstores. Healer currently doesn't check commit cloud blobs. Reviewed By: StanislavGlebik Differential Revision: D30839608 fbshipit-source-id: d13cd4500f7b14731d8b75c763c14a698399ba02	2021-09-13 14:34:04 -07:00
Muir Manders	6f4f76c6bd	scmstore: add activity log replay tool Summary: The new debugscmstorereplay command replays scmstore fetches given an activity log created previously via the scmstore.activity log config parameter. Replaying activity logs may help to understand or reproduce performance issues related to file fetching. Currently the replay tool ignores all complications such as concurrent fetches or variable backends. Differential Revision: D30288701 fbshipit-source-id: c6b24acdbd37b5a51ccba3e74e8f074062e880e5	2021-09-13 13:07:32 -07:00
Muir Manders	c4e6a98967	scmstore: add activity log to record file fetches Summary: The new scmstore.activitylog config knob optionally specifies a file for scmstore to record fetch activity. Currently it only records file fetches, but it is intended to also record tree fetches once that is fully baked. The purpose of the log is to record file access patterns to help debug command performance. The following commit will include a tool to replay scmstore activity from the log file. Activity is stored in the log as newline delimited JSON objects. In addition to fetched keys, we also record the start time and duration of each fetch. Differential Revision: D30288715 fbshipit-source-id: c40177e95b1f613ebed41e50a476cbf39e6d9364	2021-09-13 13:07:32 -07:00
Carolyn Busch	d43c6597cd	integrate edenapi commit graph into pull Summary: Pull commit graph using edenapi for lazy changelog repos. Reviewed By: quark-zju Differential Revision: D30733266 fbshipit-source-id: 90be0105cac2a2ab5170970c612767343c06bf7a	2021-09-13 12:29:54 -07:00
Jun Wu	04433db83f	segmented_changelog: update doc for hash -> location endpoint Summary: Make it more detailed, especially about corner cases. Avoid ambiguous words like "valid" etc. Reviewed By: farnz Differential Revision: D30876339 fbshipit-source-id: a45ca643c6454645f7729053a7ea5dd78016fc68	2021-09-13 10:08:42 -07:00
Carolyn Busch	97212dc012	impl graph for client Summary: Add commit graph request to edenapi client. Reviewed By: quark-zju Differential Revision: D30721118 fbshipit-source-id: dfcc012e7b411d0e4d73e11af837cc6bdac67291	2021-09-13 09:58:46 -07:00
Durham Goode	fdf2fcb5bf	tests: fix broken test output Summary: Same fix as D30874167 (`9edb2cafe7`), but for hg-server. This was broken in a recent update. Reviewed By: yancouto Differential Revision: D30882520 fbshipit-source-id: 7e23556f619e3ead585e9e756456f30578ff7cab	2021-09-13 09:33:44 -07:00
Stanislau Hlebik	f3af697f30	mononoke: introduce BackgroundUnlessTooSlow session class and use it in blobstore Summary: Some time ago (see D25910464 (`fca761e153`)) we've started using Background session class while deriving data. This was done to avoid overloading blobstore sync queue - if Background session class is set then multiplex blobstore waits for all blobstores to finish instead of writing to the blobstore sync queue right away. However if any of the blobstores fails then we start writing to the blobstore sync queue. In theory it should have avoided the problem of overloading blobstore sync queue while having the same multiplex reliability (i.e. if only a single blobstore fails the whole multiplex put doesn't fail) Unfortunately there was a flaw - if blobstore put of a single blobstore wasn't failing but was just too slow, then the whole multiplexed put operation becomes too slow. This diff fixes this flaw by adding a timeout - if multiplexed put is taking too long then we fallback to writing entries to the blobstore sync queue. Note that I added a new session class - BackgroundUnlessTooSlow - because I figured that in some cases we are ok with waiting a long time but not writing to the sync queue. Skiplist builder might be a good example of that - since it's doing overwrites then we don't want to write to the blobstore sync queue at all, because healer doesn't process overwrites correctly. Reviewed By: farnz Differential Revision: D30892377 fbshipit-source-id: 69ac1795002b124e11daac13d8bfe59895191168	2021-09-13 08:09:12 -07:00
Yan Soares Couto	5e861e6949	HGPLAIN version of createremote Summary: When specifying `HGPLAIN`, only the hash is outputted, which is easier for automation. Reviewed By: StanislavGlebik Differential Revision: D30899254 fbshipit-source-id: 32457c6b92d14305c5b0bafb1217d574ec83a85c	2021-09-13 06:15:04 -07:00
Stanislau Hlebik	8ca1ef0c5e	mononoke: actually use scuba dataset that's set on the cmd line Summary: I added logging in D30805504 (`d5e2624fbb`), however it wasn't really logging anything, because I forgot to pass scuba sample builder to CoreContext (facepalm). This diff fixes it. Reviewed By: HarveyHunt Differential Revision: D30899642 fbshipit-source-id: 6e20f1e84fc96175be8ca7a6f91c0fc61caf8e49	2021-09-13 05:38:33 -07:00
Stanislau Hlebik	5e45855552	mononoke: remove unnecessary CoreContext creation Summary: It looks like the comment is misleading (we don't really derive anything in this block, just finding underived commits), and this CoreContext override doesn't seem necessary anymore. Let's remove it Reviewed By: farnz Differential Revision: D30899641 fbshipit-source-id: 2850905891a9bd8b01f3f6fa9ef15c572fc2f07a	2021-09-13 05:27:51 -07:00
Yan Soares Couto	1df5836799	Allow using "snapshot create" Summary: `createremote` is a slightly inconsistent name. The reasoning behind it was that this command creates the snapshot on server side only. But since actually making the client snapshot-aware is pretty far away, I prefer to make it "create". Reviewed By: StanislavGlebik Differential Revision: D30871026 fbshipit-source-id: fde5d65e38249998f71e51b76ccb7d7b6b9bf24d	2021-09-13 02:59:48 -07:00
Yan Soares Couto	11715d4fd9	Fix untracked deletion files being treated as deletion Summary: This was a very well-hidden bug that I failed to notice in the integration tests. Turns out the in serde enum unit variants are serialized to strings, while struct variants to a dictionary. My code assumed it was always a dictionary. And because python is python, it worked, as the `in` operator works for strings and dicts. But `"Deletion" in fc` for strings means if Deletion is a substring of `fc`, which is also true for `UntrackedDeletion`, thus all untracked deletions were treated like normal deletions. Reviewed By: StanislavGlebik Differential Revision: D30868534 fbshipit-source-id: e574de6493bcd8e8d42d6e22da4dc482d083f22d	2021-09-13 02:59:48 -07:00
Genevieve Helsel	21c2410a6d	pretty print rage sections and normalize underlining Summary: the rage summary is getting hard to quickly parse, so this underlines each section header, as well as unifies underline looks (with `eden stats`). This adopts the underline code from `eden du` and makes it a util function for shareability. Differential Revision: D30857773 fbshipit-source-id: 66b5b06f5b0125304d45d3465a8bc2248693b791	2021-09-10 23:54:48 -07:00
Genevieve Helsel	bae2745023	add more information in unloadedInodes check Summary: I think it would be helpful to see the path of the inode that causes this check to fail Reviewed By: kmancini Differential Revision: D30880645 fbshipit-source-id: 08cad2277484568a6e325b1db7a89a9cf0fe1d3f	2021-09-10 23:50:35 -07:00
Meyer Jacobs	1d7fe4d006	scmstore: implement refresh as flush, and forward calls to contentstore when available Summary: Refresh doesn't have any special meaning outside of packfiles, but in some contexts (like BackingStore) is used to trigger a flush. Previously I'd implemented refresh as a no-op. With this change, refresh will trigger a flush, and if a contentstore fallback is available will also forward the refresh call to contentstore for packfile-specific refresh behavior. Reviewed By: andll Differential Revision: D30851223 fbshipit-source-id: 893b4256fd8edc0f60612e61c177d885252cb85b	2021-09-10 17:27:54 -07:00
Carolyn Busch	9edb2cafe7	update debug whitespace Summary: D30829928 (`fd03bff2e2`) removes whitespace from tracing logs. Update tests. Reviewed By: DurhamG Differential Revision: D30874167 fbshipit-source-id: 29452d92f0d13f279ffcb00d9cccfa743b5ccfaa	2021-09-10 16:42:15 -07:00
Zhengchao Liu	55986887a8	test: skip userinfo test if it's run as root Summary: This test can fail for not having `USERNAME` environment variable when the test is run as root. Let's just skip the test when this happens because it doesn't make sense to drop priv as root. Reviewed By: xavierd Differential Revision: D30868518 fbshipit-source-id: 14ff6db218b1477f5905f2df3ad075a5ca186117	2021-09-10 16:42:15 -07:00
Jun Wu	8aa676ada0	edenapi_service: add capabilities endpoint Summary: Add an endpoint to provide repo configuration information, such as whether segmented changelog is supported by the server or not. This helps the client make decisions without hitting actual (expensive) endpoints and distinguishing from unrelated server errors. It would allow us to remove error-prone client-side config deciding whether to use segment clone. Reviewed By: krallin Differential Revision: D30831346 fbshipit-source-id: 872e20a32879e075c75481f622b2a49000059d04	2021-09-10 16:27:17 -07:00
Jun Wu	dafe41f872	segmented_changelog: add trait method to test if segmented_changelog is disabled or not Summary: In a future diff, we want an endpoint to test if segmented changelog is supported for a repo without doing any real computation using segmented chagnelog. This would be useful for the client to decide whether it can use segmented changelog clone or not, instead of relying on fragile per-repo configuration. Reviewed By: farnz Differential Revision: D30825920 fbshipit-source-id: 16dc5bf762da2d2b9cd808c129e1830285023f3d	2021-09-10 16:27:17 -07:00
Genevieve Helsel	67fb93bf79	print eden config in rage report Summary: it would be helpful to see a user's or sandcastle job's eden config, especially in the case of a gated feature rollout / staged feature rollout. Differential Revision: D30857763 fbshipit-source-id: ee2a311ee643fc9db5acef1b02017564c51d2362	2021-09-10 16:27:17 -07:00
Genevieve Helsel	9207e03331	fix typo in cli comment Summary: just a typo fix Created from CodeHub with https://fburl.com/edit-in-codehub Reviewed By: fanzeyi Differential Revision: D30849172 fbshipit-source-id: 9779832870c909d080548ec71ecf86aa53767dbc	2021-09-10 16:27:16 -07:00
Ilia Medianikov	f394b86528	mononoke/scs: add method to look up commit origin over pushrebase mutations Reviewed By: StanislavGlebik Differential Revision: D30495086 fbshipit-source-id: 5fe659033b5e0e8a557f173d677caa0dfd531b05	2021-09-10 16:27:16 -07:00
Yan Soares Couto	34e77eba32	Use transparent_wire macro for more things Summary: This logic (A Wire and Api object that are just the same object) is used in more places ad-hoc (for Vec<u8> and u32). This diff makes it simpler by using the macro introduced before, and derives it for a lot of basic types (integers and bytes). Reviewed By: kulshrax Differential Revision: D30605781 fbshipit-source-id: 7520b529e52cfde0a5c5d17d91f5f85b0633fa7f	2021-09-10 10:19:30 -07:00
Stanislau Hlebik	67d019e214	mononoke: add open_source_repo/open_target_repo functions Summary: It's nice to have these functions to open source and target repos. Previously we always had to get repo id first, and then call open_repo_internal_with_repo_id Reviewed By: yancouto Differential Revision: D30866314 fbshipit-source-id: dd74822da755de232f4701f8523088e0bb612cb9	2021-09-10 09:49:01 -07:00
Harvey Hunt	99535f25b8	mononoke: Add missing globs to integration tests Summary: D30829928 (`fd03bff2e2`) updated some of Mononoke's integration tests to take into account whitespace changes. However, it removed the globs from some parts of the tests. As the assigned port changes on each test run, the globs are required. Add them back in again, as well as fix up some whitespace in a test. Reviewed By: markbt Differential Revision: D30866884 fbshipit-source-id: 1557eee2143a2459a6412b8649e7e3dce5a607a4	2021-09-10 08:47:45 -07:00
Stanislau Hlebik	0bceacec63	mononoke: add a simple binary that can compute statistics about commits Summary: It's nice to have that can quickly count and print stats about a commit. I'm using it now to understand performance of derived data. Reviewed By: ahornby Differential Revision: D30865267 fbshipit-source-id: 26b91c3c05a1c417015b5be228796589348bf064	2021-09-10 06:01:53 -07:00
Callum Ryan	65bd8a9bc9	Support thrift_library::rust_include_srcs Summary: `rust_include_srcs` is supported on `thrift_library` as a way of including other Rust code in the generated crate, generally used to implement other traits on the generated types. Adding support for this in autocargo by copying these files into the output dir and making sure their option is specified to the thrift compiler Reviewed By: ahornby Differential Revision: D30789835 fbshipit-source-id: 325cb59fdf85324dccfff20a559802c11816769f	2021-09-10 00:12:44 -07:00
Durham Goode	f6328e160f	rage: fix encoding crash on Windows Summary: The default Windows encoding can't handle some unicode characters apparently, so let's use utf-8 by default. Reviewed By: quark-zju Differential Revision: D30850982 fbshipit-source-id: 51a7fdf5464d075549afe4f0bcd307c0f2eb7fa0	2021-09-09 23:08:58 -07:00
Jeremy Fitzhardinge	cbceb08640	third-party/rust: local patch to tracing-subscriber Summary: Add impls for Layer for Box/Arc<L: Layer> and <dyn Layer>. Also a pile of other updates in git which haven't been published to crates.io yet, including proper level filtering of trace events being fed into log. Reviewed By: dtolnay Differential Revision: D30829927 fbshipit-source-id: c01c9369222df2af663e8f8bf59ea78ee12f7866	2021-09-09 22:38:25 -07:00
Jeremy Fitzhardinge	fd03bff2e2	third-party/rust: bump tracing versions in preparation for patching Summary: Bump all the versions on crates.io to highest to make migration to github versions in next diff work. Reviewed By: dtolnay Differential Revision: D30829928 fbshipit-source-id: 09567c26f275b3b1806bf8fd05417e91f04ba2ef	2021-09-09 22:38:25 -07:00
Yan Soares Couto	1ff401e156	No need to know bubble id for snapshot restore Summary: We don't need to pass the bubble id to the server, it can find it from the changeset id. This fixes a TODO I added previously, and should make the `restore` command complete. Reviewed By: ahornby Differential Revision: D30609423 fbshipit-source-id: d1c8eb0e0556069fa408520a0aea91a0f865fbe1	2021-09-09 18:36:09 -07:00
Yan Soares Couto	8fb52e3219	Download files on snapshot restore Summary: Uses the endpoint added on previous diffs to download the snapshot files to the repo, and adds them correctly to the snapshot restore. This should almost complete the `snapshot restore` command, missing is getting the bubble id from the snapshot hash. {gif:sqc6yx6c} Reviewed By: StanislavGlebik Differential Revision: D30583038 fbshipit-source-id: 6549a52f767c50444c316b358d9704bc4a136934	2021-09-09 18:36:09 -07:00
Yan Soares Couto	69faf9ad21	Add python bindings to download file endpoint Summary: This adds the `downloadfiles` method to the python EdenApi wrapper. It uses multiple calls to the endpoint added on previous diffs to download each file and place it somewhere on the repo. It also does deduplication of downloads. Reviewed By: StanislavGlebik Differential Revision: D30582638 fbshipit-source-id: 34e864d03c0e48a7605ee8e4e92376881dbb2de9	2021-09-09 18:36:09 -07:00
Eric Williamson	765b8c51a9	Fix hashbinary option to handle removed files Summary: When using hashbinary with a removed/moved file, hg throws with `TypeError: object supporting the buffer API required` this is because we are trying to `sha1(None)`. This diff falls back to the `Binary file %s has changed` message when we have a removed file. Reviewed By: quark-zju Differential Revision: D30845897 fbshipit-source-id: a3d2b7d11d9c1ca3855140c9abd7550cf7076bbc	2021-09-09 17:00:24 -07:00
Zhengchao Liu	601f99b118	support FS events logging in NFS mounts Summary: This adds the support for FS events logging for NFS. For context, each type of event is assigned a sampling group that determines its sampling rate. In TraceBus subscription callback, events are sent to `FsEventLogger` to be sampled and logged through `HiveLogger`. Reviewed By: xavierd Differential Revision: D30843863 fbshipit-source-id: 65394d31b1197efd69c7fd4c1b24562f5abd5785	2021-09-09 16:29:53 -07:00
Arun Kulshreshtha	4c174dcda0	http-client: add success and failure event listeners for requests Summary: Previously, it was only possible to register event listeners for request completion on the `HttpClient` itself, rather than on individual `Request`s. This diff adds similar event listeners to `Request`s themselves, so that its possible to register a callback to fire when any request completes, regardless of whether it was sent via an `HttpClient` or as a one-off. This is similar to `RequestCreationEventListeners`, which run for the creation of every request, whether or not the request is associated with a client. Notably, to avoid circular references the new event listeners take a `RequestInfo` argument instead of a `RequestContext` (since the listeners are themselves stored inside the `RequestContext`). In practice, the `RequestInfo` should contain all of the information one might want to access about the request. Reviewed By: quark-zju Differential Revision: D30831840 fbshipit-source-id: 77ca9dc5fd9f8fc5ee60319baabd77171af70d45	2021-09-09 15:16:43 -07:00
Durham Goode	af83efe403	doctor: show full content store repair message Summary: The content store repair binding ate the entire repair message, making it hard to debug when it wasn't working. Reviewed By: quark-zju Differential Revision: D30824740 fbshipit-source-id: 52dbfe79f2dd1568285cda63fb54cacf532aa4a1	2021-09-09 13:24:26 -07:00
Jun Wu	0207c8878f	verify: verify lazy changelog Summary: Make `verify` check the lazy changelog properties: - Universal id <-> name mappings are known locally. - Segments are sane (ex. high-level segments are built from low-level and there are no cycles) With `--dag`, also check the graph with a newly cloned remote graph. This just calls the verification logic added in Rust `dag` crate to do the heavy lifting. Differential Revision: D30820773 fbshipit-source-id: 8f62f41738c3c8e3fe88442860a83fdb4944f178	2021-09-09 11:05:44 -07:00
Zeyi (Rice) Fan	369861a9ee	windows: invalidate negative path cache during start Summary: In certain situations, users may cause EdenFS to falsely return a path not exist result while the path is available. Windows will cache that and causing subsequent access to that file to automatically return a file not exist error. We currently only invalidate this negative cache during checkout and rebooting the machine as the cache is even kept during EdenFS restarts. In this diff, we starts to invalidate the negative path cache at startup so if the user ever had issues an `eden restart` would be sufficient to fix. Reviewed By: xavierd Differential Revision: D30814059 fbshipit-source-id: 53283f471702762b2eed0c5d0f6a9cc49f4db739	2021-09-09 10:48:53 -07:00
Yan Soares Couto	56d4129e8c	Add download endpoint to EdenApi trait Summary: This adds the plumbing to access download a file using the endpoint from the previous diff via the EdenApi trait, which does the actual http request. It concats the stream into a Bytes object and returns it. Reviewed By: StanislavGlebik Differential Revision: D30582422 fbshipit-source-id: ed0fe5e34e3fecc6c1b26d2dceb322dfcf5f8e37	2021-09-09 10:04:40 -07:00
Yan Soares Couto	b5184cbb2d	Endpoint for downloading file from upload token Summary: This diff adds an endpoint `/download/file` that allows to download a file given an upload token. This will be used for snapshots, as we need to download the snapshot changes, and there's no way to do that right now. Other options, and why I didn't do them: - Using the existing `/files` endpoint: Not possible, as it needs hg filenodes and we don't have those. - Returning the file contents in the fetch_snapshot request: Might make the response too big - Returning just a single Bytes instead of a stream: I thought streaming would be preferred, and more future proof. In the stack I still put everything in memory in the client, but maybe in the future it should be possible to stream it directly to the file. I'm happy to remove if preferred, though. Reviewed By: StanislavGlebik Differential Revision: D30582411 fbshipit-source-id: f9423bc42867402d380e831bc45d3ce3b3825434	2021-09-09 10:04:40 -07:00
CodemodService Bot	15535f044d	Daily `common/rust/cargo_from_buck/bin/autocargo` Reviewed By: krallin Differential Revision: D30835464 fbshipit-source-id: 76918978f1c0f372ce8a4e8a57223e081e5ca993	2021-09-09 09:03:15 -07:00
Jan Mazur	7f81330da1	add x2pagentd info to hg rage Summary: This proved useful couple of times when folks experienced problems with the agent. Reviewed By: ahornby Differential Revision: D30837676 fbshipit-source-id: aec769f60a09ecb83857e6e60d49a5662b4ce0b2	2021-09-09 08:49:49 -07:00
Jun Wu	315bec71f4	revlogindex: revert D30732364 and update tests Summary: Add back the octopus merge support for revlog. This recommits D30686451 (`b13579fdf9`) and D30686450 (`7eb11cb392`) as-is, with updates to test files. Original commit changeset: 9f213766e7c4 Reviewed By: StanislavGlebik Differential Revision: D30784681 fbshipit-source-id: ace0c317652ad8b657c8edd9a0130532dad53078	2021-09-09 08:29:03 -07:00
Yan Soares Couto	6f635f7d8a	No need to implement for Arc Summary: As far as I could tell, this was legacy from some refactorings. It was only used in one place, and it was easy to fix. Also, if we really need it in the future, we can probably use `#[auto_impl]` instead of doing it manually. Reviewed By: StanislavGlebik Differential Revision: D30574803 fbshipit-source-id: 20715364713775818fe0e83844637f48b310d87f	2021-09-09 07:43:29 -07:00
Yan Soares Couto	175a82712a	Fix createremote no working from non-root folder Summary: createremote only worked from root of the repo. This fixes it, and tests that in the integration test Reviewed By: StanislavGlebik Differential Revision: D30546582 fbshipit-source-id: 84aa304d346e448b44e5d7fb9e9607d84a67da25	2021-09-09 07:43:29 -07:00
Yan Soares Couto	29606df2fe	Restore to correct parent and deleted files Summary: This adds basic logic for `snapshot restore` command. - It updates to the parent of the snapshot - It loads the snapshot changes For now I did not do changes/tracked changes, as it will need to download the file contents, which will need a new edenapi endpoint, so I'll leave it for a future diff. It just restores your deleted files for now. Reviewed By: StanislavGlebik Differential Revision: D30543507 fbshipit-source-id: 080588ceff0ecd595ce739044f0d4118fb8e1a3f	2021-09-09 07:43:29 -07:00
Stanislau Hlebik	315a8b311d	mononoke: add benchmark mode to backfill_derived_data Reviewed By: markbt Differential Revision: D30837581 fbshipit-source-id: 0c778b629f99c6110a78500c235131d0cb12f9aa	2021-09-09 05:01:18 -07:00
Liubov Dmitrieva	c9810f8459	log sync reason Summary: log sync reason for `hg cloud sync`. This will help us to investigate issues better and measure impact for new Eden Api Uploads case by case (after amend, rebase, etc) on different platforms. Reviewed By: yancouto Differential Revision: D30775519 fbshipit-source-id: 696e954ec8db19226fb67ad0952e23e2b67e9931	2021-09-09 04:31:22 -07:00
David Tolnay	5e9b8cd4b2	third-party/rust: Update thiserror from 1.0.23 to 1.0.29 Summary: Release notes: - https://github.com/dtolnay/thiserror/releases/tag/1.0.24 - https://github.com/dtolnay/thiserror/releases/tag/1.0.25 - https://github.com/dtolnay/thiserror/releases/tag/1.0.26 - https://github.com/dtolnay/thiserror/releases/tag/1.0.27 - https://github.com/dtolnay/thiserror/releases/tag/1.0.28 - https://github.com/dtolnay/thiserror/releases/tag/1.0.29 The pertinent feature is 1.0.29 adding support for inferred trait bounds on error types that contain generic type parameters. I remember someone asking for this in fbcode but I forget what project it was for. ``` use thiserror::Error; #[derive(Error, Debug)] pub enum MyError<E, F, G> { #[error("thing {0} ({0:?})")] Variant(E), #[error("some error")] Delegate(#[source] SomeError<F>), #[error("err 0o{val:o}")] Octal { val: G }, } ``` ``` // generated impl<E, F, G> std::error::Error for MyError<E, F, G> where SomeError<F>: std::error::Error + 'static, //' Self: std::fmt::Debug + std::fmt::Display; impl<E, F, G> std::fmt::Display for MyError<E, F, G> where E: std::fmt::Debug + std::fmt::Display, G: std::fmt::Octal; ``` Reviewed By: zertosh Differential Revision: D30758449 fbshipit-source-id: b3afe08fe8c8affa26693df9cbb63e04632ea1d3	2021-09-08 20:49:35 -07:00
Adam Simpkins	adb0571dcc	add an EDEN_HAVE_USAGE_SERVICE config macro Summary: Put code using the usage service behind an `EDEN_HAVE_USAGE_SERVICE` macro. Previously the C++ code was simply guarded by a `__linux__` check, and the CMake code did not have a guard at all. This caused builds from the GitHub repository to fail on Linux, since the code attempted to use the usage service client which was not available. Reviewed By: xavierd Differential Revision: D30797846 fbshipit-source-id: 32a0905d0e1d594c3cfb04a466aea456d0bd6ca1	2021-09-08 19:50:44 -07:00
Durham Goode	eed3ce44f3	sparse: remove default "" for sparse profiles Summary: In the v1 sparse config arrangement, if all rules were excludes then we would include a default "" rule. This was always a little confusing and caused some weird behavior. Let's remove it from the v2 world. This actually bit us because the fbsource_exclude profile only has excludes, which caused it to insert a ** include, which pulled in all of fbsource. We could fix it to only check if a profile is excludes-only once all the transitive profiles have been loaded, but I think the cleaner fix is to remove this logic since it's confusing and never actually used in production. Differential Revision: D30824082 fbshipit-source-id: adcf4c820cc9f7636f79759d03fc0b387b9f55fa	2021-09-08 19:50:44 -07:00
Durham Goode	1c4e8d7de7	http-client: propagate HttpClientErrors errors from inside decode stream Summary: Any error inside the decode stream was being propagated up as a decoder error. This caused higher level code to not handle certain errors appropriately. For instance, the lfs retry logic only retries for certain classes of curl errors. So let's propagate up HttpClientErrors as is. Reviewed By: kulshrax Differential Revision: D30798108 fbshipit-source-id: 7316f6cdc47de090c202ff6a1f28d0fba60f7a15	2021-09-08 18:48:31 -07:00
Zeyi (Rice) Fan	ec8fe69d32	tokio-uds-compat: fix UB and make it actually work Summary: The previous version had two issues: 1. It's UB to cast uninit away as it may be actually uninitialized. 2. Because of the cast, the buffer was not actually written nor advanced after written to, causing the caller to think nothing was read. https://docs.rs/tokio/1.11.0/tokio/io/struct.ReadBuf.html Reviewed By: dtolnay Differential Revision: D30823808 fbshipit-source-id: d5f67e4c03f1d63f2241421dd35082ee96b5afd8	2021-09-08 18:39:56 -07:00
Andrey Chursin	8f2420d83b	cpython-ext: fix py-cell Summary: For some reason it got broken, need to call `as_ref()` to properly cast type Reviewed By: quark-zju Differential Revision: D30740629 fbshipit-source-id: f49275caae9d360859e97c03709a720dabc22e9e	2021-09-08 16:51:22 -07:00
Chad Austin	0b47b645fe	remove dead code from serializeTree Summary: LocalStore no longer special-cases Tree objects with kZeroHash ids. Instead, unconditionally write into LocalStore with the Tree's hash. Reviewed By: xavierd Differential Revision: D29155470 fbshipit-source-id: aee3840fe8dfd7aa46305b6db6f7950efb2e41d2	2021-09-08 16:27:10 -07:00
Chad Austin	a4ba22dc48	rename Hash to Hash20 Summary: In preparation for expanding to variable-width hashes, rename the existing hash type to Hash20. Reviewed By: genevievehelsel Differential Revision: D28967365 fbshipit-source-id: 8ca8c39bf03bd97475628545c74cebf0deb8e62f	2021-09-08 16:27:10 -07:00
Jun Wu	7947c378c4	context: make ctx.pareents() support >= 2 parents Summary: Do not assume `changelog.parents` returns 2 items. This changes the behavior for root commits. `parents()` used to return `[repo[nullid]]`, now it returns `[]`. Reviewed By: andll Differential Revision: D30784684 fbshipit-source-id: 73f58c85457391fb74b96b88dc4dcb69a25e81ac	2021-09-08 16:19:10 -07:00
Jun Wu	9ec959ea26	absorb: work with ctx.parents() returning empty list Summary: In a future change, `ctx.parents()` returns `[]` instead of `[repo[nullid]]` for root commits. Make the change to preserve absorb behavior. Differential Revision: D30816385 fbshipit-source-id: afded91a6e72d4eb54faf87dcdfc52a81ea1d66f	2021-09-08 16:19:10 -07:00
Jun Wu	e37b3f1501	rebase: work with ctx.parents() returning empty list Summary: In a future change, `ctx.parents()` returns `[]` instead of `[repo[nullid]]` for root commits. Make the change to preserve rebase behavior. Differential Revision: D30816386 fbshipit-source-id: ca7c489991ae149c9640b7da0e6e54f76afbc250	2021-09-08 16:19:09 -07:00
Jun Wu	e8efb51f3d	codemod: parents()[0] => p1() Summary: We're going to change parents() to return an empty list instead of `[nullctx]` for roots. This change makes it more compatible with upcoming changes. Reviewed By: andll Differential Revision: D30787305 fbshipit-source-id: 1de523964faa64a6496a7bb0197af597e393d859	2021-09-08 16:19:09 -07:00
Jun Wu	42e09b6227	treestate: add p1, p2 methods Summary: They will be used in the upcoming changes. Differential Revision: D30816387 fbshipit-source-id: 61db432c9065b038c2d8649a4f9e97f3bb68fea1	2021-09-08 16:19:09 -07:00
Jun Wu	810f7d9cfe	changelog2: make parents optionally filter nullids Summary: This will be used by the next change. Reviewed By: andll Differential Revision: D30784683 fbshipit-source-id: 59a37c5f428eaf5950584d8f17471d358bfefee7	2021-09-08 16:19:09 -07:00
Carolyn Busch	26add70c70	add http prefix lookup to pull Summary: Integrate http hash prefix lookup into the pull operation. One unfortunate change here is that if the prefix is ambiguous, we're only able to output possible full hashes as suggestions. Previously we'd also print commit log information. To retain that we'd need to add an error option to the response and have the server send back an error message with the log information or send another request to download the extra information. Reviewed By: andll Differential Revision: D30716050 fbshipit-source-id: 33f8bc38b0bfe7fce4ec11cd8def7feda3b3d3da	2021-09-08 16:05:21 -07:00
Carolyn Busch	dee2a40e35	add prefix lookup python bindings Summary: Add hash prefix lookup to python bindings. Reviewed By: andll Differential Revision: D30716052 fbshipit-source-id: 00c1a20b1bf924c5eb4596fefe15915bbc7ce55c	2021-09-08 16:05:21 -07:00
Carolyn Busch	fede26c2fc	add hash prefix lookup to eagerrepo Summary: Add implementation for hash prefix lookup to eagerrepo. Reviewed By: andll Differential Revision: D30716552 fbshipit-source-id: 89e6517b504a7571d03ef50692e13c9fcfabc485	2021-09-08 16:05:21 -07:00
Carolyn Busch	820b90c69d	add hash prefix lookup to client Summary: Add hash prefix lookup to the edenapi client. Reviewed By: andll Differential Revision: D30716051 fbshipit-source-id: 26dd0ecbfc836c9799c46c5a38aa9570bdb902db	2021-09-08 16:05:21 -07:00
Zhengchao Liu	b93c23d25c	assign FS events to sampling groups Summary: As title, sampling group determines the sampling rate at which an FS event is logged. The higher the sampling group the more heavily its events are dropped, thus, more frequent events are assigned to the higher sampling groups. I ran activity recorders on a few workflows, buck build, getdepts, and vscode editing and came up with the following assignment. Note that only a subset of events are assigned to a sampling group (so events not included will not be logged) as we just start to tune the sampling rates and these events should be good for a start. ``` Group1 (1/10) FUSE_MKDIR FUSE_RMDIR FUSE_CREATE FUSE_RENAME Group2 (1/100) FUSE_WRITE FUSE_LISTXATTR FUSE_SETATTR Group3 (1/1000) FUSE_GETXATTR FUSE_GETATTR FUSE_READ FUSE_READDIR Group4 (1/10000) FUSE_LOOKUP ``` For reference, here are the counts of FS events of a cold buck build. The frequencies of other workflows are similar. ``` FUSE_LOOKUP 60.09 98733 FUSE_READ 12.80 21037 FUSE_GETXATTR 8.91 14645 FUSE_FORGET 8.01 13162 FUSE_GETATTR 5.55 9116 FUSE_READDIR 3.21 5270 FUSE_LISTXATTR 0.59 969 FUSE_READLINK 0.54 892 FUSE_STATFS 0.21 338 FUSE_WRITE 0.04 64 FUSE_CREATE 0.02 28 FUSE_RENAME 0.01 23 FUSE_SETATTR 0.01 13 FUSE_UNLINK 0.00 6 FUSE_RMDIR 0.00 1 FUSE_MKDIR 0.00 1 FUSE_MKNOD 0.00 1 ``` Reviewed By: xavierd Differential Revision: D30770533 fbshipit-source-id: 90be881ddbeba2113bbb190bdb1e300a68f500a0	2021-09-08 11:40:22 -07:00
Arun Kulshreshtha	354fe9e111	edenapi_service: automatically compress EdenApiHandler responses Summary: The new `EdenApiHandler` framework for defining EdenAPI endpoints provides a common place where responses are encoded. This diff adds automatic content compression at this point, using the received `Accept-Encoding` header from the request to determine what compression, if any, should be used. As a result, all endpoints that implement `EdenApiHandler` will get compression for free. Reviewed By: yancouto Differential Revision: D30553242 fbshipit-source-id: 9eda54cbf81dd1e03abec769744c96b16fad64ea	2021-09-08 11:33:19 -07:00
Mark Juggurnauth-Thomas	70e4650d9c	add logging of client information Summary: It can sometimes be difficult to work out from the logging which commit cloud requests came from which client repo. Previously you could often infer it from the client identities, however if the request is proxied, the originating hostname can be lost, and it still doesn't handle the case where the host contains multiple repos. This diff adds a new `ClientInfo` struct, which is included by the client on every `get_references` and `update_references` request. This is logged by the service, allowing us to correlate which client it came from, and what workspace version the client had at that time. Reviewed By: StanislavGlebik Differential Revision: D30697839 fbshipit-source-id: 8fe2e03f0be2f443f8ae1814f083c04ba5d1805e	2021-09-08 11:27:37 -07:00
Andrey Chursin	ec126c6bc3	eden_api: add retry to fast forward pull Reviewed By: yancouto Differential Revision: D30739638 fbshipit-source-id: 3fa3a23c463f1e6b5563bb710724443a39655416	2021-09-08 11:07:47 -07:00
Andrey Chursin	da3ee4a723	eden_api: remove callback from clone_data Summary: It was not used and hard to implement retry with Reviewed By: yancouto Differential Revision: D30716647 fbshipit-source-id: a90b629f7758486c9e526d1eaf3fd29da305f2e7	2021-09-08 11:07:47 -07:00
Andrey Chursin	37992d4d8a	edenapi: retry clone_data errors Reviewed By: yancouto Differential Revision: D30716452 fbshipit-source-id: 9d0aa42c18322cc755a4d6acc1caacfd624229a0	2021-09-08 11:07:46 -07:00
Lincoln Bergeson	0fc4c158a4	Add dmidecode crate to Cargo.toml Summary: As discussed in D30543524 et al, we plan to use a third-party library for parsing dmidecode data rather than rolling our own. https://docs.rs/dmidecode/0.7.0/dmidecode/index.html https://github.com/jcreekmore/dmidecode Reviewed By: alastor-erinyes Differential Revision: D30808298 fbshipit-source-id: 74c7ed9b9bb20de76fe7a4ff26e71d2e0c329c0b	2021-09-08 09:38:12 -07:00
Durham Goode	94bc5266f2	edenfs-client: fix make by upgrading tokio Summary: D30704344 (`5704ad51f6`) upgraded tokio for the buck build. We need to do the same for the non-buck build. This unbreaks hgbuild. Also clean up some compiler warnings while I'm here. Reviewed By: fanzeyi Differential Revision: D30798315 fbshipit-source-id: 47005c7674d87196aab42b3ddf2194acced3bb6c	2021-09-08 08:52:30 -07:00
Stanislau Hlebik	d5e2624fbb	mononoke: add backfill derived data logging Summary: We have two mode of deriving data: the "normal" way and using backfilling. Backfilling is different from "normal" mode in that it derives a few commits at once, and saves them all to blobstore at once. Backfilling mode seemed to have helped us when we need to derive a lot of data (e.g. backfill the whole repo). But a) We don't know how much it helps, and we don't know if it depends on the repo b) We don't know if it helps when we derive data for newly landed commits (i.e. we use "backfill" mode in derived data tailer to derive data for latest public commits) So this diff adds a bit of logging to a separate scuba table so that we can get an idea about things like: 1) How long does it take to derive a stack of commits? 2) Where do we spend most of the time (e.g. deriving, saving the blobs, saving the mapping). Reviewed By: mitrandir77 Differential Revision: D30805504 fbshipit-source-id: d82c905cafa87459990d74769a0dddcc91fac174	2021-09-08 08:09:32 -07:00
Stanislau Hlebik	3c66a0f1a8	mononoke: use derived_data_utils in simulated_repo/benchmark Summary: It allows us to do 3 things: 1) Remove derive function 2) Add support for backfill mode so that we can compare perf with and without it 3) Use all derived data types, and not just 3 of them Reviewed By: krallin Differential Revision: D30804258 fbshipit-source-id: 604723a3d845a60cfd94b4e090a121f5b5191536	2021-09-08 07:05:47 -07:00
Stanislau Hlebik	4e76c70677	mononoke: add split_commit command Summary: This command can be useful to split a large bonsai commit into a smaller one Reviewed By: mitrandir77 Differential Revision: D30776789 fbshipit-source-id: dc56d7c51eb0e9e0988dcba868c6008ebf488927	2021-09-08 04:57:44 -07:00
CodemodService Bot	0c85a38fc1	Daily `common/rust/cargo_from_buck/bin/autocargo` Reviewed By: krallin Differential Revision: D30802656 fbshipit-source-id: 32e9e908571daf7ee111b07d47b5314ec632e7df	2021-09-08 03:36:02 -07:00
Stanislau Hlebik	a8a3f79d90	mononoke: make it possible to override blobstore get/put distribution from cmd line Summary: It's nice for experimenation Reviewed By: mitrandir77 Differential Revision: D30782023 fbshipit-source-id: 7449a7fd8b56b18aa0635323ad7ca8137dc49279	2021-09-07 23:25:31 -07:00
Stanislau Hlebik	8b79e736e1	mononoke: initialize blobstore options and mysql for derived data benchmark Summary: While we don't really need it, creation mononoke matches fail if they are not present. Let's just enable it here - it's not a bad thing to initialize them Reviewed By: mitrandir77 Differential Revision: D30780463 fbshipit-source-id: c4199c6711ae7bd9641e9f51643b94d020051dbd	2021-09-07 23:25:31 -07:00
Arun Kulshreshtha	20893ba125	edenapi: factor out config field access pattern Summary: The code for accessing config fields had a lot of repetitive boilerplate. Let's move that to a helper function. Reviewed By: andll Differential Revision: D30785932 fbshipit-source-id: fb4d47337a27bd6e75eeb38d5a9d1de5b1fac6ce	2021-09-07 21:35:27 -07:00
CodemodService Bot	b82f28cf1a	Daily `arc lint --take BLACK` Reviewed By: zertosh Differential Revision: D30799935 fbshipit-source-id: 5d98595f2706b1ff37a1918cf127888dbe6d9314	2021-09-07 21:09:42 -07:00
Carolyn Busch	f8e83ab129	add graph endpoint Summary: Implement serverside graph endpoint for fetching the mapping of commits to commit parents for the missing segment of a commit graph. This implementation uses the find_commits_to_send method from the get_bundle_response library. What may be missing from pull and the old bundle protocol now is mutation markers. Reviewed By: yancouto Differential Revision: D30485672 fbshipit-source-id: ba3a30d9e482d60831cbe7a8e89f20dab947d9a1	2021-09-07 19:53:04 -07:00
Carolyn Busch	0b2f5849cb	remove redundant hashset conversion Summary: Since the find_commits_to_send method was added, common is already a hashset not a vector, so it doesn't needed to be converted to a hashset. Reviewed By: quark-zju Differential Revision: D30622028 fbshipit-source-id: e5d1b6c60115d13c906b25142043652ba9e89d70	2021-09-07 19:53:03 -07:00
Carolyn Busch	0ffe94b9f0	add commit graph to make_req & read_res Summary: Add commit graph request and response to edenapi make_req and read_res utilities. Reviewed By: kulshrax Differential Revision: D30485673 fbshipit-source-id: ddd469381b4d824e202e1d3370ff1b452d531b8c	2021-09-07 19:53:03 -07:00
Carolyn Busch	89846ede85	add commit graph types Summary: Add request and response types for commit graph edenapi endpoint. Reviewed By: kulshrax Differential Revision: D30485671 fbshipit-source-id: 9a756543ac69fdc5618a4065fb6cfd0bda8d3d64	2021-09-07 19:53:03 -07:00
Xavier Deguillard	2bc4cf33db	backingstore: on exit, flush the store to disk Summary: Not flushing the data to disk makes studying performance almost impossible due to not being able to avoid fetching from the network. By forcing a flush to disk, we can ensure that data will always be on disk, making performance measurement easier. This will also prevent users from re-fetching the same data multiple times. Reviewed By: fanzeyi Differential Revision: D30784399 fbshipit-source-id: 0250c209b5f49f95cf2f43873573cacc661a4989	2021-09-07 14:34:43 -07:00
Xavier Deguillard	17b9e71972	store: add HgQueuedBackingStore::periodicManagementTask Summary: Since this method wasn't overriden, EdenFS would never periodically flush data to disk. Reviewed By: fanzeyi Differential Revision: D30784400 fbshipit-source-id: d88e535250a476582868dd82e57137a0ac38f921	2021-09-07 14:34:43 -07:00
Arun Kulshreshtha	d88406e38a	edenapi: simplify ConfigError Summary: Previously, there were special variants for missing and invalid URLs (since the server URL is presently the only required config option). In order to support other required config options, let's simplify the enum to just have variants for missing and invalid config fields respectively. Reviewed By: yancouto Differential Revision: D30745971 fbshipit-source-id: e414ec2fadc5d04e9c788bf290a70f6cf52dbe58	2021-09-07 14:13:25 -07:00
Jun Wu	eaafe1f723	indexedlog: fix a typo in comment Summary: Saw this when reading related code. Reviewed By: kulshrax Differential Revision: D30783665 fbshipit-source-id: f9b598b9301619346972bd0abf893f089d902022	2021-09-07 14:06:22 -07:00
Zeyi (Rice) Fan	5704ad51f6	edenfs-client: switch to tokio 1.x Reviewed By: andll Differential Revision: D30704344 fbshipit-source-id: c09f8dad1bba78329b0b97bff85c08b19fe1e0f4	2021-09-07 13:31:02 -07:00
Zeyi (Rice) Fan	8616cd90b1	switch to new fbthrift_ext Reviewed By: xavierd Differential Revision: D30703434 fbshipit-source-id: 74c1c577a330913e411b58d3a128a28cfed621a2	2021-09-07 13:31:02 -07:00
Robert Grosse	ad2731099b	Remove checkcodeglobpats from test runner Summary: I'm not sure what this was for, but it doesn't seem necessary, and removing it simplifies the code a lot, enabling us to make other improvements later. This is an alternate, less ambitious version of https://www.internalfb.com/diff/D30620443. Reviewed By: DurhamG Differential Revision: D30674016 fbshipit-source-id: 17dee50b82c78d31e45492dc23826d8c3fe838e5	2021-09-07 11:43:18 -07:00
Durham Goode	443be3727d	tests: condition test-advice-header.t on buck tests Summary: This test relies on Mononoke, so it fails for make local build/test, which breaks hgbuild. Let's only enable it for buck tests. Reviewed By: quark-zju Differential Revision: D30782799 fbshipit-source-id: 4b543beeb248715702b9072d84cdb8211dcd4a9b	2021-09-07 11:37:56 -07:00
Mateusz Kwapich	27830555a1	allow recreating targets without removing the configs Summary: Currently there are two things preventing us from running add_sync_target on existing target: * already existing bookmark * already existing config Both need to be deleted to create new target. This diff removes the second one to simplify code and make it easier to recreate the target (it's easy to forget about manually removing the config as they otherwise don't need human interventions). Reviewed By: StanislavGlebik Differential Revision: D30767613 fbshipit-source-id: f951c0e1ef9bde69d805dc911331fcdb220123f2	2021-09-07 11:33:18 -07:00
Durham Goode	f26bb33cd1	graft: remove large changelog scan Summary: This logic scans all the ancestors of the working copy that are not ancestor of the graft source and checks their extras. With lazy changelog this is extremely expensive. Let's just drop this logic. Reviewed By: quark-zju Differential Revision: D30734017 fbshipit-source-id: ca5606cea08efe10f29847970379d6bff4eb4aee	2021-09-07 11:00:56 -07:00
Mark Juggurnauth-Thomas	26cc4c4471	filenodes: store repo_id and update to new futures Summary: Update the `Filenodes` trait so that it doesn't require the repository id to be passed in every method invocation. In practice a filenodes instance can only be used for a single repo, so it is safer for the implementation to store the repository id. At the same time, update the trait to use new futures and async-trait. Reviewed By: yancouto Differential Revision: D30729630 fbshipit-source-id: a1f80a299d9b0a99ddb267d1f7093f27cf21f1af	2021-09-07 07:44:28 -07:00
Mark Juggurnauth-Thomas	8e1f8affad	mercurial_derived_data: make derivation not depend on BlobRepo Summary: Make the derivation process for mercurial changesets and manifests not depend on `BlobRepo`, but instead use the repo attribute (`RepoBlobstore`) directly. This will allow us to migrate to using `DerivedDataManager` in preparation for removing `BlobRepo` from derivation entirely. Reviewed By: yancouto Differential Revision: D30729629 fbshipit-source-id: cf478ffb97a919c78c7e6e574580218539eb0fdf	2021-09-07 07:44:28 -07:00
Mark Juggurnauth-Thomas	5de3581a10	blame/deleted_files_manifest: make derivation not depend on BlobRepo Summary: Make the derivation process for blame and deleted files manifest not depend on `BlobRepo`, but instead use the repo attribute (`RepoBlobstore`) directly. This will allow us to migrate to using `DerivedDataManager` in preparation for removing `BlobRepo` from derivation entirely. A `BlobRepo` reference is still needed at the moment for derivation of the unodes that these depend on. That will be removed when `DerivedDataManager` takes over co-ordination of derivation. Reviewed By: yancouto Differential Revision: D30729628 fbshipit-source-id: 4504abbe63c9bf036d69cb4341c75b13061fae18	2021-09-07 07:44:27 -07:00
Mark Juggurnauth-Thomas	49a8e9b323	fsnodes/skeleton_manifest: make derivation not depend on BlobRepo Summary: Make the derivation process for fsnodes and skeleton manifests not depend on `BlobRepo`, but instead take the `DerivedDataManager` from the `BlobRepo` and use that instead. This is in preparation for removing `BlobRepo` from derivation entirely. Reviewed By: yancouto Differential Revision: D30301855 fbshipit-source-id: a2ed1639526aad9ddbe8429988043f0499f7629c	2021-09-07 07:44:27 -07:00
Mark Juggurnauth-Thomas	7304cc49d3	unodes: make derivation not depend on BlobRepo Summary: Make the derivation process for unodes not depend on `BlobRepo`, but instead take the `DerivedDataManager` from the `BlobRepo` and use that instead. This is in preparation for removing `BlobRepo` from derivation entirely. Reviewed By: yancouto Differential Revision: D30300408 fbshipit-source-id: c35e9e21366de74338f453aaf6be476e7305556d	2021-09-07 07:44:27 -07:00
Mark Juggurnauth-Thomas	81364c16e1	dangerous_override: override blobstore in derived data manager Summary: The derived data manager also has a reference to the repo blobstore. This must also be overridden when we override the blobstore. Reviewed By: yancouto Differential Revision: D30738354 fbshipit-source-id: b0e16ef810c5244cd056a3c9e5b9ceaaddb5ecea	2021-09-07 07:44:27 -07:00
Liubov Dmitrieva	4ad0258a94	modernise tests (removing disabling treemanifest) Summary: modernise tests (removing disabling treemanifest) treemanifest is the default now, so it shouldn't be disabled unless absolutely necessary. Also, if we would like to switch some of the tests to Mononoke, it shouldn't be disabled. Only two tests left with it in commitcloud. Those two a bit harder to fix. ``` devvm1006.cln0 {emoji:1f440} ~/fbsource/fbcode/eden/scm/tests [139] → rg 'disable treemanifest' \| grep cloud test-commitcloud-backup-bundlestore-short-hash.t: $ disable treemanifest test-commitcloud-backup.t: $ disable treemanifest ``` Reviewed By: kulshrax Differential Revision: D30278754 fbshipit-source-id: cf450084669c2b6b361cd34952bf986e913de1a8	2021-09-07 04:54:28 -07:00
CodemodService Bot	34fef0116f	Daily `common/rust/cargo_from_buck/bin/autocargo` Reviewed By: krallin Differential Revision: D30771609 fbshipit-source-id: efd5474b1f189df318ef6f9323a66c18593f7323	2021-09-07 01:43:29 -07:00
Michael Samoylenko	e156bbd676	Allow more read-connection-types when creating SqlConnections Summary: I want to use `ReplicaFirst` read connection type since `ReplicaOnly` is a bit too restrictive. We've had 2 MySQL SEVs this year when all the replicas went down crashing our services despite the primary instance working normally. There was also a case when I've deleted too much rows at once and all replicas went down due to replication lag (I know better now) RFC - Yay or Nay? - Should I expand `ReadConnectionType` to mirror all options of `InstanceRequirement`? - Perhaps it's worth moving it into the `common/rust/shed/sql` crate? I kept cleaning up all the usages out of this diff to keep the changes minimal for RFC Differential Revision: D30574326 fbshipit-source-id: 1462b238305d47557372afe7763096c53df55f10	2021-09-06 14:40:47 -07:00
Stanislau Hlebik	4f8f705ae1	mononoke: allow using prefetched commits in segmented_changelog_seeder Summary: Segmented changelog seeder spends a significant chunk of time fetching changesets. By saving them to file we can make reseeding significantly faster. Reviewed By: farnz Differential Revision: D30765374 fbshipit-source-id: 0e6adf12e334ad70486145173ae87c810880988a	2021-09-06 11:32:17 -07:00
Stanislau Hlebik	6eec03046b	mononoke: introduce dump_public_changeset_entries Summary: In backfill_derived_data we had a way to prefetch a lot of commits at once, so that backfill_derived_data doesn't have to do it on every startup. I'd like to use the same functionality in segmented changelog seeder, so let's move it to the separate binary. Reviewed By: mitrandir77, farnz Differential Revision: D30765375 fbshipit-source-id: f6930965b270cbaae95c3ac4390b3d367eaab338	2021-09-06 11:11:44 -07:00
Jan Mazur	5268cedad5	use X-FB-Validated-X2PAuth-Advice header when APE provides it Summary: Let's print advice when request fails. Reviewed By: krallin Differential Revision: D30702879 fbshipit-source-id: 6fb907e6c57dc8383151116619ca894b9b3ea5bb	2021-09-06 08:42:36 -07:00
Meyer Jacobs	997d38fa06	scmstore: make ContentDataStore implementation local-only Summary: ContentDataStore is meant to be implemented local-only. Fetching remotely seems to cause the issue observed in https://fb.workplace.com/groups/scm/permalink/4192991577417097/ (though I'm not quite sure why yet) Reviewed By: kmancini Differential Revision: D30744817 fbshipit-source-id: 68875a4912905f9b8f88cf4be804c5d988c3905d	2021-09-02 19:29:12 -07:00
Zeyi (Rice) Fan	934975a668	generate Cargo.toml - take 2 Summary: Original commit changeset: 025ea455c51b Differential Revision: D30740404 fbshipit-source-id: 04c4c2b70105276d586c6aafe1e681549237071a	2021-09-02 17:00:33 -07:00
Genevieve Helsel	62fe933e4e	disallow infinite recursion in redirect unmount Summary: If the bind unmount fails in in the privhelper, theres a possibility of infinite recursion in this method. This adds a flag to indicate if we've tried the bind unmount before. Differential Revision: D30732857 fbshipit-source-id: 6ee887d211977ee94c8e66531287f076a7e61a2c	2021-09-02 15:00:10 -07:00
Xavier Deguillard	cfd3dd4e0b	apfs: retry apfs addVolume Summary: It sounds like macOS has a bug where an APFS subvolume may be falsely created. Let's retry with the hope that the retry will succeed. Differential Revision: D30657706 fbshipit-source-id: 60bc74f789a0d34b2be53073103b95474a9a18e6	2021-09-02 14:19:17 -07:00
Stanislau Hlebik	d08bf2b5c7	Back out "revlogindex: implement octopus merge support" Summary: Original commit changeset: d5aa2f18a02f See https://www.internalfb.com/intern/testinfra/diagnostics/4503599694158597.844424967049391.1630548327/ For some reason these tests didn't run on original diff, we are still figuring out why Reviewed By: mitrandir77 Differential Revision: D30732364 fbshipit-source-id: 9f213766e7c440bb7ca131a127f5089698b6162e	2021-09-02 13:24:31 -07:00
Yipu Miao	6c5db887dd	Regenerate rust file for eden Summary: This is regenerated rust lib using the latest compiler Reviewed By: krallin Differential Revision: D30720130 fbshipit-source-id: 3d3389ec8504568fc356dda1577e1f7801cb7e96	2021-09-02 13:16:41 -07:00
Jeremy Fitzhardinge	e5e2c39f20	third-party/rust: update strum Summary: ~~Also enable the `derive` feature so it isn't necessary to separately depend on `strum_macros`.~~ This turns out to break a lot. Reviewed By: dtolnay Differential Revision: D30709976 fbshipit-source-id: a9181070b8d7a8489eebc9e94fa24f334cd383d5	2021-09-02 11:26:28 -07:00
Arun Kulshreshtha	1125a46864	edenapi: use Arc<Inner> pattern in Client Summary: Move `edenapi::Client`'s internals to an `Arc<ClientInner>`. This makes the client `Clone`-able while sharing the same underlying state. This is particularly useful for scenarios where `Future`s or `Stream`s returned by the client need to hold a reference to the client itself (e.g., in order to issue subsequent HTTP requests). Differential Revision: D30729803 fbshipit-source-id: c97e700c9e3702f818eb86ded1a46f920a55cfd1	2021-09-02 11:26:28 -07:00
Arun Kulshreshtha	bb090a45f5	edenapi: rename Fetch to Response Summary: The `Fetch<T>` type has basically turned into the canonical type EdenAPI for all EdenAPI responses. Originally, this type was merely an implementation detail (essentially just a named tuple returned by the `fetch()` method, hence the name), but given its prominence in the API, the name is confusing. As we add more functionality and usage to this type, it makes sense to give it a more suitable name. Differential Revision: D30730573 fbshipit-source-id: 7acd2a86b55bdfc186bd9110f6a99333df9d29d3	2021-09-02 11:26:28 -07:00
Arun Kulshreshtha	0ba2df7e1d	edenapi: use more descriptive names for private methods Summary: Some of the method names used internally by `edenapi::Client` are a bit terse. This was OK back when there were only handful of private methods which were used by a small number of API methods that were doing more or less the same thing (sending concurrent POST requests for a set of keys). Today, there are way more API methods, most of which set up requests in different ways. As such, it makes sense to give these older private methods more explicit and descriptive names so that their intended usage is clear. Differential Revision: D30729802 fbshipit-source-id: 5adfd8e7ba153df8c036e4dbb312f95b9b1d7335	2021-09-02 11:26:28 -07:00
Meyer Jacobs	4dfc576ea0	scmstore: forward repack calls for trees in addition to files Summary: Allow repack to be called on treescmstore via the ContentStore shim like filescmstore is already supported. Reviewed By: andll Differential Revision: D30687145 fbshipit-source-id: 7559af08e98cfb22da6dbf45dc1746312b1e6d28	2021-09-02 11:26:28 -07:00
Meyer Jacobs	705d9b27c3	scmstore: implement LegacyStore for TreeStore Summary: Provide a basic implementation of the LegacyStore trait for TreeStore to allow repack calls to be forwarded to the fallback ContentStore for trees. Repack will be removed entirely before contentstore is deleted, and the `unimplemented` methods are never called, so this should be safe. Reviewed By: andll Differential Revision: D30687136 fbshipit-source-id: d238d70fbf6be5c25c2e1c9610430a53d031bf3b	2021-09-02 11:26:27 -07:00
Zhengchao Liu	8fe7e16f7c	log FS trace events with HiveLogger Summary: This diff lets `FsEventLogger` send the sample through `HiveLogger` Reviewed By: genevievehelsel Differential Revision: D30305695 fbshipit-source-id: 88613dc6c74710cc0f33c44ce4e36c35c58e6406	2021-09-02 10:32:03 -07:00
Stanislau Hlebik	7d4c7fab26	mononoke: log changeset id when inserting mapping Summary: Looks like it was lost during the last refactoring, let's add it back. Reviewed By: farnz Differential Revision: D30728456 fbshipit-source-id: 20c638b3c5a8664f2367f871cd29a793fd897de3	2021-09-02 09:28:39 -07:00
Arun Kulshreshtha	b9a95f33a3	http-client: implement seek() for Buffered and Streaming handlers Summary: Some users have reported errors of the form: ``` error.HttpError: [65] Send failed since rewinding of the data stream failed (seek callback returned error 2) ``` These are caused by the fact that we're passing the HTTP request body directly to libcurl in memory rather than via a file, but we haven't implemented the `seek()` method necessary for libcurl to retransmit the data if needed. This diff implements the method. Reviewed By: DurhamG Differential Revision: D30654625 fbshipit-source-id: f21a067ad02ee540b86cf2e6eff2c6f08f45a3e4	2021-09-02 08:26:02 -07:00
Thomas Orozco	35e3466031	third-party/rust: update daemonize to 0.5 Summary: Like it says in the title, this updates us to use Daemonize 0.5, though from Github and not Crates.io, because it hasn't been released to the latter yet. The main motivation here is to pull in https://github.com/knsd/daemonize/pull/39 to avoid leaking PID files to children of the daemon. This required some changes in `hphp/hack/src/facebook/hh_decl` and `xplat/rust/mobium` since the way to run code after daemonization has changed (and became more flexible). Reviewed By: ndmitchell Differential Revision: D30694946 fbshipit-source-id: d99768febe449d7a079feec78ab8826d0e29f1ef	2021-09-02 06:27:03 -07:00
Stanislau Hlebik	60d6172ea0	mononoke: make it possible to force reload segmented changelog Summary: At the moment when segmented changelog is updated and/or reseeded mononoke servers can pick it up only once an hour (this is a current reload schedule) or when mononoke server is restarted. However during production issues (see attached task for an example) it would be great to have a way to force all servers to reload segmented changelog. This diff makes it possible to do so with a tunable. Once tunable changes its value then monononoke servers almost immediately (subject to jitter) reload it. This implementation adds a special loop that polls tunables value and reloads if it changes. Note that in theory it could avoid polling and watch for configerator changes instead, but it would be harder to implement and I decided that it's not worth it. Reviewed By: farnz Differential Revision: D30725095 fbshipit-source-id: da90ea06715c4b763d0de61e5899dfda8ffe2067	2021-09-02 04:27:33 -07:00

... 2 3 4 5 6 ...

12373 Commits