sapling

mirror of https://github.com/facebook/sapling.git synced 2024-10-11 09:17:30 +03:00

Author	SHA1	Message	Date
Thomas Orozco	6b04c4d7d2	mononoke/warm_bookmarks_cache: don't initialize ALL bookmarks at once Summary: All the bookmarks is a lot of bookmarks. Don't do them all at once. Also, add some logging output so we can tell how far along we are. Reviewed By: HarveyHunt Differential Revision: D25397297 fbshipit-source-id: c19b99123f88e05e99bff61e2399a62d378a6671	2020-12-08 08:30:56 -08:00
Lukas Piatkowski	00fe313eff	mononoke/unbundle: get rid of futures 0.1 Summary: Also added a TryShared future to futures_ext. The problem with regular Shared is that if you want to share anyhow::Result the Error part of it is not cloneable. This TryShared will work nicely when returning anyhow::Result, which most of our code does. Reviewed By: aslpavel Differential Revision: D25223317 fbshipit-source-id: cf21141701884317a87dc726478dcd7a5a820c73	2020-12-07 20:41:26 -08:00
Thomas Orozco	16bac45a07	mononoke/bonsai_globalrev_mapping: set perf counters Summary: Like it says in the title. This is helpful to measure the number of SQL queries we make. This required actually threading in a CoreContext, which we didn't have before. Reviewed By: StanislavGlebik Differential Revision: D25336069 fbshipit-source-id: 35677c55550e95b5126de29c2a824b4eda32092c	2020-12-07 08:23:19 -08:00
Thomas Orozco	482ab2d2a6	eden/mononoke: allow turning on the bonsai / globalrev cache Summary: Like it says in the title. Reviewed By: StanislavGlebik Differential Revision: D25336068 fbshipit-source-id: 113050215c28a28c820d938348a0a3e54c14c3ee	2020-12-07 08:23:19 -08:00
Thomas Orozco	13255301b0	mononoke/bonsai_globalrev_mapping: add caching Summary: Like it says in the title, this adds a caching layer around Globalrevs using our existing `GetOrFillMultipleFromCacheLayers` abstraction. Note: I've opted to not track dedicated metrics for this (compare to the hg mapping to see them), since I don't believe we really ever look at them. I'd like to do a little bit of refactoring of `GetOrFillMultipleFromCacheLayers` to a) track them without having to ad-hoc code it, b) convert it 0.3 futures, and c) require less ceremony to call it. However, I'll do so in another diff. Reviewed By: StanislavGlebik Differential Revision: D25334478 fbshipit-source-id: 1385298b8fdf1cd081b6e509c6c3e03b3abbfa5b	2020-12-07 08:23:19 -08:00
Thomas Orozco	ec0bff0b82	mononoke/bonsai_globalrev_mapping: split out sql module Summary: This lib.rs is getting too big. Split it. Reviewed By: StanislavGlebik Differential Revision: D25333510 fbshipit-source-id: ea15664d2de21a24ee107162e030b7762b1d413e	2020-12-07 08:23:19 -08:00
Thomas Orozco	00c3297c49	mononoke/bonsai_globalrev_mapping: make point queries default impl methods Summary: I'd like to add a caching variant for this. Might as well not have to rewrite those methods on an ad-hoc basis. Reviewed By: StanislavGlebik Differential Revision: D25333461 fbshipit-source-id: 632c0307189fe15a926d808c1eeca1e3f240eb19	2020-12-07 08:23:18 -08:00
Thomas Orozco	1ec5537e9e	mononoke/bonsai_globalrev_mapping: update to futures 0.3 Summary: Like it says in the title. Reviewed By: StanislavGlebik Differential Revision: D25333450 fbshipit-source-id: 49ad4b1964a4dfd9f3e0108fa421d451ef905256	2020-12-07 08:23:18 -08:00
Stanislau Hlebik	40b67f9d09	mononoke: do not use batched in derived_data_tailer Reviewed By: ikostia Differential Revision: D25366938 fbshipit-source-id: 9b10853b1fdf5ee281187067aa768fa52e6d3074	2020-12-07 02:51:45 -08:00
Kostia Balytskyi	3ed778a5ea	observability: add dynamic level drain Summary: This makes logs go through a `Drain` which queries `ObservabilityContext` (introduced in a previous diff) for current logging level. ATM I did not add any tests, and it's pretty easy to add a unit-test checking that the drain indeed respects the level, but it's so simple that I am not 100% convinced that test would be all that valuable. Note that currently `ObservabilityContext` is enabled in a `Static` variation. Reviewed By: mitrandir77 Differential Revision: D25232400 fbshipit-source-id: 7499916e0a3ddab43538343e6ed215818517eaf7	2020-12-04 14:30:29 -08:00
Kostia Balytskyi	c60fcebc12	observability: introduce ObservabilityContext Summary: `ObservabilityContext` is a structure that helps logging facilities within Mononoke to make logging decisions. Specifically, the upcoming `DynamicLoggingDrain` and already existing `MononokeScubaSampleBuilder` will have this structure as a component and decide whether a particular logging statement (slog or scuba) should go ahead. Internally, `ObservabilityContext` is a wrapper around data received from a [configerator endpoint](https://www.internalfb.com/intern/configerator/edit/?path=scm%2Fmononoke%2Fobservability%2Fobservability_config.cconf). This diff makes a few unobvious decisions about how this is organized. My goals were: 1. to have production (i.e. reading from configerator), static (i.e. emulating current prod behavior) and test variants of `ObservabilityContext` 1. to avoid having consumers know which of these variants are used 1. to avoid making all consumers of `ObservabilityContext` necessarily generic 1. to avoid using dynamic dispatch of `ObservabilityContext`'s methods Points 3 and 4 mean that `ObservabilityContext` cannot be a trait. `enum` is a common solution in such cases. However, if `ObservabilityContext` is an `enum`, consumers will know which variant they are using (because `enum` variants are public). So the solution is to use a private enum wrapped in a struct. Reviewed By: mitrandir77 Differential Revision: D25287759 fbshipit-source-id: da034c71570137e8a8fb7749b1e4ad43be482f66	2020-12-04 14:30:29 -08:00
Alex Hornby	28d4471f75	mononoke: no need to collect walker iterators Summary: Can just pass on the iterator Reviewed By: ikostia Differential Revision: D25216892 fbshipit-source-id: 79c08737477ac7ed1f824c50105d5977ee592126	2020-12-04 03:07:05 -08:00
Alex Hornby	c1a563a1c6	mononoke: remove --cachelib-only-blobstore from walker test cmdlines Summary: Its now the default for this binary, might as well shorten the test command lines Reviewed By: ikostia Differential Revision: D25219717 fbshipit-source-id: 8074145c6f05f26ab7fa18d2ff399482ad592885	2020-12-04 03:07:05 -08:00
Alex Hornby	591363e1c4	mononoke: allow binaries to specify a default for cachelib-only-blobstore Summary: Reduces boilerplate for binaries usually run in this mode, notably the walker Reviewed By: ikostia Differential Revision: D25216883 fbshipit-source-id: e31d2a6aec7da3baafd8bcf208cf79cc696752c0	2020-12-04 03:07:04 -08:00
Alex Hornby	54bda6537d	mononoke: allow binaries to default a blobstore read qps Summary: This is useful to prevent accidentally consuming too much. Enabled it for the walker Reviewed By: ikostia Differential Revision: D25216880 fbshipit-source-id: e80f490d6ece40d64cc8609e7d6b80d0ecbb1671	2020-12-04 03:07:04 -08:00
Alex Hornby	f814075cee	mononoke: allow binaries to default blobstore-cachelib-attempt-zstd option Summary: Reduces boiler plate on command line for binaries like walker that want different default Reviewed By: krallin Differential Revision: D25216876 fbshipit-source-id: 0df474568d28e0726be223e9dc0a760523063d21	2020-12-04 03:07:04 -08:00
Egor Tkachenko	8c5d77f0f8	Adding connection pool for darkstorm Summary: Darkisilon cell consists of multiple hosts which shares underlying storage, so write to one of them is visible for all hosts. Lets spread requests between all these hosts. I'll get list of hosts from the smc tier and will randomly connect to one on each request. Reviewed By: krallin Differential Revision: D25163782 fbshipit-source-id: b28085dd37b15972469b7334a47def473e10f34e	2020-12-03 17:10:42 -08:00
Stefan Filip	1c310c516b	derived_data: remove unused import Summary: Fix the build. Reviewed By: ikostia Differential Revision: D25315064 fbshipit-source-id: 1f4da8b47eda7f9177ef0c2c6eddbdb374640472	2020-12-03 14:17:52 -08:00
Johan Schuijt-Li	d3224db357	use authentication abstraction for mononoke Summary: This allows us to be more flexible in choosing authentication and expands variables used in configuration. Reviewed By: singhsrb Differential Revision: D25304008 fbshipit-source-id: 636893a9eaec31ca5acfa02f72931d5e56b695d0	2020-12-03 13:57:48 -08:00
Pavel Aslanov	70bfc4abd0	convret to new type futures Summary: convert blobrepo:blobrepo_common to new type futures Reviewed By: StanislavGlebik Differential Revision: D25245426 fbshipit-source-id: d3db0e417545b2b0043e48a536737586005ac4bb	2020-12-03 07:15:04 -08:00
Thomas Orozco	15d3670d72	mononoke: UploadHgFileEntry: allow calling it without a path Summary: I'd like to experiment with splitting this into its own service. To do that, I don't want to have to include a path, since it's only used for reporting an error that will never occur (because for that service I'll be using the "generate" variant of the filenode id). Let's just make it optional. Reviewed By: lukaspiatkowski Differential Revision: D25220901 fbshipit-source-id: 6d3cf70a63b077de18a7d43f3b65766b453c425e	2020-12-03 06:19:31 -08:00
Thomas Orozco	5765030c7e	mononoke: asyncify UploadHgFileEntry Summary: Like it says in the title. Let's turn this into an async fn. Reviewed By: lukaspiatkowski Differential Revision: D25220902 fbshipit-source-id: b5de783adaca05919eb5cd6858c8b0aaf03ddfc2	2020-12-03 06:19:31 -08:00
Thomas Orozco	bf4dc18009	mononoke: UploadHgFileEntry::upload: just return a plain future Summary: This returns a Result of a tuple, but: - This never errs. - Nothing ever reads the left side of the tuple. So let's stop doing that. Reviewed By: StanislavGlebik Differential Revision: D25219887 fbshipit-source-id: f33dcf6f6e68cb17b40c4638470312afae0662e6	2020-12-03 06:19:31 -08:00
Thomas Orozco	679735fbdf	mononoke/unbundle: stop collecting ContentBlobMeta given we never use it Summary: Like it says in the title. We've had this comment about potentially using this for a couple of years now. It seems a bit unlikely at this point that we're going to use this, but currently it makes the code that provides uploading hg entries more complex than it needs to be, so let's just get rid of this. Reviewed By: lukaspiatkowski, StanislavGlebik Differential Revision: D25219728 fbshipit-source-id: 61d254edef16d24a4c29f96f983f894863b5232a	2020-12-03 06:19:31 -08:00
Thomas Orozco	d5097a2d8c	mononoke/lfs_server: account for size in object popularity Summary: This modifies our object popularity mechanism to account for the size of the objects being downloaded. Indeed, considering our bottleneck is bandwidth, we forcing similar routing for 10 downloads of a 10MB object and 10 downloads of a 1GB object doesn't make that much sense. This diffs updates our counting so that we now record the object size instead of a count. I'll set up routing so that we disallow consistent routing when a single object exceeds 250MiB/s of throughput ( = 1/4th of a task). It's worth noting that this will be equivalent to what we have right now for our most problematic objects (GraphQL schemas in Fbsource, 35M each), given that we "unroute" at 150 requests / 20 seconds (`150 * 35 / 20 = 262`). The key difference here is that this will work for all objects. This does mean LFS needs to cache and know about content metadata. That's not actually a big deal. Indeed, over a week, we serve 25K distinct objects (https://fburl.com/scuba/mononoke_lfs/a2d26s1a), so considering content metadata is a hundred bytes (and so is the key), we are looking at a few MBs of cache space here. As part of this, I've also refactored our config handling to stop duplicating structures in Configerator and LFS by using the Thrift objects directly (we still have a few dedicated structs when post-processing is necessary, but we no longer have anything that deserializes straight from JSON). Note that one further refinement here would be to consistently route but to more tasks (i.e. return one of 2 routing tokens for an object that is being downloaded at 500MiB/s). We'll see if we need that. Reviewed By: HarveyHunt Differential Revision: D24361314 fbshipit-source-id: 49e1f86cf49357f60f1eac298a753e0c1fcbdbe5	2020-12-03 06:17:06 -08:00
Harvey Hunt	3b7978c437	mononoke: cmdlib: Fix default cachelib size crash Summary: D24761026 (`caa684450f`) formatted the default cachelib size with the specifier `{:3}`. This specifier pads the left side of the string with spaces if there are less than 3 digits. Unfortunately, this means that attempting to parse the string into an `f64` fails. Here's a minimal example: https://play.rust-lang.org/?version=stable&mode=debug&edition=2018&gist=11a1e819f9919f7d02565cb8fa561b85 Remove the format specifier and instead call `.to_string()`. Reviewed By: ahornby Differential Revision: D25302079 fbshipit-source-id: 461dd628a312a967f6cf5958d2e5d51b72b0ffd8	2020-12-03 05:48:31 -08:00
Thomas Orozco	282d93d906	mononoke: up the cache zstd threshold a little bit Summary: Right now this is the same as our Filestore threshold so we always attempt to compress Filestore chunks even though they were designed to fit in cache. Whoops. Reviewed By: ikostia Differential Revision: D25274080 fbshipit-source-id: f17b54710fc36ca7c11c74247d038bf73777f7f9	2020-12-03 03:37:22 -08:00
Stefan Filip	5c7ecf402f	segmented_changelog_tailer: initialize caching Summary: Cachelib wasn't initialized so a production configuration would crash because the cache pools wouldn't be found. Reviewed By: singhsrb Differential Revision: D25291880 fbshipit-source-id: 382db0efda072b11da587e863566e816bd5393ca	2020-12-02 19:18:32 -08:00
Pavel Aslanov	337bab2744	convert to new type futures Summary: Convert `ChangesetFetcher` to new type futures Reviewed By: StanislavGlebik Differential Revision: D25244213 fbshipit-source-id: 4207386d81397a930a566db008019bb8f31bf602	2020-12-02 15:40:12 -08:00
Mark Juggurnauth-Thomas	aea042fc29	skeleton_manifests: check case on upload if not enabled Summary: If skeleton manifests are not enabled for a repository, ignore the tunables that delay casefolding checks until land-time. Reviewed By: StanislavGlebik Differential Revision: D25271607 fbshipit-source-id: dcaf9291da31d0f57b3b632888ed688ecd6cecda	2020-12-02 09:08:34 -08:00
Mark Juggurnauth-Thomas	8fe2b30e9a	derived_data: commonise blobstore mapping implementations Summary: Create `BlobstoreMapping` as a trait with the common implementations for derived data mappings that are stored in the blobstore. Reviewed By: StanislavGlebik Differential Revision: D25099915 fbshipit-source-id: 8a62fbb809918045336944c8cd3584b109811012	2020-12-02 07:33:41 -08:00
Mark Juggurnauth-Thomas	0f13c1283e	derived_data: rename Mode to DeriveMode Reviewed By: StanislavGlebik Differential Revision: D25218193 fbshipit-source-id: 7a4ff3cb3e2b0e0c0f849b0de69a75626aeddf4c	2020-12-02 07:33:41 -08:00
Mark Juggurnauth-Thomas	6b0203d10f	derived_data: use batch_derive for backfilling data Summary: Batch derivation was disabled as it caused problems for other derived data types. This was because the default batch implementation was wrong: it attempted to derive potentially linear stacks concurrently, which would cause O(n^2) derivations. Fix the default implementation to be a naive sequential iteration, and re-enable batch derivation for fsnodes and skeleton manifests. Reviewed By: StanislavGlebik Differential Revision: D25218195 fbshipit-source-id: 730555829f092cc36e1c81cf02c2b4962a3904da	2020-12-02 07:33:41 -08:00
Mark Juggurnauth-Thomas	c414b36a30	fsnodes: don't rederive parents fsnodes during batch derivation Summary: On the second and subsequent linear stack in a batch, the parent fsnode may have been derived in the previous iteration of the loop. Since we haven't completed this batch yet, the mappings have not been stored, and so the attempt to derive the parent will result in the parent being derived again, repeating the work of this batch sequentially. Apply the optimization used in skeleton manifests and fetch the parent fsnodes out of the result we are accumulating. Reviewed By: StanislavGlebik Differential Revision: D25218194 fbshipit-source-id: 5cc49204b53984f8aa73542f1a794a6251eb2b2e	2020-12-02 07:33:41 -08:00
Mark Juggurnauth-Thomas	509dcf3ccd	skeleton_manifests: enable derivation of skeleton manifests in batches Summary: Similar to fsnodes, allow skeleton manifests to be derived in parallel in large batches by splitting the changesets into linear stacks with no internal conflicts, and deriving each changeset in that batch in parallel. Reviewed By: StanislavGlebik Differential Revision: D25218196 fbshipit-source-id: e578de9ffd472e732abb1e2ef9cd19c073280cd4	2020-12-02 07:33:41 -08:00
Mark Juggurnauth-Thomas	2d01d4236d	tests: wait for source control service to fully start Summary: The Mononoke SCS tests can be flaky as the server starts to accept SSL connections some time before it is fully ready to accept requests, which can result in request timeouts. Use a simple request to test for the server being fully up and ready to handle requests. Reviewed By: krallin Differential Revision: D25250245 fbshipit-source-id: b79106ed51e5163ebe5cd1db7b0deaab0035b9bc	2020-12-02 07:33:41 -08:00
Alex Hornby	3902c04261	mononoke: update test cases to use new --with-readonly-storage option Summary: Update the few test cases remaining on older option to the newer option Reviewed By: ikostia Differential Revision: D25219710 fbshipit-source-id: 50af0dcac7ed980ec4c7180cf81e3c00ecc18b95	2020-12-02 07:27:24 -08:00
Alex Hornby	b458ae4217	mononoke: remove --readonly-storage from walker test cmdlines Summary: Remove this now it is the walker default. Makes command lines shorter Reviewed By: ikostia Differential Revision: D25219551 fbshipit-source-id: bc5ad4237cad35218a0b4c54aa81eb20edb3f3e1	2020-12-02 07:27:24 -08:00
Alex Hornby	99fb41c5bd	mononoke: allow binaries to default readonly-storage option Summary: This will reduce boilerplate command line for the walker, as most of the time we want to run it with readonly storage Because the existing --readonly-storage flag can't take a value this introduces a new --with-readonly-storage=<true\|false> option Reviewed By: krallin Differential Revision: D25216871 fbshipit-source-id: e1b83b428a9c3787f48c18fd396d23ac95991b77	2020-12-02 07:27:23 -08:00
Alex Hornby	935a7ddfc8	mononoke: remove the need to pass in cachelib settings twice Summary: Previously needed to pass in cachelib settings once to MononokeAppBuilder and once to parse_and_init_cachelib. This change adds a MononokeClapApp and MononokeMatches that preserve the settings, thus preventing the need to pass them in twice (and thus avoiding possible inconsistency) MononokeMatches uses MaybeOwned to hold the inner ArgMatches, which allows us to hold both the usual reference case from get_matches and an owned case for get_matches_from which is used in test cases. Reviewed By: krallin Differential Revision: D24788450 fbshipit-source-id: aad5fff2edda305177dcefa4b3a98ab99bc2d811	2020-12-02 07:27:23 -08:00
Alex Hornby	31bcf94df7	mononoke: set a default cache_size in the walker Summary: Shorten command lines by setting a default in code. Reviewed By: ikostia Differential Revision: D24761025 fbshipit-source-id: 13deb1622ee1b97135ee787f6b6ffeed2f05813b	2020-12-02 07:27:23 -08:00
Alex Hornby	caa684450f	mononoke: show cachelib cache_size in --help usage Summary: Show cachelib cache_size default in --help usage so its clear what you'll get if no command line args passed Because we need to convert from bytes to GiB, the lifetime of the help string isn't long enough for clap's reference recieving default_value, so use OnceCell to be able to pass a static reference. Reviewed By: krallin Differential Revision: D24761026 fbshipit-source-id: 81b5e7ceb832d5cb55cc9faef59c5e6432f7c4b0	2020-12-02 07:27:23 -08:00
Alex Hornby	f077f69408	mononoke: move expected_item_size_byte into CachelibSettings Summary: Move expected_item_size_byte into CachelibSettings, seems like it should be there. To enable its use also exposes a parse_and_init_cachelib method for callers that have different defaults to default cachelibe settings. Reviewed By: krallin Differential Revision: D24761024 fbshipit-source-id: 440082ab77b5b9f879c99b8f764e71bec68d270e	2020-12-02 00:47:22 -08:00
Meyer Jacobs	293053b774	edenapi: expose "attributes" parameter in tree request API Summary: Introduce a new API type, `TreeAttributes`, corresponding to the existing type `WireTreeAttributesRequest`, which exposes which optional attributes are available for fetching. An `Option<TreeAttributes>` parameter is added to the `trees` API, and if set to `None`, the client will make a request with TreeAttributes::default(). The Python bindings accept a dictionary for the attributes parameter, and any fields present will overwrite the default settings from TreeAttributes::default(). Unrecognized attributes will be silently ignored. Reviewed By: kulshrax Differential Revision: D25041255 fbshipit-source-id: 5c581c20aac06eeb0428fff42bfd93f6aecbb629	2020-12-01 19:07:25 -08:00
Stefan Filip	4b9dc9074f	segmented_changelog: measure runs/failures/duration for updates Summary: Basic observability for how the segmeted changelog update process is performing. Reviewed By: krallin Differential Revision: D25108739 fbshipit-source-id: b1f406eb0c862464b186f933d126e0f3a08144e4	2020-12-01 17:29:23 -08:00
Stefan Filip	b2aac949cf	cmds: update segmented-changelog-tailer to run on a list of repos Summary: The update of the segmented changelog is light weight enough that we can consider all repositories sharing a common tailer process. With all repositories sharing a single tailer the the maintenance burden will be lower. Things that I am particularly unsure about are: tailer configuration setup and tailer structure. With regards to setup, I am not sure if this is more or less than what production servers do to instantiate. With regards to structure, I think that it makes a lot of sense to have a function that takes a single repo name as parameter but the configuration setup has an influence on the details. I am also unsure how important it is to paralelize the instantiation of the blobrepos. Finally, it is worth mentioning that the SegmentedChangelogTailer waits for `delay` after an update finishes rather than on a period. The benefit is that we don't have large updates taking down a process because we schedule the same large repo update too many timer. The drawback is that scheduling gets messed up over time and multiple repo updates can end up starting at the same time. Reviewed By: farnz Differential Revision: D25100839 fbshipit-source-id: 5fff9f87ba4dc44a17c4a7aaa715d0698b04f5c3	2020-12-01 17:29:23 -08:00
Alex Hornby	dac5f36baa	mononoke: add option to include all the derived types that are enabled for a repo Summary: This makes it easier to enable all derived data for scrubbing Reviewed By: ikostia Differential Revision: D25188963 fbshipit-source-id: e9c981e33273d6b2eeadcce0d0a341b33e91e42d	2020-12-01 11:45:00 -08:00
Alex Hornby	14aa6d37f2	mononoke: Show possible Node and Edge type args in the scrub help Summary: Show the options for specifying node and edge types in the --help output These changes removed the last use of lazy_static in the walker, so updated TARGETS as well. Reviewed By: krallin Differential Revision: D25188964 fbshipit-source-id: c5ccb4f5a0f3be1b8cb7d51cd5f99236d60d3029	2020-12-01 11:45:00 -08:00
Alex Hornby	a2247dc41c	mononoke: rename MononokeApp to MononokeAppBuilder Summary: It has a build() method and later in stack it will build a mononoke specific type rather than the clap::App Differential Revision: D25216827 fbshipit-source-id: 24a531856405a702e7fecf54d60be1ea3d2aa6e7	2020-12-01 11:45:00 -08:00
Lukas Piatkowski	25a93e6a09	arc/bucklint/non-fbcode-build: cover Mononoke and its dependencies with oss build Summary: This is step #5 of T78216315 Reviewed By: farnz Differential Revision: D24506457 fbshipit-source-id: 1417693d3bbc2c7316d63149b945d522a5a8efb2	2020-12-01 09:43:12 -08:00

1 2 3 4 5 ...

2018 Commits