sapling

mirror of https://github.com/facebook/sapling.git synced 2024-10-11 17:27:53 +03:00

Author	SHA1	Message	Date
David Tolnay	0cb8a052f5	Update formatter to rustfmt 2.0 Reviewed By: zertosh Differential Revision: D23591021 fbshipit-source-id: e664aa2fdd3aaa457796a59080be6b94f604a112	2020-09-09 07:52:33 -07:00
Alex Hornby	ecb58ff8d7	mononoke: add cmdlib argument to control cachelib zstd compression Summary: Add a cmdlib argument to control cachelib zstd compression. The default behaviour is unchanged, in that the CachelibBlobstore will attempted compression when putting to the cache if the object is larger than the cachelib max size. To make the cache behaviour more testable, this change also adds an option to do an eager put to cache without the spawn. The default remains to do a lazy fire and forget put into the cache with tokio::spawn. The motivation for the change is that when running the walker the compression putting to cachelib can dominate CPU usage for part of the walk, so it's best to turn it off and let those items be uncached as the walker is unlikely to visit them again (it only revisits items that were not fully derived). Reviewed By: StanislavGlebik Differential Revision: D22797872 fbshipit-source-id: d05f63811e78597bf3874d7fd0e139b9268cf35d	2020-07-31 01:12:02 -07:00
Lukasz Piatkowski	0dd3c4e4bb	add Mononoke integration tests CI (#26 ) Summary: This diff adds a minimal workflow for running integrations tests for Mononoke. Currently only one test is run and it fails. This also splits the regular Mononoke CI into separate files for Linux and Mac to match the current style in Eden repo. There are the "scopeguard::defer" fixes here that somehow escaped the CI tests. Some tweaks have been made to "integration_runner_real.py" to make it runnable outside FB context. Lastly the change from using "[[ -v ... ]" to "[[ -n "${...:-}" ]]; in "library.sh" was made because the former is not supported by the default Bash version preinstalled on modern MacOS. Pull Request resolved: https://github.com/facebookexperimental/eden/pull/26 Reviewed By: krallin Differential Revision: D22541344 Pulled By: lukaspiatkowski fbshipit-source-id: 5023d147823166a8754be852c29b1e7b0e6d9f5f	2020-07-16 12:16:10 -07:00
Thomas Orozco	ae917ba227	mononoke/virtually_sharded_blobstore: make sampling rate tunable Summary: As it says in the title. Reviewed By: farnz Differential Revision: D22432526 fbshipit-source-id: 42726584689cbc2f5c9138b42b7bf77939921bdd	2020-07-08 09:07:19 -07:00
Arun Kulshreshtha	5f0181f48c	Regenerate all Cargo.tomls after upgrade to futures 0.3.5 Summary: D22381744 updated the version of `futures` in third-party/rust to 0.3.5, but did not regenerate the autocargo-managed Cargo.toml files in the repo. Although this is a semver-compatible change (and therefore should not break anything), it means that affected projects would see changes to all of their Cargo.toml files the next time they ran `cargo autocargo`. Reviewed By: dtolnay Differential Revision: D22403809 fbshipit-source-id: eb1fdbaf69c99549309da0f67c9bebcb69c1131b	2020-07-06 20:49:43 -07:00
Thomas Orozco	ce0af2d591	mononoke/virtually_sharded_blobstore: deduplicate puts based on data being put Summary: This updates the virtually_sharded_blobstore to deduplicate puts only if the data being put is actually the data we have put in the past. This is done by keeping track of the hash of things we've put in the presence cache. This has 2 benefits: - This is safer. We only dedupe puts we 100% know succeeded (because this particular instance was the one to attempt the put). - This is creates less surprises, notably it lets us overwrite data in the backing store (if we are writing something different). Reviewed By: StanislavGlebik Differential Revision: D22392809 fbshipit-source-id: d76a49baa9a5749b0fb4865ee1fc1aa5016791bc	2020-07-06 12:10:46 -07:00
Thomas Orozco	19b31ead9d	mononoke/virtually_sharded_blobstore: make race tests a little more forgiving Summary: Running those on my devserver, I noticed they can be a bit flaky. They're are racy on the purpose, but let's relax them a bit. We have a lot of margin here — our blobstore is rate limited at once request every 10ms, and we need to do 100 requests (the goal is to show that they don't all wait), so 100ms is fine to prove that they're not rate limited when sharing the same data. Reviewed By: StanislavGlebik Differential Revision: D22392810 fbshipit-source-id: 2e3c9cdf19b0e4ab979dfc000fbfa8da864c4fd6	2020-07-06 12:10:46 -07:00
Thomas Orozco	07907b2b26	mononoke/virtually_sharded_blobstore: merge in the context_concurrency_blobstore Summary: There is inevitably interaction between caching, deduplication and rate limiting: - You don't want the rate limiting to be above caching (in the blobstore stack, that is), because you shouldn't rate limits cache hits (this is where we are today). - You don't want the rate limiting to below deduplication, because then you get priority inversion where a low-priority rate-limited request might hold the semaphore while a higher-priority, non rate limited request wants to do the same fetch (we could have moved rate limiting here prior to introducing deduplication, but I didn't do it earlier because I wanted to eventually introduce deduplication). So, now that we have caching and deduplication in the same blobstore, let's also incorporate rate limiting there!. Note that this also brings a potential motivation for moving Memcache into this blobstore, in case we don't want rate limiting to apply to requests before they go to the _actual_ blobstore (I did not do this in this diff). The design here when accessing the blobstore is as follows: - Get the semaphore - Check if the data is in cache, if so release the semaphore and return the data. - Otherwise, check if we are rater limited. Then, if we are rate limited: - Release the semaphore - Wait for our turn - Acquire the semaphore again - Check the cache again (someone might have put the data we want while we were waiting). - If the data is there, then return our rate limit token. - If the data isn't there, then proceed to query the blobstore. If we aren't rate limited, then we just proceed to query the blobstore. There are a couple subtle aspects of this: - If we have a "late" cache hit (i.e. after we waited for rate limiting), then we'll have waited but we won't need to query the blobstore. - This is important when a large number of requests from the same key arrive at the same time and get rate limited. If we don't do this second cache check or if we don't return the token, then we'll consume a rate limiting token for each request (instead of 1 for the first request). - If a piece of data isn't cacheable, we should treat it like a cache hit with regard to semaphores (i.e. release early), but like a miss with regard to rate limits (i.e. wait). Both of those are addressed captured in the code by returning the `Ticket` on a cache hit. We can then choose to either return the ticket on a cache hit, or wait for it on a cache miss. (all of this logic is captured in unit tests, we can remove any of the blocks there in `Shards::acquire` and a test will fail) Reviewed By: farnz Differential Revision: D22374606 fbshipit-source-id: c3a48805d3cdfed2a885bec8c47c173ee7ebfe2d	2020-07-06 04:38:31 -07:00
Thomas Orozco	de731a89fc	mononoke/virtually_sharded_blobstore: log deduplicated puts Summary: If anything were to go wrong, we'd be happy to know which puts we ignored. So, let's log them. Reviewed By: farnz Differential Revision: D22356714 fbshipit-source-id: 5687bf0fc426421c5f28b99a9004d87c97106695	2020-07-03 05:53:11 -07:00
Thomas Orozco	c68100f46e	mononoke/virtually_sharded_blobstore: spawn before taking semaphores Summary: I canaried this on Fastreplay, but unfortunately that showed that sometimes we just deadlock, or get so slow we might as well be deadlocked (and it happens pretty quickly, after ~20 minutes). I tried spawning all the `get()` futures, and that fixes the problem (but it makes gettreepack noticeably slower), so that suggests something somewhere is creating futures, polling them a little bit, then never driving them to completion. For better or worse, I'd experienced the exact same problem with the ContextConcurrencyBlobstore (my initial attempt at QOS, which also used a semaphore), so I was kinda expecting this to happen. In a sense, this nice because I we've suspected there were things like that in the codebase for a while (e.g. with the occasional SQL timeout we see where it looks like MySQL responds fast but we don't actually poll it until past the timeout), and it gives us a somewhat convenient repro. In another sense, it's annoying because it blocks this work :) So, to work around the problem, for now, let's spawn futures to force the work to complete when a semaphore is held. I originally had an unconditional spawn here, but that is too expensive for the cache-hit code path and slows things down (by about ~2x). However, having it only if we'll query the blobstore isn't not as expensive, and that seems to be fine (in fact it is a ~20% p99 perf improvement, though the exact number depends on the number of shard we use for this, which I've had to tweak a bit). https://pxl.cl/1c18H I did find what I think is one potential instance of this problem in `bounded_traversal_stream`, which is that we never try to poll `scheduled` to completion. Instead, we just poll for the next ready future in our FuturesUnordered, and if that turns out to be synchronous work then we'll just re-enqueue more stuff (and sort of starve async work in this FuturesUnordered). I tried updating bounded traversal to try a fairer implementation (which polls everything), but that wasn't sufficient to make the problem go away, so I think this is something we have to just accept for now (note that this actually has some interesting perf impact in isolation: it's a free ~20% perf improvement on p95+: https://pxl.cl/1c192 see 976b6b92293a0912147c09aa222b2957873ef0df if you're curious Reviewed By: farnz Differential Revision: D22332478 fbshipit-source-id: 885b84cda1abc15c51fbc5dd34473e49338e13f4	2020-07-03 05:53:11 -07:00
Thomas Orozco	2082621d51	mononoke/virtually_sharded_blobstore: add ODS metrics Summary: Those are useful to track. Reviewed By: farnz Differential Revision: D22332480 fbshipit-source-id: 43f5cd7121c4aa497d961015e7c16973615798d1	2020-07-03 05:53:10 -07:00
Thomas Orozco	1db62473f2	mononoke/virtually_sharded_blobstore: track perf counters Summary: Like it says in the title. Those are useful! Reviewed By: farnz Differential Revision: D22332479 fbshipit-source-id: f9bddad75fcbed2593c675f9ba45965bd87f1575	2020-07-03 05:53:10 -07:00
Thomas Orozco	c297024a52	mononoke/virtually_sharded_blobstore: do not delay reads for uncacheable data Summary: The goal of this blobstore is to dedupe reads by waiting for them to finish and hit cache instead (and also to dedupe writes, but that's not relevant here). However, this is not a desirable feature if a blob cannot be stored in cache, because then we're serializing accesses for no good reason. So, when that happens, we store "this cannot be stored in cache", and we release reads immediately. Reviewed By: farnz Differential Revision: D22285269 fbshipit-source-id: be7f1c73dc36b6d58c5075172e5e3c5764eed894	2020-07-03 05:53:10 -07:00
Thomas Orozco	b9319a4d32	mononoke/virtually_sharded_blobstore: add a newtype for cache keys + a prefix Summary: I'm going to store things that aren't quite the exact blobs in here, so on the off chance that we somehow have two caching blobstores (the old one and this one) that use the same pools, we should avoid collisions by using a prefix. And, since I'm going to use a prefix, I'm adding a newtype wrapper to not use the prefixed key as the blobstore key by accident. Differential Revision: D22285271 fbshipit-source-id: e352ba107f205958fa33af829c8a46896c24027e	2020-07-03 05:53:10 -07:00
Thomas Orozco	bf3c2e19f0	mononoke/virtually_sharded_blobstore: a caching blobstore that deduplicates Summary: This introduces a caching blobstore that deduplicates reads and writes. The underlying motivation is to improve performance for processes that might find themsleves inadvertently reading the same data concurrently from a bunch of independent callsites (most of Mononoke), or writing the same bit of data over and over again. The latter is particularly useful for things like commit cloud backfilling in WWW, where some logger commits include the same blob being written hundreds or thousands of times, and cause us to overload the underlying Zippy shard in Manifold. This is however a problem we've also encountered in the past in e.g. the deleted files manifest and had to solve there. This blobstore is a little different in the sense that it solves that problem for all writers. This comes at the cost of writes being dropped if they're known to be redundant, which prevents updates through this blobstore. This is desirable for most of Mononoke, but not all (notably, for skiplist updates it's not great). For now, I'm going to add this behind an opt-in flag, and later on I'm planning to make it opt-out and turn it off there (I'm thinking to use the CoreContext for this). Reviewed By: farnz Differential Revision: D22285270 fbshipit-source-id: 4e3502ab2da52a3a0e0e471cd9bc4c10b84a3cc5	2020-07-03 05:53:10 -07:00

15 Commits