sapling

mirror of https://github.com/facebook/sapling.git synced 2024-10-11 17:27:53 +03:00

Author	SHA1	Message	Date
Thomas Orozco	07907b2b26	mononoke/virtually_sharded_blobstore: merge in the context_concurrency_blobstore Summary: There is inevitably interaction between caching, deduplication and rate limiting: - You don't want the rate limiting to be above caching (in the blobstore stack, that is), because you shouldn't rate limits cache hits (this is where we are today). - You don't want the rate limiting to below deduplication, because then you get priority inversion where a low-priority rate-limited request might hold the semaphore while a higher-priority, non rate limited request wants to do the same fetch (we could have moved rate limiting here prior to introducing deduplication, but I didn't do it earlier because I wanted to eventually introduce deduplication). So, now that we have caching and deduplication in the same blobstore, let's also incorporate rate limiting there!. Note that this also brings a potential motivation for moving Memcache into this blobstore, in case we don't want rate limiting to apply to requests before they go to the _actual_ blobstore (I did not do this in this diff). The design here when accessing the blobstore is as follows: - Get the semaphore - Check if the data is in cache, if so release the semaphore and return the data. - Otherwise, check if we are rater limited. Then, if we are rate limited: - Release the semaphore - Wait for our turn - Acquire the semaphore again - Check the cache again (someone might have put the data we want while we were waiting). - If the data is there, then return our rate limit token. - If the data isn't there, then proceed to query the blobstore. If we aren't rate limited, then we just proceed to query the blobstore. There are a couple subtle aspects of this: - If we have a "late" cache hit (i.e. after we waited for rate limiting), then we'll have waited but we won't need to query the blobstore. - This is important when a large number of requests from the same key arrive at the same time and get rate limited. If we don't do this second cache check or if we don't return the token, then we'll consume a rate limiting token for each request (instead of 1 for the first request). - If a piece of data isn't cacheable, we should treat it like a cache hit with regard to semaphores (i.e. release early), but like a miss with regard to rate limits (i.e. wait). Both of those are addressed captured in the code by returning the `Ticket` on a cache hit. We can then choose to either return the ticket on a cache hit, or wait for it on a cache miss. (all of this logic is captured in unit tests, we can remove any of the blocks there in `Shards::acquire` and a test will fail) Reviewed By: farnz Differential Revision: D22374606 fbshipit-source-id: c3a48805d3cdfed2a885bec8c47c173ee7ebfe2d	2020-07-06 04:38:31 -07:00
Thomas Orozco	de731a89fc	mononoke/virtually_sharded_blobstore: log deduplicated puts Summary: If anything were to go wrong, we'd be happy to know which puts we ignored. So, let's log them. Reviewed By: farnz Differential Revision: D22356714 fbshipit-source-id: 5687bf0fc426421c5f28b99a9004d87c97106695	2020-07-03 05:53:11 -07:00
Thomas Orozco	c68100f46e	mononoke/virtually_sharded_blobstore: spawn before taking semaphores Summary: I canaried this on Fastreplay, but unfortunately that showed that sometimes we just deadlock, or get so slow we might as well be deadlocked (and it happens pretty quickly, after ~20 minutes). I tried spawning all the `get()` futures, and that fixes the problem (but it makes gettreepack noticeably slower), so that suggests something somewhere is creating futures, polling them a little bit, then never driving them to completion. For better or worse, I'd experienced the exact same problem with the ContextConcurrencyBlobstore (my initial attempt at QOS, which also used a semaphore), so I was kinda expecting this to happen. In a sense, this nice because I we've suspected there were things like that in the codebase for a while (e.g. with the occasional SQL timeout we see where it looks like MySQL responds fast but we don't actually poll it until past the timeout), and it gives us a somewhat convenient repro. In another sense, it's annoying because it blocks this work :) So, to work around the problem, for now, let's spawn futures to force the work to complete when a semaphore is held. I originally had an unconditional spawn here, but that is too expensive for the cache-hit code path and slows things down (by about ~2x). However, having it only if we'll query the blobstore isn't not as expensive, and that seems to be fine (in fact it is a ~20% p99 perf improvement, though the exact number depends on the number of shard we use for this, which I've had to tweak a bit). https://pxl.cl/1c18H I did find what I think is one potential instance of this problem in `bounded_traversal_stream`, which is that we never try to poll `scheduled` to completion. Instead, we just poll for the next ready future in our FuturesUnordered, and if that turns out to be synchronous work then we'll just re-enqueue more stuff (and sort of starve async work in this FuturesUnordered). I tried updating bounded traversal to try a fairer implementation (which polls everything), but that wasn't sufficient to make the problem go away, so I think this is something we have to just accept for now (note that this actually has some interesting perf impact in isolation: it's a free ~20% perf improvement on p95+: https://pxl.cl/1c192 see 976b6b92293a0912147c09aa222b2957873ef0df if you're curious Reviewed By: farnz Differential Revision: D22332478 fbshipit-source-id: 885b84cda1abc15c51fbc5dd34473e49338e13f4	2020-07-03 05:53:11 -07:00
Thomas Orozco	2082621d51	mononoke/virtually_sharded_blobstore: add ODS metrics Summary: Those are useful to track. Reviewed By: farnz Differential Revision: D22332480 fbshipit-source-id: 43f5cd7121c4aa497d961015e7c16973615798d1	2020-07-03 05:53:10 -07:00
Thomas Orozco	1db62473f2	mononoke/virtually_sharded_blobstore: track perf counters Summary: Like it says in the title. Those are useful! Reviewed By: farnz Differential Revision: D22332479 fbshipit-source-id: f9bddad75fcbed2593c675f9ba45965bd87f1575	2020-07-03 05:53:10 -07:00
Thomas Orozco	c297024a52	mononoke/virtually_sharded_blobstore: do not delay reads for uncacheable data Summary: The goal of this blobstore is to dedupe reads by waiting for them to finish and hit cache instead (and also to dedupe writes, but that's not relevant here). However, this is not a desirable feature if a blob cannot be stored in cache, because then we're serializing accesses for no good reason. So, when that happens, we store "this cannot be stored in cache", and we release reads immediately. Reviewed By: farnz Differential Revision: D22285269 fbshipit-source-id: be7f1c73dc36b6d58c5075172e5e3c5764eed894	2020-07-03 05:53:10 -07:00
Thomas Orozco	b9319a4d32	mononoke/virtually_sharded_blobstore: add a newtype for cache keys + a prefix Summary: I'm going to store things that aren't quite the exact blobs in here, so on the off chance that we somehow have two caching blobstores (the old one and this one) that use the same pools, we should avoid collisions by using a prefix. And, since I'm going to use a prefix, I'm adding a newtype wrapper to not use the prefixed key as the blobstore key by accident. Differential Revision: D22285271 fbshipit-source-id: e352ba107f205958fa33af829c8a46896c24027e	2020-07-03 05:53:10 -07:00
Thomas Orozco	bf3c2e19f0	mononoke/virtually_sharded_blobstore: a caching blobstore that deduplicates Summary: This introduces a caching blobstore that deduplicates reads and writes. The underlying motivation is to improve performance for processes that might find themsleves inadvertently reading the same data concurrently from a bunch of independent callsites (most of Mononoke), or writing the same bit of data over and over again. The latter is particularly useful for things like commit cloud backfilling in WWW, where some logger commits include the same blob being written hundreds or thousands of times, and cause us to overload the underlying Zippy shard in Manifold. This is however a problem we've also encountered in the past in e.g. the deleted files manifest and had to solve there. This blobstore is a little different in the sense that it solves that problem for all writers. This comes at the cost of writes being dropped if they're known to be redundant, which prevents updates through this blobstore. This is desirable for most of Mononoke, but not all (notably, for skiplist updates it's not great). For now, I'm going to add this behind an opt-in flag, and later on I'm planning to make it opt-out and turn it off there (I'm thinking to use the CoreContext for this). Reviewed By: farnz Differential Revision: D22285270 fbshipit-source-id: 4e3502ab2da52a3a0e0e471cd9bc4c10b84a3cc5	2020-07-03 05:53:10 -07:00

8 Commits