sapling

mirror of https://github.com/facebook/sapling.git synced 2024-12-26 22:47:26 +03:00

Author	SHA1	Message	Date
Lukas Piatkowski	12c684afcd	mononoke/hooks: make deny_files public Reviewed By: aslpavel Differential Revision: D23537799 fbshipit-source-id: 58c9568e30982f682b00faae42bc3a3f3595890f	2020-09-04 12:23:35 -07:00
Jun Wu	e74133f0fa	dag: limit max segment level to 4 Summary: This is based on fbsource data, building level 5 proves to be not useful. This would save 300ms in the write path. Reviewed By: sfilipco Differential Revision: D23494505 fbshipit-source-id: ca795b4900af40dbfdaa463d36f3169413bf6a62	2020-09-04 12:20:54 -07:00
Jun Wu	b4adf0602f	dag: remove non-master "Name -> Id" index on request Summary: Previously the IdMap's "Name -> Id" index simply ignores the "reassign non-master" request. It turns out stale entries in that index can cause issues as demonstrated by the previous diff. Update IdMap to actually remove both indexes of non-master group on remove_non_master so it cannot have stale entries. To optimize the index, the format of IdMap is changed from: [ 8 bytes Id (Big Endian) ] [ Name ] to: [ 8 bytes Id (Big Endian) ] [ 1 byte Group ] [ Name ] So the index can use reference to the slice, instead of embedding the bytes, to reduce index size. The filesystem directory name for IdMap used by NameDag is bumped to `idmap2` so it won't read the incompatible old `idmap` data. Reviewed By: sfilipco Differential Revision: D23494508 fbshipit-source-id: 3cb7782577750ba5bd13515b370f787519ed3894	2020-09-04 12:20:53 -07:00
Jun Wu	c5d6c9d0f2	dag: add a test showing non-master rebuild issues Summary: Some vertexes can disappear from the graph! Reviewed By: sfilipco Differential Revision: D23494506 fbshipit-source-id: ecbf2a4169e5fc82596e89a4bfe4c442a82e9cd2	2020-09-04 12:20:53 -07:00
Jun Wu	4aea3657e1	dag: move some test utilities to a TestDag struct Summary: The TestDag struct will be used to do some more complicated tests. Reviewed By: sfilipco Differential Revision: D23494507 fbshipit-source-id: 11350f9e448725ae49f50a7b6f19efc57ad84448	2020-09-04 12:20:53 -07:00
Thomas Orozco	3ba2c2b429	mononoke/hg_sync: make it work on Mercurial Python 3 Summary: A few things here: - The heads must be bytes. - The arguments to wireproto must be strings (we used to encode / decode them, but we shouldn't). - The bookmark must be a string (otherwise it gets serialized as `"b\"foo\""` and then it deserializes to that instead of `foo`). Reviewed By: StanislavGlebik Differential Revision: D23499846 fbshipit-source-id: c8a657f24c161080c2d829eb214d17bc1c3d13ef	2020-09-04 11:56:44 -07:00
Thomas Orozco	747b355236	mononoke: make mononoke_hg_sync_job sendunbundlereplaybatch more debuggable Summary: Right now we get very little logging out of errors in here, which is making it difficult to fix it on Py3 (where it currently is broken). This diff doesn't fix anything, but at the very least, let's make the errors better so we can make this easier to start debugging. Reviewed By: ahornby Differential Revision: D23499369 fbshipit-source-id: 7ee60b3f2a3be13f73b1f72dee062ca80cb8d8d9	2020-09-04 11:56:44 -07:00
Thomas Orozco	c8dd8ae4e3	mononoke: run tests using hg Python 3 as well Summary: The motivation for this is to surface potential regressions in hg Python 3 by testing code paths that are exercised in Mononoke. The primary driver for this were the regressions in the LFS extension that broke uploads, and for which we have test coverage here in Mononoke. To do this, I extracted the manifest generation (the manifest is the list of binaries that the tests know about, which is passed to the hg test runner), and moved it into its own function, then added a new target for the py3 tests. Unfortunately, a number of tests are broken in Python 3 currently. We should fix those. It looks like there are some errors in Mercurial when walking a manifest with non-UTF-8 files, and the other problem is that the hg sync job is in fact broken: https://fburl.com/testinfra/545af3p8. Reviewed By: ahornby Differential Revision: D23499370 fbshipit-source-id: 762764147f3b57b2493d017fb7e9d562a58d67ba	2020-09-04 11:56:44 -07:00
svcscm	704aaae026	Updating submodules Summary: GitHub commits: `44dacad838` `b2151a77ff` Reviewed By: 2d2d2d2d2d fbshipit-source-id: 0caf31f53977eae25eaa72894052d0c9726c9987	2020-09-04 09:20:23 -07:00
Stanislau Hlebik	7b323a4fd9	mononoke: add log-only mode in redaction Summary: Before redacting something it would be good to check that this file is not accessed by anything. Having log-only mode would help with that. Reviewed By: ikostia Differential Revision: D23503666 fbshipit-source-id: ae492d4e0e6f2da792d36ee42a73f591e632dfa4	2020-09-04 07:37:15 -07:00
Stanislau Hlebik	0740f99f13	mononoke: allow logging censored scuba accesses to file Summary: In the next diff I'm going to add log-only mode to redaction, and it would be good to have a way of testing it (i.e. testing that it actually logs accesses to bad keys). In this diff let's use a config option that allows logging censored scuba accesses to file, and let's update redaction integration test to use it Reviewed By: ikostia Differential Revision: D23537797 fbshipit-source-id: 69af2f05b86bdc0ff6145979f211ddd4f43142d2	2020-09-04 07:37:14 -07:00
Thomas Orozco	f1e4f62e2d	mononoke/fsnodes: expose FsnodeFile as the LeafId Summary: Fsnodes have a lot of data about files, but right now we can't access it through a Fsnode lookup or a manifest walk, because the LeafId for a Fsnode is just the content id and the file type. This is a bit sad, because it means we e.g. cannot dump a manifest with file sizes (D23471561 (`179e4eb80e`)). Just changing the LeafId is easy, but that brings a new problem with Fsnode derivation. Indeed, deriving manifests normally expects us to have the "derive leaf" function produce a LeafId (so we'd want to produce a `FsnodeFile`), but in Fsnodes, this currently happens in deriving trees instead. Unfortunately, we cannot easily just move the code that produces `FsnodeFile` from the tree derivation to the leaf derivation, that is, do: ``` fn check_fsnode_leaf( leaf_info: LeafInfo<FsnodeFile, (ContentId, FileType)>, ) -> impl Future<Item = (Option<FsnodeSummary>, FsnodeFile), Error = Error> ``` Indeed, the performance of Fsnode derivation relies on all the leaves for a given tree being derived together with the tree and its parents in context. So, we'd need the ability for deriving a new leaf to return something different from the actual leaf id. This means we want to return a `(ContentId, FileType)`, even though our `LeafId` is a `FsnodeFile`. To do this, this diff introduces a new `IntermediateLeafId` type in the derivation. This represents the type of the leaf that is passed from deriving a leaf to deriving a tree. We need to be able to turn a real `LeafId` into it, because sometimes we don't re-derive leaves. I think we could also refactor some of the code that passes a context here to just do this through the `IntermediateLeafId`, but I didn't look into this too much. So, this diff does that, and uses it in Mononoke Admin so we can print file sizes. Reviewed By: StanislavGlebik Differential Revision: D23497754 fbshipit-source-id: 2fc480be0b1e4d3d261da1d4d3dcd9c7b8501b9b	2020-09-04 06:30:18 -07:00
Mateusz Kwapich	f7be2eef14	tunable scuba sampling Summary: This allows us to sample the most popular method logs (`repo_list_hg_manifest` calls make up for 90% samples in our scuba table) while still have full logging for other queries end errors. The sampling can be eaily disabled via tunable. In case we get a lot of errors we can also start sampling the error request with a simple configerator change. Reviewed By: krallin Differential Revision: D23507333 fbshipit-source-id: c7e34467d99410ec3de08cce2db275a55394effd	2020-09-04 06:26:35 -07:00
Viet Hung Nguyen	437a0e905b	mononoke/repo_import: add deriving data types for multiple repos Summary: Previously, we only supported deriving data types for the repo we import into. This diff expands on this and now we can do that for multiple repos (e.g. small repos we backsync commits to from large repo we import to). Reviewed By: StanislavGlebik Differential Revision: D23499953 fbshipit-source-id: 223209a6a2739eae93082cae4f04e53e0cba0c58	2020-09-04 05:39:21 -07:00
Stanislau Hlebik	11a45b6b60	mononoke: do not pass tasks to find_files_with_given_content_id_blobstore_keys Summary: In the next diff I'm going to add log_only mode for redaction. And in this diff I make a small refactoring that makes next diff simpler. find_files_with_given_content_id_blobstore_keys don't accept tasks anymore, just content keys. Reviewed By: aslpavel Differential Revision: D23535829 fbshipit-source-id: 1dac37f5ea7038fc779ad51192a290fcc23e6556	2020-09-04 05:22:03 -07:00
Lukas Piatkowski	67a71d1f98	mononoke/hooks: make limit_commitsize and limit_filesize public Reviewed By: aslpavel Differential Revision: D23502908 fbshipit-source-id: 8b9070cfaa28af7b808d02548c0fb7c5d344550d	2020-09-04 04:23:05 -07:00
Lukas Piatkowski	462cb96cc2	mononoke/hooks: make no_questionable_filenames public Reviewed By: aslpavel Differential Revision: D23478259 fbshipit-source-id: 642948c2685690298a71fbe7177c4bd6a6e43f85	2020-09-04 04:23:05 -07:00
svcscm	9294504e2f	Updating submodules Summary: GitHub commits: `230f3152a3` `21d29c5416` `3965ff470f` Reviewed By: 2d2d2d2d2d fbshipit-source-id: ed4bbfd2a1f8def5532c52b5f80d45b902520e19	2020-09-04 02:40:37 -07:00
Lukas Piatkowski	eebdc0b896	mononoke/metaconfig: sync thrift changes from configerator for HookConfig Summary: Use the new fields from RawHookConfig in HookConfig Reviewed By: StanislavGlebik Differential Revision: D23499766 fbshipit-source-id: 43e9d2dfdcfb0fa0dd4de6310ea0013db1b69474	2020-09-04 02:02:06 -07:00
svcscm	66a67eeefb	Updating submodules Summary: GitHub commits: `f822c28c80` `b3b52cfd4c` `0887fa5eee` `06ad5dd293` `d315dce3a3` `18d14b742e` `a63372e639` `8c3871b03d` `1289a1fb2b` `a578bb71eb` Reviewed By: 2d2d2d2d2d fbshipit-source-id: ee0aebf40aec72592a0383d5aa0e969c5a9f3278	2020-09-04 02:02:06 -07:00
svcscm	4022d48598	Updating submodules Summary: GitHub commits: `11110b676d` Reviewed By: 2d2d2d2d2d fbshipit-source-id: 4bca8a2535ba02e18118475f295858be27f865cb	2020-09-03 20:21:07 -07:00
Jun Wu	c9e6995675	py2: fix crecord compatibility Summary: D23460476 (`c84653c7a9`) breaks Python 2: Python 2: bytes + bytearray -> bytearray Python 3: bytes + bytearray -> bytes Fix it. Python 2: b"%s" % bytearray -> bytes Python 2: b"%s" % bytearray -> bytes Reviewed By: singhsrb Differential Revision: D23514590 fbshipit-source-id: 7fd5f2372444732f13909c42251f000f05955228	2020-09-03 18:51:10 -07:00
svcscm	2293422228	Updating submodules Summary: GitHub commits: `dc8f483a80` `322fdec245` `64bc64b1ff` `b312125a22` Reviewed By: 2d2d2d2d2d fbshipit-source-id: ff3563a2dad35f7a7922e291bd1744ef19f8f83f	2020-09-03 18:51:10 -07:00
svcscm	60027bab68	Updating submodules Summary: GitHub commits: `82dfc151be` `7aae308fab` `f12d8e5401` `5746767387` `127834d403` Reviewed By: 2d2d2d2d2d fbshipit-source-id: 5cb1d4e688097ae6e27c62512534800903412a8d	2020-09-03 17:46:42 -07:00
Zeyi (Rice) Fan	8be3e1940a	change default configuration path Summary: `C:/tools/eden` will be overriden whenever a new EdenFS package is installed, therefore making it unsuitable to be managed by Chef. Changing the default configuration directory to `C:\ProgramData\eden` that aligns with other programs. Reviewed By: xavierd Differential Revision: D23484626 fbshipit-source-id: 763518c608b24caa08e089a738f5c3577a0d6483	2020-09-03 17:18:51 -07:00
Xavier Deguillard	5c6ab8afac	utils: rename ProcessAccessLog::AccessType enum Summary: Removing Fuse from the enum name makes it non tied to Fuse and thus makes it more portable. This also eliminates the last platform specific bit from RequestData. Reviewed By: chadaustin Differential Revision: D23467773 fbshipit-source-id: 52515522c8ac51d0c4b56dc5e42d4b6593df6623	2020-09-03 17:00:07 -07:00
Stefan Filip	3f0b08e46f	segmented_changelog: add version field to IdMap Summary: The version is going to be used to seamlessly upgrade the IdMap. We can generate the IdMap in a variety of ways. Naturally, algorithms for generating the IdMap may change, so we want a mechanism for updating the shared IdMap. A generated IdDag is going to require a specific IdMap version. To be more precise, the IdDag is going to specify which version of IdMap it has to be interpreted with. Reviewed By: quark-zju Differential Revision: D23501158 fbshipit-source-id: 370e6d9f87c433645d2a6b3336b139bea456c1a0	2020-09-03 16:33:20 -07:00
Stefan Filip	58a4821fe3	segmented_changelog: add IdMap trait with SqlIdMap implementation Summary: Separate the operational bits of the IdMap from the core SegmentedChangelog requirements. I debaded whether it make sense to add repo_id to SqlIdMap. Given the current architecture I don't see a reason not to do it. On the contrary separating two objects felt convoluted. Reviewed By: quark-zju Differential Revision: D23501160 fbshipit-source-id: dab076ab65286d625d2b33476569da99c7b733d9	2020-09-03 16:33:20 -07:00
Stefan Filip	f3c353edbc	segmented_changelog: change idmap module from file to directory Summary: Planning to add a trait for core idmap functionality (that's just translating cs_id to vertex and back). The current IdMap will then be an implementation of that trait. Reviewed By: quark-zju Differential Revision: D23501159 fbshipit-source-id: 34e3b26744e4b5465cd108cca362c38070317920	2020-09-03 16:33:20 -07:00
Stefan Filip	c09f80882c	edenapi: use async-runtime to schedule futures Summary: Replacing places where the tokio runtime is instantiated inside the edenapi client crate. Reviewed By: quark-zju Differential Revision: D23468596 fbshipit-source-id: ef68718c7d5b89b6477a2946daaa51618b53d06a	2020-09-03 15:45:34 -07:00
svcscm	4200a8c6b5	Updating submodules Summary: GitHub commits: `1a82d06a39` Reviewed By: 2d2d2d2d2d fbshipit-source-id: 079dddb8d07e271ec4edf166507159797b8d76dd	2020-09-03 15:45:34 -07:00
svcscm	b966d21042	Updating submodules Summary: GitHub commits: `716b4bb058` `c99481f541` `69c4e10693` `580c7e5035` `6f962ea365` Reviewed By: 2d2d2d2d2d fbshipit-source-id: 804b50d3de1691e06319c0deccd0fc94abab4e62	2020-09-03 14:03:46 -07:00
Jun Wu	cea2bf8728	dag: limit segment level at open time Summary: At open time, it's pointless to attempt to create new levels. So let's just read the existing max_level and do not try to build max_level + 1. This turns out to save 300ms in profiling result. Reviewed By: sfilipco Differential Revision: D23494509 fbshipit-source-id: 4ea326a3cc21792790ea0b87e5bf608a94ae382b	2020-09-03 13:48:43 -07:00
Jun Wu	f238529a97	multilog: use per-log meta to pick up updated indexes Summary: With MultiLog, per-log meta was previously entirely ignored. However, they can be useful for updated indexes. For example, application defines a new index, and opens a Log via MultiLog. The application would expect the new index is built only once. Without MultiLog, per-log meta is updated at open time in place. With MultiLog, the updated index meta is not written back to the multimeta so the new index would be rebuilt multiple times undesirably. Update MultiLog to reuse the per-log meta if it's compatible so it can pick up new indexes. Reviewed By: sfilipco Differential Revision: D23488212 fbshipit-source-id: c8b3e6b5589dbda2e76a143d15085862a93dae22	2020-09-03 13:48:43 -07:00
Jun Wu	f79e7657af	multilog: stop writing poisoned per-log meta Summary: The poisoned meta makes investigation harder. ex. `debugdumpindexlog` won't work on those logs. Reviewed By: sfilipco Differential Revision: D23488213 fbshipit-source-id: b33894d8c605694b6adf5afdaed45707fbd7357e	2020-09-03 13:48:43 -07:00
Stanislau Hlebik	4947e07cb7	mononoke: asyncify one function in redaction admin subcommand Summary: I'm going to change this function soon, so it's nice to asyncify it to make next diffs simpler and also remove duplicated logic. Also remove unnecessary `logger` parameter - we can always get logger from CoreContext Reviewed By: krallin Differential Revision: D23501634 fbshipit-source-id: 7ad2fc17167e4107481ceb230e0b7cb3e7f2549a	2020-09-03 12:22:24 -07:00
Mateusz Kwapich	20d096f5d5	add thrift metadata support Summary: This closely replicates EscapeZero work in D23328638 and will allow us to issue requests to SCS using Thrift Fiddle (https://www.internalfb.com/thrift_fiddle). Reviewed By: EscapeZero Differential Revision: D23475864 fbshipit-source-id: fb286e3fcd6ea79704fa2e7e1ed9ab5595ff7b81	2020-09-03 12:18:18 -07:00
Arun Kulshreshtha	858a080502	gotham_ext: make StreamBody automatically delay post-request callbacks Summary: Now that post-request callbacks are available in `gotham_ext`, we can make `StreamBody` use them directly instead of using an LFS-specific wrapper (previously required to access the LFS server's `RequestContext`). This also means that the EdenAPI server will get this behavior for free. Reviewed By: krallin Differential Revision: D23402969 fbshipit-source-id: 56ab710473f13e8983b136664af364af6884bd3f	2020-09-03 11:59:32 -07:00
Arun Kulshreshtha	5556a447d1	edenapi_server: use LogMiddleware Summary: Add `LogMiddleware` to the EdenAPI server, which will print a log message whenever a request is received or has completed. Reviewed By: DurhamG Differential Revision: D23299902 fbshipit-source-id: f44ef1b01692f0e4f9b109917fcee89a84ca4208	2020-09-03 11:59:32 -07:00
Arun Kulshreshtha	96a6a3fcfb	edenapi_server: use LoadMiddleware Summary: Use `LoadMiddleware` to track the number of outstanding requests in the server. Reviewed By: DurhamG Differential Revision: D23298415 fbshipit-source-id: bdcdb0f657d8deac593d356c87ac0d8d3f39e322	2020-09-03 11:59:32 -07:00
Arun Kulshreshtha	7144363d2c	gotham_ext: move LogMiddleware to gotham_ext Summary: Now that `LogMiddleware` no longer depends on `RequestContext`, it can be moved into `gotham_ext`. Reviewed By: DurhamG Differential Revision: D23298412 fbshipit-source-id: d5288decba98c3dd4605b9a44e41eba0f47fee37	2020-09-03 11:59:31 -07:00
Arun Kulshreshtha	35d292e513	gotham_ext: move LoadMiddleware to gotham_ext Summary: Now that `LoadMiddleware` no longer depends on `RequestContext`, it can be moved into `gotham_ext`. Reviewed By: DurhamG Differential Revision: D23298416 fbshipit-source-id: 5d29da492e39beb5621daf0570d9b3e657cbfc04	2020-09-03 11:59:31 -07:00
Arun Kulshreshtha	82c451fb9f	lfs_server: use PostRequestMiddleware Summary: This diff removes the post-request callback functionality from the LFS server's `RequestContext` and replaces it with the new `PostRequestMiddleware`. The middleware is directly based on `RequestContext`, so the underlying behavior is essentially the same as before. Reviewed By: krallin Differential Revision: D23298413 fbshipit-source-id: 1e58a40f6ce6d526456dbd9ae3a8efc85768bf04	2020-09-03 11:59:31 -07:00
Arun Kulshreshtha	3ad7fa8b6f	gotham_ext: allow applications to dynamically configure PostRequestMiddleware Summary: Make `PostRequestMiddleware` generic over a user-provided config struct which can be used to dynamically configure the behavior of post-request callback dispatching. Right now this is only used to support disabling hostname logging, but could be easily extended to cover more uses in the future. Reviewed By: krallin Differential Revision: D23495005 fbshipit-source-id: 3d59a8346f449775ec76d03c260d973d04fb90a9	2020-09-03 11:59:31 -07:00
Arun Kulshreshtha	cc0f2e4c40	gotham_ext: add PostRequestMiddleware Summary: Add new middleware that allows HTTP handlers and other middleware to register callbacks that will be run once the current request completes. This is heavily based on the post-request callback functionality from the LFS server's `RequestContext`. The intention here is to expose this functionality in a manner that's independent of other, application-specific logic. Reviewed By: krallin Differential Revision: D23298419 fbshipit-source-id: e4b1534b02c35f685ce544de13e331947e187818	2020-09-03 11:59:31 -07:00
svcscm	099cc184a0	Updating submodules Summary: GitHub commits: `a599e69180` `eedb340bd5` `9d1c6c33bb` `304c6af6c9` `c4d8838a2b` `afc553b07a` Reviewed By: 2d2d2d2d2d fbshipit-source-id: 849bc54f24474d4c71cc903f66dd41cf7aa10e76	2020-09-03 10:50:27 -07:00
Thomas Orozco	d77cf89ead	mononoke/admin: clean up unodes subcommand a bit Summary: I pattern matched off of this for the previous diff in this stack, and spotted a bit of clean up that might make sense here: - Using `.help()` for a subcommand overrides the whole help text. We meant to use `.about()` here. I fixed this in some copy-pasted code as well. - Printing debug output alongside real output makes it harder to select the real output. I fixed this by logging debug output to stderr instead. Reviewed By: StanislavGlebik Differential Revision: D23471560 fbshipit-source-id: 7900cfe65613c48abd77faad6d6a45a7aa523b36	2020-09-03 09:32:06 -07:00
Thomas Orozco	179e4eb80e	mononoke/admin: add a subcommand for dumping paths Summary: This adds a subcommand for dumping all the paths in a repository. This is helpful when you have a Content ID, limited imagination and time on your hands, and you'd like to turn those into a file path where that Content ID lives. This uses fsnodes for the traversal because that's O(# directories) as opposed top O(# files). I had an earlier implementation that used unodes, but that was really slow. Reviewed By: StanislavGlebik Differential Revision: D23471561 fbshipit-source-id: 948bfd20939adf4de0fb1e4b2852ad4d12182f16	2020-09-03 09:32:06 -07:00
Viet Hung Nguyen	7c34b39ec8	mononoke/repo_import: add backsyncing to rewrite file paths, remove backup file Summary: add backsyncing to rewrite file paths: After setting the variables for large repo (D23294833 (`d6895d837d`)), we try to import the git commits into large repo and rewrite the file paths. Following this, repo import tool should back-sync the commits into small_repo. next step: derive all the data types for both small and large repos. Currently, we only derive it for the large repo. ============== remove backup file: The backup file was a last-minute addition when trying to import a repo for the first time. Removed it, because we shouldn't write to external files. Future plan is to include better process recoverability across the whole tool and not just rewrite file paths functionality. Reviewed By: StanislavGlebik Differential Revision: D23452571 fbshipit-source-id: bda39694fa34788218be795319dbbfd014ba85ff	2020-09-03 06:43:08 -07:00
Stanislau Hlebik	a77d9f243a	mononoke: parallelize operations in create_commit scs method Reviewed By: krallin Differential Revision: D23496535 fbshipit-source-id: 18f88abb9b85d38a93d2aa99c38edcf8190343c3	2020-09-03 04:12:35 -07:00

... 3 4 5 6 7 ...

60450 Commits