sapling

mirror of https://github.com/facebook/sapling.git synced 2024-10-11 01:07:15 +03:00

Author	SHA1	Message	Date
Lukas Piatkowski	bccad1bf42	new_blobimport: import bookmarks from revlog repo Summary: This change isn't doing much on it's own since rocksdb's BlobRepo is using in memory Bookmarks ATM and they dissapear when import is finished. Later bookmarks will be used for Head discovery, then it will be properly tested Reviewed By: farnz Differential Revision: D7728716 fbshipit-source-id: ad50f35b18d93aa1e38951408092e46e67fde0c7	2018-04-24 11:17:06 -07:00
Lukas Piatkowski	a692a2a9ea	new_blobimport: split main.rs to make this crate more maintable Summary: I am planning to add importing bookmarks, doing it on current main.rs would make it unreadable, so I am splitting this file now Reviewed By: farnz Differential Revision: D7728185 fbshipit-source-id: fdfb4f60eecd9c8af7626bd0e892bb1bfbf7f081	2018-04-24 11:17:06 -07:00
Lukas Piatkowski	f28ce26504	blobrepo: handle changesets with null root manifests Summary: The eden integration test contains a commit with no content which new_blobimport couldn't import. With this changes the commit API is capable of handling such commits. Reviewed By: jsgf Differential Revision: D7709243 fbshipit-source-id: 7d55eb2ec421820d189ab05b0f8cb4411f850a7b	2018-04-24 11:17:06 -07:00
Stanislau Hlebik	4a731b2118	mononoke: do not fail if duplicate changeset was inserted Summary: Let's fail only if inconsistent data was inserted - for example, same commit hash but different parents. This matches core hg behavior, and also it's completely normal for commit cloud to send more parent commits than necessary. Reviewed By: lukaspiatkowski Differential Revision: D7722649 fbshipit-source-id: 172a0985fb3fda27d55e9dce8916ec3793de5db9	2018-04-24 08:49:48 -07:00
Stanislau Hlebik	b4c67170fd	mononoke: use exactly the same changesets during blobimport Summary: There are a few separate steps during blobimport. One of them is inserting the blobs and another is inserting the changesets. That worked fine if RevlogRepo was static. However if RevlogRepo has changed between first and second step we could've inserted blobs for N commits, but insert (N + X) changesets into changesets table. That means that a few commits would have no blobs at all. This diff fixes it by fixing the changesets that we want to blobimport beforehand. Reviewed By: farnz Differential Revision: D7615133 fbshipit-source-id: 1a66907e34a65588b101199c8f59abda53f7bc20	2018-04-24 02:18:38 -07:00
Stanislau Hlebik	392b020b8f	mononoke: add an option to create mysql changesets in blobimport Summary: As with filenodes, we also want to write changesets in a real store. Reviewed By: lukaspiatkowski Differential Revision: D7615101 fbshipit-source-id: 269deb8fc3c1f58afb82f453a68ea4d8a3f1f63d	2018-04-24 02:18:38 -07:00
Stanislau Hlebik	3f600af142	mononoke: add Changesets cache Summary: Use asyncmemo to cache Changesets. Unfortunately currently we are using separate asyncmemo cache, so we have to specify the size for the caches separately. Later we'll have a single cache for everything, and the number of config knobs will go down. Reviewed By: lukaspiatkowski Differential Revision: D7685376 fbshipit-source-id: efe8a3a95fcc72fab4f4af93564e706cd1540c2f	2018-04-22 04:59:27 -07:00
Stanislau Hlebik	a48e52fb7e	mononoke: use caching blobstore Summary: Let's use it! Pass config option that set's the cache max memory usage (don't put a limit on the number of entries, it's useless in that case). Currently we'll set a separate size for each of the caches that we use (blobstore, changesets, filenodes, etc). Later we'll have just one single option that sets the cache size for all of them. Reviewed By: lukaspiatkowski Differential Revision: D7671814 fbshipit-source-id: f9571078e6faaa80ea4c31c76a9eebcc24d8a68a	2018-04-22 04:59:27 -07:00
Siddharth Agarwal	c652345586	blobrepo: optionally verify that provided and computed entry hashes are consistent Summary: We know that the hashes for non-root-tree-manifests and filenodes should always be consistent. Verify that. Reviewed By: farnz Differential Revision: D7704087 fbshipit-source-id: 7f6207878c5cd372b272aa6970506dd63b5a3c7c	2018-04-20 08:40:54 -07:00
Siddharth Agarwal	9b5ee510cb	blobrepo: use the incoming provided hash as the blobstore key instead of computing it Summary: As the comment explains, sometimes the hashes don't match the contents. Accept such pushes. Reviewed By: farnz Differential Revision: D7699930 fbshipit-source-id: 376f01b6cf03f6cad84c2c878d192d55f8d81812	2018-04-20 08:40:54 -07:00
Siddharth Agarwal	9e49e6da1a	blobrepo: make upload_entry arguments named, move to Hg domain Summary: We're going to keep this around for now as part of double-writing. All the hashes here are definitely Mercurial hashes, so use them that way. Reviewed By: lukaspiatkowski Differential Revision: D7683890 fbshipit-source-id: 270091cd11f3cec7ef4cf565de5ef913fcf7adea	2018-04-20 08:40:54 -07:00
Siddharth Agarwal	3591b86b9f	mercurial: switch file::File to HgNodeHash Summary: file::File works entirely in the Mercurial domain, so these conversions are good. Reviewed By: StanislavGlebik Differential Revision: D7665973 fbshipit-source-id: 8a192c5d1886492ad21593693b080c8e5ddf8f7e	2018-04-20 08:40:54 -07:00
Siddharth Agarwal	ed1c842dcd	blobrepo: rename BlobEntry to HgBlobEntry Summary: This is because these Mercurial entries are (at least currently) going to be stored as they come in, and this data structure is entirely in the Mercurial domain. Reviewed By: lukaspiatkowski Differential Revision: D7664972 fbshipit-source-id: 9de5475eed0d7ab7085c29fd0282f205043cfe5a	2018-04-20 08:40:54 -07:00
Siddharth Agarwal	7dc019e422	blobrepo: add some context + clean up error messages Summary: I was trying to debug something with the new blobimport, and this was getting in the way. Reviewed By: StanislavGlebik Differential Revision: D7664660 fbshipit-source-id: 2ec4ee79fbe13584f35e7dcd9e8df2b8bdf181c0	2018-04-18 19:53:34 -07:00
Siddharth Agarwal	72be4dc3fe	newblobimport: remove some HgBlob code that isn't necessary Summary: The comment doesn't quite look right, and `HgBlob` will always have content available now. Reviewed By: StanislavGlebik Differential Revision: D7663200 fbshipit-source-id: 614b8bc97ece99aaefdc8fa6eaf36fe66779be13	2018-04-18 19:53:34 -07:00
Siddharth Agarwal	1cd28c2587	blobrepo: use named args for creating a changeset Summary: The list of arguments is becoming too long, and I need to add even more here. Reviewed By: StanislavGlebik, farnz Differential Revision: D7652096 fbshipit-source-id: 62a4631e163e95cf5c950a949e72facab629ea54	2018-04-18 19:53:34 -07:00
Siddharth Agarwal	4e9f826728	blobstore: restrict the types that can be stored Summary: Currently, any sort of `Bytes` can be stored in the blobstore. That caused me to make several mistakes while writing the code to store bonsai changesets, because I'd just end up storing the wrong set of `Bytes`. Introduce stronger typing so that only types that explicitly implement `BlobStorable` can be stored in the blobstore. Currently, these sorts of blobs can be stored in the blob store: * `ChangesetBlob` and `ContentBlob` in `mononoke-types` (these are Thrift-serialized structures) * The envelope `RawNodeBlob` and `RawCSBlob` types in `blobrepo`, once converted to `EnvelopeBlob` instances * `HgBlob`, which contains revlog data (manifests or files) exactly as serialized by Mercurial Reviewed By: StanislavGlebik Differential Revision: D7627290 fbshipit-source-id: d1bcbde8881e365dec99618556e7b054985bccf7	2018-04-18 19:53:32 -07:00
Stanislau Hlebik	23ce735b81	mononoke: add xdb tier to the config and use it for filenodes Summary: Pass mysql tier name to the BlobRepo, so that we can use it to connect to mysql based storages like mysql changeset storage, filenodes storage etc. Note that currently Filenodes storage only connects to master region. This will be fixed in the later diffs Reviewed By: lukaspiatkowski Differential Revision: D7585191 fbshipit-source-id: 168082abfeb7ccba549c7a49e6269cc01c490c14	2018-04-18 04:09:09 -07:00
Siddharth Agarwal	4b607998bf	mercurial: some improvements to File API Summary: Now that `BlobNode` no longer returns `None`: * don't expose the `BlobNode` API outside the crate because it turns out to not be very useful (it should probably go away eventually?) * make the `File` API not return `Option` types * Add a new `file_contents` that returns a brand-new `FileContents` (this is the first time we're tying together Mercurial and Mononoke data structures!) Also remove a `Symlink` API that isn't really correct honestly. Reviewed By: StanislavGlebik Differential Revision: D7624729 fbshipit-source-id: 38443093b8bfea91384c959f3425cf355fac9f65	2018-04-17 11:37:29 -07:00
Lukas Piatkowski	7114d4cb42	blobrepo: make RevlogChangest parsing into ChangesetContent explicit Summary: Having an implicit `From` parsing makes it hard to track the exceptional places where RevlogChangest can be directly translated to BlobChangeset. Make it explicit for better tracking of this behavior Reviewed By: StanislavGlebik Differential Revision: D7637247 fbshipit-source-id: 781341315102ea6b2265c33bb09a89aae3d0c329	2018-04-17 05:26:27 -07:00
Siddharth Agarwal	cf81c04664	move raw blob serialize/deserialize to a central location Summary: I'm going to put in some stronger typing around what can be stored in the blob store. Centralizing the management here makes that much easier. Reviewed By: StanislavGlebik Differential Revision: D7619519 fbshipit-source-id: a428679018f0a1571e54bc01bb5483ba9fdf1cb5	2018-04-16 17:16:28 -07:00
Lukas Piatkowski	af71f3005c	CODEMOD: rename mercurial::Parents to HgParents Reviewed By: sid0 Differential Revision: D7620436 fbshipit-source-id: 811c0694c3ee943e50d41c6d341e72f0ee9372ba	2018-04-16 03:40:25 -07:00
Lukas Piatkowski	2e0522f884	CODEMOD: rename mercurial::EntryId to HgEntryId Reviewed By: sid0 Differential Revision: D7620299 fbshipit-source-id: 9bc1505473567528d1f174ae0f0f1312af187814	2018-04-16 03:40:25 -07:00
Lukas Piatkowski	4c4a06c4fe	CODEMOD: replace mercurial::{HgChangesetId, HgManifestId, HgNodeHash} with {HgChangesetId, HgManifestId, HgNodeHash} Summary: The Hg prefix is unique now so let's not use verbose mercurial:: Reviewed By: sid0 Differential Revision: D7620112 fbshipit-source-id: 0aece310ed817445fef4c94b32f78fda3a3b1c49	2018-04-16 03:40:25 -07:00
Lukas Piatkowski	03255529fa	CODEMOD: rename mercurial::NodeHash to HgNodeHash Reviewed By: sid0 Differential Revision: D7619973 fbshipit-source-id: 229fea891788c33eb1f45446ba2333e945ca5553	2018-04-16 03:40:25 -07:00
Lukas Piatkowski	086640fdd0	CODEMOD: rename mercurial_types::BlobNode to DBlobNode Summary: mercurial_types::DBlobNode should be replaced by types from mononoke_types or mercurial in most cases. This rename should help with tracking this Reviewed By: sid0 Differential Revision: D7619793 fbshipit-source-id: 261fd92acae825dc4bc8011c3716c5585eb0413c	2018-04-16 03:40:25 -07:00
Lukas Piatkowski	9436963e20	CODEMOD: rename mercurial_types::Parent to DParent Summary: mercurial_types::DParent should be replaced by types from mononoke_types or mercurial in most cases. This rename should help with tracking this Reviewed By: sid0 Differential Revision: D7619686 fbshipit-source-id: 5ad105113779387f1408c806860483e06ed5fb3d	2018-04-16 03:40:25 -07:00
Lukas Piatkowski	73034a99b2	CODEMOD: rename mercurial_types::HgFileNodeId to DFileNodeId Summary: mercurial_types::DFileNodeId should be replaced by types from mononoke_types in most cases. This rename should help with tracking this Reviewed By: sid0 Differential Revision: D7619290 fbshipit-source-id: aa6a8e55ae3810c4531028c3b3db2e5730fe7846	2018-04-16 03:40:24 -07:00
Lukas Piatkowski	f6c3f72745	CODEMOD: rename mercurial_types::HgChangesetId to DChangesetId Summary: mercurial_types::DChangesetId should be replaced by types from mononoke_types in most cases and by mercurial::HgChangesetId in others. This rename should help with tracking this Reviewed By: sid0 Differential Revision: D7618897 fbshipit-source-id: 78904f57376606be99b56662164e0c110e632c64	2018-04-16 03:40:24 -07:00
Corbin Hall	ba4c029396	Update Mononoke to use with_context on futures/streams Summary: Migrates some uses of `.map_err()` and `.context(format!())` usage in Mononoke to `.with_context()` Reviewed By: lukaspiatkowski Differential Revision: D7607935 fbshipit-source-id: 551538c78a1755f7aa0716532ab437a1baf6dd89	2018-04-13 11:00:42 -07:00
Lukas Piatkowski	dd3c638c1d	api cleanup: clean api of mercurial and mercurial_types NodeHash Summary: This is a cleanup of NodeHash API. There were few unused methods and few ways to convert between mercurial and mercurial_types hashes. With this diff it is very easy to identify the places where this converstion happens. A followup of this diff will be to use this new API to easily replace the NodeHash convertions in places where it requires remapping. Reviewed By: sid0 Differential Revision: D7592876 fbshipit-source-id: 6875aa6df1a3708ce54ca5724f6eb960d179192b	2018-04-13 10:00:03 -07:00
Stanislau Hlebik	9d6f639389	mononoke: always fill changesets table Summary: Let's fill cs table even if we import only part of the repo. This let's us import new changesets incrementally. That can be dangerous since we don't check if parent commits are present. However this blobimport is a temporary measure until we get a full-fidelity blobimport that uses a commit API. Reviewed By: jsgf Differential Revision: D7485495 fbshipit-source-id: 63ba91bad4eb1c1662db73293c76a506f48a4753	2018-04-13 03:58:22 -07:00
Stanislau Hlebik	89befdb26d	mononoke: avoid using linknodes Summary: They are replaced by filenodes Reviewed By: farnz Differential Revision: D7443320 fbshipit-source-id: 13c7d07bc00dcbaa991663c8da8a07fcb0de1332	2018-04-13 02:47:24 -07:00
Siddharth Agarwal	8e7a1acc0f	mercurial-types: rename Blob to HgBlob Summary: This will probably go away soon, but for now I want to be able to disambiguate the new Thrift-encoded blobs in Mononoke from these. Reviewed By: StanislavGlebik Differential Revision: D7565808 fbshipit-source-id: d61f3096fa368b934a923dee54a0ea1e3469ae0d	2018-04-11 13:42:31 -07:00
Siddharth Agarwal	7cb68c0c60	mercurial-types: make the Type enum use FileType Summary: Since `FileType` now exists, the `Type` enum can use it instead of defining its own stuff. Reviewed By: farnz Differential Revision: D7526046 fbshipit-source-id: 3b8eb5502bee9bc410ced811dc019c1ce757633f	2018-04-06 13:16:55 -07:00
Stanislau Hlebik	7164741b13	mononoke: add an option to create mysql filenodes in blobimport Summary: Previously we were able to create just sqlite filenodes, now let's make it possible to create a mysql filenodes. Reviewed By: jsgf Differential Revision: D7485098 fbshipit-source-id: b9156e51d41a570f9e6aaf9eaa9e476222257bca	2018-04-06 04:36:37 -07:00
Stanislau Hlebik	9736a2ef4a	mononoke: generate filenodes in blobimport Summary: We'll need filenodes in blobimport when we'll add filenodes to the BlobRepo. The implementation is not great - it creates a separate thread for the filenodes (see "filenodeinserts"). New filenodes are sent via UnboundedSender from the parsing cpupool However it doesn't worth the effort to clean up the code that we are going to deprecate in a couple of weeks. Reviewed By: farnz Differential Revision: D7429440 fbshipit-source-id: 4a9220915bd27f5c1c2028ec604afd700bb8a509	2018-03-29 03:29:57 -07:00
Dino Wernli	b52cbcab45	Add support for context-enabled streams Summary: The diff adds an extension of streams to the `failures_ext` crate, allowing streams with error type `failure::Error` or `failure::Fail` to store a context. As a proof-of-concept, the resulting `context()` function is applied to a stream in use in mononoke. Reviewed By: lukaspiatkowski Differential Revision: D7336012 fbshipit-source-id: 822c9dcd5b6c0a60470e8fd98fecd569928be7d1	2018-03-25 08:06:04 -07:00
Jeremy Fitzhardinge	d6504de44c	scm/mononoke: make Changesets::add take ChangesetInsert by value Summary: There's no point passing it by reference since callers don't need to retain it, and the async implementation needs to move it into another context. Reviewed By: farnz Differential Revision: D7350001 fbshipit-source-id: 5947557a84621afae801dc20e3994496244e3a10	2018-03-23 13:47:54 -07:00
Lukas Piatkowski	d0e66cc5f7	mercurial: distinguish between NodeHash used in mercurial Revlogs and in Mononoke Blobstore Summary: This codemod tries not to change the existing behavior of system, only introduce new types specific to Mercurial Revlogs. It introduces a lot of copypasta intentionally and it will be cleaned in following diffs. Reviewed By: farnz Differential Revision: D7367191 fbshipit-source-id: 0a915f427dff431065e903b5f6fbd3cba6bc22a7	2018-03-22 12:24:35 -07:00
Siddharth Agarwal	42aa9019cf	blobimport: stop using empty MPaths Summary: We're going to get rid of empty MPaths very soon, so stop using them here. Reviewed By: StanislavGlebik, farnz Differential Revision: D7358023 fbshipit-source-id: 2d5fff40eae03cc63ef5514faee1a8505b3b2bd0	2018-03-22 05:21:57 -07:00
Siddharth Agarwal	df5c6c7202	use Option<MPath> instead of empty MPaths whenever desired Summary: Lots of places specifically want a non-empty `MPath`, and earlier those cases weren't type-checked. Now `Option<MPath>` stands for an empty `MPath`. In the next diff `MPath::empty()` will go away and the only way to represent an `MPath` that doesn't exist will be with `Option<MPath>`. Reviewed By: farnz Differential Revision: D7350970 fbshipit-source-id: 1612aec67134e7a0ebad15dbaa93b5ea972f8ddf	2018-03-21 15:59:31 -07:00
Siddharth Agarwal	2c2dabdd38	use Option<&MPathElement> instead of &Option<MPathElement> Summary: The `Option<&MPathElement>` type is more general -- it's easy to convert from `&Option<MPathElement>` to it, but the other way around can require a clone. Reviewed By: farnz Differential Revision: D7339161 fbshipit-source-id: 0c8ab57a19bc330245c612e3e0e3651e368ab8cb	2018-03-21 09:32:11 -07:00
Lukas Piatkowski	67dbc75f9d	mononoke: add new_blobimport that uses BlobRepo API for importing data Summary: The new_blobimport as opposed to the old one do two things differently: 1. It uses a better structured API of RevlogRepo. The old one reads the Revlogs directly and does not verify if the data it has read is correct or it does not let us fix it into canonical form easily (once we have a canonical form different from Revlog's). 2. It uses BlobRepo's Commit API instead of writing directly to storage. This ensures consistency in our code and let's us leverage the validation that is incorporated in Commit API. Reviewed By: farnz Differential Revision: D7041976 fbshipit-source-id: fe592524533955f364f1b037109b3b5b5bab6b02	2018-03-20 11:53:09 -07:00
Lukas Piatkowski	a4cd4005df	mercurial: refactor RevlogRepo to expose less methods publicly Summary: RevlogRepo exposes a ton of methods that are almost equvalent to taking Revlog directly and ignoring the RevloRepo abstraction above it. This diff cleans this up a bit, there are still some methods that the "old" blobimport uses, but the "new" one shouldn't need to do that. Reviewed By: StanislavGlebik Differential Revision: D7289445 fbshipit-source-id: ac7130fe41c4e4484d6986fe5b19d5adc751369a	2018-03-20 11:53:09 -07:00
Siddharth Agarwal	b338897dc4	prefix ChangesetId, ManifestId and BlobHash with Hg Summary: Mononoke will introduce its own ChangesetId, ManifestId and BlobHash, and it would be good to rename these before that lands. Reviewed By: farnz Differential Revision: D7293334 fbshipit-source-id: 7d9d5ddf1f1f45ad45f04194e4811b0f6decb3b0	2018-03-15 17:45:29 -07:00
Dino Wernli	fa2b14cd8b	Remove the generic types from Blob and BlobNode in favor of Bytes Summary: Replace the generic types if `Blob` and `BlobNode` with `Bytes`. Reviewed By: lukaspiatkowski Differential Revision: D7115361 fbshipit-source-id: 924d347377569c6d1b3b4aed14d584510598da7b	2018-03-02 09:45:04 -08:00
Jeremy Fitzhardinge	aeff388fef	tp2: update rust-crates-io Summary: Update to include num_cpu and blake2. Also update bincode to 1.0.0 Reviewed By: StanislavGlebik Differential Revision: D7098292 fbshipit-source-id: 67793a6f458d50fc049781f34abaf313c8ff7a79	2018-02-27 09:27:19 -08:00
Simon Farnsworth	5e074bdd43	Create changesets API in BlobRepo Summary: Provide an API to ask BlobRepo to create changesets for you from pieces that you either have to hand, or have created via upload_entry(). Parallelism is maintained in as far as possible - if you commit N changesets, they should all upload blobs in parallel, but the final completion future depends on the parents, so that completion order can be maintained. The ultimate goal of this API is to ensure that only valid commits are added to the `BlobRepo` - this means that, once the future returned by `create_changeset` resolves, you have a repo with commits and blobs in place. Until then, all the pieces can be uploaded, but are not guaranteed to be accessible to clients. Still TODO is teaching this to use the complete changesets infra so that we simply know which changesets are fully uploaded. Reviewed By: StanislavGlebik Differential Revision: D6743004 fbshipit-source-id: 813329058d85c022d75388890181b48b78d2acf3	2018-02-27 02:19:11 -08:00
Stanislau Hlebik	e8b335468a	mononoke: use changesets table in blobimport Summary: Iff all the inserts finished successfully, then it's safe to mark changesets as complete. This diff fills up changesets store after blobimport successfully finishes. For simplicity if --commit-limit or --skip is set then we skip filling up the changeset store. Reviewed By: sid0 Differential Revision: D7043831 fbshipit-source-id: 8ae864b45222d52281c885a49c2dca44ba577137	2018-02-26 09:43:24 -08:00
Stanislau Hlebik	1d7668ffba	mononoke: remove get_mpath and get_path and use get_name Summary: As we discussed before, let's add get_name() method that returns MPathElement, and remove get_path() and get_mpath(). Except for renaming, diff also make repoconfig work with tree manifest, and fixes linknodes creation in blobimport - previously basename was used instead of the whole path. Reviewed By: jsgf Differential Revision: D6857097 fbshipit-source-id: c09f3ff40d38643bd44aee8b4488277d658cf4f6	2018-02-07 07:53:48 -08:00
Katherine McKinley	5c1e5825b0	Convert NodeHash to ChangesetId in BlobChangeset Summary: Change BlobChangeset and callers to use ChangesetId instead of NodeId Reviewed By: lukaspiatkowski Differential Revision: D6835450 fbshipit-source-id: 7b20359837632aef4803e40965380c38f54c9d0a	2018-01-31 13:36:45 -08:00
Xiaotian Wu	2c4d93ceb8	modify entryid related code Summary: modify entryid code Reviewed By: lukaspiatkowski Differential Revision: D6722202 fbshipit-source-id: 1df45071709f4a425374a87a29553830071b5d2d	2018-01-16 07:57:25 -08:00
Xiaotian Wu	92c0528b1f	modify manifestid related code Summary: nodehash -> manifestid Reviewed By: lukaspiatkowski Differential Revision: D6719378 fbshipit-source-id: 1ec59b33270e389da8e74b3864c37a86c9d89f81	2018-01-16 07:57:25 -08:00
Stanislau Hlebik	5c43ec42c7	mononoke: add --inmemory-logs-capacity option Summary: Adds an option that sets the number of filelogs and revlogs that will be loaded in memory. That let's us use blobimporting in memory constrained enviroments. Reviewed By: jsgf Differential Revision: D6532734 fbshipit-source-id: b748478ec80e75f56a8e07ae1532b0d69c4a5e16	2017-12-13 06:58:24 -08:00
Simon Farnsworth	4d6f3fa51b	Remove generics from Linknode Summary: Like the other BlobState components, Linknode was too generic - reduce down to a practical set for live implementations. Error handling is not great here or in Bookmarks, but I'm going to await the decision on moving to Failure before I improve it. Reviewed By: jsgf Differential Revision: D6459012 fbshipit-source-id: 00314309f62ba070b5908a28f5174a31b6dd0d84	2017-12-11 07:05:53 -08:00
Simon Farnsworth	28caf3f938	Remove Error, GetBlob and PutBlob from BlobStore Summary: Remove the last associated types from BlobStore - this means that BlobStore now has an associated trait object type. Reviewed By: jsgf Differential Revision: D6425414 fbshipit-source-id: 7186dab9b56593dd1d70be732d4ad56d1e7b3c63	2017-12-11 07:05:53 -08:00
Jeremy Fitzhardinge	ac31713c84	rust: failure cleanup pass Summary: Don't use failure's bail!() and ensure!() macros. Instead, failure_ext provides: - bail_err!(err) - Converts its single parameter to the expected error and returns; ie `return Err(From::from(err));` - bail_msg!(fmt, ...) - takes format string parameters and returns a `failure::err_msg()` error - ensure_err!(), ensure_msg!() - corresponding changes Also: - remove all stray references to error-chain - remove direct references to failure_derive (it's reexported via failure and failure_ext) - replace uses of `Err(foo)?;` with `bail_err!()` (since `bail_err` unconditionally returns, but `Err(x)?` does not in principle, which can affect type inference) Reviewed By: kulshrax Differential Revision: D6507717 fbshipit-source-id: 635fb6f8c96d185b195dff171ea9c8db9e83af10	2017-12-07 14:10:17 -08:00
Stanislau Hlebik	2bf9290758	mononoke: add skip parameter to blobimport Summary: Make it possible to skip a number of commits. Also change type from usize to u64, to make sure it works the same on 32-bit platforms (although that shouldn't matter much). Reviewed By: jsgf Differential Revision: D6395743 fbshipit-source-id: 88a12583de2b23d4f55115d696c5398f6814c2da	2017-12-06 01:21:11 -08:00
Jeremy Fitzhardinge	dc5e78c1c1	rust: mass convert scm/mononoke/... to use `failure` Summary: Convert scm/mononoke to use failure, and update common/rust crates it depends on as well. What it looks like is a lot of deleted code... General strategy: - common/rust/failure_ext adds some things that are in git failure that aren't yet in crates.io (`bail!` and `ensure!`, `Result<T, Error>`) - everything returns `Result<T, failure::Error>` - crates with real error get an error type, with a derived Fail implementation - replicate error-chain by defining an `enum ErrorKind` where the fields match the declared errors in the error! macro - crates with dummy error-chain (no local errors) lose it - `.chain_err()` -> `.context()` or `.with_context()` So far the only place I've needed to extract an error is in a unit test. Having a single unified error type has simplified a lot of things, and removed a lot of error type parameters, error conversion, etc, etc. Reviewed By: sid0 Differential Revision: D6446584 fbshipit-source-id: 744640ca2997d4a85513c4519017f2e2e78a73f5	2017-12-05 18:11:13 -08:00
Simon Farnsworth	16da012250	Remove ValueIn/ValueOut from the BlobStore generic arguments. Summary: BlobStore is entirely generic, and puts no limits on its implementations. Remove ValueIn and ValueOut type parameters, and insist that all blobs are Bytes (as per production setups) Reviewed By: StanislavGlebik Differential Revision: D6425413 fbshipit-source-id: 455e526d8baebd0d0f1906941648acca89be4881	2017-12-04 10:22:09 -08:00
Simon Farnsworth	c1ee28dbc7	Remove Key from the BlobStore generic arguments. Summary: BlobStore is entirely generic, and puts no limits on its implementations. Remove the "Key" type parameter, and insist that all keys are String (as per production setups) Reviewed By: StanislavGlebik Differential Revision: D6425412 fbshipit-source-id: 1f1229bf8e001bf780964e883c6beb071e9ef1d8	2017-12-04 10:22:09 -08:00
Stanislau Hlebik	074fdbdc83	mononoke: make OUTPUT parameter optional in blobimport Summary: In some cases output path is not necessary at all - for example, if we put blobs into the remote storage and we don't care about heads. Let's make OUTPUT parameter optional for these cases. Reviewed By: jsgf Differential Revision: D6397168 fbshipit-source-id: 06ee3b2bba038ff5076040a01c9d73c2b6e2b5fc	2017-12-03 08:37:01 -08:00
Simon Farnsworth	16615b3749	Make Heads a parameterless trait Summary: As part of removing excess genericism, make Heads a trait with no associated types or type parameters. Reviewed By: StanislavGlebik Differential Revision: D6352727 fbshipit-source-id: df9ef87e0e0abe43c30e7318da38d7f930c37c6e	2017-11-23 07:05:35 -08:00
Siddharth Agarwal	a6c5093cc8	blobimport: write out linknodes Summary: This makes it quite easy to write out linknodes. Also regenerate linknodes for our test fixtures -- the next commit will bring them in. Reviewed By: jsgf Differential Revision: D6214033 fbshipit-source-id: 3b930fe9eda45a1b7bc6f0b3f81dd8af102061fc	2017-11-13 22:01:55 -08:00
Jeremy Fitzhardinge	66a5fa4362	mononoke: mothball hgqlserve Summary: It's an interesting prototype, but awkward to keep running and we only need it for reference. Reviewed By: StanislavGlebik Differential Revision: D6306296 fbshipit-source-id: 10b5bf3631debcb9de258d4d68089ff709dc1329	2017-11-13 17:45:49 -08:00
Stanislau Hlebik	83f2eb90a5	mononoke: remove retries Summary: Putting retries on this layer is not very good, because it requires every client to add RetryingBlobstore. Reviewed By: kulshrax Differential Revision: D6298254 fbshipit-source-id: dbdce7fe141f9e1511322e74a1258d3819a68eb5	2017-11-13 03:21:31 -08:00
Stanislau Hlebik	ab834dd388	blobimport: add more stats Reviewed By: lukaspiatkowski Differential Revision: D6207104 fbshipit-source-id: c124a39618a95bbb42cff22ce9120e517c26489f	2017-11-07 10:43:50 -08:00
Siddharth Agarwal	acda30f2fb	replace AsRef<Path> with Into<PathBuf> in a few spots Summary: We need ownership of the buffer in all of these cases, and `AsRef<Path>` could potentially create unnecessary copies. Reviewed By: jsgf Differential Revision: D6214034 fbshipit-source-id: 806a87bfe3b125febaaaaf26c8b8dcac407de145	2017-11-02 13:10:56 -07:00
Stanislau Hlebik	e64ef44fda	mononoke: add an option to limit blob size in blobimport Summary: This option can be used non-production ready blobstores that can't yet handle big blobs. Reviewed By: farnz Differential Revision: D6189922 fbshipit-source-id: fa4df5b49c6d1126d3b3114e9ebe376931947917	2017-11-01 13:01:58 -07:00
Stanislau Hlebik	8cc33676aa	mononoke: add --commits-limit option to blobimport Summary: It's quite useful option for testing and I had to reimplement this option a couple of time. It's time to land it. Reviewed By: farnz Differential Revision: D6172230 fbshipit-source-id: ec1b7c0453a3a612a173aec87978a4917568cd7b	2017-11-01 13:01:57 -07:00
Lukas Piatkowski	9712530c0e	blobstore: add RetryingBlobstore that retries failed put/get operations with delay Reviewed By: StanislavGlebik Differential Revision: D6203017 fbshipit-source-id: 277fa267e86d2cb5eede241bf80dd8d1c90a3b96	2017-10-31 20:53:08 -07:00
Siddharth Agarwal	c8d6e7f954	use RepoPath instead of MPath in a few places Summary: `RepoPath` represents any absolute path -- root, directory or file. There's a lot of code that manually switches between directory and file entries -- abstract all of that away. Reviewed By: farnz Differential Revision: D6201383 fbshipit-source-id: 0047023a67a5484ddbdd00bb57bca3bfb7d4dd3f	2017-10-31 14:26:39 -07:00
Siddharth Agarwal	3269b802a4	factor out code to put manifest and file blobs Summary: copy_changeset was getting a bit too long, and linknode stuff would have made it even longer. Reviewed By: StanislavGlebik Differential Revision: D6097840 fbshipit-source-id: 00800cf9516adf69f2ca19244d3e14268f148ae4	2017-10-19 09:39:17 -07:00
Siddharth Agarwal	92eb9c6e6a	blobimport: use (effectively) named params for conversion Summary: Going to add more params here, and this is becoming quite hard to read. Reviewed By: StanislavGlebik Differential Revision: D6096419 fbshipit-source-id: 50f0b99bb6b1804fc01f6a99fc0297c1695dbaa5	2017-10-19 09:39:17 -07:00
Siddharth Agarwal	a1c6c494fd	blobimport: nicer manifest method names Summary: Namespacing really helps here! Reviewed By: StanislavGlebik Differential Revision: D6095128 fbshipit-source-id: f7e4b6d011c2d55c3976a6ba9a20f17384c62b92	2017-10-19 09:39:17 -07:00
Siddharth Agarwal	675c1e0b56	blobimport: factor out conversion code into separate files Summary: I'm adding linknode support to this store, and with that and without this refactoring the code becomes quite unmanageable. I've recorded this as copies to preserve blame info for the bits that will remain untouched. This seems to work pretty well. Reviewed By: StanislavGlebik Differential Revision: D6094812 fbshipit-source-id: f7a7a1d3546d4ef2dbfa33a0a8e97d47b44f51a5	2017-10-19 09:39:17 -07:00
Siddharth Agarwal	7205d847f5	blobimport: factor out error definitions into another module Reviewed By: StanislavGlebik Differential Revision: D6094813 fbshipit-source-id: fa58b9c69c82c506435d7c7d3b259c8d9cb46545	2017-10-19 09:39:17 -07:00
Siddharth Agarwal	2fbdf6d207	move blobimport code into a subdirectory Summary: Will factor this out into several files in upcoming patches. Reviewed By: StanislavGlebik Differential Revision: D6094811 fbshipit-source-id: cd354888882aff2552e61dea788aeb5426e08f4d	2017-10-19 09:39:17 -07:00
Stanislau Hlebik	553343bc0d	mononoke: filter the same manifest entries in blobimporting Summary: There is no need to insert the same entries twice. Let's filter them. Note that while it's possible to have the same manifest entries (for example, file or dirs with the same content), all changeset entries should be unique, because each changeset in the repo is unique and is processed exactly once. Reviewed By: farnz Differential Revision: D6076667 fbshipit-source-id: 64bdf25a21884eb2faf43f32590f7cbb8f8dd300	2017-10-18 01:36:22 -07:00
Stanislau Hlebik	13823f0f7e	mononoke: add separate io thread to blobimport Summary: Let's move all IO to the separate thread. This helps quite a lot when used with slow blostore, because parser threads are not blocked on IO - importing upstream mercurial repo went from 20 mins to 9 mins. Reviewed By: lukaspiatkowski Differential Revision: D6050992 fbshipit-source-id: c3877b123bad993d819495247135544a141eab10	2017-10-18 01:36:22 -07:00
Arun Kulshreshtha	be57d5440b	Change default Manifold bucket to mononoke_prod in blobimport Summary: Change the default bucket for blobimport to be mononoke_prod, a higher capacity bucket than the previous mononoke bucket. Also make it possible to specify the bucket via the CLI rather than hardcoding it. Reviewed By: jsgf Differential Revision: D6073745 fbshipit-source-id: 11dcf0c8bbef0b7c3f5971cf0676cf6325f276a6	2017-10-17 00:36:28 -07:00
Stanislau Hlebik	024650d13d	mononoke: fix blobimporting warnings Reviewed By: kulshrax Differential Revision: D5995891 fbshipit-source-id: d8e396da8689cc1d07856a21e0f023a83e222877	2017-10-12 04:07:45 -07:00
Lukas Piatkowski	0a76eb6f45	blobimport: add optional thrift service and stats counters Reviewed By: farnz Differential Revision: D6030720 fbshipit-source-id: 26971f13061e9a3a1e65d339fc7bdc444b5165fd	2017-10-11 10:58:08 -07:00
Lukas Piatkowski	a3579ee80d	blobimport: add more descriptive error messages on manifold failure Reviewed By: kulshrax Differential Revision: D6022078 fbshipit-source-id: 4504524cb49dbd9b013ac44229107be5024dfcad	2017-10-11 05:50:50 -07:00
Lukas Piatkowski	565afd9615	blobimport: use drain from slog-glog-fmt for logging Summary: the glog drain does not swallow f.e. backtrace of error_chain errors, so it is a bit easier to debug the tool Reviewed By: farnz Differential Revision: D6021671 fbshipit-source-id: 32bfe01bfd77d85c37a2a446cb3e5d000763c689	2017-10-11 05:50:46 -07:00
Siddharth Agarwal	db28a15305	add deny(warnings)/allow(deprecated) to a few crates Summary: Realized that we were missing a few crates from the Tokio cleanup because those crates didn't have `#![deny(warnings)]`. This also caused a bunch of files to be rustfmted, which is fine. Reviewed By: kulshrax Differential Revision: D6024628 fbshipit-source-id: 55032d20f3676c92ef124d861e1edcd34126ab55	2017-10-10 15:23:25 -07:00
Stanislau Hlebik	c474b57a9e	blobimport: add an option to postpone compaction Summary: Compaction can slow down blobimporting a lot. Let's add an option to postpone it till the end Reviewed By: farnz Differential Revision: D5882003 fbshipit-source-id: 0611a8e94b3d7331bdacf909d820526f547414a0	2017-09-22 08:33:25 -07:00
Siddharth Agarwal	de4b1f6c93	use BlobHash in RawNodeBlob Summary: Also ensure that `blobimport` doesn't use its own copy. Reviewed By: jsgf Differential Revision: D5847604 fbshipit-source-id: 5390848cd5fab8abd967ef9701720491d703c0f1	2017-09-18 00:35:52 -07:00
Jeremy Fitzhardinge	2f72747e85	mononoke: blobimport: more use of impl Future Summary: This seemed to run up against a compiler issue (https://users.rust-lang.org/t/impl-future-lost-fact-that-a-closure-is-fnonce/12870/16) that can be worked around by adding an apparently unnecessary `.boxed()`. Reviewed By: sid0 Differential Revision: D5843292 fbshipit-source-id: 0a82760bf6afbf7ba5f04541ca57bedcc935d411	2017-09-15 15:58:33 -07:00
Jeremy Fitzhardinge	8f701222aa	mononoke: blobimport: first use of impl Future Summary: Use `impl Future` rather than a boxed future. Reviewed By: sid0 Differential Revision: D5829773 fbshipit-source-id: 40c4339e96f7194544f416534952b78a23d93fa6	2017-09-15 15:24:05 -07:00
Jeremy Fitzhardinge	d584bc31b3	mononoke: blobimport: rustfmt Reviewed By: sid0 Differential Revision: D5831880 fbshipit-source-id: 9d9d7b18214241336e97219c48bf281dd08b5ab9	2017-09-15 15:24:05 -07:00
Arun Kulshreshtha	c9d8af3f1c	Add ManifoldBlob support to blobimport Summary: Add the `--blobstore manifold` option to blobimport to make it write blobs to Manifold. Reviewed By: jsgf Differential Revision: D5758930 fbshipit-source-id: a14a3c155b5d8d7b171ed7a4e53f8569539cb2e9	2017-09-05 13:38:47 -07:00
Jeremy Fitzhardinge	0745e97bfe	mononoke: don't use `:` in blob key names Summary: `:` is a reserved character for Windows paths, so Mercurial rejects them from being committed. Use `-` instead, so that we can commit file blob repo test fixtures. Reviewed By: kulshrax Differential Revision: D5731525 fbshipit-source-id: 8d14fc03f1b135cbc4d42aeaf2f3a0ae6d13f956	2017-09-04 15:20:06 -07:00
Siddharth Agarwal	c1a30e25c9	make repo error types require std::error::Error Summary: This gets us `Display` support as well. Reviewed By: lukaspiatkowski Differential Revision: D5734383 fbshipit-source-id: 1485cf80bb310cdd282b4546bed56c60082be8ec	2017-08-31 13:51:24 -07:00
Siddharth Agarwal	8202a344b4	rustfmt mercurial-types and blobimport Summary: Just a few minor changes that make our lives easier overall. Reviewed By: lukaspiatkowski Differential Revision: D5737854 fbshipit-source-id: da951d7872433bffa8fc64d15cd0e917f77144b5	2017-08-31 13:51:24 -07:00
Stanislau Hlebik	dbd845ba0e	mononoke blobimport: split work between threads by linkrev Summary: We want to avoid putting the same entries twice in the blobstore. And even more - we want to avoid generating list of these entries at all in the first place. The first approach was to add a `Mutex<HashSet>` that worker threads will use to filter out entries that were already imported. Turned out that this Mutex kills almost all the speedup from concurrency. But since we have linkrevs then for each entry we know in which commit this entry was created [1]. That means that all of the entries are already nicely split between the threads. So no synchronization is needed. It gives a good speedup - from ~7min to 2min of importing of hg upstream treemanifest repo using file blobstore. Note: there is still a lock contention - tree revlogs and file revlogs maps are protected by mutex. We can optimize it later if needed. [1] There is a well-known linkrev issue in mercurial. It shouldn't affect our case at all. Reviewed By: jsgf Differential Revision: D5650074 fbshipit-source-id: c4f9e2763127ffe4402417dd3963f1f450d7b325	2017-08-23 05:13:50 -07:00
Stanislau Hlebik	747e4a2ed9	mononoke: rustfmt on blobimport Reviewed By: sid0 Differential Revision: D5649665 fbshipit-source-id: 553ef550e7465ab5f3bb129cf4d0c282128fa24d	2017-08-23 05:13:50 -07:00
Stanislau Hlebik	b72949e3d1	mononoke: blob importing of tree manifest repo Summary: Main part is `get_stream_of_manifest_entries` that creates a stream of all tree manifest entries by recursively going through all of them. Reviewed By: jsgf Differential Revision: D5622490 fbshipit-source-id: 4a8b2707df0300a37931c465bafb1ed54d6d4d25	2017-08-18 09:51:17 -07:00
Stanislau Hlebik	c33e4afddf	mononoke: same storage keys for manifest entries as for the file entries Summary: A preparation step before blob importing of tree manifest repos to blobrepo. `get_parents()` method of BlobEntry reads parents from the blobstore. It works fine for file entries because file entries can stores its parents in the blobstore. With tree manifests BlobEntry can contain also tree manifest entries, and that means that tree manifest entries parents should also be stored somewhere in the blobstore. I suggest to use the same logic for the tree manifest entries as for the file entries. File and manifest entries have two blobstore entries - one stores hash of the content and parents, another stores the actual content. To do this I moved `RawNodeBlob` and `get_node()` to the separate module and made fields public. Reviewed By: jsgf Differential Revision: D5622342 fbshipit-source-id: c9f0c446107d4697b042544ff8b37a159064f061	2017-08-15 10:37:34 -07:00
Stanislau Hlebik	91d587e052	mononoke: change Path implementation Summary: Instead of storing `Vec<u8>`, let's store `Vec<PathComponent>`, where PathComponent is Vec<u8> without b'\'. To make sure len() is still `O(1)` let's store it too. Reviewed By: sid0 Differential Revision: D5573721 fbshipit-source-id: 91967809284d79bf0fcdcabcae9fd787a37c318b	2017-08-10 05:24:42 -07:00
facebook-github-bot	2b6af6b941	Initial commit fbshipit-source-id: f75baa4ff6aa71973f677b752d7aba582cf4927f	2017-07-27 18:00:19 -07:00

1 2 3 4 5

202 Commits