sapling

mirror of https://github.com/facebook/sapling.git synced 2024-10-12 01:39:21 +03:00

Author	SHA1	Message	Date
Durham Goode	e9a9bad998	Use atomic file writes for server side cache We've gotten reports of users receiving corrupt file blobs directly from the server. The corruption doesn't enter the cache pool, and we don't get any further reports of it, so I think it's a transient issue caused certain readers reading the file before the writer has finished writing it. Let's use atomic rename files to make this not happen.	2015-09-28 10:31:38 -07:00
Wez Furlong	6e7195b8ef	Be more careful during close I saw some crazy looking stack traces like this while testing an improved implementation of our internal cacheprocess binary: ``` fileservice.prefetch([(self.filename, id)]) File "/usr/lib/python2.6/site-packages/remotefilelog/remotefilelog.py", line 78, in read File "/usr/lib/python2.6/site-packages/remotefilelog/fileserverclient.py", line 357, in prefetch raw = self._read(hex(node)) File "/usr/lib/python2.6/site-packages/remotefilelog/remotefilelog.py", line 283, in _read missingids = self.request(missingids) File "/usr/lib/python2.6/site-packages/remotefilelog/fileserverclient.py", line 196, in request fileservice.prefetch([(self.filename, id)]) File "/usr/lib/python2.6/site-packages/remotefilelog/fileserverclient.py", line 357, in prefetch missingid = cache.receiveline() File "/usr/lib/python2.6/site-packages/remotefilelog/fileserverclient.py", line 105, in receiveline self.close() missingids = self.request(missingids) File "/usr/lib/python2.6/site-packages/remotefilelog/fileserverclient.py", line 76, in close File "/usr/lib/python2.6/site-packages/remotefilelog/fileserverclient.py", line 196, in request self.pipei.write("exit\n") missingid = cache.receiveline() File "/usr/lib/python2.6/site-packages/remotefilelog/fileserverclient.py", line 105, in receiveline ValueError: I/O operation on closed file self.close() File "/usr/lib/python2.6/site-packages/remotefilelog/fileserverclient.py", line 76, in close self.pipei.write("exit\n") ValueError: I/O operation on closed file ``` it looks like we are somehow re-entrant (maybe referenced from multiple generators?) and get tripped up if we're not careful about checking for or catching issues during the close() method call. So let's be a little more careful :-)	2015-09-15 07:48:14 -07:00
Adam Simpkins	a93ebb8b1e	remotefilelogserver: fix missing import Summary: _walkstreamfiles() uses mercurial.store.decodedir(), so mercurial.store needs to be imported. Test Plan: Confirmed that _walkstreamfiles() no longer throws an exception when cloning a remote shallow repository. Reviewers: durham, pyd, rmcelroy Reviewed By: rmcelroy Subscribers: net-systems-diffs@, exa, yogeshwer Differential Revision: https://phabricator.fb.com/D2409648 Signature: t1:2409648:1441245825:00a758f6f0884b77572078589f18592ca6cb6fa4	2015-09-02 19:04:33 -07:00
Durham Goode	fb7827372b	Don't check datafiles if the matcher says everything is remote Streaming clones were taking a while because apparently self.datafiles() actually stats each .i file instead of just returning the list straight from fncache. To fix this, let's not call datafiles() when we know the matcher is going to reject everything anyways. This significantly speeds up streaming clones.	2015-09-05 12:24:04 -07:00
Mathias De Maré	8ab8d2601b	fileserverclient: clear error message if cachepath is not configured	2015-08-29 08:20:54 +02:00
Augie Fackler	226a6f1027	fileserverclient: add config knob to control batch size Previously we'd just send one enormous batch for everything to the server. This led to prolonged periods of no progress output for the user. Now we send batches in smaller chunks (default is 100) which gives the user some idea that things are working. Includes a trivial test, which doesn't really verify that the batching logic is used as described, but at least prevents the boneheaded error I had in an earlier (unmailed) version of this patch which forgot to use configint() when loading the config setting.	2015-08-18 15:14:01 -04:00
Augie Fackler	06c09f03ab	fileserverclient: correctly use exception constructor We were passing one argument instead of 3.	2015-08-18 15:35:21 -04:00
Augie Fackler	51f7cac5a7	getfile: add error reporting to getfile method Without this, the only way to report a failure of a file load in a batched set of getfile requests is to fail the entire batch, which is potentially painful. Instead, add our own error reporting in-band which the client can then detect and raise. I'm not completely happy with the somewhat adhoc error reporting here, but we expect our server to have at least one additional error ("not allowed to see file contents") which will require some special handling on our end, so we need some level of flexibility in the error reporting protocol so we can extend it later. Sigh. Open question: should we reserve some range of error codes so that it's easy for strange custom servers to have related monkeypatches to client code for custom handling of unforseen-by-remotefilelog conditions? I couldn't figure out how to actually get the client to try loading file contents over http in the test, but the get-with-headers test at least proves that the server responses look the way I expect.	2015-08-04 14:59:53 -04:00
Durham Goode	5bb4351364	prefetch: add prefetching to bundle receiving We were not prefetching the potential dependent files for the filelog revisions we received over the wire. This resulted in a lot of non-batched downloads, which was super slow. This fixes it by batch downloading the parents and delta parents of the incoming filelog revisions and adds a test.	2015-07-21 18:32:33 -07:00
Durham Goode	9152c8be08	fileserverclient: fix progress bar A previous commit changed count to be a list, but missed the use of it when being passed to progress. This fixes it.	2015-07-21 18:31:01 -07:00
Augie Fackler	26ab790f75	fileserverclient: mark getfile as batchable This lets clients send many getfile requests in a single transaction. Note that this requires 76fcf62accb0 be applied to your Mercurial, or you'll be bitten by a bug[0] in Mercurial's wireproto batching. As a result of this change, remotefilelog now effectively requires the upcoming Mercurial 3.5 if you want to use a specific release. 0: http://bz.selenic.com/show_bug.cgi?id=4739	2015-06-30 17:34:01 -04:00
Augie Fackler	16310f95f3	remotefilelog: introduce new getfile method Right now, this is a naive fetch-one-file method. The next change will mark the method as batchable and use a batch in the client so that many files can be requested in a single RPC.	2015-06-30 17:32:31 -04:00
Augie Fackler	adef2bd2d0	remotefilelogserver: move umask twiddling for cache into _loadfileblob This narrows the interval during which we've modified umask, which seems nice. Done as a separate change for clarity.	2015-06-30 16:58:15 -04:00
Augie Fackler	d2f7930f70	fileserverclient: tease out a _getfiles method This will make it easier to detect servers that support _getfiles2 and prefer that method when available.	2015-06-30 16:43:18 -04:00
Augie Fackler	5966446c14	remotefilelogserver: tease out a _loadfileblob method for future use We're about to introduce a new getfiles method, so let's take this opportunity to split out the file loading code so it'll be used in only one place.	2015-06-30 15:02:07 -04:00
Augie Fackler	882ca8e705	remotefilelogserver: prevent getfiles from being called over http at all This means that even old clients that fail to sniff for capabilities before trying getfiles will get a sensible error message back from the server.	2015-06-30 11:04:47 -04:00
Augie Fackler	4e4a3a3a7b	remotefilelogserver: disable remotefilelog serving over non-ssh protocols	2015-06-29 16:34:31 -04:00
Augie Fackler	e2d021637c	fileserverclient: refuse to operate on a non-sshpeer The way the protocol is defined for getfiles interleaves reading filenames and sending file contents, which works fine over ssh but is incompatible with http. This change is probably not neccessary now that remotefilelog correctly checks for its own capability first, but it helped me debug so I left it in for completeness.	2015-06-29 16:25:44 -04:00
Augie Fackler	dd2e200ad1	fileserverclient: sniff for remotefilelog capability before using it This prevents clients from causing a server problem on an http server.	2015-06-29 17:33:56 -04:00
Augie Fackler	32cb84c8b7	remotefilelogserver: restrict remotefilelog capability to ssh This only works over ssh, so let's not pretend otherwise. A future change will ensure the capability is still advertised via ssh.	2015-06-29 17:36:25 -04:00
Augie Fackler	5a72282b12	remotefilelogserver: wrap wireproto._capabilities If we instead wrap wireproto.capabilities, then our capabilities don't get transmitted via the hello command, so not all clients will notice the new capability unless we do the wrapping here. Test output is in the test that previously demonstrated the defect. Note that there's still a defect: we're advertising the capability over http even though we have no hope of the getfiles method working over http.	2015-06-29 17:35:32 -04:00
Augie Fackler	2c11d5bbf8	remotefilelog: stop declaring remotefilelog to be an hg-internal extension The magic string 'internal' causes Mercurial to never blame remotefilelog for being broken. I had suspected that remotefilelog might work with 3.4, but the tests fail against 3.4.1, so I'm just making testedwith empty.	2015-07-01 15:58:44 -04:00
Durham Goode	87ac4a0c9e	Fix building revgraph across merge commits The rev graph building code was flawed because it didn't track second parents correctly. This was caught when someone was developing an extension and attempted to commit a merge commit in some way.	2015-06-30 16:43:01 -07:00
Augie Fackler	5eecca9702	remotefilelog: handle the death of repo.sopener (hg change 0bbe3294361a) repo.sopener has been deprecated since hg 2.3, and repo.svfs replaces it. Since it's been dead for so long, let's just use svfs and call it good enough.	2015-06-30 10:12:38 -04:00
Durham Goode	047afeff5f	hooks: remove incominghook Summary: The incominghook was meant to pregenerate any remotefilelog blobs that were likely to be needed shortly. Unfortunately it actually just slows down pushes, since in large repos the hook takes longer than the push does sometimes. So let's just remove it. Test Plan: Apparently there were no tests for this :p Reviewers: sid0, lcharignon, mitrandir, ericsumner, rmcelroy Reviewed By: rmcelroy Differential Revision: https://phabricator.fb.com/D2185894 Signature: t1:2185894:1435126819:e1e1125520411356eccff4baee31ab2938ebc0fe	2015-06-23 20:03:57 -07:00
Siddharth Agarwal	c45c59236b	remove prefetch from the short help list Summary: I really don't think it should be in this list. Test Plan: `hg` Reviewers: durham, #sourcecontrol, rmcelroy Reviewed By: durham, #sourcecontrol, rmcelroy Subscribers: rmcelroy Differential Revision: https://phabricator.fb.com/D1997655 Signature: t1:1997655:1429189594:aa8f355a6fc61e300f824be6b2fbd64a42dde2b5	2015-04-16 00:38:43 -07:00
Durham Goode	93e4a455ff	clone: fix streaming clones Upstream refactored the streaming clone api, so we need to adjust accordingly.	2015-05-27 17:29:34 -07:00
Durham Goode	acea316460	Fix blob generation with adjustlinkrevs Summary: When adjustlinkrevs got moved to the filectx upstream, we incorrectly moved it to the remotefilectx inside remotefilelog. We don't actually use remotefilectx on the server, so wrapping it did nothing. The fix is to move the wrapping to be in remotefilelogserver.py so it is executed on the server side. Test Plan: Did a checkout with my shallow client pointed at a full repo with no blob cache. Verified it went quickly (minutes, instead of hours). Reviewers: pyd Differential Revision: https://phabricator.fb.com/D2097851	2015-05-22 21:32:12 -07:00
Durham Goode	95e9918016	sparse: remove sparse-filtered results from copy tracing Summary: Since we only prefetch things that are in the sparse checkout, copy tracing (which touches everything in the manifest diff) would do individual file downloads for every file. Let's just remove those files from the copy tracing check entirely since the user probably doesn't care if they're outside the sparse checkout. Test Plan: Added a test Reviewers: sid0, rmcelroy, lcharignon, pyd Differential Revision: https://phabricator.fb.com/D2083768	2015-05-18 16:08:49 -07:00
Laurent Charignon	5652bd276a	Match with with latest version of core to pass the test Summary: Match with with latest version of core to pass the test. There were a couple of changes in core that broke the extension, I matched those changes to make the test pass. Test Plan: The tests are all passing Reviewers: durham Differential Revision: https://phabricator.fb.com/D2053958	2015-05-07 12:50:51 -07:00
Durham Goode	b29a6b04dd	Add match arg to computeforwardmissing wrapper Upstream now has a matcher on _computeforwardmissing which will allow us to only prefetch the necessary parts of a sparse checkout. Since we're now being returned an iterator, we need to convert it to a list since we iterate over it and return it.	2015-04-22 16:39:16 -07:00
Durham Goode	8bf6e4f004	sparse: make remotefilelog aware of sparse checkouts Summary: Previously remotefilelog would prefetch every file in a commit. With the sparse checkout extension we want to only prefetch things in the sparse checkout. This commit makes remotefilelog aware of the possible existence of a sparse matcher. Test Plan: Added tests Reviewers: sid0, rmcelroy, pyd, lcharignon Subscribers: kang Differential Revision: https://phabricator.fb.com/D1967207	2015-04-02 09:58:46 -07:00
Ryan McElroy	5a769d1fd6	ajustlinknodes: check for node in nodemap Summary: Per @pyd's review of D1933267, we need to check for the linknode in cl.nodemap, not in cl (whose __contains__ method only looks for revs and doesn't even check for visibility... lolz). Test Plan: ran tests Reviewers: durham, sid0, pyd, ericsumner, lcharignon, davidsp, mitrandir Reviewed By: mitrandir Subscribers: akushner, daviser, pyd Differential Revision: https://phabricator.fb.com/D1934941 Tasks: 6573011 Signature: t1:1934941:1427130649:b084635db9bfcd28c4d4a1bcf12a7500c06b323c	2015-03-23 09:55:23 -07:00
Durham Goode	25efa4b886	Fix adjust linknodes for ancestries with old nodes Summary: The new version of adjust linknodes wasn't accounting for the fact that some ancestries contained nodes that no longer exist. Check for that before looking for common ancestors. The old version of this code survived by luck. We were catching KeyErrors as one base case, and it just happens that LookupError from the changelog is also a KeyError, so it was getting caught and eaten. Test Plan: We should probably add a test, but I have to leave shortly and this is pretty broken, so we'll have to take a rain check. Reviewers: rmcelroy, pyd, sid0 Differential Revision: https://phabricator.fb.com/D1933267	2015-03-20 18:39:38 -07:00
Ryan McElroy	1184ee707c	Fix stack overflow when dealing with long file histories Summary: The new fixmappinglinknodes function was using recursion to traverse the file history, but this would break for files with history that was extremely long (stack overflow). Switch to using a manual stack approach. Test Plan: Ran the tests (I'd added a test to cover this logic before). Reviewers: sid0, davidsp, mitrandir, lcharignon, pyd, rmcelroy Reviewed By: rmcelroy Subscribers: michaelbarton Differential Revision: https://phabricator.fb.com/D1931944 Signature: t1:1931944:1426884986:3a0ef144fb55b8c0533e5c5de90699a1823b891f	2015-03-20 14:04:40 -07:00
Siddharth Agarwal	604cebd541	make patch.trydiff wrapper more generic Summary: I'm going to add a new parameter upstream. Make this more generic so that we don't have to try and support both the old and the new versions. Test Plan: Ran tests with both old and new hg. Reviewers: davidsp, rmcelroy, akushner, pyd, daviser, mitrandir, ericsumner, durham Reviewed By: durham Differential Revision: https://phabricator.fb.com/D1920172 Signature: t1:1920172:1426615175:d90bda3b3cc30f6e5f3149af82ae9e43dee39455	2015-03-17 10:56:59 -07:00
Durham Goode	c599c6ae79	Extra changes related to the previous commit	2015-03-12 15:58:46 -07:00
Durham Goode	1d96446f97	push: fix pushing multiple manifests with the same file node Summary: Previously remotefilelog did not produce all the necessary local data blobs when doing a peer push/pull if the incoming changegroup had two manifests that referred to the same file revision. We would only create a file blob containing the history for the first occurrence, then if the user tried to access the file history for other occurrences they got an exception. The fix is to add linkrev fixup logic, similar to the adjustlinkrev() method from core Mercurial's filectx. Now, if no valid local file blob can be found, we will compute a valid history by reading the changelog. We might be able to write this data to disk in the future as well to prevent having to repeatedly compute this. Test Plan: Added a test Reviewers: sid0, rmcelroy, pyd, mitrandir, lcharignon Differential Revision: https://phabricator.fb.com/D1904453	2015-03-10 20:02:14 -07:00
Durham Goode	8bc01a01bc	prefetching: fix computenonoverlap wrapper The computenonoverlap function has changed upstream. Update ourselves to match it.	2015-03-10 19:59:43 -07:00
Siddharth Agarwal	43e26aff3b	shallowrepo: prefetch files before a commitctx Summary: For hg-git conversions we're going to cause commits without actually updating to the base. Currently, this will cause lots of individual fetches. The test demonstrates the issue -- wihtout this patch it'll fetch the 2 files over 2 fetches, but with it it'll fetch the files over 1 fetch. Test Plan: Ran the tests. Reviewers: davidsp, rmcelroy, akushner, pyd, daviser, mitrandir, ericsumner, durham Reviewed By: durham Differential Revision: https://phabricator.fb.com/D1893721 Tasks: `6390769` Signature: t1:1893721:1425624679:5651f71d5023919e9321646275b681b573847c44	2015-03-05 16:06:12 -08:00
Durham Goode	8203e771e3	Fix store/data permissions to have g+w Previously we only set the umask for shared caches. Let's set it for .hg/store/data as well so shallow repos can be used for shared repositories.	2015-02-25 17:13:49 -08:00
Durham Goode	74c8469821	Update copy wrapping to use new upstream functions Upstream has refactored the copy logic to compute the file lists in separate functions, so we no longer need to compute the file lists ourselves. Update the README's Mercurial min-version since this change depends on new APIs inside Mercurial.	2015-01-27 19:20:47 -08:00
Durham Goode	f84dcdee5d	Move _adjustlinkrev onto remotefilectx Summary: Upstream has moved _adjustlinkrev from being a global function to one on the filectx. Let's do the same. Test Plan: Ran the tests Reviewers: mitrandir Differential Revision: https://phabricator.fb.com/D1825043	2015-02-03 18:59:00 -08:00
Durham Goode	07359d1038	Change server blob creation to not use adjustlinkrev Summary: adjustlinkrev makes ancestor reading orders of magnitude slower, so we need to avoid using it. Since adjustlinkrev already returns the linkrev in certain cases, let's just force it to always return that during file blob creation. Test Plan: Generated a few thousand blobs for www and fbcode using the old and new methods and verified that they were byte-for-byte identical. Reviewers: sid0, pyd, mpm, rmcelroy Differential Revision: https://phabricator.fb.com/D1782400	2015-01-14 13:14:35 -08:00
Durham Goode	d56fa342f0	Improve error message when fallback server isn't configured Summary: If the remotefilelog server was not specified in the hgrc, or if the project hgrc wasn't trusted, it would throw an obtuse error about a NoneType string. This fixes it to give a more informative error explaining the problem. Test Plan: Added a test Reviewers: sid0, pyd, mitrandir, ericsumner, rmcelroy Reviewed By: rmcelroy Differential Revision: https://phabricator.fb.com/D1774743 Signature: t1:1774743:1420830544:5122a8e11f668ee8c35996e0f4395883a31ce8b0	2015-01-09 09:43:14 -08:00
Durham Goode	4d92ad3ed7	Add optional cache validation Summary: There are reports of the local cache becoming invalid when stored on disk. This adds an option that will do some basic validation and remediation for those entries, and log some data to disk. This is optional, since it incurs some performance overhead. We just want to use it long enough to track down the issue. Test Plan: Added a test Reviewers: sid0, pyd, ericsumner, rmcelroy, mitrandir Reviewed By: mitrandir Differential Revision: https://phabricator.fb.com/D1774724 Signature: t1:1774724:1420827432:06ace9d1dc078f469e0f61ebd7f604fc3b606f6d	2015-01-08 18:59:04 -08:00
Durham Goode	5f69d8dd0b	Improve error message for corrupt cache files Summary: We've gotten reports of corrupt cache files, and the error message is pretty obtuse (ValueError for converting a string to an int). This refactors the size check into a function and provides a better error message. Test Plan: Added a test Reviewers: sid0, pyd, mitrandir, ericsumner, rmcelroy Reviewed By: rmcelroy Differential Revision: https://phabricator.fb.com/D1774721 Signature: t1:1774721:1420830671:afd54dde8fdc00e08ed1c6cb73bf9fdc7fac2327	2015-01-09 09:11:06 -08:00
Durham Goode	f0548ee974	Update remotefilectx.filectx to match upstream Upstream has changed the filectx function slightly, so we need to match it.	2015-01-09 11:56:42 -08:00
Siddharth Agarwal	8b622893dc	[shallowbundle] don't drop units and reorder on the floor Summary: We were forgetting to pass these arguments on to the child function. Test Plan: Visual inspection. Reviewers: durham, davidsp, rmcelroy, akushner, pyd, daviser, mitrandir, ericsumner Reviewed By: ericsumner Differential Revision: https://phabricator.fb.com/D1773782 Signature: t1:1773782:1420765574:d73be08ab25265e4769d8bf70671f2ea1c13f8dd	2015-01-08 17:02:37 -08:00
Durham Goode	6687d78fc7	Add introrev to remotefilectx Mercurial upstream does some fancy stuff inside introrev now to provide the correct introrev. It relies on having the filelog though, so we need to avoid it. Remotefilelog has perfect history knowledge, so we can just return the correct linkrev.	2015-01-06 09:28:16 -08:00
Durham Goode	d985df868c	Atomically write local cache files Summary: We're seeing some weird cache corruption errors when writing the cache to disk. My best bet is there's multiple writes colliding and causing bad data, so let's do atomic renames. Test Plan: Ran the test suite Reviewers: sid0, pyd, davidsp, rmcelroy Reviewed By: rmcelroy Subscribers: ericsumner, mitrandir Differential Revision: https://phabricator.fb.com/D1747190 Signature: t1:1747190:1418865586:0a07e5243dfe9c1d5ea24f81874910d1080f24e2	2014-12-17 16:36:40 -08:00
Pierre-Yves David	ee7bdd47d8	remotefilelog: "implement" rawsize too It is part of the revlog API and some extension like tortoisehg rely on it. The default implementation is the same as size so we can safely mimic this here.	2014-11-29 05:20:28 -08:00
Durham Goode	97d36d285b	Fix rebase with changeset evolution A recent fix to make ancestor maps work with changeset evolution actually caused a pretty serious regression. The ancestormap validation code was returning ancestormaps with hidden ancestors if the first commit in the history was a hidden node. This resulted in lots of invalid ancestories being returned. Instead we only want to allow hidden ancestors in the map if the relativeto commit has been explicitly set to a hidden node.	2014-11-24 22:42:34 -08:00
Siddharth Agarwal	d731468f70	[bundle2] insert ourselves into the cg1packer class hierarchy and fix up the packermap Summary: Last bits needed to get remotefilelog over bundle2 working. Includes tests. Test Plan: Ran tests, including with `--extra-config-opt experimental.bundle2-exp=True` Reviewers: davidsp, akushner, pyd, rmcelroy, daviser, durham Reviewed By: durham Differential Revision: https://phabricator.fb.com/D1671738 Tasks: 5568731 Signature: t1:1671738:1415676482:b9e7a1f308919526b0c41fee54d89da876518ec7	2014-11-07 18:35:52 -08:00
Siddharth Agarwal	ca3f7a704e	[bundle2] rename shallowbundle to shallowcg1packer Summary: Preparation for bundle2 support Test Plan: Ran tests Reviewers: pyd, akushner, davidsp, durham Reviewed By: durham Differential Revision: https://phabricator.fb.com/D1668145 Tasks: 5568731 Signature: t1:1668145:1415643197:05ea239c2eb713f82bed6ad67bcd02fad7073a1f	2014-11-07 15:39:20 -08:00
Siddharth Agarwal	74584bb934	[bundle2] support arbitrary kwargs in getlocalbundle Summary: bundle2 adds arbitrary kwargs like `listkeys`. Test Plan: Got further in a remotefilelog pull with bundle2. Reviewers: pyd, davidsp, akushner, durham Reviewed By: durham Differential Revision: https://phabricator.fb.com/D1668121 Tasks: 5568731 Signature: t1:1668121:1415643137:8f85d1c32ffc00f3c7d8bf3c3179626268814a17	2014-11-07 18:31:48 -08:00
Durham Goode	3889ee7b5d	Fix relative ancestor traversals for hg blame Certain filectx constructions used the rev number of the self._changeid. We need to convert that to a node before using it. This was breaking blame. I've now added a blame test too.	2014-10-23 17:16:07 -07:00
Durham Goode	dc5a3bf415	Allow pulling from shallow bundlerepos Bundlerepos work by providing a fake revlog layer above an existing revlog. Since remotefilelog doesn't use revlogs for filelogs, bundlerepo's did not work. This commit fixes it such that you can now hg pull from a bundle, as long as that bundle is shallow (i.e. contains no file contents). This will work for the common use case of trying to recover data from .hg/strip-backups. For reference, shallow bundles don't contain any file data because we never delete any file data from .hg/store/data when using remotefilelog. Even after the commits have been stripped.	2014-10-23 00:01:21 -07:00
Durham Goode	f9730cd521	Fix dirstate wrapping to match upstream Upstream Mercurial commit f447144c8ada changed the dirstate.status output. This updates remotefilelog to match that new output.	2014-10-22 12:36:53 -07:00
Durham Goode	37798a0827	Fix pull wrapping to match upstream Upstream Mercurial has moved localrepo.pull into exchange.pull. This moves our wrapping of that command out of shallowrepo and into __init__. Exchange is becoming an increasingly important class, so we may want to think about moving all exchange wrapper logic out to a separate module in remotefilelog.	2014-10-14 15:50:04 -07:00
Durham Goode	65503211ed	Fix revset indexing bug and update test output repo.revs() no longer returns an object that can be indexed, so we can't use [] on it anymore. So let's use list() on it first. The bookmark output from upstream Mercurial has also changed, so we need to update the tests.	2014-10-14 15:30:38 -07:00
Durham Goode	3ecee80a81	Allow ancestormap to contain hidden commits (sometimes) Summary: When doing 'hg unshelve foo.txt' with Changeset Evolution enabled, uncommit will first prune the commit, then try to read the filelog history to determine if any renames need to be undone. Since the commit is now pruned, remotefilelog fails to find any valid histories. This fixes it two allow hidden histories if the filectx commit is hidden. It also tweaks remotefilectx to produce commit-relative histories when possible, which will result in more accurate histories. Test Plan: Ran hg uncommit in the evolve repo that had problems before. Verified it now worked. Reviewers: pyd, sid0 Differential Revision: https://phabricator.fb.com/D1587306	2014-09-30 14:40:09 -07:00
Durham Goode	8a5a5330c1	Fix pullprefetch for recently landed commits Summary: Pull-prefetch would not download file versions from the server if the file version already existed in the local cache or the local store data. Unfortunately, if someone landed their commit, then later stripped their local version, the local store data file version might become invalid and no local cache version would exist. Meaning things like 'commit' might fail when offline. This changes prefetch to always fetch from the server when dealing with files it knows are from revs on the server. Test Plan: Added a test that makes local commits that already exist on the server, and verifies that a pull-prefetch fetches the server file version, despite that same version existing locally. Reviewers: sid0, pyd, davidsp Subscribers: orip Differential Revision: https://phabricator.fb.com/D1607260	2014-10-09 15:20:54 -07:00
Pierre-Yves David	548b8af8b5	client: add a second argument to ResponseError Summary: The ResponseError exception expect a second argument. Otherwise the code handling it crashes. Test Plan: The handling of the response error stop crashing. Reviewers: durham Differential Revision: https://phabricator.fb.com/D1581574	2014-09-11 20:30:16 +02:00
Pierre-Yves David	c72eed0894	clone: have a more robust finally clause Summary: If the orig function crash before the fileservice is installed, the finally clause explode, shadowing the original error. We fixes thats. Test Plan: crash stopped being shadowed but crash in the finally clause. Reviewers: durham Differential Revision: https://phabricator.fb.com/D1581562	2014-09-11 20:08:42 +02:00
Siddharth Agarwal	5faaeedd84	[remotefilelog] fix packmeta call Summary: API change Test Plan: @durham ran an amend. Reviewers: durham Reviewed By: durham Subscribers: durham Differential Revision: https://phabricator.fb.com/D1569510	2014-09-22 11:38:04 -07:00
Durham Goode	c7f1c0b383	Fix committing merges Summary: Upstream Mercurial changed the way merging works and added revlog.commonancestorsheads. This changes remotefilelog to implement the same API. Previously we were able to use ancestors.genericancestors to do the graph traversal. Upstream Mercurial has deleted that function though (since it is now unused), so remotefilelog must now build a temporary rev graph in order to use the ancestors.* apis. Test Plan: Added a test. It failed without the fix, it passes with the fix. Reviewers: sid0, davidsp, pyd Differential Revision: https://phabricator.fb.com/D1566787	2014-09-19 12:21:30 -07:00
Siddharth Agarwal	8d48e1e5ee	fix for parsemeta API change Summary: This was broken by recent changes. Test Plan: Ran test suite. Reviewers: durham Reviewed By: durham Differential Revision: https://phabricator.fb.com/D1558890 Tasks: 5170539	2014-09-16 13:28:03 -07:00
Pierre-Yves David	2c956d95e2	revert: only pre-fetch files that needs to be touched Summary: With recent version of mercurial (>= 3.2, 4dfcf21a6aa7), revert uses status information to determine the files that needs to be touched. It then offer a simple handle for extensions that needs prefetch. Test Plan: Ran the tests. Certain tests depended on the old revert behavior (of prefetching everything), so they required slight changes. Reviewers: pyd, sid0, davidsp Differential Revision: https://phabricator.fb.com/D1551059	2014-09-08 15:20:59 +02:00
Durham Goode	580f3eaeb3	Update to match Mercurial version b8c8cacd4482 Summary: Changegroups have been refactored upstream and we need to update our remotefilelog monkey patching accordingly. Also fix an issue with the tests where 'function foo()' was not considered valid on certain systems. Test Plan: Ran the tests Reviewers: pyd, sid0, davidsp Differential Revision: https://phabricator.fb.com/D1551019	2014-09-11 14:39:14 -07:00
Durham Goode	17c16cf610	Optimize pullprefetch to limit number of stats Summary: Previously, if pullprefetch was set, we'd perform a prefetch of the entire manifest of the specified revs (usually the public bookmarks). This involved stat-ing all the relevant files in the cache to see if they already existed, which added an extra 6 seconds or so to every pull. Now we only prefetch the files that are different from our working copy. We assume we already have all the files that are in our working copy. This reduces the pullprefetch overhead significantly. Test Plan: Did a pull on my laptop. Verified it didn't hang for 6 seconds at the prefetch stage. Also updated a test Reviewers: davidsp, pyd, sid0 Reviewed By: sid0 Differential Revision: https://phabricator.fb.com/D1505841 Tasks: 4608894	2014-08-19 09:33:31 -07:00
Durham Goode	e46cd0e8e0	Merge heads	2014-08-07 10:23:18 -07:00
Durham Goode	e5228d9989	Fix pullprefetch that uses bookmarks Summary: Previously, pullprefetch was executed during the repo.pull stage. This happens before the bookmarks have been moved, so revsets like 'bookmark()' would prefetch the wrong commits. This change moves the pullprefetch logic to after the pull command is completely finished. Updated a test to make sure this is caught. Also fixes a bug where we were using linkrevs to read a manifest rev entry. We should be using the manifest rev instead. Test Plan: Added a test. Ran it. Reviewers: sid0, pyd, davidsp Differential Revision: https://phabricator.fb.com/D1483345	2014-08-06 18:50:57 -07:00
Siddharth Agarwal	07a515c430	don't show remotefilelog commands in the shortlist Summary: These commands (well, not the debug one) were visible in the shortlist that showed up when you type `hg`. They're not basic commands. Test Plan: Ran `hg` with the extension enabled, didn't see those commands. Reviewers: durham Reviewed By: durham Differential Revision: https://phabricator.fb.com/D1454931	2014-07-23 20:37:48 -07:00
Durham Goode	c44433c62c	Fix hg log on patterns Summary: Due to a change in upstream mercurial, hg log with patterns was no longer working. This fixes it by forcing hg log to take the slow path when using patterns. It also updates the warning messages to work when running hg log <file> from within a subdirectory. Test Plan: Ran the new tests Reviewers: sid0 Differential Revision: https://phabricator.fb.com/D1450193	2014-07-22 12:55:29 -07:00
Durham Goode	13058fb30c	Allow auto-prefetching during pulls Summary: Adds a remotefilelog.pullprefetch config options that accepts a revset. Whenever a pull is run, the revs matched by that revset will be prefetched. The most common value for this will be '(bookmark() + heads(all())) & public()', since it will download almost everything necessary to work offline. Test Plan: Added a test. Ran it. Reviewers: davidsp, pyd, sid0 Reviewed By: sid0 Differential Revision: https://phabricator.fb.com/D1419420	2014-07-03 13:05:11 -07:00
Siddharth Agarwal	f662120645	merge	2014-06-21 16:06:06 -07:00
Siddharth Agarwal	0d248aa73f	applyupdates: update for Mercurial changes Summary: Update for Mercurial commits 1b6040917a6c anmd 9b42f49d06aa. Test Plan: Ran the tests Reviewers: durham, dschleimer, pyd, akushner, davidsp Reviewed By: davidsp Differential Revision: https://phabricator.fb.com/D1388563 Tasks: 4533623	2014-06-17 15:47:12 -07:00
Durham Goode	e6bee07496	Expand environment variables in cacheprocess and cachepath Summary: Expands environment variables in the cacheprocess and cachepath config options, so users can specify something like remotefilelog.cachepath=$HOME/.hgcache Test Plan: Set my cachepath to $HOME/.hgcache on my laptop and manually performed a shallow clone. Verified data was put in ~/.hgcache Reviewers: sid0 Differential Revision: https://phabricator.fb.com/D1342174	2014-05-21 12:28:03 -07:00
Siddharth Agarwal	ef8674624a	Fix shallowbundle.getbundle for local non-remotefilelog repositories Summary: Pulling from a local non-remotefilelog repo to a remotefilelog repo was broken. This fixes it. Test Plan: `hg pull` from a local non-remotefilelog repo to a remotefilelog repo. Reviewers: durham Reviewed By: durham Differential Revision: https://phabricator.fb.com/D1341059	2014-05-20 20:56:19 -07:00
Durham Goode	c5b2f574a0	Fix changegroup wrapping with new upstream Mercurial Summary: Recent changes to upstream Mercurial have moved localrepo.getbundle and localrepo.addchangegroupfiles to changegroup.py. remotefilelog wraps these functions, and thus needs to be updated. Applyupdate also had a function signature change, which is fixed here. Minor fix to a test as well, which had a hard coded time instead of a glob. Test Plan: ./run-tests.py --with-hg=/data/users/durham/hg/hg Reviewers: sid0, davidsp, pyd, dschleimer Differential Revision: https://phabricator.fb.com/D1260737	2014-04-04 15:55:06 -07:00
Durham Goode	0237412d94	Fix shallow clones using getbundle protocol Preivously shallow clones only work using the streaming clone protocol. With this change they work for the standard getbundle protocol as well. This is what the majority of Mercurial users use, so we need to support that.	2014-02-24 22:19:15 -08:00
Durham Goode	0301f9f129	Move local cache logic into it's own class The current local cache is just files on disk, and this implementation detail was spread across the extension. This change refactors it to hide the implementation inside a class so that we can replace it with other implementations (such as a sqlite local cache) later.	2014-02-11 16:25:55 -08:00
Durham Goode	bdea38dd56	Move fileservice to be per repo instead of global Previously the file service client was a global object that all repos could share. This was a bit hacky and is no longer needed. Now the file service client exists per repo instance. This is part of a series of changes to abstract the local caching and remote file service in such a way that we can plug and play implementations.	2014-02-11 14:41:56 -08:00
Durham Goode	9eda4f7a0f	Fix fallback when memcache process exits unexpectedly If the memcache process exited early, remotefilelog was throwing an exception instead of falling back to the server. This change makes it fall back to the server, and also print a warning that the cache connection closed early.	2014-01-09 11:41:12 -08:00
Durham Goode	fc3a887712	hg bundle produces full sized bundles Summary: hg bundle was producing shallow bundles. This change makes it produce full sized bundles so they can be used in other repos. Test Plan: Added a test Reviewers: sid0 Reviewed By: sid0 CC: keegancsmith Differential Revision: https://phabricator.fb.com/D1167462	2014-02-10 16:13:41 -08:00
Durham Goode	92d01b616c	Allow readonly access to remotefilelog cache Summary: Previously requesting remotefilelog file blobs from the server required write access in order to write the blob to the cache. This changes it to not abort entirely if the user doesn't have write access. Test Plan: cd tests ./run-tests.py --with-hg=/data/users/durham/hg/hg test-permissions.t Also ran the test without the fix and verified it fails. Reviewers: sid0, davidsp, pyd, dschleimer Reviewed By: dschleimer Differential Revision: https://phabricator.fb.com/D1145976 Task ID: 3601184	2014-01-27 17:09:48 -08:00
Durham Goode	106035959b	Add prefetch command to remotefilelog Summary: Adds a 'hg prefetch' command to remotefilelog for prepopulating the local cache. Supports specifying revsets and file patterns to limit what is downloaded. Test Plan: ./run-tests.py test-prefetch.t --with-hg=/data/users/durham/hg/hg Reviewers: dschleimer, sid0, davidsp, pyd, mpm CC: kunalb, minyoung Differential Revision: https://phabricator.fb.com/D1129942	2014-01-15 13:41:29 -08:00
Durham Goode	f76b0f894c	Fix looking up double digit alternates The alternate lookup code was mistakening looking for only the last digit instead of looking at the entire prefix. This meant files with more than 10 alternates would start failing to find histories, which breaks rebase.	2014-01-09 11:40:39 -08:00
Durham Goode	16a7f940d5	Increase batch request size When falling back to the master server for cache misses, we only kept two requests in flight at any time. Over high latency connections (like across oceans) this resulted in very slow downloads. This change increases the request size to 10,000 keys at once. This will keep the size of the request lower than the tcp buffer size, while allowing us to maximize our throughput.	2013-12-17 14:31:21 -08:00
Durham Goode	285ad01336	Handle the case where the alternates directory doesn't exist yet	2013-12-13 17:14:55 -08:00
Durham Goode	688d0f9594	Fix debugremotefilelog command	2013-12-13 11:42:50 -08:00
Durham Goode	4e8c3b941d	Fix broken alternates lookup	2013-12-13 11:21:51 -08:00
Durham Goode	17f5a0d712	Fix issues with hg pulling from svn	2013-12-12 12:34:39 -08:00
Durham Goode	4d6f31837e	Fix hang when manifest size is greater than tcp buffer Previously we sent the entire list of files to the fallback repo in a single ssh write/flush. If the size of this write exceeded the tcp buffer on the receiving end, the call would hang until the buffer had room. The problem is that the receiving end (the server) is hung trying to send data back to the client. Therefore it deadlocked. The fix is to send and receive requests one at a time. We always have the next request in flight while receiving so we shouldn't be waiting on requests too often.	2013-12-11 13:39:53 -08:00
Durham Goode	393958c76b	Allow naming repos Enables specifying a name for a repo that is used in the cache key. This allows multiple repos on a machine to share a cache without the risk of keys overlapping.	2013-08-15 11:00:51 -07:00
Durham Goode	85e48b58fd	Move server and debug logic into their own files __init__.py was getting quite large. This change moves the server and debug logic into their own files. Client-side logic remains in __init__.py	2013-11-25 16:36:44 -08:00
Durham Goode	d9d4477013	Remove global variable for tracking shallow remotes Previously we used a global variable to track if the incoming connection was from a shallow remote (based on if the network command was a *_shallow command). This is hacky and overall a bad idea. The new implementation stores the shallow flag as a bundlecapability passed to the getbundle command. A side effect of this is remotefilelog won't work with versions of mercurial that don't use the getbundle command.	2013-11-25 14:22:56 -08:00
Durham Goode	b88d1b44d4	Replace linknode fallback algorithm The previous algorithm thought that if the system cache had the file rev, it was guaranteed to be valid. This isn't true in the case of a machine in which multiple people share the cache (one person may have pulled a rev but the other hasn't). The new algorithm is more explicit. It checks: - system cache - local cache - local cache fallbacks - remote cache - master server	2013-11-22 13:41:54 -08:00
Durham Goode	e5f5e3244b	Add more comments explaining various complexities	2013-11-05 17:19:59 -08:00
Durham Goode	24ce0242d7	Add example cache client implementation Adds a cache client implementation using the opensource python-memcached library. It's more of an educational example than a production ready one since it doesn't perform the requests asynchronously. It does however split up large files into smaller chunks for you.	2013-10-17 14:18:23 -07:00
Durham Goode	18baf608df	Remove unused time and traceback imports	2013-10-16 13:40:25 -07:00
Durham Goode	d122f76e5b	Add readme and GPL info	2013-10-15 17:20:12 -07:00
Durham Goode	1275d15990	Add include and exclude configuration settings The remotefilelog extension currently doesn't work with tags. Adding include and exclude patterns allows users to specify which files they want to treat as shallow and which the want to download the entire history for. By excluding .hgtags from being shallow, this enables tags to work in a mostly shallow repo. This also enables largefile like scenarios where most files are full and only a few large ones are kept remote.	2013-09-26 10:46:06 -07:00
Durham Goode	5a628dc440	Fix linknode test failure	2013-10-09 10:20:47 -07:00
Durham Goode	3c6137f555	Fix revert prefetch causing excess output	2013-10-07 17:13:00 -07:00
Durham Goode	b47e016320	Replace linknode recovery tests with a real world test	2013-10-04 14:40:47 -07:00
Durham Goode	7268e5b709	Refactor ancestormap linknode logic to handle a bug A rare bug can occur where the local file blob might not exist, but a valid old version of that blob does exist. This refactor the linknode logic in ancestormap to check the old versions if the server fetch fails to find the blob. It still prints an ugly warning message from the server, but this whole issue is quite rare anyway.	2013-10-03 15:15:15 -07:00
Durham Goode	ab72a92e85	GC server cache and add GC tests	2013-10-02 16:21:48 -07:00
Durham Goode	be29ee042a	Fix reverting from non-root directories	2013-10-02 09:45:52 -07:00
Durham Goode	335e1d1bfc	Prefetch before revert	2013-09-17 11:24:31 -07:00
Durham Goode	2ccd88dfcd	Support new mercurial _basesupported	2013-10-01 15:11:57 -07:00
Durham Goode	efdfcc1502	Send all available data during a pull	2013-09-19 16:22:14 -07:00
Durham Goode	3667c253fd	Refresh changelog during getfiles loop	2013-09-19 15:56:26 -07:00
Durham Goode	cf9d751d8a	Add remotefilelog debug commands	2013-09-17 20:15:08 -07:00
Durham Goode	6a8a2f0e58	Fix rare issue with broken linknodes in the ancestormap	2013-09-16 18:46:24 -07:00
Durham Goode	f480c7deef	Remove remotefilectx.__str__ Recent changes to Mercurial mean this is implemented by a base class.	2013-09-11 12:29:01 -07:00
Durham Goode	4ce55b8a0f	Add log file warning	2013-09-11 10:27:56 -07:00
Durham Goode	6781d80d25	Fix local pulls to send file data	2013-09-09 11:44:08 -07:00
Durham Goode	6acb5968a1	Clean up empty cache files if we encounter them	2013-09-09 11:23:03 -07:00
Durham Goode	3619a1911d	Cut down number of sys calls during filelog reads When the cache is stored on a filesystem, excessive stat calls can slow mercurial updates down dramatically. This reduces it to a single open call for the cache location and if that fails, a single open call for the local location.	2013-09-09 10:23:29 -07:00
Durham Goode	c17ec690c9	Change cache key to use a two character prefix for directories. Some file systems can't handle having a ton of files/directories inside a directory, so this splits up all our files amongst directories.	2013-09-06 13:28:15 -07:00
Durham Goode	4d70ed4fce	Fix a bug with status prefetching in merge scenarios	2013-09-04 19:07:01 -07:00
Durham Goode	29ba0e9bc1	If cacheprocess is not set, always use the fallback This allows tests to run without a memcache process.	2013-09-03 20:03:24 -07:00
Durham Goode	4a5c8d437d	Fix hg diff when fnode is None	2013-09-03 11:39:16 -07:00
Durham Goode	5ec22c7093	Prevent verify from checking filelogs	2013-08-30 15:43:22 -07:00
Durham Goode	b685d98f57	Prefetch revisions before a diff	2013-08-30 11:27:09 -07:00
Durham Goode	4edeed8417	Prefetch lookup set during hg status	2013-08-30 11:09:19 -07:00
Durham Goode	3c879ed1a8	Enable efficient pulling between shallow repos	2013-08-28 18:51:01 -07:00
Durham Goode	d0738cc010	Make cache files owned by uid/svnuser	2013-08-20 12:59:33 -07:00
Durham Goode	96bbab8f7a	Fix shared cache permissions to be g+w	2013-08-15 10:59:11 -07:00
Durham Goode	f68d704603	Enable hg gc from outside a repo	2013-08-15 10:56:25 -07:00
Durham Goode	bf7491936d	Fix hg diff with added or moved files. A workingctx produces manifest entries with nullid+'a' or nullid+'m' for any added or modified files. The extension was trying to prefetch these but they didn't exist and caused an error. Luckily they are length 42 so we can check for them and not prefetch them.	2013-07-24 22:16:50 -07:00
Durham Goode	a5828ce7a3	Add newline to end of debug output	2013-07-24 18:49:14 -07:00
Durham Goode	9df6e83354	Prevent 'running ssh...' in stdout when run with -v	2013-07-24 13:20:13 -07:00
Durham Goode	3cbc732b42	Fix fallbackrepo not being present during the clone after update. Make debug message get sent to stderr instead of stdout.	2013-07-23 19:06:40 -07:00
Durham Goode	77d31b12e4	Add hit/miss ratio to debug output	2013-07-01 17:37:55 -07:00
Durham Goode	9642a8a2d6	Add remotefilelog.fallbackrepo config	2013-07-01 16:28:34 -07:00
Durham Goode	58ff8f91f6	Prefetch before copy tracing	2013-07-01 15:35:08 -07:00
Durham Goode	027a1d4ab8	Set umask before writing files to shared cache	2013-06-28 17:12:20 -07:00
Durham Goode	8e037436cb	Add gc command for cleaning up the cache	2013-06-28 15:57:15 -07:00
Durham Goode	6e3494bf98	Add incoming hook for producing file blobs	2013-06-27 15:14:22 -07:00
Durham Goode	1ac9b8cbc1	Move requirement string to a variable	2013-06-26 14:37:59 -07:00
Durham Goode	3e6b7810df	Override bundle10.generatefiles instead of prune	2013-06-25 13:26:24 -07:00
Durham Goode	bb32b111bf	Change contract between extension and memcache process to allow arbitrary key lengths and customizable cache paths	2013-06-25 11:38:48 -07:00
Durham Goode	6536d87bc0	Prevent pull from sending files to shallow clones	2013-06-23 13:50:22 -07:00
Durham Goode	84b481de56	Add option for server cache location. Change _callstream wrapper to only run on client.	2013-06-21 13:22:18 -07:00
Durham Goode	f16a3a4134	Rename to remotefilelog since shallowrepo is already taken	2013-06-21 10:14:29 -07:00

... 8 9 10 11 12

598 Commits