sapling

mirror of https://github.com/facebook/sapling.git synced 2024-10-11 17:27:53 +03:00

Author	SHA1	Message	Date
Siddharth Agarwal	27976ad2dc	revlog: compute length of compressed deltas along with chain length In upcoming patches to the revlog, we're going to split up the notions of bounding I/O and bounding CPU.	2014-11-11 19:54:36 -08:00
Siddharth Agarwal	6e115e5383	revlog: store fulltext when compressed delta is bigger than it This is a very silly case and not particularly likely to happen in the wild, but it turns out we can hit it in a couple of places. As we tune the storage parameters we're likely to hit more such cases. The affected test cases all have smaller revlogs now.	2014-11-11 21:41:12 -08:00
Siddharth Agarwal	e5d387f47e	revlog: make a predicate clearer with parens	2014-11-11 21:39:56 -08:00
Pierre-Yves David	a818f5fbca	transaction: extract backupentry registration in a dedicated function We are about to use the 'backupentry' mechanism to allow cleaning up transaction-related temporary files (such as 'changelog.i.a'). We start by extracting the entry registration into its own method for easy reuse. At that point, I would like to rename the backup-file related variable to something generic but I'm a bit short of ideas.	2014-11-05 13:06:24 +00:00
Pierre-Yves David	8259127ccb	transaction: pass the transaction to 'postclose' callback This mirrors the API for 'pending' and 'finalize' callbacks. I do not have immediate usage planned for it, but I'm sure some callback will be happy to access transaction related data.	2014-11-08 16:35:15 +00:00
Pierre-Yves David	29f854f61a	transaction: pass the transaction to 'finalize' callback The callback will likely need to perform some operation related to the transaction (eg: registering file update). So we better pass the current transaction as the callback argument. Otherwise callback that needs it has to rely on horrible weak reference trick. This allow already allow us to slay a wild weak reference usage.	2014-11-08 16:31:38 +00:00
Pierre-Yves David	92bf4dcbdc	transaction: pass the transaction to 'pending' callback The callback will likely need to perform some operation related to the transaction (eg: backing files up). So we better pass the current transaction as the callback argument. Otherwise callback that needs it has to rely on horrible weak reference trick. The first foreseen user of this is changelog._writepending. We would like it to register the temporary file it create for cleanup purpose.	2014-11-08 16:27:50 +00:00
Pierre-Yves David	6ab03ec965	transaction: gather backupjournal logic together in the __init__ The initialisation of file-backup related variable were a bit scattered, we gather them together.	2014-11-05 10:22:17 +00:00
Pierre-Yves David	b91f1df8cd	transaction: handle missing file in backupentries (instead of using entries) The case where a backup of a missing file was requested was previously handled by the 'entries' list. As the 'backupentries' is about to gain ability to backup files outside of '.hg/store', we want it to be able to handle the missing file too. Reminder: using 'addbackup' on a missing file means that such file needs to be deleted if we rollback the transaction.	2014-11-05 01:38:48 +00:00
Pierre-Yves David	c436094d45	test-hup: use ls instead of echo Having one file entry per line makes it easier to analyse diffs.	2014-11-05 01:23:40 +00:00
FUJIWARA Katsunori	3f4cab3fa5	largefiles: move "copyalltostore" invocation into "markcommitted" Before this patch, while "hg convert", largefiles avoids copying largefiles in the working directory into the store area by combination of setting "repo._isconverting" in "mercurialsink{before\|after}" and checking it in "copytostoreabsolute". This avoiding is needed while "hg convert", because converting doesn't update largefiles in the working directory. But this implementation is not efficient, because: - invocation in "markcommitted" can easily ensure updating largefiles in the working directory "markcommitted" is invoked only when new revision is committed via "commit" of "localrepository" (= with files in the working directory). On the other hand, "commitctx" may be invoked directly for in-memory committing. - committing without updating the working directory (e.g. "import --bypass") also needs this kind of avoiding For efficiency of this kind of avoiding, this patch does: - move "copyalltostore" invocation into "markcommitted" - remove meaningless procedures below: - hooking "mercurialsink{before\|after}" to (un)set "repo._isconverting" - checking "repo._isconverting" in "copytostoreabsolute" This patch invokes "copyalltostore" also in "_commitcontext", because "_commitcontext" expects that largefiles in the working directory are copied into store area after "commitctx". In this case, the working directory is used as a kind of temporary area to write largefiles out, even though converted revisions are committed via "commitctx" (without updating normal files).	2014-11-08 00:48:41 +09:00
FUJIWARA Katsunori	9e83bdcdeb	largefiles: avoid printing messages while transplanting by "_lfstatuswriters" Putting "lambda msg, *opts: None" (= avoid printing messages always) into "_lfstatuswriters" while transplanting makes explicit passing "printmessage = False" for "updatelfiles()" useless. This patch also removes setting/unsetting "repo._istransplanting" in "overridetransplant", because there is no code path referring it.	2014-11-08 00:48:41 +09:00
FUJIWARA Katsunori	ba0a7a0792	largefiles: update standins only at the 1st commit of "transplant --continue" Before this patch, "hg transplant --continue" may record incorrect standins, because largefiles extension always avoid updating standins while transplanting, even though largefiles in the working directory may be modified manually at the 1st commit of "hg transplant --continue". But, on the other hand, updating standins should be avoided at subsequent commits for efficiency reason. To update standins only at the 1st commit of "hg transplant --continue", this patch uses "automatedcommithook", which updates standins by "lfutil.updatestandinsbymatch()" only at the 1st commit of resuming. Even after this patch, "repo._istransplanting = True" is still needed to avoid some status report while updating largefiles in "lfcommands.updatelfiles()". This is reason why this patch omits not "repo._istransplanting = True" in "overriderebase" but examination of "getattr(repo, "_istransplanting", False)" in "updatestandinsbymatch".	2014-11-08 00:48:41 +09:00
FUJIWARA Katsunori	5e13e41d95	largefiles: avoid redundant "updatelfiles" invocation in "overridetransplant" At "hg transplant --merge REV", largefiles newly coming from the 2nd parent (= REV) are marked as "a"(dded) by "patch.patch()", and have to be marked as "n"(ormal) after commit. But until changeset 978713c45992, such largefiles were still marked as "a" unexpectedly even after commit, because no additional entry is added to filelog of such largefiles and they aren't listed in "repo[newnode].files()" in this case: "newnode" is one of newly committed changeset (= result of "repo.commit()"). "updatelfiles" invocation in "overridetransplant" shadows this problem by forcibly synchronizing lfdirstate to dirstate. Now, "updatelfiles" invocation in "overridetransplant" is redundant, because changeset 978713c45992 made "markcommitted" use "ctx.files()" to get targets of "synclfdirstate" instead of "repo[newnode].files()".	2014-11-08 00:48:38 +09:00
Matt Mackall	4aa4e6191e	merge with stable	2014-11-12 15:18:30 -06:00
Yuya Nishihara	b2ed607f5e	util.system: remove unused handling of onerr=ui In our code, onerr is None or util.Abort. It smells bad to overload ui and exception class.	2014-11-08 13:14:19 +09:00
Yuya Nishihara	ceda6fbba9	util.system: use ui.system() in place of optional ui.fout parameter	2014-11-08 13:06:22 +09:00
Yuya Nishihara	c3aa617f2e	ui: introduce util.system() wrapper to make sure ui.fout is used This change is intended to avoid future problem of data corruption under command server. out=ui.fout is mandatory as long as command server uses stdout as IPC channel.	2014-11-08 12:57:42 +09:00
Yuya Nishihara	990e90eeb6	hook: remove redundant code to redirect http hook output to client stream out=ui and out=ui.fout should be the same here. ui.fout was introduced at 7d217b512ae2, which was not available when out=ui was added at 5716839ccf7d.	2014-11-12 22:21:51 +09:00
Yuya Nishihara	2a05713142	hgk: forward command output to ui.fout consistently Nobody would want to run hgk in command server, but it should work in principle. This fixes possible data corruption of command-server channel.	2014-11-12 21:53:44 +09:00
Matt Mackall	9620de068c	merge with stable	2014-11-11 18:43:19 -06:00
Augie Fackler	7a3a996d0f	run-tests: use a try/except ladder instead of looking for a specific version This ensures we get json instead of simplejson in as many places as possible.	2014-11-10 13:20:56 -05:00
Augie Fackler	c2e4fc2371	hghave: use a less brittle have-json check	2014-11-10 13:27:25 -05:00
Sean Farley	93b998c77a	sortdict: add insert method Future patches will allow extensions to choose which order a namespace should output in the log, so we add a way for sortdict to insert to a specific location.	2014-10-15 12:39:19 -07:00
Sean Farley	8ea5f6192f	sortdict: add iteritems method Future patches will start using sortdict for log operations where order is important. Adding iteritems removes the headache of having to remember to use items() if the object is a sortdict.	2014-11-09 13:15:28 -08:00
Martin von Zweigbergk	98fbfc44f9	addremove: add back forgotten files (BC) After running "hg forget README && hg addremove", README will still be reported as removed, while "hg forget README && hg add README" adds it back so it gets reported as clean. It seems like they should behave the same. Furthermore, it seems like no files should remain untracked after 'hg addremove && hg commit' (or 'hg commit -A'). For these reasons, change the behavior of addremove so it does add forgotten files back. The problem is with scmutil._interestingfiles(), which reports the file as removed, so scmutil.addremove() does not add it. Fix by teaching _interestingfiles() to report forgotten files separately from removed files and make addremove() add forgotten files back. However, do not treat forgotten files as sources for rename detection. Note that since removed and forgotten files are treated the same before this change, forgotten files were considered sources for rename detection. Also update the other caller, marktouched(), in the same way as addremove().	2014-11-08 23:13:39 -08:00
Martin von Zweigbergk	42c14b3ed1	add: add back forgotten files even when not matching exactly (BC) I accidentally did 'hg forget .' and tried to undo the operation with 'hg add .'. I expected the files to be reported as either modified or clean, but they were still reported as removed. It turns out that forgotten files are only added back if they are listed explicitly, as shown by the following two invocations. This makes it hard to recover from the mistake of forgetting a lot of files. $ hg forget README && hg add README && hg status -A README C README $ hg forget README && hg add . && hg status -A README R README The problem lies in cmdutil.add(). That method checks that the file isn't already tracked before adding it, but it does so by checking the dirstate, which does have an entry for forgotten files (state 'r'). We should instead be checking whether the file exists in the workingctx. The workingctx is also what we later call add() on, and that method takes care of transforming the add() into a normallookup() on the dirstate. Since we're changing repo.dirstate into wctx, let's also change repo.walk into wctx.walk for consistency (repo.walk calls wctx.walk, so we're simply inlining the call).	2014-11-10 14:51:18 -08:00
Martin von Zweigbergk	abccb5c328	context.status: explain "caching reasons" more fully Where we "load earliest manifest first for caching reasons", elaborate on what "caching reasons" refers to. Text provided by Matt in http://thread.gmane.org/gmane.comp.version-control.mercurial.devel/73235/focus=73578.	2014-11-11 10:16:54 -08:00
Augie Fackler	10400a5ca1	localrepo: rename revlog.maxchainlen to format.maxchainlen This is more consistent with other option names, as spotted by Pierre-Yves. Thanks!	2014-11-11 10:35:06 -05:00
Mateusz Kwapich	3433abb6a8	revlog: add config variable for limiting delta-chain length The current heuristic for deciding between storing delta and full texts is based on ratio of (sizeofdeltas)/(sizeoffulltext). In some cases (for example a manifest for ahuge repo) this approach can result in extremely long delta chains (~30,000) which are very slow to read. (In the case of a manifest ~500ms are added to every hg command because of that). This commit introduces "revlog.maxchainlength" configuration variable that will limit delta chain length.	2014-11-06 14:20:05 -08:00
Mateusz Kwapich	1a554418d5	debugrevlog: fix computing chain length in debugrevlog -d The chain length was computed correctly only when generaldelta feature was enabled. Now it's fixed. When generaldelta is disabled the base revision in revlog index is not the revision we have delta against - it's always previous revision. Instead of incorrect chainbaseandlen in command.py we are now using two single-responsibility functions in revlog.py: - chainbase(rev) - chainlen(rev) Only chainlen(rev) was missing so it was written to mimic the way the chain of deltas is actually found during file reconstruction.	2014-11-06 14:08:25 -08:00
Pierre-Yves David	1d1e3a5b9c	transaction: factorise append-only file registration The addition is done in two different places but differs slightly. We factorise this addition to ensure it is consistent in all places.	2014-11-05 10:13:01 +00:00
Pierre-Yves David	b9cd014175	transaction: document `tr.add`	2014-11-05 13:00:48 +00:00
Pierre-Yves David	9a1f491fee	transaction: drop backupentries logic from startgroup and endgroup The `startgroup` and `endgroup` methods are used in a very specific context to wrap a very specific operation (revlog truncation). It does not make sense to perform any other operations during such a "group" (eg:file backup). There is currently no user of backupfile during a "group" so we drop the group-specific code and restrict authorized operations during "group".	2014-11-05 10:05:38 +00:00
Pierre-Yves David	483eb136e7	transaction: document startgroup and endgroup These enigmatic methods are only used in repair. We document them to clarify there purpose and user.	2014-11-05 10:00:15 +00:00
Pierre-Yves David	36a4f98617	transaction: mark backup-related attributes private As the transaction is gaining more functions and attributes, it is important to clarify what is part of the public API.	2014-11-05 09:31:57 +00:00
Pierre-Yves David	63c82fbc8a	transaction: document the contents of `tr.backupentries` Now that all items are known we can document it.	2014-11-05 01:30:29 +00:00
Pierre-Yves David	9d5b590e75	transaction: drop the third item in `tr.backupentries` This third item is always None and never used.	2014-11-05 01:33:16 +00:00
Matt Mackall	816fd34333	merge with stable	2014-11-10 17:29:15 -06:00
Siddharth Agarwal	3e8587d071	changegroup.cg2packer: lookup 'group' via inheritance chain This lets extensions insert themselves in the class hierarchy.	2014-11-07 17:54:59 -08:00
Martin von Zweigbergk	f07d2059f7	context.status: only filter suspect symlinks in the dirstate status We don't care about filtering out symlinks that have already been committed with full content, only those that have been accidentally resolved in the working directory.	2014-10-23 17:00:38 -07:00
Martin von Zweigbergk	e1831ee6df	context.status: inline _poststatus() By inlining _poststatus() into _buildstatus(), it becomes clearer that it is only called for the workingctx.	2014-10-23 16:19:56 -07:00
Martin von Zweigbergk	9901573e36	context.status: remove now-empty _prestatus()	2014-10-12 00:06:40 -07:00
Martin von Zweigbergk	22a4f35851	context.status: call _dirstatestatus() from within _buildstatus() By making the call to _dirstatestatus() within _buildstatus(), it becomes clearer that it's called only for the workingctx.	2014-10-11 23:30:08 -07:00
Martin von Zweigbergk	f41e08e268	context.status: move manifest caching trick to _buildstatus() In basectx._buildstatus(), we read the manifests for the two revisions being compared. For "caching reasons" unknown to me, it is better to read the earlier manifest first, which basectx._prestatus() takes care of. However, if the 'self' context is a committablectx and the 'other' context is the parent of the working directory (as in the very common case of plain "hg status"), there is no need to read any manifests at all -- all that's needed is the dirstate status. To avoid reading the manifests, _prestatus() is overridden in committablectx and avoids calling its super method, and _buildstatus() calls its super method only if the 'other' context is not the parent of the working directory. It seems easier to follow what's happening if we move the pre-fetching to _buildstatus() just before the place where the manifests are fetched. We just need to add an extra check that the revision is not None to handle the case that was previously handled by subclass overriding. That also makes it safe for committablectx._prestatus() to call its parent, although the latter now becomes empty, so we won't bother.	2014-10-12 00:00:13 -07:00
Martin von Zweigbergk	7d724075bb	context.status: remove unused arguments from _matchstatus()	2014-10-11 23:18:53 -07:00
Martin von Zweigbergk	3e01ff74de	context.status: remove overriding in workingctx The workingctx method simply calls the super method. The only effect it has is that it uses a different default argument for the 'other' argument. The only in-tree caller is patch.diff, which always passes an argument to the method, so it should be safe to remove the overriding. Having the default argument depend on the type seems rather dangerous anyway.	2014-10-23 13:43:20 -07:00
Mike Edgar	536f09de36	synthrepo: when adding files, ensure new path is not a directory	2014-10-20 14:20:43 -04:00
Mike Edgar	b7b380cf6e	synthrepo: synthesized dates must be positive, fit in 32-bit signed ints	2014-10-20 13:59:13 -04:00
Mads Kiilerich	f7f618d52d	discovery: test coverage for issue4438 / 475a22a41c55 / a720a37e15a3 The randomness in the discovery protocol made this problem hard to reproduce. The test mocks random.sample to make sure we hit the problem every time. The set iteration order also made the output unstable ... but with the issue fixed, it is stable.	2014-11-06 01:48:29 +01:00

... 3 4 5 6 7 ...

23292 Commits