sapling

mirror of https://github.com/facebook/sapling.git synced 2024-10-10 08:47:12 +03:00

Author	SHA1	Message	Date
Mads Kiilerich	835157e77d	branchmap: use revbranchcache when updating branch map The revbranchcache is read on demand before it will be used for updating the branch map. It is written back when the branchmap is written and it will thus use the same locking as branchmap. The revbranchcache instance is short-lived; it is only stored in the branchmap from .update() is invoked and until .write() is invoked. Branchmap already assume that the repo is locked in that case. The use of revbranchcache for branch map updates will make sure that the revbranchcache "always" is kept up-to-date. The perfbranchmap benchmark is somewhat bogus, especially when we can see that the caching makes a significant difference between the realistic case of a first run and the rare case of rerunning it with a full cache. Here are some 'base' numbers on mozilla-central: Before: ! wall 6.912745 comb 6.910000 user 6.840000 sys 0.070000 (best of 3) After - initial, cache is empty: ! wall 7.792569 comb 7.790000 user 7.720000 sys 0.070000 (best of 3) After - cache is full: ! wall 0.879688 comb 0.880000 user 0.870000 sys 0.010000 (best of 4) The overhead when running with empty cache comes from checking, missing and updating it every time. Most of the performance improvement comes from not having to extract the branch info from the changelog. The last doubling of performance comes from no longer having to convert all branch names to local encoding but reuse the few already converted branch names. On the hg repo: Before: ! wall 0.715703 comb 0.710000 user 0.710000 sys 0.000000 (best of 14) After: ! wall 0.105489 comb 0.110000 user 0.110000 sys 0.000000 (best of 87)	2015-01-08 00:01:03 +01:00
Mads Kiilerich	1b3892318f	branchcache: introduce revbranchcache for caching of revision branch names It is expensive to retrieve the branch name of a revision. Very expensive when creating a changectx and calling .branch() every time - slightly less when using changelog.branchinfo(). Now, to speed things up, provide a way to cache the results on disk in an efficient format. Each branchname is assigned a number, and for each revision we store the number of the corresponding branch name. The branch names are stored in a dedicated file which is strictly append only. Branch names are usually reused across several revisions, and the total list of branch names will thus be so small that it is feasible to read the whole set of names before using the cache. It will however do that it might be more efficient to use the changelog for retrieving the branch info for a single revision. The revision entries are stored in another file. This file is usually append only, but if the repository has been modified, the file will be truncated and the relevant parts rewritten on demand. The entries for each revision are 8 bytes each, and the whole revision file will thus be 1/8 of 00changelog.i. Each revision entry contains the first 4 bytes of the corresponding node hash. This is used as a check sum that always is verified before the entry is used. That check is relatively expensive but it makes sure history modification is detected and handled correctly. It will also detect and handle most revision file corruptions. This is just a cache. A new format can always be introduced if other requirements or ideas make that seem like a good idea. Rebuilding the cache is not really more expensive than it was to run for example 'hg log -b branchname' before this cache was introduced. This new method is still unused but promise to make some operations several times faster once it actually is used. Abandoning Python 2.4 would make it possible to implement this more efficiently by using struct classes and pack_into. The Python code could probably also be micro optimized or it could be implemented very efficiently in C where it would be easy to control the data access.	2015-01-08 00:01:03 +01:00
Anton Shestakov	6fe7d43de3	hgweb: move archive entries outside of <li> in monoblue style archiveentry already includes surrounding <li></li>, so putting archive entries inside <li> element produced incorrect markup.	2015-01-09 22:53:38 +08:00
Anton Shestakov	8bdbccf3bf	hgweb: add searchhint to templates/coal/map coal style uses every template (except header.tmpl) directly from paper style, but doesn't use paper/map file. Elements defined in such map files are used in templates as you would expect. For example, paper/search.tmpl contains '{searchhint}' and template engine replaces that with the actual hint. But when coal style reuses paper/search.tmpl, it needs to define searchhint in its map file as well, or template engine will not find it. So let's copy it from paper/map to coal/map. Before this change, if the coal style was selected, the hint for the search field in page header was present, but it was completely empty. Although the absence of searchhint in coal/map produced no error.	2015-01-09 15:24:55 +08:00
Gregory Szorc	73fa937531	cmdutil.jsonchangeset: properly compute added and removed files jsonchangeset._show() was computing the reverse status of the current changeset. As a result, added files were showing up as removed and removed files were showing up as adds. There were existing tests for this code and they were flat out wrong.	2015-01-05 22:18:55 -08:00
Yuya Nishihara	b67b0a75ea	revset: drop pre-lazyset optimization for stringset of subset == entire repo It was introduced at deb42ca4dd93, where spanset.__contains__() did not exist. Nowadays, we have to pay huge penalty for len(subset). The following example showed that OR operation could be O(n * m^2) (n: len(repo), m: number of OR operators, m >= 2) probably because of filteredset.__len__. revset #0: 0\|1\|2\|3\|4\|5\|6\|7\|8\|9 0) wall 8.092713 comb 8.090000 user 8.090000 sys 0.000000 (best of 3) 1) wall 0.445354 comb 0.450000 user 0.430000 sys 0.020000 (best of 22) 2) wall 0.000389 comb 0.000000 user 0.000000 sys 0.000000 (best of 7347) (0: 3.2.4, 1: 3.1.2, 2: this patch)	2015-01-03 10:25:08 +09:00
Matt Harbison	b391094100	largefiles: properly sync lfdirstate after removing largefiles The more aggressive synchronization of lfdirstate that was backed out in 4fe80f20ab06 masked the problem where lfdirstate would hold an 'R' for a largefile that was added and then removed without a commit between. We could just conditionally call lfdirstate.drop() or lfdirstate.remove() here, but this also properly updates lfdirstate if the standin doesn't exist for the file somehow (i.e. call drop instead of remove). Without this change, the precommit status in the commit command immediately after the test change lists the removed (and never committed) largefile as 'R'. It can also lead to situations where the status command reports the same, long after the commit [1]. [1] http://www.selenic.com/pipermail/mercurial-devel/2015-January/065153.html	2015-01-04 15:26:26 -05:00
Matt Mackall	e50959fc84	Added signature for changeset 4fe80f20ab06	2015-01-01 16:30:11 -06:00
Mads Kiilerich	b438c7467e	largefiles: backout d20af8be6a14 - linear updates handle m -> a differently d20af8be6a14 introduced a significant performance regression: All largefiles were marked 'normallookup' by linear (or noop) updates and had to be rehashed by the next command. The previous change introduced a different solution to the problem d20af8be6a14 solved and we can thus back it out again.	2014-12-31 14:46:03 +01:00
Mads Kiilerich	6cfab77588	largefiles: mark lfile as added in lfdirstate when the standin is added This is an alternative solution to the problem addressed by d20af8be6a14. This implementation has the advantage that it doesn't mark clean largefiles as normallookup. We can thus avoid repeated rehashing of all largefiles when d20af8be6a14 is backed out. This implementation use the existing 'lfmr' actions that 4f7f7352d9d0 introduced for handling another part of the same cases.	2014-12-31 14:46:02 +01:00
Mads Kiilerich	d0a915efcf	tests: add test coverage for lfdirstate invalidation of linear update d20af8be6a14 introduced a significant performance regression: All largefiles are marked 'normallookup' in lfdirstate by linear (or noop) updates and has to be rehashed by the next command. To avoid such regressions, keep an eye on the dirstate content after a plain 'hg up'.	2014-12-31 14:45:02 +01:00
Matt Harbison	efebc89ab9	largefiles: enable subrepo support for remove Previously, remove failed when operating on a largefile in a subrepo, stating that the file is untracked.	2014-12-30 21:12:52 -05:00
Pierre-Yves David	171c6e502b	transplant: properly skip empty changeset (issue4423) If resolving a merge conflict result in an empty changesets, we now properly skip the changeset instead of crashing. Original patch from Robert Collins <robertc@robertcollins.net>.	2015-01-08 21:36:12 -08:00
Mathias De Maré	1521d2118c	color: add support for colorizing git subrepo diffs	2015-01-08 21:30:22 +01:00
Matt Harbison	5aac6fe79c	tests: fix test-casefolding.t output This change started with 59dfa285ee25.	2015-01-08 23:33:56 -05:00
Matt Mackall	bf17f56a67	sshpeer: more thorough shell quoting This fixes an issue spotted by Jesse Hertz.	2014-12-29 14:27:02 -06:00
FUJIWARA Katsunori	ef3bad18bd	i18n-ja: synchronized with 04d70d13ef9d	2014-12-29 15:59:56 +09:00
Wagner Bruna	1546a1d329	i18n-pt_BR: synchronized with 480699610f70	2014-12-26 17:21:08 -02:00
Augie Fackler	ffd2cf1dba	demandimport: blacklist distutils.msvc9compiler (issue4475) This module depends on _winreg, which is windows-only. Recent versions of setuptools load distutils.msvc9compiler and expect it to ImportError immediately when on non-Windows platforms, so we need to let them do that. This breaks in an especially mystifying way, because setuptools uses vars() on the imported module. We then throw an exception, which vars doesn't pick up on well. For example: In [3]: class wat(object): ...: @property ...: def __dict__(self): ...: assert False ...: In [4]: vars(wat()) --------------------------------------------------------------------------- TypeError Traceback (most recent call last) <ipython-input-4-2781ada5ffe6> in <module>() ----> 1 vars(wat()) TypeError: vars() argument must have __dict__ attribute Which is similar to the problem we run into.	2014-12-22 17:27:31 -05:00
Matt Harbison	93382de245	largefiles: fix a spurious missing file warning with forget (issue4053) If an uncommitted and deleted file was forgotten, a warning would be emitted, even though the operation was successful. See the previous patch for 'remove -A' for the exact circumstances, and details about the cause.	2014-12-21 15:06:54 -05:00
Matt Harbison	8edc086e01	largefiles: fix a spurious missing file warning with 'remove -A' (issue4053) The bug report doesn't mention largefiles, but the given recipe doesn't fail unless the largefiles extension is loaded. The problem only affected normal files, whether or not any largefiles are committed, and only files that have not been committed yet. (Files with an 'a' state are dropped from dirstate, not marked removed.) Further, if the named normal file never existed, the warning would be printed out twice. The problem is that the core implementation of remove() calls repo.status(), which eventually triggers a dirstate.walk(). When the file isn't seen in the filesystem during the walk, the exception handling finds the file in dirstate, so it doesn't complain. However, the largefiles implementation called status() again with all of the original files (including the normal ones, just dropped). This time, the exception handler doesn't find the file in dirstate and does complain. This simply excludes the normal files from the second repo.status() call, which the largefiles extension has no interest is processing anyway.	2014-12-21 15:04:13 -05:00
Matt Harbison	572104a185	largefiles: introduce the 'composelargefilematcher()' method This is a copy/paste (with the necessary tweaks) of the composenormalfilematcher method currently on default, which does the inverse- this trims the normal files out of the matcher. It will be used in the next patch.	2014-12-21 14:42:46 -05:00
Matt Mackall	fcca668c77	Added signature for changeset ac36055e3c9c	2014-12-18 14:59:28 -06:00
Matt Mackall	8b83dfc480	pathauditor: check for Windows shortname aliases	2014-12-18 14:18:28 -06:00
Augie Fackler	13206234f4	pathauditor: check for codepoints ignored on OS X	2014-12-16 13:08:17 -05:00
Augie Fackler	c1fe89f64f	darwin: omit ignorable codepoints when normcase()ing a file path This lets us avoid some nasty case collision problems in OS X with invisible codepoints.	2014-12-16 13:07:10 -05:00
Augie Fackler	3c9e7fcc66	encoding: add hfsignoreclean to clean out HFS-ignored characters According to Apple Technote 1150 (unavailable from Apple as far as I can tell, but archived in several places online), HFS+ ignores sixteen specific unicode runes when doing path normalization. We need to handle those cases, so this function lets us efficiently strip the offending characters from a UTF-8 encoded string (which is the only way it seems to matter on OS X.)	2014-12-16 13:06:41 -05:00
Mike Edgar	3171ae0735	synthrepo: new filenames must not also be new directories, and vice-versa When generating many new files into a set of many possible new directories, there is the possibility that the same path is chosen as both file and directory. How likely this is depends on the size of the dictionary used, the generated directory structure and the number of generated files.	2014-12-12 17:42:14 +00:00
Martin von Zweigbergk	01d503fc7e	status: don't override _buildstatus() in workingcommitctx Now that the caching into _status is done in workingctx._dirstatestatus(), which workingcommitctx._dirstatestatus() does not call, there is no caching to prevent in _buildstatus(), so stop overriding it.	2015-01-08 13:29:06 -08:00
Martin von Zweigbergk	370c0e4b47	status: cache dirstate status in _dirstatestatus() Since it's only the dirstate status we cache, it makes more sense to cache it in the _dirstatestatus() method. Note that this change means the dirstate status will also be cached when status is requested between the working copy and some other revision, while we currently only cache the result if exactly the status between the working copy and its parent is requested.	2015-01-08 13:12:44 -08:00
Augie Fackler	ab5a564dd5	test-casefolding.t: demonstrate a bug with HFS+ ignoring some codepoints	2014-12-11 15:42:49 -05:00
Augie Fackler	1a7f98d9da	manifest: disallow setting the node id of an entry to None manifest.diff() uses None as a special value to denote the absence of a file, so setting a file node to None means you then can't trust manifest.diff(). This should also make future manifest work slightly easier.	2014-12-12 13:40:44 -05:00
Augie Fackler	dbb0b3f2fe	context: stop setting None for modified or added nodes Instead use a magic value, so that we can identify modified or added nodes correctly when using manifest.diff(). Thanks to Martin von Zweigbergk for catching that we have to update _buildstatus as well. That part eluded my debugging for some time.	2014-12-12 15:29:54 -05:00
Matt Harbison	f96d6adc63	largefiles: don't actually remove largefiles in an addremove dry run The addlargefiles() method already properly handled dry runs.	2014-12-13 13:33:48 -05:00
Durham Goode	fd373c16b0	log: fix log revset instability The log/graphlog revset was not producing stable results since it was iterating over a dict. Now we sort before iterating to guarantee a fixed order. This fixes some potential flakiness in the tests.	2014-12-08 15:41:54 -08:00
Durham Goode	74a5156e21	log: fix log -f slow path to actually follow history The revset created when -f was used with a slow path (for patterns and directories) did not actually contain any logic to enforce follow. Instead it was depending on the passed in subset to already be limited (which was limited to :. but not ::.). This fixes it by adding a '& ::.' to any -f log revset. hg log -f <file> is still broken, in that it can return results that aren't actually ancestors of the current file, but fixing that has major perf implications, so we'll deal with it later.	2014-12-05 14:27:32 -08:00
Martin von Zweigbergk	7a9312d969	update: don't overwrite untracked ignored files on update When looking for untracked files that would conflict with a tracked file in the target revision (or the remote side of a merge), we explcitly exclude ignored files. The code was added in f1db75422e70 (merge: refactor unknown file conflict checking, 2012-02-09), but it seems like only unknown, not ignored, files were considered since the beginning of time. Although ignored files are mostly build outputs and backup files, we should still not overwrite them. Fix by simply removing the explicit check.	2014-11-16 23:41:44 -08:00
Martin von Zweigbergk	dbe58b6bd9	update: add tests for untracked local file We don't seem to have any tests for updating to another revision when there are untracked files on the local side that conflict with the those on the remote side, so let's add tests. This shows how we overwrite untracked ignored files when updating to a revision that tracks the file.	2014-12-02 17:11:01 -08:00
Pierre-Yves David	8c62cbc185	rebase: ignore negative state when updating back to original wc parent The state mapping also contains some magic negative values (detached parent, ignored revision). Blindly reading the state thus lead to unfortunate usage of the negative value as an update destination. We now filter them out. We do a minor alteration of the test to catch this.	2014-12-02 11:06:38 -08:00
Matt Mackall	f37609509f	merge with i18n	2014-12-01 18:51:10 -06:00
Matt Mackall	57db67f921	Added signature for changeset 99b29d2bd5ed	2014-12-01 18:39:19 -06:00
Wagner Bruna	3b892bfd69	i18n-pt_BR: synchronized with a05dbabaa7d7	2014-12-01 11:21:30 -02:00
Mads Kiilerich	4c4b1f5ddd	merge: before cd/dc prompt, check that changed side really changed Before, merging would in some cases ask "wrong" questions about "changed/deleted" conflicts ... and even do it before the resolve phase where they can be postponed, re"resolved" or answered in bulk operations. Instead, check that the content of the changed file really did change. Reading and comparing file content is expensive and should be avoided before the resolve phase. Prompting the user is however even more expensive. Checking the content here is thus better. The 'f in ancestors[0]' should not be necessary but is included to be extra safe.	2014-12-01 02:30:21 +01:00
Mads Kiilerich	0619283698	largefiles: don't show largefile/normal prompts if one side is unchanged	2014-12-01 02:11:29 +01:00
Mads Kiilerich	99f8557b66	tests: add test-issue3084.t cases for 'changed but same' as for 'unchanged' Use suffix -same for cases where file changed but content is the same - that is the case where manifestmerge doesn't detect that a file is unchanged. (The suffix -id is already used for cases where the file didn't change - that is the trivial case where manifestmerge detects that the file is unchanged.) These new tests are good but the results are bad. There shouldn't be any merge conflicts or prompts when one side didn't change.	2014-12-01 02:11:17 +01:00
Mads Kiilerich	ad85c54a44	tests: clean-up of largefiles tests in test-issue3084.t Prepare for adding more test cases to the systematic testing, moving the test from dcd04c26c0f8 to another section.	2014-12-01 02:10:57 +01:00
Pierre-Yves David	1e785e1322	pushkey: gracefully handle prepushkey hook failure (issue4455) This allow to gracefully report the failure of the bookmark push and carry on. Before this change set. Local push would plain quit and wireprotocol would failed in various ungraceful way.	2014-11-29 19:17:47 -08:00
Pierre-Yves David	6cf24b99de	hook: raise a more specialized HookAbort exception when a hook fails We need to gracefully handle some aborts for pushkey, especially because it leads to a user-facing crash over the wireprotocols. So we need a more specialized exception to catch.	2014-11-29 19:13:35 -08:00
Matt Mackall	c8d5dc5d96	help: fix typo in files help	2014-12-01 15:04:34 -06:00
Pierre-Yves David	56b039c98c	revset: fix first and last for generatorset (issue4465) The code was just plain wrong.	2014-12-01 05:18:12 -08:00

... 2 3 4 5 6 ...

23753 Commits