sapling

mirror of https://github.com/facebook/sapling.git synced 2024-10-11 09:17:30 +03:00

Author	SHA1	Message	Date
Pierre-Yves David	ff08bc7b93	obsstore: disable garbage collection during initialization (issue4456) Python garbage collection is triggered by container creation. So code that creates a lot of tuples tends to trigger GC a lot. We disable the gc during obsolescence marker parsing and associated initialization. This provides an interesting speedup (25%). Load marker function on my 58758 markers repo: before: 0.468247 seconds after: 0.344362 seconds The benefit is a bit less visible overall. With python2.6 on my system I see: after: 0.60 before: 0.53 The difference is probably explained by the delaying of a costly GC. (but there is still a win). Marking involved tuples, lists and dicts as ignorable by the garbage collector should give us more benefit. But this is another adventure. Thanks goes to Siddharth Agarwal for the lead.	2014-11-26 16:58:31 -08:00
Pierre-Yves David	dd01dca5ec	dirstate: use the 'nogc' decorator Now that we have a generic way to disable the gc, we use it. however, we have too use it in a baroque way. See inline comment for details.	2014-12-04 05:43:15 -08:00
Pierre-Yves David	a756e8c469	util: add a 'nogc' decorator to disable the garbage collection Garbage collection behave pathologically when creating a lot of containers. As we do that more than once it become sensible to have a decorator for it. See inline documentation for details.	2014-12-04 05:43:40 -08:00
Martin von Zweigbergk	72e9071545	merge: make 'keep' message more descriptive Most merge action messages don't describe the action itself, they describe the reason the action was taken. The only exeption is the 'k' action, for which the message is just "keep" and instead there is a code comment folling it that says "remote unchanged". Let's move that comment into the merge action message.	2014-12-03 14:03:20 -08:00
Matt Harbison	e5bd4011a8	addremove: restore the relative path printing when files are named This fixes the previously mentioned issue with 7d5fcea60c78, and undoes its corresponding test change. The test change demonstrates the correctness when a file is specified (i.e. the glob is required on Windows because relative paths use '\' and absolute paths use '/'). It is admittedly very subtle, but there will be a more robust test in the addremove -S v3 series.	2014-12-04 23:08:27 -05:00
Matt Harbison	25fa81d27a	match: introduce uipath() to properly style a file path Several methods print files relative to the repo root, unless files are named on the command line, in which case they are printed relative to cwd. Since the check relies on the 'pats' parameter, which needs to be replaced by a matcher when adding subrepo support, this logic gets folded into the matcher to tidy up the callers. Prior to 7d5fcea60c78, this style decision was based off of whether or not the 'pats' list was empty. That change altered the check to test match.anypats() instead, in order to make paths printed consistent when -I/-X is specified. That however, changed the style when a file is given to the command. So now we test the pattern list to get the old behavior for files, as well as test -I/-X to get the consistency for patterns.	2014-12-04 23:04:55 -05:00
Matt Mackall	004a613006	merge with stable	2014-12-05 12:10:56 -06:00
Martin von Zweigbergk	7526f0e8b8	merge: don't overwrite conflicting file in locally renamed directory When the local side has renamed a directory from a/ to b/ and added a file b/c in it, and the remote side has added a file a/c, we end up overwriting the local file b/c with the contents of remote file a/c. Add a check for this case and use the merge ('m') action in this case instead of the directory rename get ('dg') action.	2014-12-03 11:02:52 -08:00
Martin von Zweigbergk	24dc017945	merge: don't ignore conflicting file in remote renamed directory When the remote side has renamed a directory from a/ to b/ and added a file b/c in it, and the local side has added a file a/c, we end up moving a/c to b/c without considering the remote version of b/c. Add a check for this case and use the merge ('m') action in this case instead of the directory rename ('dm') action.	2014-12-03 10:56:07 -08:00
Martin von Zweigbergk	0483da3f64	merge: duplicate 'if f in copied' into each branch	2014-11-23 15:08:50 -08:00
Martin von Zweigbergk	1f5f9eefce	merge: branch code into {n1 and n2, n1, n2} top-level cases There are three high-level cases that are of interest in manifestmerge(): 1) The file exists on both sides, 2) The file exists only on the local side, and 3) The file exists only on the remote side. Let's make this clearer in the code. The 'if f in copied' case will be broken up into the two applicable branches in the next patch.	2014-11-23 14:09:10 -08:00
Ryan McElroy	cf43c21202	bookmarks: factor out repository lookup from writing bookmarks file This will allow the share extension to extend bookmarks functionality to share bookmarks between repositories.	2014-12-02 20:31:19 -08:00
Mike Edgar	343ab73738	parsers: ensure revlog index node tree is initialized before insertion Currently, the revlog index C implementation assumes its node tree will be initialized before a new element is inserted by revnum. For example, revlog.py executes 'self.index.insert(-1, e)' in _addrevision(). This is only safe because the node tree has been initialized by a "node in self.nodemap" check made in addrevision(). (For context, this was discovered while developing an experimental revlog mixin which stores "elided nodes" via a separate code path from _addrevision(); that new code path segfaults without this patch.)	2014-12-04 12:02:02 -05:00
Yuya Nishihara	72198a2056	revset: add 'only' to DoS-safe symbols There is no reason to disable it in hgweb because the same query can be written without using 'only()'.	2014-12-03 22:56:42 +09:00
Yuya Nishihara	f62323518b	revset: move 'only' so that functions are sorted alphabetically	2014-12-03 22:52:54 +09:00
John Coomes	310ded0ea2	add: use lexists so that broken symbolic links are added This restores the add behavior prior to 83750c0d774f and matches the behavior of addremove.	2014-12-03 14:33:29 -08:00
Ryan McElroy	468da3b11a	bookmarks: factor out bookmark file opening for easier extensibility	2014-11-27 00:24:25 -08:00
Siddharth Agarwal	c24b872cf4	diff: explicitly honor all diffopts 'hg diff' should naturally honor all diffopts.	2014-11-18 22:18:05 -08:00
Siddharth Agarwal	21ee5b69d0	annotate: explicitly only honor whitespace diffopts The whitespace ones are the only ones the annotate logic cares about anyway, so there's no visible impact.	2014-11-18 22:16:46 -08:00
Siddharth Agarwal	bf23516938	jsonchangeset: don't honor whitespace and format-changing diffopts JSON is meant to be parsed by computers, and format changes can break them.	2014-11-18 22:13:05 -08:00
Martin von Zweigbergk	81df1b38d5	merge: display modify/delete conflict prompts in sorted order The order is determined by manifest.diff(), which currently is not sorted. There are currently no tests for this, but we will soon add some that would be flaky without this patch.	2014-11-26 10:25:27 -08:00
Eric Sumner	973f0c3522	bundle2-push: provide transaction to reply unbundler This patch series is intended to allow bundle2 push reply part handlers to make changes to the local repository; it has been developed in parallel with an extension that allows the server to rebase incoming changesets while applying them. This diff adds an experimental config option "bundle2.pushback" which provides a transaction to the reply unbundler during a push operation. This behavior is opt-in because of potential security issues: the response can contain any part type that has a handler defined, allowing the server to make arbitrary changes to the local repository.	2014-11-21 15:50:38 -08:00
Eric Sumner	154e42b303	bundle2.processbundle: let callers request default behavior This patch series is intended to allow bundle2 push reply part handlers to make changes to the local repository; it has been developed in parallel with an extension that allows the server to rebase incoming changesets while applying them. The default transaction getter for processbundle is a private function that raises an exception; this diff lets calling code pass None as the transaction getter to explicitly request this default behavior. The next diff will check a config option to determine whether to provide a transaction to the reply bundle processor. If one shouldn't be provided, the code needs a way to specify that the default behavior should be used.	2014-11-24 16:04:44 -08:00
Eric Sumner	bd16762201	push: elevate phase transaction to cover entire operation This patch series is intended to allow bundle2 push reply part handlers to make changes to the local repository; it has been developed in parallel with an extension that allows the server to rebase incoming changesets while applying them. Most pushes already open a transaction in order to sync phase information. This diff replaces that transaction with one that spans the entire push operation. This transaction will be used in a later patch to guard repository changes made during the reply handler.	2014-11-21 15:06:38 -08:00
Eric Sumner	f1a430fbce	pull: extract transaction logic into separate object This patch series is intended to allow bundle2 push reply part handlers to make changes to the local repository; it has been developed in parallel with an extension that allows the server to rebase incoming changesets while applying them. Aside from the transaction logic, the pulloperation class is used primarily as a logic-free data structure for storing state information. This diff extracts the transaction logic into its own class that can be shared with push operations.	2014-11-21 14:32:57 -08:00
Siddharth Agarwal	113c428a6d	patch.difffeatureopts: add a feature for format-changing diffopts Typically, callers that are interested in this set are also interested in the others. We still add it as a first-class flag for convenience.	2014-11-18 17:22:03 -08:00
Siddharth Agarwal	542813a378	patch.difffeatureopts: add a feature for whitespace diffopts These aren't exactly format-breaking features -- just ones for which patches applied to a repo will produce incorrect commits, In any case, some commands like record and annotate only care about this feature.	2014-11-18 17:09:26 -08:00
Siddharth Agarwal	8c24c56a7e	patch.difffeatureopts: add a feature for diff.git This deserves to be its own feature -- indeed, this is often the only feature several commands care about.	2014-11-18 17:10:14 -08:00
Siddharth Agarwal	3864c7a9bb	patch: rename diffopts to diffallopts This is meant to be used when all features are explicitly requested. diffopts is kept as an alias for now -- it will go away soon.	2014-11-18 21:43:38 -08:00
Siddharth Agarwal	56b105c2c6	patch: add a new function to initialize diffopts by feature Not all callers are interested in all diffopts -- for example, commands like record (which use diff internally) break when diffopts like noprefix are enabled. This function will allow us to add flags that callers can use to enable only the features they're interested in.	2014-11-18 16:53:22 -08:00
Siddharth Agarwal	e42499560a	patch.diffopts: use a dict for initialization In upcoming patches we'll conditionally add to and remove from the dict.	2014-11-18 16:00:54 -08:00
Martin von Zweigbergk	242bbe216d	addremove: print relative paths when called with -I/-X (BC) For "hg addremove 'glob:.py'", we print any paths added or removed as relative to the current directory, but when "hg addremove -I 'glob:.py'" is used, we use the absolute path (relative from the repo root). It seems like they should be the same, so change it so we use relative paths in both cases. Continue to use absolute paths when no patterns are given.	2014-12-01 21:48:32 -08:00
Matt Mackall	289d6b53bc	merge with stable	2014-12-01 19:34:11 -06:00
Mathias De Maré	76a33af9ab	subrepo: add status support for ignored files in git subrepos Retrieving the status of a git subrepo did not show ignored files. Using 'git ls-files', we can retrieve these files and display the correct status.	2014-11-28 20:16:15 +01:00
Pierre-Yves David	c52ad3f03b	manifest: document the extra letter in working copy manifest node As the second developer to get confused by this in November, I'm adding some documentation for the next poor soul.	2014-11-26 15:37:01 -08:00
Mads Kiilerich	9389b9fa07	merge: 0 is a valid ancestor different from None Most internal functions can take either a hash or an integer. Merge did however not handle 0 as revision 0. Now it does.	2014-11-30 19:26:53 +01:00
Matt Mackall	4abfc94f18	merge with stable	2014-11-27 12:25:01 -06:00
Martin von Zweigbergk	bdbe0fa90a	merge: use None as filename for base in 'both created' conflicts Instead of using a file that we know is not in the common ancestor's maniffest, let's use None. This is safe as the only place that cares about the value (applyupdates) already checks if the item exists in the ancestor.	2014-11-24 16:17:02 -08:00
Martin von Zweigbergk	96d97f796c	merge: break out "both renamed a -> b" case We can further limit the scope of the 2-way merge case by breaking out the case where the file was not created from scratch on both sides but rather renamed in the same way (and is therefore a 3-way merge). This involves copying some code, but it makes it clearer which case the "Note:" in the code refers to.	2014-11-24 16:42:36 -08:00
Martin von Zweigbergk	dd435d36bb	merge: separate out "both created" cases When 'f' is not in 'ma', 'a' will be 'nullid' and all the if/elif conditions that check whether some one nodeid is equal to 'a' will fail, and the else-clause will instead apply. We can make that more explicit by creating a separate 'm' action for the case where 'a' is 'nullid'. While it does mean copying some code, perhaps it makes it a little clearer which codepaths are possible, and which cases the "Note:" in the code refers to. It also lets us make the debug action messages a little more specific.	2014-11-24 16:16:34 -08:00
Martin von Zweigbergk	b78125e6ba	merge: indent to prepare for next patch	2014-11-24 16:11:22 -08:00
Matt Harbison	6b20f5bf8e	add: check for the existence of a file matched inexactly before adding it The change in 02ecc94fb657 created a problem on Windows and OS X: --- /usr/local/mercurial/tests/test-issue660.t +++ /usr/local/mercurial/tests/test-issue660.t.err @@ -47,6 +47,8 @@ Should succeed - shadow removed: $ hg add b + adding b/b + b/b does not exist! Prior to the failing 'hg add', the file 'b/b' was added and committed, then 'b' was recursively deleted from the filesystem, file 'b' was created and the delete was recorded with 'hg rm --after'. This add is attempting to record the existence of file 'b'. A filesystem that is not case sensitive prevents dirstate.walk() from skipping its step 3, and step 3 has the effect of inserting removed files into the walk list. The Linux code doesn't run through step 3, and didn't exhibit the problem. It's not clear why a non case sensitive filesystem triggers step 3, given that the path normalization occurs in step 2. Prior to 02ecc94fb657, part of the check here was 'f not in repo.dirstate' instead of 'f not in wctx'. Files in the 'r' state are filtered out of context.__contains__() but not dirstate.__contains__(). Therefore the removed file name wasn't added to the list of files to add when checking against dirstate. That change was to allow removed files to be readded, but adding a file that doesn't exist is nonsensical. If the user specifies a missing file, it will be an exact match and will still fail.	2014-11-19 22:27:55 -05:00
Martin von Zweigbergk	663d394fe9	merge: remove obsolete check for untracked files in 'dm' action Since 4a56fba99974 (merge: don't use unknown(), 2012-02-09), untracked files are no longer included in the manifest diff, so there is no need to check exclude them when renaming files for directory moves with the 'dm' action.	2014-11-24 09:50:27 -08:00
Martin von Zweigbergk	52b3a1afd7	merge: remove dead assignment in applyupdates()	2014-11-23 23:10:34 -08:00
Martin von Zweigbergk	3e76cdec1c	merge: move calculateupdates() before applyupdated() calculateupdates() happens before applyupdates(), so move it before in the code. That also moves it close to manifestmerge(), which is a good location as calculateupdates() is the only caller of manifestmerge().	2014-11-21 13:06:04 -08:00
Martin von Zweigbergk	527ed28755	merge: remove unused variables from _checkcollision()	2014-11-24 11:28:46 -08:00
Matt Mackall	174e7f793d	merge with stable	2014-11-22 17:09:04 -06:00
Martin von Zweigbergk	d69dae068a	merge: consistently use single quotes for non-user-facing strings Because I'm getting tired of searching for both 'O' and "O".	2014-11-20 16:39:32 -08:00
Pierre-Yves David	9fc6abae03	localrepo: add a currenttransaction method This method returnx the current transaction or None: it will allow a cache writer to hook in an existing transaction.	2014-11-13 11:12:47 +00:00
Pierre-Yves David	8f926bef9a	repoview: extract actual hidden cache writing in its own function This will allow the generation of this cache within the transaction. Relying on the transaction will reduce the chance of reader seeing bad cache.	2014-11-13 11:11:17 +00:00
Martin von Zweigbergk	c71ba3444e	dirstate: speed up repeated missing directory checks In a mozilla repo with tip at bb3ff09f52fe, hg update tip~1000 && time hg revert -nq -r tip . displays ~4:20 minutes. With tip~100, it runs in ~11 s. With revision 100000, it did not finish in 12 minutes. Revert calls dirstate.status() with a matcher that matches each file in the target revision. The main problem [1] lies in dirstate._walkexplicit(), which looks for matching deleted directories by checking whether each path is prefix of any path in the dirstate. With m files in the dirstate and n files in the target revision that are not in the dirstate, this is clearly O(m*n). Let's improve by keeping a lazily initialized set of all the directories in the dirstate, so the time becomes O(m+n). After this patch, the 4:20 minutes become 5.5 s, while for a single missing path, it slows down from 1.092 s to 1.150 s (best of 4). The >12 min case becomes 5.8 s. [1] A narrower optimization would be to make revert take the fast path for '.' and '--all'.	2014-11-19 23:15:07 -08:00
Martin von Zweigbergk	2916dab85b	revert: access status fields by name rather than index For better readability.	2014-11-19 17:07:27 -08:00
FUJIWARA Katsunori	4a1c867054	subrepo: remove "_getstorehashcachepath" referred by no other code paths	2014-11-19 18:35:14 +09:00
FUJIWARA Katsunori	ba433385a7	subrepo: replace direct file APIs around "writelines" by "vfs.writelines" This patch also replaces "self._getstorehashcachepath" (building absolute path up) by "self._getstorehashcachename" (building relative path up), because "vfs.writelines" requires relative path.	2014-11-19 18:35:14 +09:00
FUJIWARA Katsunori	86bc73ffa9	vfs: add "writelines" This patch allows "writelines" to take "mode" and "notindexed" arguments, because subsequent patch for subrepo requires both.	2014-11-19 18:35:14 +09:00
FUJIWARA Katsunori	f60bafa1b3	vfs: add "notindexed" argument to invoke "ensuredir" with it in write mode This patch uses "False" as default value of "notindexed" argument, even though "vfs.makedir()" uses "True" for it, because "os.mkdir()" doesn't set "_FILE_ATTRIBUTE_NOT_CONTENT_INDEXED" attribute to newly created directories.	2014-11-19 18:35:14 +09:00
FUJIWARA Katsunori	59f23cabee	subrepo: replace direct file APIs around "readlines" by "vfs.tryreadlines" This patch also replaces "self._getstorehashcachepath" (building absolute path up) by "self._getstorehashcachename" (building relative path up), because "vfs.tryreadlines" requires relative path. This patch makes "_readstorehashcache()" return "[]" (returned by "vfs.tryreadlines()"), when cache file doesn't exist, even though "_readstorehashcache()" returned '' (empty string) in such case before this patch. "_readstorehashcache()" is invoked only by the code path below in "_storeclean()": for filehash in self._readstorehashcache(path): if filehash != itercache.next(): clean = False break In this case, "[]" and '' don't differ from each other, because both of them cause avoiding iteration of "for loop".	2014-11-19 18:35:14 +09:00
FUJIWARA Katsunori	b1ff97d24c	vfs: add "readlines" and "tryreadlines" This patch allows "readlines" and "tryreadlines" to take "mode" argument, because "subrepo" requires to read files not in "rb" (binary, default for vfs) but in "r" (text) mode in subsequent patch.	2014-11-19 18:35:14 +09:00
FUJIWARA Katsunori	38f496a6bb	subrepo: add "_cachestorehashvfs" to handle cache store hash files via vfs This "vfs" object will be used by subsequent patches to handle cache store hash files without direct file APIs. This patch decorates "_cachestorehashvfs" with "@propertycache" to delay vfs creation, because it is used only for cooperation with other repositories. In this patch, "/" is used as the path separator, even though "self._repo.join" uses platform specific path separator (e.g. "\\" on Windows). But it is reasonable enough, because "store" and other management file handling already include such implementation, and they work well.	2014-11-19 18:35:14 +09:00
FUJIWARA Katsunori	82551da514	subrepo: remove "_calcfilehash" referred by no other code paths	2014-11-19 18:35:14 +09:00
FUJIWARA Katsunori	d95af73d10	subrepo: replace "_calcfilehash" invocation by "vfs.tryread" "_calcfilehash" can be completely replaced by simple "vfs.tryread" invocation. def _calcfilehash(filename): data = '' if os.path.exists(filename): fd = open(filename, 'rb') data = fd.read() fd.close() return util.sha1(data).hexdigest() Building absolute path "absname" up by "self._repo.join" for files in "filelist" is avoided, because "vfs.tryread" does so internally.	2014-11-19 18:35:14 +09:00
FUJIWARA Katsunori	617a0e35a6	subrepo: replace "os.path.exists" by "exists" via wvfs of the parent Existance of specified "path" should be examined by "exists" via wvfs of the parent repository, because the working directory of the parent repository may be in UTF-8 mode. Wide API should be used via wvfs in such case. In this patch, "/" is used as the path separator, even though "path" uses platform specific path separator (e.g. "\\" on Windows). But it is reasonable enough, because "store" and other management file handling already include such implementation, and they work well.	2014-11-19 18:35:14 +09:00
FUJIWARA Katsunori	42cf1cdb87	subrepo: avoid redundant "util.makedirs" invocation "util.makedirs" for the (sub-)repository root of "hgsubrepo" is also executed in the constructor of "localrepository", if "create" is True and ".hg" of it doesn't exist. This patch avoids redundant "util.makedirs" invocation in the constructor of "hgsubrepo".	2014-11-19 18:35:14 +09:00
Martin von Zweigbergk	1d09a87f4e	merge: remove confusing comment about --force manifestmerge() has a piece of code that's roughly: if not force and different: abort else: # if different: old untracked f may be overwritten and lost ... The comment only talks about what happens when 'different' is true, and in combination with the if-block above, that must mean that it is only about what happens when 'force and different'. It seems quite fine that files are overwritten when 'force' is true, so let's remove the comment. As it stands, it can easily be interpreted as a TODO (which is how I interpreted it at first).	2014-11-19 08:50:08 -08:00
Pierre-Yves David	160c394fe7	phases: read pending data when appropriate If we are called by a hook and pending data exists, read those.	2014-10-17 22:23:06 -07:00
Pierre-Yves David	4012eb31b0	bookmark: read pending data when appropriate If we are called by a hook and pending data exists, read it.	2014-09-28 21:27:48 -07:00
Pierre-Yves David	3ace7493d7	transaction: write pending generated files Such file are generated with a .pending prefix. It is up to the reader to implement the necessary logic for reading pending files. We add a test to ensure pending files are properly cleaned-up in both success and error cases.	2014-10-17 22:19:05 -07:00
Pierre-Yves David	58e32f1eeb	transaction: have _generatefile return a boolean The function returns True if any files were generated. This will be used to know if any pending files have been written.	2014-10-17 21:57:32 -07:00
Pierre-Yves David	81a1fe4d5b	transaction: allow generating files with a suffix This will allow us to generate temporary pending files. Files generated with a suffix are assumed temporary and will be cleaned up at the end of the transaction.	2014-09-29 01:29:08 -07:00
Matt Mackall	3f845e51cb	transaction: fix some docstring grammar	2014-11-19 09:52:05 -06:00
Pierre-Yves David	ecac877d99	transaction: accept a 'location' argument for registertmp This will allow generation of temporary files outside of store. This will be useful for bookmarks.	2014-11-12 14:57:41 +00:00
Martin von Zweigbergk	f29370d747	update: remove unnecessary check for unknown files with --check As far as I and the test suite can tell, the checks in manifestmerge() already report the errors (whether or not --check is given), so we don't need to call merge.checkunknown(). Since this is the last call to the method, also remove the method.	2014-11-18 16:14:32 -08:00
Siddharth Agarwal	fba9f14547	setdiscovery: avoid a full changelog graph traversal We were definitely being suboptimal here: we were constructing two full sets, one with the full set of common nodes (i.e. a graph traversal) and one with all nodes. Then we subtract one set from the other. This whole process is O(commits) and causes discovery to be significantly slower than it should be. Instead, keep track of common incrementally and keep undecided as small as possible. This makes discovery massively faster on large repos: on one such repo, 'hg debugdiscovery' over SSH with one commit missing on the client and five on the server went from 4.5 seconds to 1.5. (An 'hg debugdiscovery' with no commits missing on the client, i.e. connection startup time, was 1.2 seconds.)	2014-11-16 00:40:29 -08:00
Siddharth Agarwal	1a87e8b8c3	ancestor: add a way to remove ancestors of bases from a given set This and missingancestors can share state, which will turn out to be perfect for set discovery.	2014-11-14 19:40:30 -08:00
Siddharth Agarwal	0d3efeefd2	ancestor: add a way to add to bases of a missing ancestor object This will be useful for setdiscovery, since with that we incrementally add to our knowledge of common nodes.	2014-11-14 17:21:00 -08:00
Siddharth Agarwal	8c7869477d	ancestor: add a way to test whether a missing ancestor object has bases This is pretty trivial so there's no unit test coverage for it. This will be used by setdiscovery.	2014-11-16 00:39:29 -08:00
Siddharth Agarwal	078961d745	ancestor: remove now-unused missingancestors function Callers should use revlog.incrementalmissingrevs instead.	2014-11-14 16:53:40 -08:00
Siddharth Agarwal	2d669c474b	revlog: switch findmissing* methods to incrementalmissingrevs This will allow us to remove ancestor.missingancestors in an upcoming patch.	2014-11-14 16:52:40 -08:00
Siddharth Agarwal	5692148f49	revlog: add a method to get missing revs incrementally This will turn out to be useful for discovery.	2014-11-16 00:39:48 -08:00
Siddharth Agarwal	b865bfbae7	ancestor.missingancestors: turn into a state-keeping class This allows multiple efficient missing ancestor queries against the same set of bases. In upcoming patches we'll also define ways to grow the set of bases. The fact that the test output hasn't changed establishes this patch's correctness.	2014-11-14 23:44:38 -08:00
Siddharth Agarwal	1ff25a7ef5	ancestor.missingancestors: calculate start point after filtering revsvisit Any revs that are filtered out are also in basesvisit, which means they wouldn't be returned in the missing list anyway. There's no need to explore such revs or their ancestors. The 'if not revsvisit' check moves down because we can't call max() on an empty set.	2014-11-14 13:47:25 -08:00
Siddharth Agarwal	6cb962f9f8	ancestor.missingancestors: don't discard from basesvisit We only actually care about whether revsvisit is empty, so we can let basesvisit grow to arbitrary size. It turns out that this actually helps performance. For a large repo with hundreds of thousands of commits, hg perfrevset 'only(0, tip)' (basically the worst case, involving a full DAG traversal) goes from 1.63 seconds to 1.50. hg perfrevset 'only(tip, 0)' remains unchanged at 1.98 seconds.	2014-11-14 11:33:52 -08:00
Siddharth Agarwal	7103eb28ea	ancestor.lazyancestors: take parentrevs function rather than changelog Principle of least privilege, and it also brings this in line with missingancestors.	2014-11-14 14:36:25 -08:00
Matt Harbison	8766c8079f	remove: avoid a bogus warning about no tracked files when removing '.' Previously, any files relative to the root of the repo that match the -I patterns would be deleted, but the command exited with 1 after printing a warning: $ hg remove -S -I 're:.*.txt' . removing sub1/sub2/folder/test.txt removing sub1/sub2/test.txt not removing .: no tracked files	2014-11-16 00:24:23 -05:00
Matt Harbison	c0a7d4f5bc	remove: support remove with explicit paths in subrepos	2014-11-12 23:15:20 -05:00
Matt Harbison	17e7835237	remove: recurse into subrepositories with --subrepos/-S flag Like 'forget', git and svn subrepos are currently not supported. Unfortunately the name 'remove' is already used in the subrepo classes, so we break the convention of naming the subrepo function after the command.	2014-11-15 21:36:19 -05:00
Yuya Nishihara	1c7ce2bd17	cmdserver: protect pipe server streams against corruption caused by direct io Because pipe-mode server uses stdio as IPC channel, other modules should not touch stdio directly and use ui instead. However, this strategy is brittle because several Python functions read and write stdio implicitly. print 'hello' # should use ui.write() # => ch = 'h', size = 1701604463 'ello', data = '\n' This patch adds protection for such mistakes. Both stdio files and low-level file descriptors are redirected to /dev/null while command server uses them.	2014-11-15 13:50:43 +09:00
Yuya Nishihara	2808848bc1	cmdserver: postpone creation of pipe server until run() This makes it easy to swap file descriptors while running command server.	2014-11-15 13:04:41 +09:00
Yuya Nishihara	0c88d9a994	cmdserver: use given streams as pipe channels like other commands Because commandserver itself is an hg subcommand, it shouldn't use stdio directly in principle.	2014-11-15 12:43:35 +09:00
Siddharth Agarwal	b3be1bdf41	revset.only: use cl.findmissingrevs ancestor.missingancestors is really an implementation detail.	2014-11-14 16:38:58 -08:00
Martin von Zweigbergk	edf3f3461a	manifestmerge: use already existing fl2 synonym for m2.flags(f) Probably not a noticeable performance gain, but shortens the code slightly.	2014-11-14 09:33:28 -08:00
Martin von Zweigbergk	6584c8e690	merge: drop underscore prefix from _checkunknown() The method has been called from commands.py since 8d9ca2ac2fe8 (update: just merge unknown file collisions, 2012-02-09), so drop the underscore prefix that suggests that it's private.	2014-11-13 23:12:15 -08:00
Pierre-Yves David	46df0be40b	transaction: drop special handling for phases and bookmarks generation We are still doing double backups, but now that we have proper location handling this is less of an issue. Dropping this simplifies the code before we add some pending-related logic. This also ensures we actually test the new 'location' mechanism.	2014-11-12 14:47:48 +00:00
Pierre-Yves David	b9594d1c53	transaction: use 'location' instead of 'vfs' objects for file generation The argument is now a location name. The location must be present in the 'vfsmap' provided to the transaction at creation time.	2014-10-17 20:53:42 -07:00
Pierre-Yves David	c1cac6ba41	transaction: use 'location' instead of 'vfs' in the addbackup method This unlock the backup of file outside of store (eg: bookmarks).	2014-11-05 01:59:32 +00:00
Pierre-Yves David	414fa72b29	addbackup: handle file in subdirectory The current naming scheme ('journal.backups.<file>') resulted is bad directory name when 'file' was in a subdirectory. We now extract the directory name and create the backupfile within it. We plan to use file in a subdirectory for cachefile.	2014-11-14 00:14:23 +00:00
Pierre-Yves David	70e9add6df	addbackup: use the vfs for the backup destination too The backup file location was always computed using the opener, bypassing the 'location' setting. (And making the feature broken.)	2014-11-14 14:54:55 +00:00
Pierre-Yves David	cec71977a9	transaction: set backupentries version to proper value Now that all mechanisms are in place, we can advertise it with a proper new version.	2014-11-13 11:17:36 +00:00
Pierre-Yves David	4c03465397	transaction: support cache file in backupentries We do not want to abort if anything wrong happen while handling a cache file. Cache file have way to be invalidated and if old/bad version stay no misbehavior will happen. Proper value will eventually be computed and the wrong will be righten. This changeset use the transaction reporter (usually writing on stderr) to write details about failed cache handling. This will only apply to write operation using a transaction. The usual update during read only operation will stay a debug message. I was on the way to bring these message back to debug level when I realised it could be a feature. People with write access to the repository are likely to have the power to fix error related to cache (and it is valuable to fix them). So let the things as is for now.	2014-11-13 11:17:09 +00:00
Pierre-Yves David	382c5bbb8d	transaction: use the location value when doing backup We finally use the 'location' value coupled with the 'vfsmap' to restore backup for the right file.	2014-10-17 21:04:35 -07:00
Pierre-Yves David	53a1a60278	transaction: pass a vfs map to the transaction The goal is to allow access to file outside ofthe store directory from the transaction. The obvious target are the `bookmarks` file. But we can envision usage for cache too. We keep passing a main opener explicitly because a lot of code rely on this default opener. The main opener (operating on store) is using an empty key ''.	2014-10-17 20:49:39 -07:00
Pierre-Yves David	d772e961ee	transaction: change the on disk format for backupentries We need to store new data to improve the current transaction logic: - location: We want to generate and backup file outside of the 'store' (eg: bookmarks, or various cache files). This requires knowing and preserving where each file is located. The value of this new field is a string. It will be used as a key for a vfs mapping. - cache: We would like to handle cache file in the transaction code. This Will help to have cache consistent with the repository state and avoid performance issue on big repository like Mozilla. However, failure to handle cache file should not result in a transaction failure. We add a new field that carry this information. The value is boolean, A True value mean any error while handling this file can be ignored. Those two mechanisms are not implemented yet, but they are now persisted in the on disk file. Support for new mechanisms is coming in later changeset. We update the file format now and will introduce the new features in later changeset. The format version is set to 0 until we actually support the new feature. This will prevent misunderstanding between incomplete and final client. Support for reading both version 1 and (future) version 2 could be achieved (using default value when reading version 1) but has not been seen as necessary for now.	2014-11-05 01:52:46 +00:00
Siddharth Agarwal	8354d9169f	revlog: cache chain info after calculating it for a rev (issue4452) This dumb cache works surprisingly well: on a repository with typical delta chains ~50k in length, unbundling a linear series of 5000 revisions (changelogs and manifests only) went from 60 seconds to 3.	2014-11-13 21:36:38 -08:00
Martin von Zweigbergk	455810026c	manifest: add matches() method Move the code in context._manifestmatches() into a new manifest.matches(). It's a natural place for the code to live and it allows other callers to easily use it. It should also make it easier to optimize the new method in alternative implementations of the manifest (same reasoning as with manifest.diff()).	2014-10-22 21:38:30 -07:00
Martin von Zweigbergk	cfd2a961bd	context.status: pass status tuple into _buildstatus By passing a status tuple (instead of the current list), we can access the status fields by name and make it a little more readable.	2014-11-12 22:20:36 -08:00
Martin von Zweigbergk	f2e6b819eb	context.status: avoid de- and reconstructing status tuple We can just modify the status tuple we got from dirstate.status() instead of deconstructing it and constructing a new instance, thereby simplifying the code a little.	2014-11-12 22:07:31 -08:00
Martin von Zweigbergk	7de43e79f4	context.status: make _dirstatestatus() return an status tuple Letting _dirstatestatus() return an scmutil.status instance also means that _buildstatus() will always return such an instance, so we can remove the conversion from the call sites.	2014-11-12 16:51:11 -08:00
Martin von Zweigbergk	38e47fc778	context.status: wipe deleted/unknown/ignored fields when reversed It makes no sense to request reverse status (i.e. changes from the working copy to its parent) and then look at the deleted, unknown or ignored fields. If you do, you would get the result from the forward status (changes from parent to the working copy). Instead of giving a nonsensical answer to a nonsensical question, it seems a little saner to return empty lists. It might be best if we could prevent the caller accessing these lists, but it's doubtful it's worth the trouble.	2014-11-12 21:19:07 -08:00
Siddharth Agarwal	48ef0179c1	patch.trydiff: add support for noprefix	2014-11-12 23:50:21 -08:00
Siddharth Agarwal	ce2b0fd888	mdiff.unidiff: add support for noprefix	2014-11-12 23:29:14 -08:00
Siddharth Agarwal	4d6e85ce0c	diff: add a --noprefix option See previous patch descriptions for the motivation. The tests reflect the current state of the world -- as we add support we'll see changes in the test output.	2014-11-13 00:13:48 -08:00
Siddharth Agarwal	37a8021dad	patch.diffopts: add support for noprefix In an upcoming patch we'll enable support as an option to 'hg diff' as well. The tests reflect the current state of the world -- as we add support we'll see changes in the test output.	2014-11-13 00:08:44 -08:00
Siddharth Agarwal	ccd6611ec9	patch.diffopts: allow a setting to be forced in plain mode Upcoming patches will add an option that will almost certainly break diff output parsers when enabled. Add support for forcing an option to something in plain mode, as a fallback. Options passed in via the CLI are not affected, though -- it is assumed that any script passing the option in explicitly knows what it is doing.	2014-11-12 23:47:25 -08:00
Siddharth Agarwal	dabae3afed	patch.diffopts: break get function into if statements We're going to add another condition here, and with the current structure that becomes just too confusing.	2014-11-12 23:44:17 -08:00
Siddharth Agarwal	535dc77081	mdiff.diffopts: add a new noprefix option By popular demand, we introduce an option to disable the 'a/' and 'b/' prefixes in diff output. This makes copying and pasting filenames from diff output easier. This option will be implemented and documented in upcoming patches. To ensure that existing scripts that parse output don't break, we will ensure that this prefix is disabled in plain mode. A straight 'hg export \| hg import' without HGPLAIN=1 will still be broken though, but there's little that can be done about that.	2014-11-12 23:25:32 -08:00
Siddharth Agarwal	3deaceac88	mdiff.diffopts: add doc comment for nobinary	2014-11-12 23:19:44 -08:00
Pierre-Yves David	500808d844	changelog: register changelog.i.a as a temporary file The file is registered to make sure the transaction is cleaned up in all cases.	2014-11-08 17:08:09 +00:00
Pierre-Yves David	84cb2b6041	transaction: allow registering a temporary transaction file During the transaction, files may be created to store or expose data involved in the transaction (eg: changelog index data are written in a 'changelog.i.a' for hooks). But we do not have an official way to record such file creation and make sure they are cleaned up. The lack of clean-up is currently okay because there is a single file involved and a single producer/consumer. However, as we want to expose more data (bookmarks, phases, obsmarker) we need something more solid. The 'backupentries' mechanism could handle that. Temporary files can be encoded as a backup of nothing '('', <temporarypath>)'. We "need" to attach it to the same mechanism as we use to be able to use temporary transaction files outside of .'store/' and 'backupentries' is expected to gain such feature. This changeset makes it clear that we should rename 'backupentries' to something more generic.	2014-11-05 09:27:08 +00:00
Pierre-Yves David	82498ec61b	transaction: always generate file on close The conditionnal was buggy and file were only generated if "onclose" was defined. By luck, "onclose" was always defined.	2014-11-13 10:22:47 +00:00
Matt Harbison	7fc45849db	remove: move most of the implementation into cmdutils.remove() This will allow access to the reusable parts from subrepos, similar to add(), forget(), etc.	2014-11-09 12:31:34 -05:00
Siddharth Agarwal	1acd4cfca4	revlog: increase I/O bound to 4x the amount of data consumed This doesn't affect normal clones since they'd be bound by the CPU bound below anyway -- it does, however, improve generaldelta clones significantly. This also results in better deltaing for generaldelta clones -- in generaldelta clones, we calculate deltas with respect to the closest base if it has a higher revision number than either parent. If the base is on a significantly different branch, this can result in pointlessly massive deltas. This reduces the number of bases and hence the number of bad deltas. Empirically, for a highly branchy repository, this resulted in an improvement of around 15% to manifest size.	2014-11-11 20:08:19 -08:00
Siddharth Agarwal	fe51051ee5	revlog: bound based on the length of the compressed deltas This is only relevant for generaldelta clones.	2014-11-11 20:01:19 -08:00
Siddharth Agarwal	27976ad2dc	revlog: compute length of compressed deltas along with chain length In upcoming patches to the revlog, we're going to split up the notions of bounding I/O and bounding CPU.	2014-11-11 19:54:36 -08:00
Siddharth Agarwal	6e115e5383	revlog: store fulltext when compressed delta is bigger than it This is a very silly case and not particularly likely to happen in the wild, but it turns out we can hit it in a couple of places. As we tune the storage parameters we're likely to hit more such cases. The affected test cases all have smaller revlogs now.	2014-11-11 21:41:12 -08:00
Siddharth Agarwal	e5d387f47e	revlog: make a predicate clearer with parens	2014-11-11 21:39:56 -08:00
Pierre-Yves David	a818f5fbca	transaction: extract backupentry registration in a dedicated function We are about to use the 'backupentry' mechanism to allow cleaning up transaction-related temporary files (such as 'changelog.i.a'). We start by extracting the entry registration into its own method for easy reuse. At that point, I would like to rename the backup-file related variable to something generic but I'm a bit short of ideas.	2014-11-05 13:06:24 +00:00
Pierre-Yves David	8259127ccb	transaction: pass the transaction to 'postclose' callback This mirrors the API for 'pending' and 'finalize' callbacks. I do not have immediate usage planned for it, but I'm sure some callback will be happy to access transaction related data.	2014-11-08 16:35:15 +00:00
Pierre-Yves David	29f854f61a	transaction: pass the transaction to 'finalize' callback The callback will likely need to perform some operation related to the transaction (eg: registering file update). So we better pass the current transaction as the callback argument. Otherwise callback that needs it has to rely on horrible weak reference trick. This allow already allow us to slay a wild weak reference usage.	2014-11-08 16:31:38 +00:00
Pierre-Yves David	92bf4dcbdc	transaction: pass the transaction to 'pending' callback The callback will likely need to perform some operation related to the transaction (eg: backing files up). So we better pass the current transaction as the callback argument. Otherwise callback that needs it has to rely on horrible weak reference trick. The first foreseen user of this is changelog._writepending. We would like it to register the temporary file it create for cleanup purpose.	2014-11-08 16:27:50 +00:00
Pierre-Yves David	6ab03ec965	transaction: gather backupjournal logic together in the __init__ The initialisation of file-backup related variable were a bit scattered, we gather them together.	2014-11-05 10:22:17 +00:00
Pierre-Yves David	b91f1df8cd	transaction: handle missing file in backupentries (instead of using entries) The case where a backup of a missing file was requested was previously handled by the 'entries' list. As the 'backupentries' is about to gain ability to backup files outside of '.hg/store', we want it to be able to handle the missing file too. Reminder: using 'addbackup' on a missing file means that such file needs to be deleted if we rollback the transaction.	2014-11-05 01:38:48 +00:00
Yuya Nishihara	b2ed607f5e	util.system: remove unused handling of onerr=ui In our code, onerr is None or util.Abort. It smells bad to overload ui and exception class.	2014-11-08 13:14:19 +09:00
Yuya Nishihara	ceda6fbba9	util.system: use ui.system() in place of optional ui.fout parameter	2014-11-08 13:06:22 +09:00
Yuya Nishihara	c3aa617f2e	ui: introduce util.system() wrapper to make sure ui.fout is used This change is intended to avoid future problem of data corruption under command server. out=ui.fout is mandatory as long as command server uses stdout as IPC channel.	2014-11-08 12:57:42 +09:00
Yuya Nishihara	990e90eeb6	hook: remove redundant code to redirect http hook output to client stream out=ui and out=ui.fout should be the same here. ui.fout was introduced at 7d217b512ae2, which was not available when out=ui was added at 5716839ccf7d.	2014-11-12 22:21:51 +09:00
Sean Farley	93b998c77a	sortdict: add insert method Future patches will allow extensions to choose which order a namespace should output in the log, so we add a way for sortdict to insert to a specific location.	2014-10-15 12:39:19 -07:00
Sean Farley	8ea5f6192f	sortdict: add iteritems method Future patches will start using sortdict for log operations where order is important. Adding iteritems removes the headache of having to remember to use items() if the object is a sortdict.	2014-11-09 13:15:28 -08:00
Martin von Zweigbergk	98fbfc44f9	addremove: add back forgotten files (BC) After running "hg forget README && hg addremove", README will still be reported as removed, while "hg forget README && hg add README" adds it back so it gets reported as clean. It seems like they should behave the same. Furthermore, it seems like no files should remain untracked after 'hg addremove && hg commit' (or 'hg commit -A'). For these reasons, change the behavior of addremove so it does add forgotten files back. The problem is with scmutil._interestingfiles(), which reports the file as removed, so scmutil.addremove() does not add it. Fix by teaching _interestingfiles() to report forgotten files separately from removed files and make addremove() add forgotten files back. However, do not treat forgotten files as sources for rename detection. Note that since removed and forgotten files are treated the same before this change, forgotten files were considered sources for rename detection. Also update the other caller, marktouched(), in the same way as addremove().	2014-11-08 23:13:39 -08:00
Martin von Zweigbergk	42c14b3ed1	add: add back forgotten files even when not matching exactly (BC) I accidentally did 'hg forget .' and tried to undo the operation with 'hg add .'. I expected the files to be reported as either modified or clean, but they were still reported as removed. It turns out that forgotten files are only added back if they are listed explicitly, as shown by the following two invocations. This makes it hard to recover from the mistake of forgetting a lot of files. $ hg forget README && hg add README && hg status -A README C README $ hg forget README && hg add . && hg status -A README R README The problem lies in cmdutil.add(). That method checks that the file isn't already tracked before adding it, but it does so by checking the dirstate, which does have an entry for forgotten files (state 'r'). We should instead be checking whether the file exists in the workingctx. The workingctx is also what we later call add() on, and that method takes care of transforming the add() into a normallookup() on the dirstate. Since we're changing repo.dirstate into wctx, let's also change repo.walk into wctx.walk for consistency (repo.walk calls wctx.walk, so we're simply inlining the call).	2014-11-10 14:51:18 -08:00
Martin von Zweigbergk	abccb5c328	context.status: explain "caching reasons" more fully Where we "load earliest manifest first for caching reasons", elaborate on what "caching reasons" refers to. Text provided by Matt in http://thread.gmane.org/gmane.comp.version-control.mercurial.devel/73235/focus=73578.	2014-11-11 10:16:54 -08:00
Augie Fackler	10400a5ca1	localrepo: rename revlog.maxchainlen to format.maxchainlen This is more consistent with other option names, as spotted by Pierre-Yves. Thanks!	2014-11-11 10:35:06 -05:00
Mateusz Kwapich	3433abb6a8	revlog: add config variable for limiting delta-chain length The current heuristic for deciding between storing delta and full texts is based on ratio of (sizeofdeltas)/(sizeoffulltext). In some cases (for example a manifest for ahuge repo) this approach can result in extremely long delta chains (~30,000) which are very slow to read. (In the case of a manifest ~500ms are added to every hg command because of that). This commit introduces "revlog.maxchainlength" configuration variable that will limit delta chain length.	2014-11-06 14:20:05 -08:00
Mateusz Kwapich	1a554418d5	debugrevlog: fix computing chain length in debugrevlog -d The chain length was computed correctly only when generaldelta feature was enabled. Now it's fixed. When generaldelta is disabled the base revision in revlog index is not the revision we have delta against - it's always previous revision. Instead of incorrect chainbaseandlen in command.py we are now using two single-responsibility functions in revlog.py: - chainbase(rev) - chainlen(rev) Only chainlen(rev) was missing so it was written to mimic the way the chain of deltas is actually found during file reconstruction.	2014-11-06 14:08:25 -08:00
Pierre-Yves David	1d1e3a5b9c	transaction: factorise append-only file registration The addition is done in two different places but differs slightly. We factorise this addition to ensure it is consistent in all places.	2014-11-05 10:13:01 +00:00
Pierre-Yves David	b9cd014175	transaction: document `tr.add`	2014-11-05 13:00:48 +00:00
Pierre-Yves David	9a1f491fee	transaction: drop backupentries logic from startgroup and endgroup The `startgroup` and `endgroup` methods are used in a very specific context to wrap a very specific operation (revlog truncation). It does not make sense to perform any other operations during such a "group" (eg:file backup). There is currently no user of backupfile during a "group" so we drop the group-specific code and restrict authorized operations during "group".	2014-11-05 10:05:38 +00:00
Pierre-Yves David	483eb136e7	transaction: document startgroup and endgroup These enigmatic methods are only used in repair. We document them to clarify there purpose and user.	2014-11-05 10:00:15 +00:00
Pierre-Yves David	36a4f98617	transaction: mark backup-related attributes private As the transaction is gaining more functions and attributes, it is important to clarify what is part of the public API.	2014-11-05 09:31:57 +00:00
Pierre-Yves David	63c82fbc8a	transaction: document the contents of `tr.backupentries` Now that all items are known we can document it.	2014-11-05 01:30:29 +00:00
Pierre-Yves David	9d5b590e75	transaction: drop the third item in `tr.backupentries` This third item is always None and never used.	2014-11-05 01:33:16 +00:00
Matt Mackall	816fd34333	merge with stable	2014-11-10 17:29:15 -06:00
Siddharth Agarwal	3e8587d071	changegroup.cg2packer: lookup 'group' via inheritance chain This lets extensions insert themselves in the class hierarchy.	2014-11-07 17:54:59 -08:00
Martin von Zweigbergk	f07d2059f7	context.status: only filter suspect symlinks in the dirstate status We don't care about filtering out symlinks that have already been committed with full content, only those that have been accidentally resolved in the working directory.	2014-10-23 17:00:38 -07:00
Martin von Zweigbergk	e1831ee6df	context.status: inline _poststatus() By inlining _poststatus() into _buildstatus(), it becomes clearer that it is only called for the workingctx.	2014-10-23 16:19:56 -07:00
Martin von Zweigbergk	9901573e36	context.status: remove now-empty _prestatus()	2014-10-12 00:06:40 -07:00
Martin von Zweigbergk	22a4f35851	context.status: call _dirstatestatus() from within _buildstatus() By making the call to _dirstatestatus() within _buildstatus(), it becomes clearer that it's called only for the workingctx.	2014-10-11 23:30:08 -07:00
Martin von Zweigbergk	f41e08e268	context.status: move manifest caching trick to _buildstatus() In basectx._buildstatus(), we read the manifests for the two revisions being compared. For "caching reasons" unknown to me, it is better to read the earlier manifest first, which basectx._prestatus() takes care of. However, if the 'self' context is a committablectx and the 'other' context is the parent of the working directory (as in the very common case of plain "hg status"), there is no need to read any manifests at all -- all that's needed is the dirstate status. To avoid reading the manifests, _prestatus() is overridden in committablectx and avoids calling its super method, and _buildstatus() calls its super method only if the 'other' context is not the parent of the working directory. It seems easier to follow what's happening if we move the pre-fetching to _buildstatus() just before the place where the manifests are fetched. We just need to add an extra check that the revision is not None to handle the case that was previously handled by subclass overriding. That also makes it safe for committablectx._prestatus() to call its parent, although the latter now becomes empty, so we won't bother.	2014-10-12 00:00:13 -07:00
Martin von Zweigbergk	7d724075bb	context.status: remove unused arguments from _matchstatus()	2014-10-11 23:18:53 -07:00
Martin von Zweigbergk	3e01ff74de	context.status: remove overriding in workingctx The workingctx method simply calls the super method. The only effect it has is that it uses a different default argument for the 'other' argument. The only in-tree caller is patch.diff, which always passes an argument to the method, so it should be safe to remove the overriding. Having the default argument depend on the type seems rather dangerous anyway.	2014-10-23 13:43:20 -07:00
Pierre-Yves David	4cc6125cd7	exchange: use the postclose API on transaction As with changegroup, we should wait for the transaction to be really closed before scheduling hook execution.	2014-10-28 14:58:36 +01:00
Pierre-Yves David	130c63f6e2	changegroup: use the 'postclose' API on transaction The post-transaction hooks run after the lock release (because hooks may want to touch the repository), but they must only run if the transaction is successfully closed. We use the new 'addpostclose' method on transaction to register a callback installing this post-lock-release call.	2014-10-28 15:44:23 +01:00
Pierre-Yves David	bff90c5de3	transaction: allow registering a post-close callback The addchangegroup code considers the transaction done after a 'tr.close()' call and schedules the hook's execution for after lock release. In the nested transaction case, the transaction is not yet committed and we must delay this scheduling. We add an 'addpostclose' method (like the 'addpending' and 'addfinalize' ones) that registers code to be run if the transaction is successfully committed.	2014-10-28 14:24:43 +01:00
Mike Edgar	a86ac22655	exchange: swap "push" for "pull" in pulloperation docstring	2014-10-24 15:58:46 -04:00
Mike Edgar	2dc006a3bd	exchange: prepare kwargs for bundle2 part generation exactly once	2014-10-29 12:46:08 -04:00
Mike Edgar	dd8ab953f0	exchange: fix indentation in _pullchangeset	2014-10-25 00:40:51 -04:00
Mike Edgar	3ac3b11add	dagutil: fix id/ix typos in docstrings	2014-10-24 16:26:44 -04:00
Pierre-Yves David	70f41de9c6	bundle2: handle empty 'b2x:changegroup' value in push and pull Changeset d79feb65f3ee added advertising of supported changegroup version through the new 'b2x:changegroup' capability. However, this capability is not new and has been around since 3.1 with an empty value. This makes new clients unable to push to 3.2 servers through bundle2 as they cannot find a common changegroup version to use from and empty list. Treating empty 'b2x:changegroup' value as old client fixes it.	2014-11-06 09:52:57 +00:00
Pierre-Yves David	1da866fd04	bundle2: drop duplicated definition of 'b2x:exchange' This bundle2 capability is going to be dynamically computed in 'getrepocaps'. We do not need to include it in the static value.	2014-11-06 10:05:43 +00:00
Pierre-Yves David	8803fc197d	changelog: rely on transaction for finalization Instead of calling 'cl.finalize()' by hand (possibly at a bogus time) we register it in the transaction during 'delayupdate' and rely on 'tr.close()' to call it at the right time.	2014-10-18 01:09:41 -07:00
Pierre-Yves David	7ae09eecbb	transaction: allow registering a finalization callback The new 'addfinalize' method allows people to register a callback to be triggered when the transaction is closed. This aims to get rid of explicit calls to 'changelog.finalize'. This also obsoletes the 'onclose' function but removing it is not in the scope of this series.	2014-10-17 22:28:09 -07:00
Pierre-Yves David	d6b8860637	changelog: handle writepending in the transaction The 'delayupdate' method now takes a transaction object and registers its '_writepending' method for execution in 'transaction.writepending()'. The hook can then use 'transaction.writepending()' directly. At some point this will allow the addition of other file creation during writepending.	2014-10-17 21:55:31 -07:00
Pierre-Yves David	16a2a58b80	transaction: add 'writepending' logic The contents of the transaction must be flushed to disk before running a hook. But it must be flushed to a special file so that the normal reader does not use it. This logic is currently in the changelog only. We add some facility to register such operations in the transaction itself.	2014-10-17 21:19:54 -07:00
Pierre-Yves David	71f171494e	changelog: rework the delayupdate mechanism The current way we use the 'delayupdate' mechanism is wrong. We call 'delayupdate' right after the transaction retrieval, then we call 'finalize' right before calling 'tr.close()'. The 'finalize' call will -always- result in a flush to disk, making the data available to all readers. But the 'tr.close()' may be a no-op if the transaction is nested. This would result in data: 1) exposed to reader too early, 2) rolled back by other part of the transaction after such exposure So we need to end up in a situation where we call 'finalize' a single time when the transaction actually closes. For this purpose we need to be able to call 'delayupdate' and '_writepending' multiple times and 'finalize' once. This was not possible with the previous state of the code. This changeset refactors the code to makes this possible. We buffer data in memory as much as possible and fall-back to writing to a ".a" file after the first call to '_writepending'.	2014-10-18 01:12:18 -07:00
Matt Mackall	20462719e9	merge with stable	2014-11-05 12:41:12 -06:00
Matt Mackall	c575219e98	merge with stable	2014-11-05 11:16:31 -06:00
Sune Foldager	7cb0f8602d	changegroup: introduce cg2packer/unpacker cg2 supports generaldelta in changegroups, to be used in bundle2. Since generaldelta is handled directly in cg2, reordering is switched off by default.	2014-10-17 14:41:11 +02:00
Pierre-Yves David	660a827d16	push: send highest changegroup format supported by both side When using bundle2, we find the common subset of supported changegroup-packers and we pick the max of them. This allow to use generaldelta aware changegroups through bundle2.	2014-10-17 12:19:24 -07:00
Pierre-Yves David	f6830ffac2	getbundle: send highest changegroup format supported by both side When using bundle2, we find the common subset of supported changegroup-packers and we pick the max of them. This allow to use generaldelta aware changegroup through bundle2.	2014-10-17 14:59:10 +02:00
Sune Foldager	efbba1affa	changegroup: allow use of different cg#packer in getchangegroupraw This will allow the use of general delta aware changegroup formats.	2014-10-17 14:41:21 +02:00
Sune Foldager	e8de499479	changegroup: introduce "raw" versions of some commands The commands getchangegroup, getlocalchangegroup and getsubset now each have a version ending in -raw. The raw versions return the chunk generator from the changegroup packer directly, without wrapping it in a chunkbuffer and unpacker. This avoids extra chunkbuffers in the bundle2 code path. Also, the raw versions can be extended to support alternative packers in the future, to be used from bundle2.	2014-10-17 14:41:02 +02:00
Matt Mackall	01aca04608	merge with stable	2014-11-04 16:43:33 -06:00
Pierre-Yves David	80ae5ba5ab	bundle2: support a "version" argument in `changegroup` part When included, this mandatory parameter (mandatory == cannot be ignored) lets the part handler select the right cgunpacker class.	2014-09-24 21:33:12 -07:00
Pierre-Yves David	553f12b081	bundle2caps: advertise the available versions for changegroup packer This will let the bundle2 client and server detect what packer they should be using. This detection part is not done. I expect it to be done with the addition of the second packer (with generaldelta support).	2014-09-24 21:28:54 -07:00
Pierre-Yves David	7e87948427	changegroup: add a "packermap" dictionary to track different packer versions We only have "01" right now, but we should get general delta in soon. Bundle2 is expected to make use of this to advertise and select the right packer to use on both sides.	2014-09-24 21:24:06 -07:00
Pierre-Yves David	515b17f5ef	revset-matching: call 'getset' on a 'fullreposet' Calling 'baseset(repo.changelog)' builds a list for all revisions in the repo. And we already have the lazy and efficient 'fullreposet' class for this purpose. This gives us the usual benefits of the fullreposet but it is less visible because the matching process itself is very expensive: revset) matching(100) before) wall 6.413281 comb 6.420000 user 5.910000 sys 0.510000 (best of 3) after) wall 6.173608 comb 6.170000 user 5.750000 sys 0.420000 (best of 3) However for some complex list, this provide a massive speedup revset) matching(parents(100)) before) wall 23.890740 comb 23.890000 user 23.450000 sys 0.440000 (best of 3) after) wall 6.382280 comb 6.390000 user 5.930000 sys 0.460000 (best of 3)	2014-10-16 23:15:35 -07:00
Pierre-Yves David	7b5edd0cf5	revset-parentspec: call 'getset' on a 'fullreposet' Calling 'baseset(repo.changelog)' builds a list for all revisions in the repo. And we already have the lazy and efficient 'fullreposet' class for this purpose. This gives us the usual benefits of the fullreposet: revset) 100^1 before) wall 0.002694 comb 0.000000 user 0.000000 sys 0.000000 (best of 897) after) wall 0.000997 comb 0.000000 user 0.000000 sys 0.000000 (best of 2324) revset) parents(100)^1 before) wall 0.003832 comb 0.000000 user 0.000000 sys 0.000000 (best of 587) after) wall 0.001034 comb 0.000000 user 0.000000 sys 0.000000 (best of 2309) revset) (100^1)^1 before) wall 0.005616 comb 0.000000 user 0.000000 sys 0.000000 (best of 405) after) wall 0.001030 comb 0.000000 user 0.000000 sys 0.000000 (best of 2258)	2014-10-16 23:15:06 -07:00
Pierre-Yves David	f1982bfd1d	revset-children: call 'getset' on a 'fullreposet' Calling 'baseset(repo.changelog)' builds a list for all revisions in the repo. And we already have the lazy and efficient 'fullreposet' class for this purpose. This gives us the usual benefits of the fullreposet: revset) children(tip~100) before) wall 0.007469 comb 0.010000 user 0.010000 sys 0.000000 (best of 338) after) wall 0.003356 comb 0.000000 user 0.000000 sys 0.000000 (best of 755)	2014-10-16 23:14:17 -07:00
Pierre-Yves David	593610531e	revset-ancestorspec: call 'getset' on a 'fullreposet' Calling 'baseset(repo.changelog)' builds a list for all revisions in the repo. And we already have the lazy and efficient 'fullreposet' class for this purpose. This gives us the usual benefits of the fullreposet: revset) 100~5 before) wall 0.002712 comb 0.000000 user 0.000000 sys 0.000000 (best of 918) after) wall 0.000996 comb 0.000000 user 0.000000 sys 0.000000 (best of 2493) revset) parents(100)~5 before) wall 0.003812 comb 0.010000 user 0.010000 sys 0.000000 (best of 667) after) wall 0.001038 comb 0.000000 user 0.000000 sys 0.000000 (best of 2361) revset) (100~5)~5 before) wall 0.005614 comb 0.000000 user 0.000000 sys 0.000000 (best of 446) after) wall 0.001035 comb 0.000000 user 0.000000 sys 0.000000 (best of 2424)	2014-10-16 23:11:25 -07:00
Pierre-Yves David	985922c845	revset-rangeset: call 'getset' on a 'fullreposet' Calling 'baseset(repo.changelog)' builds a list for all revisions in the repo. And we already have the lazy and efficient 'fullreposet' class for this purpose. This gives us the usual benefit ofs the fullreposet: revset) 10:100 before) wall 0.002774 comb 0.000000 user 0.000000 sys 0.000000 (best of 797) after) wall 0.001977 comb 0.000000 user 0.000000 sys 0.000000 (best of 1244) revset) parents(10):parents(100) before) wall 0.005054 comb 0.000000 user 0.000000 sys 0.000000 (best of 481) after) wall 0.002060 comb 0.000000 user 0.000000 sys 0.000000 (best of 1056)	2014-10-16 23:10:44 -07:00
Matt Mackall	462cd9a28d	merge with stable	2014-11-03 16:56:32 -06:00
Matt Harbison	4db0fae74a	filemerge: split the logic for finding an external tool to its own function This will be used by extdiff in an subsequent patch.	2014-11-02 14:58:50 -05:00
Mads Kiilerich	79172c85c8	config: move mergetools configuration from contrib to default configuration The merge tool configuration is an essential part of a good initial user experience. 'make osx' installers and direct 'make' installation did not have merge tool configuration. Now they have. Note: The installer fixes for windows have been done blindly and might require additional changes.	2014-10-19 03:22:23 +02:00
Mads Kiilerich	7f5af5ba4c	config: introduce "built-in" default configuration settings in default.d This helps providing a more consistent user experience on all platforms and with all packaging. The exact location of default.d depends on how Mercurial is installed and whether it is 'frozen'. The exact location should never be relevant to users and is intentionally not explained in details in the documentation. It will however always be next to the help and templates files. Note that setting HGRCPATH also disables these defaults. I don't know if that should be considered a bug or a feature.	2014-09-04 21:36:35 +02:00
Mads Kiilerich	523c87c1fe	spelling: fixes from proofreading of spell checker issues	2014-04-17 22:47:38 +02:00
Pierre-Yves David	1b8f2c7e41	setdiscovery: limit the size of all sample (issue4411) Further digging on this issue show that the limit on the sample size used in discovery never works for heads. Here is a quote from the code itself: desiredlen = size - len(always) if desiredlen <= 0: # This could be bad if there are very many heads, all unknown to the # server. We're counting on long request support here. The long request support never landed and evolution make the "very many heads, all unknown to the server" case quite common. We implement a simple and stupid hard limit of sample size for all query. This should prevent HTTP 414 error with the current state of the code.	2014-11-01 23:52:53 +00:00
Pierre-Yves David	fddc78071a	hook: protect commit hooks against stripping of temporary commit (issue4422) History rewriting commands like histedit tend to use temporary commits. They may schedule hook execution on these temporary commits for after the lock has been released. But temporary commits are likely to have been stripped before the lock is released (and the hook run). Hook executed for missing revisions leads to various crashes. We disable hooks execution for revision missing in the repo. This provides a dirty but simple fix to user issues.	2014-11-01 23:17:50 +00:00
Pierre-Yves David	2463533597	addset: fix `first` and `last` on sorted addset (issue4426) The lazy sorting were not enforced on addset. This was made visible through MQ.	2014-11-01 22:58:30 +00:00
Matt Mackall	738a03c0f6	clone: properly mark branches closed with --uncompressed (issue4428) On streaming clone, we were priming the local branch cache with the remote branchmap, without checking which heads were closed. This fixes an issue introduced in: changeset: 17740:f8d7aaf86507 user: Tomasz Kleczek <tomasz.kleczek@fb.com> date: Wed Oct 03 13:19:53 2012 -0700 summary: branchcache: fetch source branchcache during clone (issue3378) that was exposed in 2.9 by: changeset: 20192:6c385e85aa05 user: Brodie Rao <brodie@sf.io> date: Mon Sep 16 01:08:29 2013 -0700 summary: branches: simplify with repo.branchmap().iterbranches()	2014-11-01 17:30:57 -05:00
Matt Mackall	19a76cd0ae	debuglocks: add missing usage summary	2014-11-01 13:13:04 -05:00
Mads Kiilerich	a42a10599a	help: don't crash on help for 'sections' with multiple '.'	2014-11-01 19:02:31 +01:00
Mads Kiilerich	ddd482eeb0	help: fix config description of ui.reportoldssl	2014-11-01 18:28:54 +01:00
Durham Goode	a9c8623638	clone: fix copying bookmarks in uncompressed clones (issue4430) 8a92e6790099 broke bookmarks getting copied during uncompressed clones. Since most of the pull logic has been moved into exchange.py, lets just call exchange.pull to fix up the repo with the latest bits after the streaming clone has bootstrapped the repo. This keeps us from having to duplicate the bookmark logic.	2014-10-31 12:56:25 -07:00
Martin von Zweigbergk	ef6448aa8b	revset: don't recreate matcher for every revision The matcher variable 'm' in checkstatus() is reset to None on each call, so the caching of the matcher no longer happens as it was intended. This seems to be a regression in 6b9fbae54476 (revset: added lazyset implementation to checkstatus, 2014-01-03). Fix by moving the cached matcher into the enclosing function so it's actually cached across calls. This speeds up hg log -r 'modifies(mercurial/context.py)' >/dev/null from 7.5s to 4s. Also see similar fix in 5ff5c5c9e69f (revset: avoid recalculating filesets, 2014-10-22).	2014-10-31 10:41:36 -07:00
FUJIWARA Katsunori	65a438a0c5	help: use ":hg:`command`" instead of incorrect ":hg:'command'" notation	2014-11-01 02:43:08 +09:00
FUJIWARA Katsunori	13d9d50897	i18n: add i18n comment to error messages of filesets predicates	2014-11-01 02:43:08 +09:00
FUJIWARA Katsunori	3bf105df13	i18n: add i18n comment to error messages of template functions	2014-11-01 02:43:08 +09:00
FUJIWARA Katsunori	45bd2bfacb	help: refer ":merge3" instead of "internal:merge3" According to warning message (introduced by 727c196b0843) in filemerge.py, the former should be used as official name.	2014-11-01 02:43:08 +09:00
FUJIWARA Katsunori	bf44489048	i18n: make hint message of exception translatable	2014-11-01 02:43:08 +09:00
FUJIWARA Katsunori	18a0e8a463	help: use "hg files" instead of "hg locate" in "hg help filesets" The latter command is already deprecated.	2014-11-01 02:43:08 +09:00
FUJIWARA Katsunori	42fbbbc7eb	files: refer "hg help filesets" instead of "hg help revsets" in help text "specifying FILE patterns" should refer the former.	2014-11-01 02:41:18 +09:00
Pierre-Yves David	34fb3a3cdd	transaction: only generate file when we actually close the transaction Before this change, the file were written for every call to `tr.close()` exposing data to reader far too early.	2014-10-17 21:25:48 -07:00
Pierre-Yves David	245e001eb5	transaction: extract file generation into its own function We extract the code generating files into its own function. We are about to move this code around to fix a bug. We'll need it in a function soon to reuse it for "pending" logic. So we move the code into a function instead of moving it twice.	2014-09-29 00:59:25 -07:00
Matt Harbison	ebc17dec9c	amend: abort early if no username is configured with evolve enabled (issue4211) Amend will reuse the original username if a new one is not provided with -U, but obsolete.createmarkers() only considers ui.username() for the obsolete marker's metadata. Allowing the metadata field to be spoofed seems to defeat the point of the field in the first place. This covers 'evolve amend' and 'ci --amend' with evolve enabled. Without this, the transaction aborts but the parent changeset is set to -1. The corresponding test will be added to evolve separately.	2014-10-28 23:05:19 -04:00
Durham Goode	c9e0ce83ec	revset: fix O(2^n) perf regression in addset hg log -r 1 ... -r 100 was never returning due to a regression in the way addset computes __nonzero__. It used 'bool(self._r1 or self._r2)' which required executing self._r1.__nonzero__ twice (once for the or, once for the bool). hg log with a lot of -r's happens to build a one sided addset tree of N length, which ends up being 2^N performance. This patch fixes it by converting to bool before or'ing. This problem can be repro'd with something as simple as: hg log `for x in $(seq 1 50) ; do echo "-r $x "; done` Adding '1 + 2 + ... + 20' to the revsetbenchmark.txt didn't seem to repro the problem, so I wasn't able to add a revset benchmark for this issue.	2014-10-28 14:06:06 -07:00
Siddharth Agarwal	c9db5b4295	util.fspath: use a dict rather than a linear scan for lookups Previously, we'd scan through the entire directory listing looking for a normalized match. This is O(N) in the number of files in the directory. If we decide to call util.fspath on each file in it, the overall complexity works out to O(N^2). This becomes a problem with directories a few thousand files or larger. Switch to using a dictionary instead. There is a slightly higher upfront cost to pay, but for cases like the above this is amortized O(1). Plus there is a lower constant factor because generator comprehensions are faster than for loops, so overall it works out to be a very small loss in performance for 1 file, and a huge gain when there's more. For a large repo with around 200k files in it on a case-insensitive file system, for a large directory with over 30,000 files in it, the following command was tested: ls \| shuf -n $COUNT \| xargs hg status This command leads to util.fspath being called on $COUNT files in the directory. COUNT before after 1 0.77s 0.78s 100 1.42s 0.80s 1000 6.3s 0.96s I also tested with COUNT=10000, but before took too long so I gave up.	2014-10-24 11:39:39 -07:00
Jordi Gutiérrez Hermoso	d3b97d5baf	doc: change 'revision or range' to 'revision or revset' The phrase "revision or range" comes from a pre-revset era. Since the documentation for ranges now is under the revset docs, and as a helpful hint nudging users towards revsets, I think it's better to say "revision or revset"	2014-10-24 13:50:00 -04:00
Mads Kiilerich	9a3561b211	changelog: use headrevsfiltered 5d1adb6683fa introduced use of the new filtering headrevs C implementation. It caught TypeError to detect when to fall back to the implementation that was compatible with old extensions. That method was however not reliable. Instead, use the new headrevsfiltered function when passing a filter. It will reliably fail with AttributeError when an old extension that predates headrevsfiltered is used.	2014-10-26 12:14:12 +01:00
Mads Kiilerich	40c407ae08	parsers: introduce headrevsfiltered in C extension All extensions that have this function do support filtering. The existing headrevs function may support filtering but we cannot reliably detect whether it does.	2014-10-26 12:14:10 +01:00
Matt Harbison	e8e311feb1	httppeer: close the temporary bundle file after two-way streaming it This fixes several push tests in test-bundle2-exchange.t that were failing on Windows with messages like the following: $ hg -R main push http://localhost:$HGPORT2/ -r 32af7686d403 \ --bookmark book_32af pushing to http://localhost:$HGPORT2/ searching for changes remote: adding changesets remote: adding manifests remote: adding file changes remote: added 1 changesets with 1 changes to 1 files remote: 1 new obsolescence markers updating bookmark book_32af abort: The process cannot access the file because it is being used by another process: 'C:\path\to\tmp\bundle.hg' [255]	2014-10-25 21:34:49 -04:00
Martin von Zweigbergk	a97f944a97	status: make 'hg status --rev' faster when there are deleted files In order not to avoid listing files as both added and deleted, for example, we check for every file in the manifest if it is in the _list_ of deleted files. This can get quite slow when there are many deleted files. Change it to a set to make the containment check faster. On a somewhat contrived example of the Mozilla repo with the entire testing/ directory deleted (~14k files), this makes 'hg status --rev .^' go from 26s to 2s.	2014-10-24 14:24:28 -07:00
Pierre-Yves David	e107a615ed	setdiscovery: limit the size of the initial sample (issue4411) The set discovery start by sending a "known" command with all local heads. When the number of local heads is massive (eg: using hidden changesets) such request becomes too large. This lead to 414 error over http, aborting the whole process. We limit the size of the sample used by the first query to fix this. The test are impacted because they do test massive number of heads. But they do not test it over real world http setup.	2014-10-27 17:52:33 +01:00
Pierre-Yves David	d1263d8d84	setdiscovery: extract sample limitation in a `_limitsample` function We need to reuse this logic for the initial query. We extract it in a function to unsure sample limiting is applied consistently in all cases.	2014-10-27 17:40:32 +01:00
Gregory Szorc	d0818afbd4	exchange: don't report failure from identical bookmarks 48062b2d0f30 regressed the behavior of pushing an unchanged bookmark to a remote. Before that commit, pushing a unchanged bookmark would result in "exporting bookmark @" being printed. After that commit, we now see an incorrect message "bookmark %s does not exist on the local or remote repository!" This patch fixes the regression introduced by 48062b2d0f30 by having the bookmark error reporting code filter identical bookmarks and adds a test for the behavior.	2014-10-24 17:24:46 -05:00
Gregory Szorc	ae20b321ce	bookmarks: explicitly track identical bookmarks bookmarks.compare() previously lumped identical bookmarks in the "invalid" bucket. This patch adds a "same" bucket. An 8-tuple for holding this state is pretty gnarly. The return value should probably be converted into a class to increase readability. But that is beyond the scope of a patch intended to be a late arrival to stable.	2014-10-24 10:40:37 -07:00
Martin von Zweigbergk	061743c247	context.status: remove incorrect swapping of added/removed in workingctx The comment in workingctx.status() says that "calling 'super' subtly reveresed the contexts", but that is simply not true, so we should not be swapping added and removed fields.	2014-10-24 15:52:20 -05:00
Yuya Nishihara	740a18d819	revset: avoid O(n) lookup of invalid revision in rev() 0cc5c10d5dc7 was not the final version of that patch. It was really slow because `l not in repo.changelog` iterates revisions up to `l`. Instead, rev() should utilize spanset.__contains__(). revset #0: rev(210000) 0) wall 0.000039 comb 0.000000 user 0.000000 sys 0.000000 (best of 67978) 1) wall 0.002721 comb 0.000000 user 0.000000 sys 0.000000 (best of 1055) 2) wall 0.000059 comb 0.000000 user 0.000000 sys 0.000000 (best of 45599) (0: 3.2-rc, 1: 0cc5c10d5dc7, 2: this patch) Note that the benchmark result described in 0cc5c10d5dc7 is wrong because it is the one of the initial version.	2014-10-23 21:53:37 +09:00
Wagner Bruna	779ceca4ff	i18n: add hint to digest mismatch message	2014-10-23 12:35:10 -02:00
Wagner Bruna	6791cdb7aa	help/config: fix typo	2014-10-23 12:28:00 -02:00
Wagner Bruna	8b3ed4dd05	files: fix example list syntax	2014-10-23 12:27:57 -02:00
Mads Kiilerich	20e288b0f3	parsers: use 'k' format for Py_BuildValue instead of 'n' because Python 2.4 'n' was introduced in Mercurial in 5d1adb6683fa and broke Python 2.4 support in mysterious ways that only showed failure in test-glog.t. Py_BuildValue failed because of the unknown format and a TypeError was thrown ... but it never showed up on the Python side and it happily continued processing with wrong data. Quoting https://docs.python.org/2/c-api/arg.html : n (integer) [Py_ssize_t] Convert a Python integer or long integer to a C Py_ssize_t. New in version 2.5. k (integer) [unsigned long] Convert a Python integer or long integer to a C unsigned long without overflow checking. This will use unsigned long instead of Py_ssize_t. That is not a good solution, but good is not an option when we have to support Python 2.4.	2014-10-23 02:42:57 +02:00
Ryan McElroy	365c7718eb	amend: fix amending rename commit with diverged topologies (issue4405) This addresses the bug described in issue4405: when obsolescence markers are enabled, amending a commit with a file move can lead to the copy information being lost. However, the bug is more general and can be reproduced without obsmarkers as well, as demonstracted by Pierre-Yves and put into the updated test. Specifically, graph topology divergences between the filelogs and the changelog can cause copy information to be lost during amends.	2014-10-16 06:35:06 -07:00
Augie Fackler	0e58e63b9c	hgweb: disable SSLv3 serving (BC) Because of recent attacks[0] on SSLv3, let's just drop support entirely. 0: http://googleonlinesecurity.blogspot.com/2014/10/this-poodle-bites-exploiting-ssl-30.html	2014-10-21 17:09:37 -04:00
Augie Fackler	e19e1a783c	sslutil: only support TLS (BC) In light of the POODLE[0] attack on SSLv3, let's just drop the ability to use anything older than TLSv1 entirely. This only fixes the client side. Another commit will fix the server side. There are still a few SSLv[23] constants hiding in httpclient, but I'll fix those separately upstream and import them when we're not in a code freeze. 0: http://googleonlinesecurity.blogspot.com/2014/10/this-poodle-bites-exploiting-ssl-30.html	2014-10-21 17:01:23 -04:00
Pierre-Yves David	ad1ec3399d	bundle2: transmit exception during part generation If an exception is raised during a bundle2 part payload generation it is now recorded in the bundle. If such exception occurs, we capture it, transmit an abort exception through the bundle, cleanly close the current part payload and raise it again. This allow to generate valid bundle even in case of exception so that the consumer does not wait forever for a dead producer. This also allow to raise the exception during unbundling at the exact point it happened during bundling make debugging easier.	2014-10-15 03:52:20 -07:00
Pierre-Yves David	38561c6027	bundle2: add a interrupt mechanism It is now possible to emit a single part in the middle of a payload production. This part will be processed with limitation (only access to a `ui` object). The goal is to let the server raise exception and output while a part is being processed. The source motivation is to transmit exception that occurs while generating a part. This change is was the motivation to bump the bundle2 format from HG2X to HG2Y. Somehow, the format bump made it into 3.2 without it. So this change go on stable. It is low risk as bundle2 is still disabled by default.	2014-10-14 10:47:47 -07:00
Durham Goode	fd796ba36d	transactions: change backupfiles format to use newlines Previously the journal.backupfiles file was delimited by \0. Now we delimit it using \n (same as the journal file). This allows us to change the number of values in each line more easily, rather than relying on the count of \0's.	2014-10-21 12:38:28 -07:00
Durham Goode	646f35dbec	transactions: add version number to journal.backupfiles The transaction format will be changing a bit over the next releases, so let's go ahead and add a version number to make backwards compatibility easier. This whole file format was broken prior to 3.2 (see previous patch), so changing it now is pretty low risk.	2014-10-21 11:37:29 -07:00
Durham Goode	faf9d65282	transactions: fix hg recover with fncache backups The transaction backupfiles logic was broken for 'hg recover'. The file format is XXX\0XXX\0YYY\0YYY\0 but the parser did a couple things wrong. 1) It went one step beyond the final \0 and tried to read past the end of the array. 2) array[i:i+1] returns a single item, instead of two items as intended. Added a test to catch it, which turns out to be the first actual 'hg recover' test.	2014-10-20 16:53:56 -07:00
Yuya Nishihara	bac0595bd4	revset: have rev() drop out-of-range or filtered rev explicitly (issue4396) The recent optimization of "and" operation relies on the assumption that the rhs set does not contain invalid revisions. So rev() has to remove invalid revisions. This is still faster than using `.filter(lambda r: r == l)`. revset #0: rev(25) 0) wall 0.026341 comb 0.020000 user 0.020000 sys 0.000000 (best of 113) 1) wall 0.000038 comb 0.000000 user 0.000000 sys 0.000000 (best of 66567) 2) wall 0.000062 comb 0.000000 user 0.000000 sys 0.000000 (best of 43699) (0: 428fa22fb2d1^, 1: 3.2-rc, 2: this patch)	2014-10-19 16:48:33 +09:00
Matt Mackall	56b374dd4e	revset: avoid recalculating filesets This fixes a regression in ea41ca830940 that moved matcher building into a callback, thus causing it be rebuilt for each revision matched against.	2014-10-22 15:47:27 -05:00
Yuya Nishihara	df336dc3c8	ui: separate option to show prompt echo, enabled only in tests (issue4417) The problem in commandserver was addressed by 766cfbe766dc, but it is tricky to reuse ui.nontty option to disable echo back. Instead, this patch introduces new option to enable echoing of prompt response. Prompt echoing is changed to be off by default, which should avoid possible breakage of output parsing in user scripts. For now, this option is undocumented because it exists for internal use.	2014-10-21 23:16:46 +09:00
Pierre-Yves David	b3b7700935	hook: schedule run "b2x-transactionclose" for after lock release Hooks that run after the transaction need to be able to touch the repository. So we need to run them after the lock release. This is similar to what the "changegroup" hook is doing in the `addchangegroup` function.	2014-10-17 02:17:36 -07:00
Pierre-Yves David	2658f42db0	repoview: issue a special message when filtering hidden changesets Hidden changesets are by far the most common error case and is the only one[1] that can reach the user. We move to a friendlier message with a hint about how to access the data anyway. We should probably point to a help topic instead but we do not have such a topic yet. Example of the new output abort: hidden revision '4'! (use --hidden to access hidden revisions) [1] Actually, filtering from "served" can also reach the user during certain exchange operations.	2014-10-17 15:25:32 -07:00
Pierre-Yves David	1dd313bfc1	repoview: include the filter name in filtered revision error messages This will help user to debug. A more precise message will be issued for the most common case ("visible" filter) in the next changesets. example output: - abort: filtered revision '4'! + abort: filtered revision '4' (not in 'visible' subset)!	2014-10-17 15:54:43 -07:00
Mads Kiilerich	489de2727b	ssl: only use the dummy cert hack if using an Apple Python (issue4410) The hack for using certificate store in addition to the provided CAs resides in Apple's OpenSSL. Apple's own Pythons will use it, but other custom built Pythons might use a custom built OpenSSL without that hack and will fail when exposed to the dummy cacert introduced in ee8b7fe5e119. There do not seem to be a simple way to check from Python if we are using a patched OpenSSL or if it is an Apple OpenSSL. Instead, check if the Python executable resides in /usr/bin/python* or in /System/Library/Frameworks/Python.framework/ and assume that all Pythons found there will be native Pythons using the patched OpenSSL. Custom built Pythons will not get the benefit of using the CAs from the certificate store.	2014-10-17 18:56:12 +02:00
Yuya Nishihara	a9b00f4522	cmdserver: include pid of server handling requests in hello message Because unix-mode server forks child process per connection, client does not know the pid of the server that will handle requests. The pid is necessary to interrupt hung process: 1. client connects to socket server 2. server accepts the connection, forks, and tells pid 3. client requests "runcommand pull" .. hung .. 4. client sends SIGINT to the (forked) server 5. server returns from I/O wait Note that getsockopt(SO_PEERCRED) of Linux cannot be used because the server fork()s after accept().	2014-10-18 12:24:50 +09:00
Siddharth Agarwal	146e346b87	lock: while releasing, unlink lockfile even if the release function throws Consider a hypothetical bug in the release function that causes it to raise an exception. Also consider the bisect command, which saves its state in a finally clause. Saving the state requires acquiring the wlock. If we don't unlink the lockfile when the exception is thrown, we'll try to acquire the wlock again. We're going to try and acquire a lock again while our old lockfile is on disk. The PID on disk is our own, and of course we're still running, so we won't take over the lock. Hence we'll be stuck waiting for a lock that we left behind ourselves. To avoid this, always unlink the lockfile. This preserves the invariant that self.held > 0 is equivalent to the lockfile existing on disk.	2014-10-16 19:15:51 -07:00
Augie Fackler	69caabb3af	i18n: cache the result of every gettext call In looking at profiler output for 'hg log' on mozilla-central, I noticed we spent a _huge_ amount of time in gettext relative to what it's doing. Caching provides a roughly 15% performance improvement even on repositories as small as hg. == hg repo on linux == Before: % cumulative self time seconds seconds name 5.05 0.19 0.19 i18n.py:62:gettext 4.84 0.18 0.18 revlog.py:88:decompress 2.95 0.17 0.11 changelog.py:201:node 2.32 0.09 0.09 ui.py:577:write 2.11 0.08 0.08 i18n.py:72:gettext 2.11 0.08 0.08 obsolete.py:196:_fm0readmarkers 1.89 0.07 0.07 obsolete.py:569:_load 1.68 0.63 0.06 localrepo.py:29:__get__ real 0m4.026s user 0m3.993s sys 0m0.034s After: % cumulative self time seconds seconds name 8.05 0.26 0.26 revlog.py:88:decompress 2.68 0.22 0.09 color.py:395:write 2.20 0.07 0.07 obsolete.py:196:_fm0readmarkers 1.95 0.06 0.06 obsolete.py:174:_fm0readmarkers 1.95 0.06 0.06 ui.py:577:write 1.95 0.06 0.06 util.py:1228:datestr 1.71 0.06 0.06 utf_8.py:16:decode 1.71 0.06 0.06 revlog.py:273:__len__ real 0m3.519s user 0m3.447s sys 0m0.073s == mozilla-central repo on linux == Before: % cumulative self time seconds seconds name 7.72 2.35 2.35 revlog.py:88:decompress 4.46 1.36 1.36 i18n.py:62:gettext 2.22 0.67 0.67 i18n.py:72:gettext 2.19 1.14 0.67 changelog.py:201:node 2.16 0.66 0.66 ui.py:577:write 1.96 0.60 0.60 utf_8.py:16:decode 1.93 1.97 0.59 color.py:395:write 1.85 0.81 0.56 changelog.py:136:tip real 0m30.822s user 0m30.660s sys 0m0.149s After: % cumulative self time seconds seconds name 9.82 2.49 2.49 revlog.py:88:decompress 2.67 1.31 0.68 localrepo.py:29:__get__ 2.57 0.65 0.65 utf_8.py:16:decode 2.48 1.01 0.63 changelog.py:201:node 2.10 0.82 0.53 changelog.py:136:tip 2.01 0.51 0.51 ui.py:577:write 1.91 0.49 0.49 util.py:1232:datestr 1.85 1.65 0.47 color.py:395:write real 0m25.619s user 0m25.446s sys 0m0.166s == cpython repo on os x = Before: % cumulative self time seconds seconds name 5.05 1.35 1.35 cmdutil.py:982:_show 4.59 1.22 1.22 revlog.py:274:__len__ 3.98 1.06 1.06 i18n.py:62:gettext 3.91 1.04 1.04 revlog.py:1016:revision 3.68 0.98 0.98 revlog.py:337:parents 3.45 0.92 0.92 revlog.py:88:decompress 2.91 0.78 0.78 revlog.py:309:rev 2.62 0.70 0.70 revlog.py:1033:revision real 0m30.414s user 0m28.145s sys 0m0.541s After: % cumulative self time seconds seconds name 7.98 1.66 1.66 cmdutil.py:982:_show 6.83 1.42 1.42 changelog.py:46:decodeextra 5.18 1.08 1.08 revlog.py:274:__len__ 3.94 0.82 0.82 revlog.py:1016:revision 3.41 0.71 0.71 revlog.py:309:rev 3.32 0.69 0.69 revlog.py:88:decompress 2.99 0.63 0.62 revlog.py:1033:revision 2.69 0.56 0.56 revlog.py:341:start real 0m22.811s user 0m21.883s sys 0m0.397s	2014-10-17 13:52:10 -04:00
Yuya Nishihara	e7ee70da05	util.system: avoid buffering of subprocess output when it is piped util.system() copies subprocess' output through pipe if output file is not stdout. Because a file iterator has internal buffering, output won't be flushed until enough data is available. Therefore, it could easily miss important messages such as "waiting for lock".	2014-08-30 17:38:14 +02:00
Mike Hommey	a290d1f800	bundle2: client side support for a part to import external bundles Bundle2 opens doors to advanced features allowing to reduce load on mercurial servers, and improve clone experience for users on unstable or slow networks. For instance, it could be possible to pre-generate a bundle of a repository, and give a pointer to it to clients cloning the repository, followed by another changegroup with the remainder. For significantly big repositories, this could come as several base bundles with e.g. 10k changesets, which, combined with checkpoints (not part of this change), would prevent users with flaky networks from starting over any time their connection fails. While the server-side support for those features doesn't exist yet, it is preferable to have client-side support for this early-on, allowing experiments on servers only requiring a vanilla client with bundle2 enabled.	2014-10-17 09:57:05 +09:00
Mike Hommey	541e928ae7	bundle2: remove _getbundleextrapart Extensions can now add new part generators with the getbundle2partsgenerator decorator instead.	2014-09-25 11:53:28 +09:00
Matt Mackall	be7ae55914	glossary: add deprecated and experimental	2014-10-17 12:34:27 -05:00
Martin von Zweigbergk	6d828e9c66	resolve: run happily after conflict-free merge	2014-10-08 22:30:04 -07:00
Pierre-Yves David	0d2e3a1dee	revset-phases: prefetch attributes in phasesrelated revsets Pre-fetching attributes gives a significant performance boost. Such is Python. draft() 0) wall 0.011661 comb 0.010000 user 0.010000 sys 0.000000 (best of 205) 1) wall 0.009804 comb 0.000000 user 0.000000 sys 0.000000 (best of 231) draft() - ::bookmark() 0) wall 0.014173 comb 0.010000 user 0.010000 sys 0.000000 (best of 177) 1) wall 0.012966 comb 0.010000 user 0.010000 sys 0.000000 (best of 182)	2014-10-16 17:46:58 -07:00
Pierre-Yves David	8347f164c0	revset-phases: do not cache phase-related filters The phase retrieval is fast enough to not require caching the result of the functions. draft() 0) wall 0.017209 comb 0.020000 user 0.020000 sys 0.000000 (best of 149) 1) wall 0.011654 comb 0.010000 user 0.010000 sys 0.000000 (best of 186) public() 0) wall 0.018687 comb 0.010000 user 0.010000 sys 0.000000 (best of 128) 1) wall 0.013290 comb 0.010000 user 0.010000 sys 0.000000 (best of 181) secret() 0) wall 0.017464 comb 0.020000 user 0.020000 sys 0.000000 (best of 127) 1) wall 0.011499 comb 0.000000 user 0.000000 sys 0.000000 (best of 196) draft() - ::bookmark() 0) wall 0.020099 comb 0.020000 user 0.020000 sys 0.000000 (best of 127) 1) wall 0.014399 comb 0.020000 user 0.020000 sys 0.000000 (best of 169)	2014-10-11 01:21:47 -07:00
Pierre-Yves David	7fc829de42	changectx: issue a FilteredRepoLookupError when applicable We capture FilteredxxxError and issue a FilteredRepoLookupError instead with a sightly different messsge. The message will likely get more improvement in the future. error: filtered revision '4'	2014-10-15 20:37:44 -07:00
Pierre-Yves David	5cf4921b01	repoview: add a FilteredRepoLookupError This exception is a more precise RepoLookupError that will allow us to issue a special message when we end up accessing a filtered revision.	2014-10-16 02:06:12 -07:00
Pierre-Yves David	37d7d2958f	repoview: add a FilteredLookupError class This exception is a more precise LookupError that will allow us to issue a special message when we end up accessing a filtered revision.	2014-10-16 02:05:06 -07:00
Pierre-Yves David	ea3e835124	repoview: add a FilteredIndexError class This exception is a more precise IndexError that will allow us to issue a special message when we end up accessing a filtered revision.	2014-10-15 17:02:44 -07:00
Pierre-Yves David	de4a821442	changectx: move `IndexError` handling in the top level try except This one can be handled by the top level catching.	2014-10-15 16:14:50 -07:00
Pierre-Yves David	4b3ae249b1	changectx: wrap the `changeid` processing in a try/except We are going to introduce more precise exception classes for filtered nodes. So we will have to upgrade them to the `RepoLookupError` level here. We wrap the whole thing into a try/except to ease this future catching. Some of the current exception catching will be moved in this one. But the current changeset focuses on code movement only.	2014-10-15 16:05:24 -07:00
Pierre-Yves David	a4c1167319	bundle2: detect and disallow a negative chunk size We have no usage planned for 2/3 of them and the support for the planned usecase is not here yet. So we raise a BundleValueError when encountered	2014-10-15 03:27:25 -07:00
Pierre-Yves David	2ecc46cd04	bundle2: add an UnsupportedPartError We need the BundleValueError for format errors not related to part support. So we add a specific class for part-support errors.	2014-10-15 03:22:47 -07:00
Pierre-Yves David	15397993b6	bundle2: change header size and make them signed (new format) We are changing all integers that denote the size of a chunk to read to int32. There are two main motivations for that. First, we change everything to the same width (32 bits) to make it possible for a reasonably agnostic actor to forward a bundle2 without any extra processing. With this change, this could be achieved by just reading int32s and forwarding chunks of the size read. A bit a smartness would be logic to detect the end of stream but nothing too complicated. Second, we need some capacity to transmit special information during the bundle processing. For example we would like to be able to raise an exception while a part is being read if this exception happend while this part was generated. Having signed integer let us use negative numbers to trigger special events during the parsing of the bundle. The format is renamed for B2X to B2Y because this breaks binary compatibility. The B2X format support is dropped. It was experimental to allow this kind of things. All elements not directly related to the binary format remain flagged "b2x" because they are still compatible.	2014-10-01 23:40:23 -05:00
Pierre-Yves David	4a9c5fcd2c	bundle2: extract processing of part into its own function This is code movement only. This will be useful to have it separated for reuse purposes. We plan to introduce a new feature to the bundle format that allow inserting a part in the middle of another part payload. This will be useful to transmit a exception raised during a part generation.	2014-10-14 02:32:26 -07:00
Pierre-Yves David	fc1793eb9f	dagwalker: drop a useless intermediate variable The variable used to contain revs.set() but as the `.set()` has been removed it is now useless.	2014-10-08 02:43:51 -07:00
Pierre-Yves David	4de6496309	revset-node: speedup by a few hundred fold Instead of checking all elements of the subset against a single rev, just check if this rev is in the subset. The old way was inherited from when the subset was a list. Non surprise, this provide massive speedup. id("b7dc31e4baa4") before) wall 0.008205 comb 0.000000 user 0.000000 sys 0.000000 (best of 302) after) wall 0.000069 comb 0.000000 user 0.000000 sys 0.000000 (best of 34518) revset #1: public() and id("b7dc31e4baa4") before) wall 0.019763 comb 0.020000 user 0.020000 sys 0.000000 (best of 124) after) wall 0.000101 comb 0.000000 user 0.000000 sys 0.000000 (best of 20130)	2014-10-11 01:39:20 -07:00
Pierre-Yves David	7268257aae	revset-only: use `subset &` instead of filtering The & version is more likely to be optimised. only(.) before) wall 0.003216 comb 0.000000 user 0.000000 sys 0.000000 (best of 768) after) wall 0.001086 comb 0.000000 user 0.000000 sys 0.000000 (best of 2231) only(default, stable) before) wall 0.018469 comb 0.020000 user 0.020000 sys 0.000000 (best of 138) after) wall 0.015888 comb 0.010000 user 0.010000 sys 0.000000 (best of 156)	2014-10-10 17:28:18 -07:00
Pierre-Yves David	4e015a4853	revset-_ancestor: use & instead of filter The & operation is more likely optimised. ::10 before) wall 0.028189 comb 0.030000 user 0.030000 sys 0.000000 (best of 100) after) wall 0.001050 comb 0.000000 user 0.000000 sys 0.000000 (best of 2326) ::tip before) wall 0.081132 comb 0.080000 user 0.080000 sys 0.000000 (best of 100) after) wall 0.055418 comb 0.050000 user 0.050000 sys 0.000000 (best of 100)	2014-09-30 15:03:54 -05:00
Pierre-Yves David	629a6bc7d0	obsstore: record data as floating point in fm0 format For python struct module, "d" is double. But for python string formating, "d" is integer. We want to preserve the floating point nature of the data, so we store it in the metadata as floating point. We use "%r" to make sure we get as many significant digitis as necessary to restore the float to the exact same value on the other side. The fm1 is transmitting the information as float. The lack of this made fm1-stored markers not survive a round-trip to fm0 leading to duplicated markers (or two markers very alike).	2014-10-16 14:46:37 -07:00
Pierre-Yves David	a56876db03	bundle2: add a comment about addchangegroup source and url	2014-10-14 01:27:25 -07:00
Pierre-Yves David	2c1a6817be	pull: call the `b2x-transactionclose` hook when closing the transaction We need a wider set of hooks to process all the changes that happened during the pull transaction. We reuse the experimental `b2x-transactionclose` hook set from server's unbundle for consistency. This hook is experimental and will not remains as-is forever, but this will open the door for experimentation in 3.2.	2014-10-15 04:19:37 -07:00
Pierre-Yves David	98eb7704b3	revset-only: use __nonzero__ to check if a revset is empty For some smartsets, computing length is more expensive than checking if the set is empty.	2014-10-08 02:45:21 -07:00
Pierre-Yves David	4ec24e2ba6	_spanset: drop __getitem__ implementation It is expensive and not part of the official smartset API.	2014-10-15 12:38:47 -07:00
Pierre-Yves David	b634e6fb8f	filteredset: drop __getitem__ implementation It is expensive and not part of the official smartset API.	2014-10-15 12:38:32 -07:00
Pierre-Yves David	8f9f017c39	generatorset: implement __len__ It was the only smartset class without a `__len__` implementation.	2014-10-15 04:28:55 -07:00
Pierre-Yves David	d4bf12d496	revset: make __len__ part of the offical API It is common for code to ask for the length of a revset. In fact, all but generatorset already implement it.	2014-10-15 04:26:23 -07:00
Yuya Nishihara	0baf7a676d	cmdserver: add service that listens on unix domain socket and forks process Typical use case of 'unix' mode is a background hg daemon. $ hg serve --cmdserver unix --cwd / -a /tmp/hg-`id -u`.sock Unlike 'pipe' mode in which parent process keeps stdio channel, 'unix' server can be detached. So clients can freely connect and disconnect from server, saving Python start-up time. It might be better to write "--cmdserver socket -a unix:/sockpath" instead of "--cmdserver unix -a /sockpath" in case hgweb gets the ability to listen on unix domain socket.	2014-10-04 16:46:50 +09:00
Yuya Nishihara	754a73fea4	cmdserver: make server streams switchable In 'unix' mode, server instance will be created per connection, and fin/fout are set to socket files.	2014-09-27 15:10:14 +09:00
Yuya Nishihara	11d619a47e	cmdserver: switch service objects by mode server class will be changed to accept fin/fout pair instead of mode string so that it can interact with socket files.	2014-09-27 15:04:46 +09:00
Yuya Nishihara	84c7e5f2c1	cmdserver: wrap 'pipe' mode server by service object This is the stub for new mode that will listen for connections on unix domain socket. Though --daemon option is not banned in 'pipe' mode, it is useless because the detached 'pipe' mode server exits immediately due to null stdin. Should it abort if --daemon is specified with --cmdserver pipe or --stdio?	2014-09-27 14:52:09 +09:00
FUJIWARA Katsunori	7120dc2e96	encoding: avoid cyclic dependency around "parsers" in pure Python build 39fbe33f95fa brought "asciilower" and "import parsers" into "encoding.py". This works fine with "parsers" module in C implementation, but doesn't with one in pure Python implementation, because the latter causes cyclic dependency below and aborting execution: util => i18n => encoding => parsers => util This patch delays importing "parsers" module until it is really needed, to avoid cyclic dependency around "parsers" in pure Python build.	2014-10-17 02:07:04 +09:00
Pierre-Yves David	830b7b0082	pull: add source information to the transaction The source information can, should be applied once when opening the transaction for the pull. This will lets element processed within a bundle2 be aware of them and open the door to running a set of hooks when closing this pull transaction. This is similar to what is done in server's unbundle call.	2014-10-14 01:40:04 -07:00
Pierre-Yves David	03cb1a74e8	changegroup: store source and url in the `hookargs` dict We store the source and url of the current data into `transaction.hookargs` this let us inherit it from upper layers that may have created a much wider transaction. We have to modify bundle2 at the same time to register the source and url in the transaction. We have to do it in the same patch otherwise, the `addchangegroup` call would fill these values and the hook calling will crash because of the duplicated 'source' and 'url' arguments passed to the hook call.	2014-10-14 00:06:46 -07:00
Pierre-Yves David	ce86284532	prechangegroup: use hook argument from the transaction There can be useful data in there (eg: bundle2 related one)	2014-10-14 00:43:20 -07:00
Pierre-Yves David	0e7fe9a947	addchangegroup: call `prechangegroup` hook after transaction retrieval We want to reused some possible information stored in the transaction `hookargs` dict that may be stored by something handling the transaction at an upper level (eg: bundle2) So we move the running of the hooks after transaction creation. This has no visible effects (but an empty transaction roolback if the hook fails) because nothing had happened in the transaction yet.	2014-10-14 00:09:25 -07:00
Pierre-Yves David	9e19dbeaf9	addchangegroup: get the `node` argument of `incoming` hook from transaction The transaction is now carrying hook-related informations. So we use it to retrieve the `node` argument. This will also carry around all kinds of other useful informations (like: "are we in a bundle2 processing")	2014-10-14 00:03:03 -07:00
Martin von Zweigbergk	11304c2090	dicthelpers: delete now that they are no longer used	2014-10-14 23:53:35 -07:00
Martin von Zweigbergk	9f2b4a3510	manifest: transpose pair of pairs from diff() It makes more sense for the file nodeids and returned from diff() to be ((n1,fl1),(n2,fl2)) than ((n1,n2),(fl1,fl2)), so change it to the former.	2014-10-14 23:18:07 -07:00
Martin von Zweigbergk	a7638ac991	manifest: for diff(), only iterate over files, not flags From manifest.diff(), we return a dict from filename to pairs of pairs of file nodeids and flags (values of the form ((n1,n2),(fl1,fl2))). To create this dict, we currently generate one dict for files (with (n1,n2) values) and one for flags (with (fl1,fl2) values) and then join these dicts. Missing files are represented by None and missing flags by '', but due to the dict joining, the inner pairs themselves can also be None. The only caller, merge.manifestmerge(), then unpacks these values while checking for None values. By inlining the calls to dicthelpers and simplifying it to only iterate over files (ignoring flags-only differences), we can simplify life for our caller.	2014-10-14 22:48:44 -07:00
Martin von Zweigbergk	3ccf5a82f8	manifest: repurpose flagsdiff() into (node-and-flag)diff() The manifestdict class already has a method for diff flags between two manifests (presumably because there is no full access to the private _flags field). The only caller is merge.manifestmerge(), which also wants a diff of files between the same manifests. Let's combine the code for diffing files and flags into a single method on manifestdict. This puts all the manifest diffing in one place and will allow for further simplification. It might also be useful for it to be encapsulated in manifestdict if we later decide to to shard manifests. The docstring is intentionally unclear about missing entries for now.	2014-10-14 17:09:16 -07:00
Mike Hommey	d2b17ca844	util: add a file handle wrapper class that does hash digest validation It is going to be used for the remote-changegroup feature in bundle2.	2014-10-16 17:03:21 +09:00
Mike Hommey	6acd9847bf	util: add a helper class to compute digests It is going to be used for the remote-changegroup feature in bundle2.	2014-10-16 17:02:51 +09:00
Mike Hommey	9962a81363	bundle2: merge return values when bundle contains multiple changegroups A bundle2 may contain multiple parts adding changegroups, in which case there are multiple operation records for changegroups, each with its own return value. Those multiple return values are aggregated in a single cgresult value for the whole operation. As can be seen in the associated test case, the situation with hooks is not really the best, but without deeper thoughts and changes, we can't do much better. Hopefully, things will be improved before bundle2 is enabled by default. In the meanwhile, multiple changegroups is not expected to be in widespread use, and even less expected to be used for pushes. Also, not many clients cloning or pulling bundle2 with multiple changesets are not expected to have changegroup hooks anyways.	2014-10-16 16:03:04 +09:00
Mike Hommey	14669879bf	changegroup: use a copy of hookargs when invoking the changegroup hook addchangegroup creates a runhook function that is used to invoke the changegroup and incoming hooks, but at the time the function is called, the contents of hookargs associated with the transaction may have been modified externally. For instance, bundle2 code affects it with obsolescence markers and bookmarks info. It also creates problems when a single transaction is used with multiple changegroups added (as per an upcoming change), whereby the contents of hookargs are that of after adding a latter changegroup when invoking the hook for the first changegroup.	2014-10-16 15:54:53 +09:00
Mike Hommey	9741dad0cc	util: move md5 back next to sha1 and allow to call it without an argument This effectively backs out changeset 7582042d6cce. The API change is done so that both util.sha1 and util.md5 can be called the same way. The function is moved in order to use it for md5 checksumming for an upcoming bundle2 feature.	2014-09-24 16:00:47 +09:00
Durham Goode	73a4415474	obsolete: prevent options from being used without createmarkers exchange and allowunstable should only be enabled if createmarkers is enabled, so check for that and raise an exception if that's not the case.	2014-10-14 13:27:00 -07:00
Durham Goode	bbb89aae08	obsolete: add exchange option This adds an option that enables obsolete marker exchange.	2014-10-14 13:26:01 -07:00
Durham Goode	faa1c33f8b	obsolete: add allowunstable option This option allows the creation of unstable commits. This allows things like amending in the middle of a stack of commits, etc.	2014-10-14 13:25:13 -07:00
Durham Goode	18f3737f3a	obsolete: add createmarkers option The basic obsolete option is allowing the creation of obsolete markers. This does not enable other features, such as allowing unstable commits or exchanging obsolete markers.	2014-10-14 13:23:52 -07:00
Durham Goode	122b7f93b8	obsolete: add readonly flag to obstore constructor Previously, obstore read the obsolete._enabled flag to determine whether to allow writes to the obstore. Since obsolete._enabled will be moving into a repo specific config, we can't read it globally, and therefore must pass the information into the constructor.	2014-10-14 13:20:31 -07:00
Durham Goode	a10734d5ff	obsolete: add isenabled function for option checking Previously, obsolete used the module level _enabled flag to determine whether it was on or off. We need a bit more granular control, so we'll be introducing toggle options. The isenabled() function is how you check if a particular option is enabled for the given repository. Future patches will add options such as 'createmarkers', 'allowunstable', and 'exchange' to enable various features of obsolete markers.	2014-10-14 13:17:35 -07:00
Durham Goode	2cf7601f2f	obsstore: fix defaultformat option passing The obsstore format passing was not actually being passed to the obsstore. This fixes it.	2014-10-15 12:52:10 -07:00
Augie Fackler	93ba5c156b	checklink: always close the NamedTemporaryFile This fixes test-patchbomb.t when using pypy with --pure.	2014-10-15 15:13:43 -04:00
Mads Kiilerich	4353d6acbb	revset: better naming of variables containing the value of a single argument Calling them args is not helpful.	2014-10-15 04:08:06 +02:00
Augie Fackler	351b53f5d8	manifest: add docstring to text() method	2014-10-14 14:42:25 -04:00
Augie Fackler	9afa9a909c	manifest: rename ambiguously-named set to setflag Just makes it a little clearer what this method does.	2014-10-10 14:09:37 -04:00
Pierre-Yves David	2a99e9b2bc	bookmarks: inform transaction-related hooks that some bookmarks were moved We do not have enough information to provide any finer data, but this is still useful information.	2014-10-12 08:29:31 -07:00
Pierre-Yves David	578b6cd317	phases: inform transaction-related hooks that a phase was moved We do not have enough information to provide finer data, but this is still useful information.	2014-10-12 08:03:20 -07:00
Pierre-Yves David	7e076db46b	pull: use `stepsdone` instead of `todosteps` The push process uses a `stepsdone` attribute instead of a `todosteps` one (with the logic swapped). We unify the two process by picking the `stepsdone` version. I feel like `stepsdone` better fits extensions that would want to extend the push exchange process.	2014-10-12 06:40:36 -07:00
Pierre-Yves David	b60194b959	pull: make discovery phase extensible We apply the same approach as for push and make the discovery extensible. There is only one user in core right now, but we already know we'll need something smarter for obsmarkers. In fact the evolve extension could use this to cleanly extend discovery. The main motivation for this change is consistency between push and pull.	2014-09-27 00:29:06 -07:00
Yuya Nishihara	187868d5fe	sshpeer: forward stdout of remote "hg init" to appropriate output channel Otherwise, commandserver channel could be corrupted.	2014-10-14 21:59:39 +09:00
Mike Edgar	ba052f742a	revlog: support importing censored file revision tombstones This change allows a revision log to not fail integrity checks when applying a changegroup delta (eg from a bundle) results in a censored file tombstone. The tombstone is inserted as-is, so future integrity verification will observe the tombstone. Deltas based on the tombstone will also remain correct. The new code path is encountered for exactly the cases where _addrevision is importing a tombstone from a changegroup. When committing a file containing the "magic" tombstone text, the "text" parameter will be non-empty and the checkhash call is not executed (and when committing, the node will be computed to match the "magic" tombstone text).	2014-09-03 16:34:29 -04:00
Mike Edgar	49d296f5b7	verify: report censored nodes if configured policy is abort	2014-10-14 16:16:04 -04:00
Mike Edgar	7403ba126b	context: handle censored data in an on-disk file context based on config Two possible behaviors are defined for handling censored data: abort, and ignore. When we ignore censored data we return an empty file to callers requesting the file data.	2014-10-14 15:46:16 -04:00
Augie Fackler	d6a88c8e25	manifest: add fastdelta method to manifestdict This is another step closer to alternate manifest implementations that can offer different hashing algorithms.	2014-10-08 15:20:14 -04:00
Augie Fackler	6e25316a67	manifest: move _search to module level and rename to _msearch The rename is intended to provide a slight hint that it is manifest-specific.	2014-10-08 15:21:59 -04:00
Augie Fackler	6d53ff9d24	manifest: move manifestdict-to-text encoding to manifest class A future patch will introduce a new format, with a new class.	2014-10-08 14:47:30 -04:00
Martin von Zweigbergk	7a97abf550	localrepo: access status fields by name rather than index	2014-10-03 13:22:31 -07:00
Martin von Zweigbergk	70f1c6600d	subrepo: use separate instances of empty lists in status We do modify the lists that make up the status in several places, so it seems risky to use the same instance of a list for several different status types. Use a separate empty list for each type instead.	2014-10-11 22:43:14 -07:00
Martin von Zweigbergk	85328e2d45	summary: make status code more readable In commands.summary(), we currently zip a list of labels with a list of statuses. This means the order of the status list has to match the list of the labels, which in turn means the status elements have to be inserted into specific places in the list. Let's instead group the labels and status data we want to display in a single list of pairs.	2014-10-03 09:29:48 -07:00
Martin von Zweigbergk	05e97c1d55	fileset: access status fields by name rather than index	2014-10-04 21:58:01 -07:00
Martin von Zweigbergk	011cf39cde	context: store status class instead of plain tuple in self._status This improves readability a bit by allowing us to refer to statuses by name rather than index.	2014-10-04 21:05:41 -07:00
Martin von Zweigbergk	8b968ecfe2	status: update and move documentation of status types to status class The various status types are currently documented on the dirstate.status() method. Now that we have a class for the status types, it makese sense to document the status types there instead. Only leave the bits related to lookup/unsure in the status() method documentation.	2014-10-10 10:14:35 -07:00
Martin von Zweigbergk	8961a5a15c	status: update various other methods to return new class	2014-10-14 00:52:27 -05:00
Martin von Zweigbergk	41a4138ec7	status: create class for status lists Callers of various status() methods (on dirstate, context, repo) get a tuple of 7 elements, where each element is a list of files. This results in lots of uses of indexes where names would be much more readable. For example, "status.ignored" seems clearer than "status[4]" [1]. So, let's introduce a simple named tuple containing the 7 status fields: modified, added, removed, deleted, unknown, ignored, clean. This patch introduces the class and updates the status methods to return instances of it. Later patches will update the callers. [1] Did you even notice that it should have been "status[5]"? (tweaked by mpm to introduce the class in scmutil and only change one user)	2014-10-10 14:32:36 -07:00
Martin von Zweigbergk	1a4e0a3d51	dirstate: separate 'lookup' status field from others The status tuple returned from dirstate.status() has an additional field compared to the other status tuples: lookup/unsure. This field is just an optimization and not something most callers care about (they want the resolved value of 'modified' or 'clean'). To prepare for a single future status type, let's separate out the 'lookup' field from the rest by having dirstate.status() return a pair: (lookup, status).	2014-10-03 21:44:10 -07:00
Martin von Zweigbergk	ff5a03e917	commit: update file nodeid and flags in the same place Now that we have a separate variable for the original 'm1' manifest, we can safely update the nodeid of the file in the new manifest in the same place as we update the flags.	2014-10-13 14:18:47 -07:00
Martin von Zweigbergk	00bbada8de	commit: use separate variable for p1 manifest and new manifest In localrepo.commitctx(), p1's manifest is copied and used as the basis for the manifest that is about to be committed. The way the copy is updated makes it safe to use it where the original p1's manifest is wanted. For readability, though, a separate variable for each purpose would be clearer. Make it so.	2014-10-13 14:11:47 -07:00
Martin von Zweigbergk	d74577dbf4	commit: remove dead initialization of 'lock' The 'lock' variable is initialized to None, but before it's ever read, it's assigned again.	2014-10-13 14:34:53 -07:00
Martin von Zweigbergk	28d3f795f0	commit: reduce scope of 'removed' variable The variable is closely related to 'added' and 'changed', so it makes sense to have it declared next to them.	2014-10-13 16:43:37 -07:00
Matt Mackall	74cbfe3cf7	graft: use merge.graft	2014-10-13 17:12:31 -05:00
Matt Mackall	0327807819	merge: add merge.graft helper This will help unify all the open-coded graft/rebase operations.	2014-10-13 17:12:12 -05:00
Matt Mackall	f663e5fc01	duplicatecopies: move from cmdutil to copies This is in preparation for moving its primary caller into merge.py, which would be a layering violation in the current location.	2014-10-13 14:33:13 -05:00
Matt Mackall	32d16e5457	graft: move rebase cleanup code next to actual rebase This is prep for refactoring the rebase logic.	2014-10-13 13:21:03 -05:00
Matt Mackall	8e8234eecc	dirstate: merge falls through to otherparent This lets us more correctly fix the state when we use setparents, as demonstrated in the change in test-graft.t.	2014-10-11 14:05:09 -05:00
Matt Mackall	f7a8e82c62	dirstate: use 'm' state in otherparent to reduce ambiguity In rebase-like operations where we abandon the second parent, we can correctly fix up the state in setparents.	2014-10-10 13:31:06 -05:00
Matt Mackall	a44416ab0f	dirstate: properly clean-up some more merge state on setparents	2014-10-10 13:05:50 -05:00
Durham Goode	030db2699f	phases: move root phase assignment to it's own function This moves the initial root phase assignment to it's own function. Future patches which make phase calculations lazy will use this function to pre-fill certain phases which can be deduced from the roots.	2014-10-07 11:42:37 -07:00
Durham Goode	0d0869f7ae	phases: add invalidate function Phase cache invalidation was spread all over the place. Let's add a function to unify it. Later more will be added to this function.	2014-10-07 11:37:54 -07:00
Durham Goode	1b30e7ed86	phases: change phase command change detection A future patch is going to make phase computation lazy, so the phase command can no longer read and diff the entire phase list directly. This changes the phase command to build it's own list for diff purposes.	2014-10-12 23:30:04 -07:00
Pierre-Yves David	551481efc6	spanset: remove `.set()` definition All my friends are dead.	2014-10-10 13:09:22 -07:00
Pierre-Yves David	cdaf453077	generatorset: remove `.set()` definition All my friends are dead.	2014-10-10 13:08:49 -07:00
Pierre-Yves David	9531e16b0d	addset: remove `.set()` definition All my friends are dead.	2014-10-10 13:08:28 -07:00
Pierre-Yves David	f957be2403	filteredset: remove `.set()` definition All my friends are dead.	2014-10-10 13:08:10 -07:00
Pierre-Yves David	fcdeb29add	baseset: remove `set()` definition All my friends are dead.	2014-10-10 13:07:35 -07:00
Pierre-Yves David	a9fcdb25c5	abstractsmartset: remove `set()` method definition Now that all usages have been removed, we can drop this not so useful part of the API. We can note that the name was wrong all along...	2014-10-10 11:27:57 -07:00
Pierre-Yves David	70851c278d	match: check if an object is a baseset using `isascending` instead of `set` The `set()` method is going away.	2014-10-10 14:27:05 -07:00
Pierre-Yves David	0de25934dc	getset: check if an object is a baseset using `isascending` instead of `set` The `set()` method is going away.	2014-10-10 14:22:23 -07:00
Pierre-Yves David	c249a728eb	fullreposet: detect smartset using "isascending" instead of "set" The `.set()` function is going away.	2014-10-10 13:24:57 -07:00

... 5 6 7 8 9 ...

12975 Commits