sapling

mirror of https://github.com/facebook/sapling.git synced 2024-10-07 15:27:13 +03:00

Author	SHA1	Message	Date
Durham Goode	e750755eba	revlog: add exception when linkrev == nullrev When we deployed the latest crew mercurial to our users, a few of them had issues where a filelog would have an entry with a -1 linkrev. This caused operations like rebase and amend to create a bundle containing the entire repository, which took a long time. I don't know what the issue is, but adding this check should prevent repos from getting in this state, and should help us pinpoint the issue next time it happens.	2013-06-17 19:44:00 -07:00
Sune Foldager	6bd4fdfe9d	bundle-ng: move group into the bundler No additional semantic changes made.	2013-05-10 21:03:01 +02:00
Alexander Plavin	48936f264c	revlog: fix a regression with null revision Introduced in the patch which fixes issue3497 Part of that patch was erroneously submitted and it shouldn't be in the code	2013-04-18 16:46:09 +04:00
Alexander Plavin	829cf92d16	log: fix behavior with empty repositories (issue3497) Make output in this special case consistent with general case one.	2013-04-17 00:29:54 +04:00
Bryan O'Sullivan	512383d40e	revlog: don't cross-check ancestor result against Python version	2013-04-16 10:08:20 -07:00
Bryan O'Sullivan	c6b9f1099d	parsers: a C implementation of the new ancestors algorithm The performance of both the old and new Python ancestor algorithms depends on the number of revs they need to traverse. Although the new algorithm performs far better than the old when revs are numerically and topologically close, both algorithms become slow under other circumstances, taking up to 1.8 seconds to give answers in a Linux kernel repo. This C implementation of the new algorithm is a fairly straightforward transliteration. The only corner case of interest is that it raises an OverflowError if the number of GCA candidates found during the first pass is greater than 24, to avoid the dual perils of fixnum overflow and trying to allocate too much memory. (If this exception is raised, the Python implementation is used instead.) Performance numbers are good: in a Linux kernel repo, time for "hg debugancestors" on two distant revs (24bf01de7537 and c2a8808f5943) is as follows: Old Python: 0.36 sec New Python: 0.42 sec New C: 0.02 sec For a case where the new algorithm should perform well: Old Python: 1.84 sec New Python: 0.07 sec New C: measures as zero when using --time (This commit includes a paranoid cross-check to ensure that the Python and C implementations give identical answers. The above performance numbers were measured with that check disabled.)	2013-04-16 10:08:20 -07:00
Bryan O'Sullivan	59b785a485	revlog: choose a consistent ancestor when there's a tie Previously, we chose a rev based on numeric ordering, which could cause "the same merge" in topologically identical but numerically different repos to choose different merge bases. We now choose the lexically least node; this is stable across different revlog orderings.	2013-04-16 10:08:19 -07:00
Bryan O'Sullivan	4a3a46aff6	ancestor: a new algorithm that is faster for nodes near tip Instead of walking all the way to the root of the DAG, we generate a set of candidate GCA revs, then figure out which ones will win the race to the root (usually without needing to traverse all the way to the root). In the common case of nodes that are close to each other in both revision number and topology, this is usually a big win: it makes "hg --time debugancestors" up to 9 times faster than the more general ancestor function when measured on heads of the linux-2.6 hg repo. Victory is not assured, however. The older function can still win by a large margin if one node is much closer to the root than the other, or by a much smaller amount if one is an ancestor of the other. For now, we've also got a small paranoid harness function that calls both ancestor functions on every input and ensures that they give equivalent answers. Even without the checker function, the old ancestor function needs to stay alive for the time being, as its generality is used by context.filectx.merge.	2013-04-16 10:08:18 -07:00
Benoit Boissinot	41300c28e0	revlog: document v0 format	2013-02-09 12:08:02 +01:00
Siddharth Agarwal	4d560304bb	revlog: move ancestor generation out to a new class This refactoring is to prepare for implementing lazy membership.	2012-12-18 10:14:01 -08:00
Siddharth Agarwal	cca6ff3076	revlog: remove incancestors since it is no longer used	2012-12-17 15:08:37 -08:00
Siddharth Agarwal	6d5198a5a3	revlog.ancestors: add support for including revs This is in preparation for an upcoming refactoring. This also fixes a bug in incancestors, where if an element of revs was an ancestor of another it would be generated twice.	2012-12-17 15:13:51 -08:00
Pierre-Yves David	9ac120f569	revlog: allow reverse iteration with revlog.revs We often need to perform rev iteration in reverse order. This changeset makes it possible to do so, in order to avoid costly reverse or reversed() calls later.	2012-11-21 00:42:05 +01:00
Siddharth Agarwal	b05b94a300	revlog: add rev-specific variant of findmissing This will be used by rebase in an upcoming commit.	2012-11-26 10:48:24 -08:00
Siddharth Agarwal	76a23a18f8	revlog: switch findmissing to use ancestor.missingancestors This also speeds up other commands that use findmissing, like incoming and merge --preview. With a large linear repository (>400000 commits) and with one incoming changeset, incoming is sped up from around 4-4.5 seconds to under 3.	2012-11-26 11:02:48 -08:00
Durham Goode	cce0517fb6	commit: increase perf by avoiding unnecessary filteredrevs check When commiting to a repo with lots of history (>400000 changesets) the filteredrevs check (added with 373606589de5) in changelog.py takes a bit of time even if the filteredrevs set is empty. Skipping the check in that case shaves 0.36 seconds off a 2.14 second commit. A 17% gain.	2012-11-16 15:39:12 -08:00
Pierre-Yves David	6d6a3d27a5	clfilter: split `revlog.headrevs` C call from python code Make the pure python implementation of headrevs available to derived classes. It is important because filtering logic applied by `revlog` derived class won't have effect on `index`. We want to be able to bypass this C call to implement our own.	2012-09-03 14:19:45 +02:00
Pierre-Yves David	23fb63d637	clfilter: handle non contiguous iteration in `revlov.headrevs` This prepares changelog level filtering. We can't assume that any revision can be heads because filtered revisions need to be excluded. New algorithm: - All revisions now start as "non heads", - every revision we iterate over is made candidate head, - parents of iterated revisions are definitely not head. Filtered revisions are never iterated over and never considered as candidate head.	2012-09-03 14:12:45 +02:00
Pierre-Yves David	6981326b92	clfilter: make the revlog class responsible of all its iteration This prepares changelog level filtering. We need the algorithms used in revlog to work on a subset of revisions. To achieve this, the use of explicit range of revision is banned. `range` and `xrange` calls are replaced by a `revlog.irevs` method. Filtered super class can then overwrite the `irevs` method to filter out revision.	2012-09-20 19:00:59 +02:00
Mads Kiilerich	2f4504e446	fix trivial spelling errors	2012-08-15 22:38:42 +02:00
Matt Mackall	5b06da939f	backout 94ae81a4e338 This may have allowed unbounded I/O sizes with the current chunk retrieval code.	2012-07-12 14:20:34 -05:00
Martin Geisler	c52341ae3f	merge with main	2012-07-12 10:03:50 +02:00
Friedrich Kastner-Masilko	a6245a11d3	revlog: fix for generaldelta distance calculation The decision whether or not to store a full snapshot instead of a delta is done based on the distance value calculated in _addrevision.builddelta(rev). This calculation traditionally used the fact of deltas only using the previous revision as base. Generaldelta mechanism is changing this, yet the calculation still assumes that current-offset minus chainbase-offset equals chain-length. This appears to be wrong. This patch corrects the calculation by means of using the chainlength function if Generaldelta is used.	2012-07-11 12:38:42 +02:00
Bryan O'Sullivan	26f2c363fd	revlog: make compress a method This allows an extension to optionally use a new compression type based on the options applied by the repo to the revlog's opener. (decompress doesn't need the same treatment, as it can be replaced using extensions.wrapfunction, and can figure out which compression algorithm is in use based on the first byte of the compressed payload.)	2012-06-25 13:56:13 -07:00
Joshua Redstone	09130c5cf2	revlog: remove reachable and switch call sites to ancestors This change does a trivial conversion of callsites to ancestors. Followon diffs will switch the callsites over to revs.	2012-06-08 08:39:44 -07:00
Joshua Redstone	70aeee0070	revlog: add incancestors, a version of ancestors that includes revs listed ancestors() returns the ancestors of revs provided. This func is like that except it also includes the revs themselves in the total set of revs generated.	2012-06-08 07:59:37 -07:00
Thomas Arendsen Hein	01adf7776d	merge heads	2012-06-07 15:55:12 +02:00
Brad Hall	f20c06750f	revlog: zlib.error sent to the user (issue3424) Give the user the zlib error message instead of a backtrace when decompression fails.	2012-06-04 14:46:42 -07:00
Joshua Redstone	e38b770424	revlog: add optional stoprev arg to revlog.ancestors() This will be used as a step in removing reachable() in a future diff. Doing it now because bryano is in the process of rewriting ancestors in C. This depends on bryano's patch to replace *revs with revs in the declaration of revlog.ancestors.	2012-06-01 15:44:13 -07:00
Bryan O'Sullivan	141bd09daa	revlog: descendants(*revs) becomes descendants(revs) (API) Once again making the API more rational, as with ancestors.	2012-06-01 12:45:16 -07:00
Bryan O'Sullivan	6ba97b40c1	revlog: ancestors(revs) becomes ancestors(revs) (API) Accepting a variable number of arguments as the old API did is deeply ugly, particularly as it means the API can't be extended with new arguments. Partly as a result, we have at least three different implementations of the same ancestors algorithm (!?). Most callers were forced to call ancestors(somelist), adding to both inefficiency and ugliness.	2012-06-01 12:37:18 -07:00
Bryan O'Sullivan	abdf4a8227	util: subclass deque for Python 2.4 backwards compatibility It turns out that Python 2.4's deque type is lacking a remove method. We can't implement remove in terms of find, because it doesn't have find either.	2012-06-01 17:05:31 -07:00
Bryan O'Sullivan	bef5b61512	cleanup: use the deque type where appropriate There have been quite a few places where we pop elements off the front of a list. This can turn O(n) algorithms into something more like O(n**2). Python has provided a deque type that can do this efficiently since at least 2.4. As an example of the difference a deque can make, it improves perfancestors performance on a Linux repo from 0.50 seconds to 0.36.	2012-05-15 10:46:23 -07:00
Bryan O'Sullivan	a49ea963d7	revlog: switch to a C version of headrevs The C implementation is more than 100 times faster than the Python version (which is still available as a fallback). In a repo with 330,000 revs and a stale .hg/cache/tags file, this patch improves the performance of "hg tip" from 2.2 to 1.6 seconds.	2012-05-19 19:44:58 -07:00
Matt Mackall	42c30757a2	revlog: don't handle long for revision matching The underlying C code doesn't support indexing by longs, there are no legitimate reasons to use a long, and longs should generally be converted to ints at a higher level by context's constructor.	2012-05-21 16:36:09 -05:00
Brodie Rao	a7ef0a0cc5	cleanup: "not x in y" -> "x not in y"	2012-05-12 16:00:57 +02:00
Bryan O'Sullivan	058dfb801d	revlog: speed up prefix matching against nodes The radix tree already contains all the information we need to determine whether a short string is an unambiguous node identifier. We now make use of this information. In a kernel tree, this improves the performance of "hg log -q -r24bf01de75" from 0.27 seconds to 0.06.	2012-05-12 10:55:08 +02:00
Matt Mackall	a97dbbe308	revlog: backout df8c4d732869 This regresses performance of 'hg branches', presumably because it's visiting the revlog in the wrong order. This suggests we either need to fix the branch code or add some read-behind to mitigate the effect.	2012-04-27 13:07:29 -05:00
Patrick Mezard	e9454c243f	revlog: fix partial revision() docstring (from f4a6c9197dbd)	2012-04-13 10:14:59 +02:00
Matt Mackall	a6546db90e	revlog: drop some unneeded rev.node calls in revdiff	2012-04-13 22:55:46 -05:00
Bryan O'Sullivan	62554752c6	revlog: avoid an expensive string copy This showed up in a statprof profile of "hg svn rebuildmeta", which is read-intensive on the changelog. This two-line patch improved the performance of that command by 10%.	2012-04-12 20:26:33 -07:00
Matt Mackall	4e0b41f193	revlog: increase readahead size	2012-04-13 21:35:48 -05:00
Bryan O'Sullivan	dc46676e81	parsers: use base-16 trie for faster node->rev mapping This greatly speeds up node->rev lookups, with results that are often user-perceptible: for instance, "hg --time log" of the node associated with rev 1000 on a linux-2.6 repo improves from 0.3 seconds to 0.03. I have not found any instances of slowdowns. The new perfnodelookup command in contrib/perf.py demonstrates the speedup more dramatically, since it performs no I/O. For a single lookup, the new code is about 40x faster. These changes also prepare the ground for the possibility of further improving the performance of prefix-based node lookups.	2012-04-12 14:05:59 -07:00
Matt Mackall	055cba03a8	revlog: allow retrieving contents by revision number	2012-04-08 12:38:02 -05:00
Matt Mackall	30645d82e7	revlog: add hasnode helper method	2012-04-07 15:43:18 -05:00
Pierre-Yves David	15ab7ccd15	revlog: make addgroup returns a list of node contained in the added source This list will contains any node see in the source, not only the added one. This is intended to allow phase to be move according what was pushed by client not only what was added.	2012-01-13 01:29:03 +01:00
Pierre-Yves David	a51dc67424	revlog: improve docstring for findcommonmissing	2012-01-09 04:15:31 +01:00
Steven Brown	3ebdb5ed19	revlog: clarify strip docstring "readd" -> "re-add" I misread it as "read".	2012-01-10 22:35:25 +08:00
Matt Mackall	864ce9da04	misc: adding missing file close() calls Spotted by Victor Stinner <victor.stinner@haypocalc.com>	2011-11-03 11:24:55 -05:00
Greg Ward	bc1dfb1ac9	atomictempfile: make close() consistent with other file-like objects. The usual contract is that close() makes your writes permanent, so atomictempfile's use of close() to discard writes (and rename() to keep them) is rather unexpected. Thus, change it so close() makes things permanent and add a new discard() method to throw them away. discard() is only used internally, in __del__(), to ensure that writes are discarded when an atomictempfile object goes out of scope. I audited mercurial., hgext., and ~80 third-party extensions, and found no one using the existing semantics of close() to discard writes, so this should be safe.	2011-08-25 20:21:04 -04:00

1 2 3 4 5 ...

433 Commits