sapling/lib
Jun Wu 75b4f92c44 indexedlog: support different checksum functions for Log entries (BC)
Summary:
Previously, the format of an Log entry is hard-coded - length, xxhash, and
content. The xxhash always takes 8 bytes.

For small (ex. 40-byte) entries, xxhash32 is actually faster and takes less
disk space.

Introduce the "entry flags" concept so we can store some metadata about what
checksum function to use. The concept could be potentially used to support
other new format changes at per entry level in the future.

As we're here, also support data without checksums. That can be useful for
content with its own checksum, like a blob store with its own SHA1 integrity
check.

Performance-wise, log insertion is slower (but the majority insertaion overhead
would be on the index part), iteration is a little bit faster, perhaps because
the log can use less data.

Before:

  log insertion                  15.874 ms
  log iteration (memory)          6.778 ms
  log iteration (disk)            6.830 ms

After:

  log insertion                  18.114 ms
  log iteration (memory)          6.403 ms
  log iteration (disk)            6.307 ms

Reviewed By: DurhamG, markbt

Differential Revision: D13051386

fbshipit-source-id: 629c251633ecf85058ee7c3ce7a9f576dfac7bdf
2018-12-06 14:57:52 -08:00
..
argparse codemod: add copyright headers 2018-10-26 15:09:12 -07:00
bookmarkstore bookmark: Turn BookmarkStore into indexed-log backed 2018-11-28 10:21:26 -08:00
cdatapack hg: disable check-code tests for C code 2018-06-05 19:21:43 -07:00
clib Started Eden for Windows and integrated hg store with it. 2018-08-21 17:51:26 -07:00
commitcloudsubscriber codemod: use explicit versions in Cargo.toml 2018-11-15 18:54:06 -08:00
configparser tests: add a test-check test that runs fix-code.py 2018-11-15 18:54:06 -08:00
encoding encoding: use Cow for returned types that may be references 2018-10-30 04:07:02 -07:00
hg_watchman_client codemod: use explicit versions in Cargo.toml 2018-11-15 18:54:06 -08:00
hgpython hgmain: use correct slashes in canonicalized paths 2018-11-28 09:20:02 -08:00
indexedlog indexedlog: support different checksum functions for Log entries (BC) 2018-12-06 14:57:52 -08:00
linelog linelog: update README 2018-11-08 12:34:36 -08:00
lz4-pyframe codemod: add copyright headers 2018-10-26 15:09:12 -07:00
minibench codemod: add copyright headers 2018-10-26 15:09:12 -07:00
mononokeapi mononokeapi: support both http and https 2018-12-03 17:46:51 -08:00
mpatch codemod: use explicit versions in Cargo.toml 2018-11-15 18:54:06 -08:00
mpatch-sys codemod: use explicit versions in Cargo.toml 2018-11-15 18:54:06 -08:00
nodemap nodemap: add key iteration 2018-12-06 11:47:41 -08:00
pathmatcher codemod: use explicit versions in Cargo.toml 2018-11-15 18:54:06 -08:00
radixbuf codemod: add copyright headers 2018-10-26 15:09:12 -07:00
revisionstore tests: add a test-check test that runs fix-code.py 2018-11-15 18:54:06 -08:00
third-party xdiff: backport upstream changes 2018-04-13 21:51:48 -07:00
treestate codemod: use explicit versions in Cargo.toml 2018-11-15 18:54:06 -08:00
types codemod: use explicit versions in Cargo.toml 2018-11-15 18:54:06 -08:00
vlqencoding vlqencoding: don't require Sized for Read or Write traits 2018-10-29 04:10:46 -07:00
watchman_client codemod: use explicit versions in Cargo.toml 2018-11-15 18:54:06 -08:00
zstdelta rust: upgrade rust to 1.30.0 and bump zstd-sys version 2018-11-06 18:13:20 -08:00
Cargo.toml nodemap: introduce rust bidirectional node map 2018-12-06 11:47:41 -08:00
README.md READMEs: tweaks based on feedback 2018-01-12 12:35:52 -08:00

lib

Any native code (C/C++/Rust) that Mercurial (either core or extensions) depends on should go here. Python code, or native code that depends on Python code (e.g. #include <Python.h> or use cpython) is disallowed.

As we start to convert more of Mercurial into Rust, and write new paths entrirely in native code, we'll want to limit our dependency on Python, which is why this barrier exists.

See also hgext/extlib/README.md, mercurial/cext/README.mb.

How do I choose between lib and extlib (and cext)?

If your code is native and doesn't depend on Python (awesome!), it goes here.

Otherwise, put it in hgext/extlib (if it's only used by extensions) or mercurial/cext (if it's used by extensions or core).