sapling

mirror of https://github.com/facebook/sapling.git synced 2024-10-13 02:07:31 +03:00

Author	SHA1	Message	Date
Jun Wu	a4322ca904	logrotate: make writable_log private Summary: The `writable_log` API can be misused to "flush" a Log, bypassing the check about whether it should be rotated or not. The real need of `writable_log` is to get accesses to indexes on the "writable" (or "latest") log. Therefore let's just expose that instead. Practically, the only use case of querying the index on the "latest" log is to make sure dependent content are written to a same Log. That also requires a "flush_filter" to be provided. Therefore add an assertion about it. Differential Revision: D14866022 fbshipit-source-id: f6c07a498597b6f0f07d7cc3130e9033ba8b9be4	2019-04-10 19:50:01 -07:00
Jun Wu	f364cd1420	logrotate: add flush_filter Summary: Introduce the "flush filter" that can replace content to be written. This would be useful to make sure delta chains are self-contained. For LogRotate, flush_filter is trigger not only when the log file was modified, but also when rotation happens, Differential Revision: D14866024 fbshipit-source-id: f417200d3ae573e9ac82985ad6afd082412b358d	2019-04-10 19:50:01 -07:00
Jun Wu	2e300dba51	indexedlog: add a flush_filter function to Log Summary: The flush filter allows mutating entries being flushed. It can be used to avoid inserting duplicated data. Differential Revision: D14866023 fbshipit-source-id: ecf6cf60a0a97cf8110ef9c957e7e3bbab5855fc	2019-04-10 19:50:00 -07:00
Jun Wu	0cd1d8ce9d	indexedlog: error out if the primary log does not match metadata Summary: Previously the code allows the "log" file to be longer than the metadata, intended to allow advanced usecases that replaces the "meta" file to get a read-only view in the past. That implies we trust the length of "log" file. But it's in theory easy to mess up - when appending to the "log" file, the process might be killed. Data integrity is first priority. Therefore let's just error out if the file length does not match the metadata. To support read-only views in the past, we can use potentially use file names other than "meta" or support in-memory metadata instead. Differential Revision: D14866025 fbshipit-source-id: bbf0061a6448375a2de06fbf31f2b9838c749be0	2019-04-10 19:50:00 -07:00
Jun Wu	12b98e1e96	indexedlog: use failure for error handling Summary: Failure makes it easier to chain errors, and backtraces. Use it. There is probably still room for improvement, by chainning errors and avoiding exposing low-level errors for APIs, and/or provide more context in error messages. But it should be already much better than before. Differential Revision: D14759305 fbshipit-source-id: b1d3a8ec959dde575f06533ea9e4cd0757057051	2019-04-05 12:17:28 -07:00
Jun Wu	dbfad715b8	logrotate: reduce max_log_count to u8 range Summary: Practically there are many issues with a large max_log_count: - The directory scan would be slower. - The index would be slower. Let's reduce it to u8 range to address the issues. This also makes the directory name short. Differential Revision: D14717896 fbshipit-source-id: d39f008abe576991e14d444c37a049a6132df507	2019-04-03 22:16:33 -07:00
Jun Wu	cf82cb6340	indexedlog: replace atomicwrites with tempfile Summary: Some tests added by upcoming diffs were timing out while they don't seem that expensive. I tracked it down to the use of `fsync` in atomicwrites. In our case, we don't need `fsync`. `fsync` is useful for making sure the order of file writes is desired even in case of system crash. For example, making sure the "primary" log file is written before writing the "meta" file. That's too expensive (esp. on filesystems like ext4) for our usecase. Indexedlog is designed to make sure data corruption can be detected, and there can be a "reasonable" way to recover (ex. by deleting all indexes, scanning through entries and re-inserting them in a new log), not to fight against OS crashes. `cargo bench` change on a btrfs filesystem: Before: index flush 42.570 ms log flush 7.712 ms After: index flush 36.485 ms log flush 1.609 ms Differential Revision: D14759304 fbshipit-source-id: 66b95d10040cf1480367b767811dfabee5e27ffe	2019-04-03 22:16:33 -07:00
Jun Wu	7cb1663ae0	indexedlog: migrate to Rust 2018 Summary: Used `cargo fix --edition`. Removed some `mut`s according to rustc warnings. Differential Revision: D14718308 fbshipit-source-id: 94e3c3f8e47143ede767fe883fdb5e9602b12854	2019-04-03 22:16:33 -07:00
Jun Wu	acc0aaea7d	indexedlog: migrate from tempdir to tempfile Summary: `tempdir` is deprecated. Use `tempfile` instead. Differential Revision: D14690867 fbshipit-source-id: f5df77708078538a0832bd941726f280ed97355f	2019-04-01 17:16:18 -07:00
Jun Wu	1e59d25e17	indexedlog: add OpenOptions::index Summary: Make it a bit easier to define indexes. Before: OpenOptions::new() .index_defs(vec![IndexDef::new("first-byte", \|_\| { vec![IndexOutput::Reference(0..1)] })]) After: OpenOptions::new() .index("first-byte", \|_\| vec![IndexOutput::Reference(0..1)]) Reviewed By: kulshrax Differential Revision: D14690357 fbshipit-source-id: 6e80a91f4279f960d9f41369c228e79023b5164c	2019-04-01 17:16:17 -07:00
Jun Wu	88fb64a6ee	indexedlog: use monospace font for links to code Summary: The Rust stdlib uses this pattern. This is done by: sed -i 's/\[\([A-Z][a-zA-Z:]\)\]/[`\1`]/g' .rs Unfortunately it seems only rustdoc nightly can linkify things correctly. More context: https://github.com/rust-lang/rust/issues/43466 Reviewed By: kulshrax Differential Revision: D14689887 fbshipit-source-id: ba2b5968bdaad06f39dc43962430906ee80692fd	2019-04-01 17:16:17 -07:00
Jun Wu	7c74b40bc1	logrotate: de-dup logic in OpenOptions Summary: rotate::OpenOptions is a superset of log::OpenOptions. Change the code to reuse logic in log::OpenOptions as much as possible. Reviewed By: kulshrax Differential Revision: D14689888 fbshipit-source-id: a6958723c49f9d41b03100f01283a8c3fb37a1ab	2019-04-01 17:16:17 -07:00
Jun Wu	277d25b581	indexedlog: move checksum_type to OpenOptions Summary: The motivation of this is, LogRotate might copy dirty (non-flushed) entries from one Log to another, and it cannot preserve the checksum type for those entries. There are 2 solutions: - Make `iter_dirty` return checksum type. - Make checksum type known by Log directly. The second choice provides a simpler public API. `append_advanced` can be removed, then `iter_dirty` is still consistent with `iter`. Therefore this change. Differential Revision: D14688174 fbshipit-source-id: 09e07d64c886a5ce9bc48dce8e29d036af1c0381	2019-04-01 17:16:16 -07:00
Jun Wu	8fc9742997	indexedlog: make Log own OpenOptions Summary: A later diff adds another field to OpenOptions that Log needs access to. Differential Revision: D14688171 fbshipit-source-id: 33170a2b74639ba0fd8a9c86207d840fb6427580	2019-04-01 17:16:16 -07:00
Jun Wu	341b3dad6f	logrotate: make flush delete old logs Summary: This is the final piece to make space usage bounded. Differential Revision: D14688179 fbshipit-source-id: a6e0058b9022789fcf036c4427d29eab19144b53	2019-04-01 17:16:16 -07:00
Jun Wu	b1b92b8def	logrotate: make flush handle "latest" change Summary: If "latest" pointer has changed, we should write to the new "latest" Log, instead of the stale one. Differential Revision: D14688180 fbshipit-source-id: eab8df8ddb8f311e472361ecc2b1bc4155f2aba4	2019-04-01 17:16:15 -07:00
Jun Wu	c23508dcd9	indexedlog: add Log::iter_dirty Summary: This API iterates entries that are in-memory only. It is useful to extract entries and store them elsewhere. Differential Revision: D14688178 fbshipit-source-id: 6ace51d859ba6886aeb94689f6c45162b9c6958e	2019-04-01 17:16:15 -07:00
Jun Wu	f38bbfd92e	logrotate: partially implement flush Summary: Implement the basic flush logic. Missing bits are listed as TODO items. Differential Revision: D14688177 fbshipit-source-id: 3613009ec2c216398af6eaff44487a20ceeb97ef	2019-04-01 17:16:15 -07:00
Jun Wu	cd1750f06b	indexedlog: make Log::flush return the new file size Summary: The file size will be used to decide whether the Log needs "rotate" in upcoming changes. Reviewed By: kulshrax Differential Revision: D14688169 fbshipit-source-id: b273abcc870b96650d2c76e6e742a3141ce48f13	2019-04-01 17:16:15 -07:00
Jun Wu	ec90e8db57	logrotate: implement append and lookup Summary: These methods just delegate to `Log` structures. Unfortunately, the key has to be copied so it can be used by the iterator to query remaining logs. Differential Revision: D14688172 fbshipit-source-id: fd581f7256031a0622ec0533c84daaab89f9bb82	2019-04-01 17:16:14 -07:00
Jun Wu	aecd9edae9	logrotate: implement open Summary: Implement the open logic. Reviewed By: kulshrax Differential Revision: D14688170 fbshipit-source-id: df3d39040e2268b3eddb131b2ae1b1f76d3e4311	2019-04-01 17:16:14 -07:00
Jun Wu	f160f31cde	logrotate: add a LogRotate structure Summary: Start implementing the "log rotate" idea by markbt. It is similar to logrotate, with plain text log files replaced by indexedlog. This implementation also avoids renaming, which can be troublesome on Windows, by just increasing the number (ex. to rotate "1/", "2/", create "3/", and delete "1/", without renaming "2/"). The main use case would be LRU key-value cache on disk. Reviewed By: kulshrax Differential Revision: D14688176 fbshipit-source-id: 3bf7917e06386ebf85d8d6deeea850c58f4875e8	2019-04-01 17:16:14 -07:00
Jun Wu	a7371c96d3	indexedlog: add create option to Log::OpenOptions Summary: One of the future need is to open a `Log` without creating it by default. The newly added `create` option can be disabled to prevent that. This also changes the code path so we no longer take a directory lock unconditionally during `open`. Differential Revision: D14688173 fbshipit-source-id: 88795d5637a1a5135d4014434b2cf828540c0333	2019-04-01 17:16:13 -07:00
Jun Wu	6555afa621	indexedlog: add Log::OpenOptions Summary: One of the upcoming changes is to add an option to avoid creating Log on demand at open time. To avoid `open` being too complicated, add an `OpenOptions` struct. This is consistent with `index` and `std::fs`. Differential Revision: D14688175 fbshipit-source-id: bb7f1556a32f1f7b15c64a23c5aee7493dd40ce6	2019-04-01 17:16:13 -07:00
Jun Wu	64db96a4b7	indexedlog: make IndexDef clone-able Summary: It's hard to clone a `Fn`. But `fn` can be cloned. Change the API to use `fn` instead. Cloning `IndexDef` allows the same index definition to be used by multiple Logs. It's used by upcoming diffs. Differential Revision: D14688181 fbshipit-source-id: 6fda03a5f744dc90ee5d7ad3f36c243602f33510	2019-03-30 08:59:13 -07:00
Jun Wu	69a6c18747	indexedlog: normalize benchmarks to use 204800 entries Summary: This makes it easier to compare benchmark results between abstractions. A sample of the result is listed below. Comparing to radixbuf, which is highly optimized and less flexible, indexedlog is about 10x slower on insertion, and about 3x slower on lookup. indexedlog: index insertion (owned key) 90.201 ms index insertion (referred key) 81.567 ms index flush 50.285 ms index lookup (memory) 25.201 ms index lookup (disk, no verify) 31.325 ms index lookup (disk, verified) 46.893 ms log insertion 18.421 ms log insertion (no checksum) 12.106 ms log insertion with index 110.143 ms log flush 8.783 ms log iteration (memory) 6.444 ms log iteration (disk) 6.719 ms raidxbuf: index insertion 11.874 ms index lookup 8.495 ms Differential Revision: D14635330 fbshipit-source-id: 28b3f33b87f4e882cb3839c37a2a11b8ac80d3e9	2019-03-27 16:29:58 -07:00
Jun Wu	1568a30c9a	indexedlog: add a benchmark inserting entries without checksum Summary: This is just a trivial test case showing the overhead of xxhash. log insertion 18.359 ms log insertion (no checksum) 7.835 ms Differential Revision: D14635329 fbshipit-source-id: adc2629c0c41aaab48d29d467849e4d96eb01c51	2019-03-27 16:29:58 -07:00
Jun Wu	09b26ed273	indexedlog: resolve a warning Summary: `std::fs` is only needed for Windows. Do not "use" it on *nix systems. Reviewed By: sfilipco Differential Revision: D14634779 fbshipit-source-id: 9fd9a29ae27e13f00b4adbc83a74bd92a1b1658c	2019-03-26 21:19:46 -07:00
Jun Wu	1348cf45f5	indexedlog: make fields in IndexDef private Summary: Change fields in IndexDef to private. Provide a public constructor method and switch users to use that instead. This makes it possible to change the IndexDef struct in the future (ex. having extra optional fields about whether the index is backed by radix tree or something different). Differential Revision: D14608955 fbshipit-source-id: 62a413268d97ba96b2c4efd2ce67cd4fa0ff4293	2019-03-26 21:19:46 -07:00
Jun Wu	b6631c103d	indexedlog: fix tests on Windows Summary: Windows disallows rewriting or truncating mmaped files. Fix the tests by either dropping the mmap, or skipping the test. Reviewed By: sfilipco Differential Revision: D14572119 fbshipit-source-id: dccafdc66db3830c2919232d899ba31365120066	2019-03-22 11:37:35 -07:00
Jun Wu	8021c26449	indexedlog: avoid filesystem race on log creation Summary: The `load_or_create_meta` function is subject to filesystem races. Solve it by always taking a lock. This hurts performance a little bit. But `open()` should not be in a hot loop. So it should probably be fine. Reviewed By: sfilipco Differential Revision: D14568122 fbshipit-source-id: d9b28555ab94252da4717de709b780b361e1dda7	2019-03-22 11:37:35 -07:00
Jun Wu	c74e894aa1	indexedlog: make directory locking work on Windows Summary: On Windows it's impossible to open (2) a directory. Therefore add a utility function that creates `lock` file automatically on Windows and open that file instead. Reviewed By: sfilipco Differential Revision: D14568117 fbshipit-source-id: bc7ae7046be654560c38fbd98ec4dd58c071b1dc	2019-03-22 11:37:35 -07:00
Jun Wu	9361d54b04	indexedlog: make sure meta file is created Summary: Previously, `load_or_create_meta` could return without actually creating the meta file. That leads to problems when `load_or_create_meta` is called a second time via `flush()`, it rewrites the primary file incorrectly. On Windows, it will fail to rewrite the primary file. Fix it by actually writing a meta file before returning. Reviewed By: sfilipco Differential Revision: D14568118 fbshipit-source-id: da3ad42bf48a923d732b1719839ca1953bd2b06c	2019-03-22 11:37:35 -07:00
Jun Wu	7e77bf81f0	indexedlog: rustfmt files Summary: As the title. Reviewed By: sfilipco Differential Revision: D14568120 fbshipit-source-id: 7f2d8fb31e8f6c57976996e41f7a61503f5873a5	2019-03-22 11:37:35 -07:00
Jun Wu	b3893b3d3c	indexedlog: add methods on Log to do prefix lookups Summary: This exposes the underlying lookup functions from `Index`. Alternatively we can allow access to `Index` and provide an `iter_started_from` method on `Log` which takes a raw offset. I have been trying to avoid exposing raw offsets in public interfaces, as they would change after `flush()` and cause problems. Reviewed By: markbt Differential Revision: D13498303 fbshipit-source-id: 8b00a2a36a9383e3edb6fd7495a005bc985fd461	2018-12-20 15:50:55 -08:00
Jun Wu	3237b77e4c	indexedlog: add APIs to lookup by prefix Summary: This is the missing API before `indexedlog::Index` can fit in the `changelog.partialmatch` case. It's actually more flexible as it can provide some example commit hashes while the existing revlog.c or radixbuf implementation just error out saying "ambiguous prefix". It can be also "abused" for the semantics of sorted "sub-keys". By replace "key" with "key + subkey" when inserting to the index. Looking up using "key" would return a lazy result list (`PrefixIter`) sorted by "subkey". Note: the radix tree is NOT efficient (both in time and space) when there are common prefixes. So this use-case needs to be careful. Reviewed By: markbt Differential Revision: D13498301 fbshipit-source-id: 637856ebd761734d68b20c15866424b1d4518ad6	2018-12-20 15:50:55 -08:00
Jun Wu	562b7a1704	indexedlog: add a function to convert base16 to base256 Summary: This will be used in prefix lookups. Reviewed By: markbt Differential Revision: D13498300 fbshipit-source-id: 3db7a21d6f35a18699d9dc3a0eca71a5410e0e61	2018-12-20 15:50:55 -08:00
Jun Wu	443a8f33b3	indexedlog: move binary indexedlog_dump out Summary: It makes testing duplicated - now `cargo test` would try running tests on 2 entry points: lib.rs and indexedlog_dump.rs. Move it to a separate crate to solve the issue. Reviewed By: markbt Differential Revision: D13498266 fbshipit-source-id: 8abf07c1272dfa825ec7701fd8ea9e0d1310ec5f	2018-12-18 08:17:21 -08:00
Jun Wu	61b1a5f475	indexedlog: fix rustc warnings Summary: `write!` result needs to be used. Reviewed By: markbt Differential Revision: D13471967 fbshipit-source-id: d48752bcac05dd33b112679d7faf990eb8ddd651	2018-12-17 12:10:52 -08:00
Jun Wu	421c7b3f45	indexedlog: add a tool to dump indexedlog content Summary: The tool can dump indexedlog content. Useful for manually investigating issues. Reviewed By: DurhamG Differential Revision: D13051387 fbshipit-source-id: 8687a1aa9dfb54776e80f184208c49da2492c34d	2018-12-06 14:57:52 -08:00
Jun Wu	54dc931140	indexedlog: use inlined leaf entries to further reduce index size Summary: Add a new entry type - INLINE_LEAF, which embeds the EXT_KEY and LINK entries to save space. The index size for referred keys is significantly reduced with little overhead: index insertion (owned key) 3.732 ms index insertion (referred key) 3.604 ms index flush 11.868 ms index lookup (memory) 1.159 ms index lookup (disk, no verify) 2.175 ms index lookup (disk, verified) 4.303 ms index size (5M owned keys) 216626039 index size (5M referred keys) 96616431 11.87s user 2.96s system 98% cpu 15.107 total The breakdown of the "5M referred keys" size is: type count bytes radixes 1729472 33835772 inline_leafs 5000000 62780651 There are no other kinds of entries stored. Previously, the index size of referred keys is: index size (5M referred keys) 136245815 bytes So it's 136MB -> 96MB, 40% decrease. Reviewed By: DurhamG Differential Revision: D13036801 fbshipit-source-id: 27e68e4b6c332c1dc419abc6aba69271952e4b3d	2018-12-06 14:57:52 -08:00
Jun Wu	a4958163ee	indexedlog: optimize size of radix entries (BC) Summary: Replace the 20-byte "jump table" with 3-byte "flag + bitmap". This saves space for indexes less than 4GB. There are some reserved bits in the "flag" so if we run into space issues when indexes are larger than 4GB, we can try adding 6-byte integer, or VLQ back without breaking backwards-compatibility. It seems to hurt flush performance a bit, because we have to scan the child array twice. However, lookup (the most important performance) does not change much. And the index is more compact. After: index flush 19.644 ms index lookup (disk, no verify) 2.220 ms index lookup (disk, verified) 4.067 ms index size (5M owned keys) 216626039 bytes index size (5M referred keys) 136245815 bytes Before: index flush 16.764 ms index lookup (disk, no verify) 2.205 ms index lookup (disk, verified) 4.030 ms index size (5M owned keys) 240838647 bytes index size (5M referred keys) 160458423 bytes For the "referred key" case, it's 160->136MB, 17% decrease. A detailed break down of components of index is: After: type count bytes (using owned keys) radixes 1729472 33835772 links 5000000 27886336 leafs 5000000 44629384 keys 5000000 110000000 type count bytes (using referred keys) radixes 1729472 33835772 links 5000000 27886336 leafs 5000000 44629384 ext_keys 5000000 29894315 Before: type count bytes (using owned keys) radixes 1729472 58048380 links 5000000 27886336 leafs 5000000 44903923 keys 5000000 110000000 type count bytes (using referred keys) radixes 1729472 58048380 links 5000000 27886336 leafs 5000000 44629384 ext_keys 5000000 29894315 Leaf nodes are taking too much space. It seems the next big optimization might be inlining ext_keys into leafs. Reviewed By: DurhamG, markbt Differential Revision: D13028196 fbshipit-source-id: 6043b16fd67a497eb52d20a17e153fcba5cb3e81	2018-12-06 14:57:52 -08:00
Jun Wu	d8117b3b04	indexedlog: increase key count for size test Summary: Since the size test only runs once, we can use a larger number of keys. This is closer to some production use-cases. `cargo bench size` shows: index size (5M owned keys) 240838647 index size (5M referred keys) 160458423 It currently uses 32 bytes per key for 5M referred keys. Reviewed By: markbt Differential Revision: D13027880 fbshipit-source-id: 726f5fb2da056e77ab93d82fda9f1afa500d0a8d	2018-12-06 14:57:52 -08:00
Jun Wu	55b6331aa4	indexedlog: add more benchmarks Summary: Add benchmarks about index sizes, and a benchmark of insertion using key references. An example `cargo bench` result running on my devserver looks like: index insertion (owned key) 3.551 ms index insertion (referred key) 3.713 ms index flush 20.648 ms index lookup (memory) 1.087 ms index lookup (disk, no verify) 2.041 ms index lookup (disk, verified) 4.347 ms index size (owned key) 886010 index size (referred key) 534298 Reviewed By: markbt Differential Revision: D13027879 fbshipit-source-id: 70644c504026ffee2122d857d5035f5b7eea4f42	2018-12-06 14:57:52 -08:00
Jun Wu	d7129256d4	indexedlog: switch checksum table to little endian (BC) Summary: For checksum values like xxhash, there is no benefit using big endian. Switch to little endian so it's slightly slightly faster on the major platforms we care about. This is a breaking change. However, the format is not used in production yet. So there is no migration code. Reviewed By: markbt Differential Revision: D13015465 fbshipit-source-id: ca83d19b3328370d089b03a33e848e64b728ef2a	2018-12-06 14:57:52 -08:00
Jun Wu	75b4f92c44	indexedlog: support different checksum functions for Log entries (BC) Summary: Previously, the format of an Log entry is hard-coded - length, xxhash, and content. The xxhash always takes 8 bytes. For small (ex. 40-byte) entries, xxhash32 is actually faster and takes less disk space. Introduce the "entry flags" concept so we can store some metadata about what checksum function to use. The concept could be potentially used to support other new format changes at per entry level in the future. As we're here, also support data without checksums. That can be useful for content with its own checksum, like a blob store with its own SHA1 integrity check. Performance-wise, log insertion is slower (but the majority insertaion overhead would be on the index part), iteration is a little bit faster, perhaps because the log can use less data. Before: log insertion 15.874 ms log iteration (memory) 6.778 ms log iteration (disk) 6.830 ms After: log insertion 18.114 ms log iteration (memory) 6.403 ms log iteration (disk) 6.307 ms Reviewed By: DurhamG, markbt Differential Revision: D13051386 fbshipit-source-id: 629c251633ecf85058ee7c3ce7a9f576dfac7bdf	2018-12-06 14:57:52 -08:00
Jun Wu	049cd99f05	indexedlog: use non-VLQ encoding for xxhash (BC) Summary: Xxhash result won't usually have leading zeros. So VLQ encoding is not an efficient choice. Use non-VLQ encoding instead. Performance wise, this is noticably faster than before: log insertion 14.161 ms log insertion with index 102.724 ms log flush 11.336 ms log iteration (memory) 6.351 ms log iteration (disk) 7.922 ms 10.18s user 3.66s system 97% cpu 14.218 total log insertion 13.377 ms log insertion with index 97.422 ms log flush 11.792 ms log iteration (memory) 6.890 ms log iteration (disk) 7.139 ms 10.20s user 3.56s system 97% cpu 14.117 total log insertion 14.573 ms log insertion with index 94.216 ms log flush 18.993 ms log iteration (memory) 7.867 ms log iteration (disk) 7.567 ms 9.85s user 3.73s system 96% cpu 14.073 total log insertion 15.526 ms log insertion with index 98.868 ms log flush 19.600 ms log iteration (memory) 7.533 ms log iteration (disk) 7.150 ms 10.13s user 4.02s system 96% cpu 14.647 total log insertion 14.629 ms log insertion with index 100.449 ms log flush 20.997 ms log iteration (memory) 7.299 ms log iteration (disk) 7.518 ms 10.14s user 3.65s system 96% cpu 14.274 total This is a format-breaking change. Fortunately we haven't really use the old format in production yet. Reviewed By: DurhamG, markbt Differential Revision: D13015463 fbshipit-source-id: 6e7e4f7a845ea8dbf0904b3902740b65cc7467d5	2018-12-06 14:57:52 -08:00
Jun Wu	42c3ef6eb6	indexedlog: add benchmark for "log" Summary: Some simple benchmark for "log". The initial result running from my devserver looks like: log insertion 33.146 ms log insertion with index 106.449 ms log flush 9.623 ms log iteration (memory) 10.644 ms log iteration (disk) 11.517 ms 13.75s user 3.61s system 97% cpu 17.778 total log insertion 27.906 ms log insertion with index 107.683 ms log flush 19.204 ms log iteration (memory) 10.239 ms log iteration (disk) 11.118 ms 12.89s user 3.55s system 97% cpu 16.924 total log insertion 31.645 ms log insertion with index 109.403 ms log flush 9.416 ms log iteration (memory) 10.226 ms log iteration (disk) 10.757 ms 13.07s user 3.02s system 97% cpu 16.423 total log insertion 31.848 ms log insertion with index 109.332 ms log flush 18.345 ms log iteration (memory) 10.709 ms log iteration (disk) 11.346 ms 13.12s user 3.70s system 97% cpu 17.276 total log insertion 29.665 ms log insertion with index 106.041 ms log flush 16.159 ms log iteration (memory) 10.367 ms log iteration (disk) 11.110 ms 12.99s user 3.27s system 97% cpu 16.717 total Reviewed By: markbt Differential Revision: D13015464 fbshipit-source-id: 035fee6c8b6d0bea4cfe194eed3d58ba4b5ebcb8	2018-12-06 14:57:52 -08:00
Durham Goode	1a3a0bcd72	nodemap: add key iteration Summary: An upcoming diff will need the ability to iterate over all the keys in the store. So let's expose that functionality. Reviewed By: quark-zju Differential Revision: D13062575 fbshipit-source-id: a173fcdbbf44e2d3f09f7229266cca6f3e67944b	2018-12-06 11:47:41 -08:00
Durham Goode	668ba5165c	indexedlog: add an iterator function for iterating over keys Summary: You can currently iterate over indexlog entries, but there's no way to iterate over the keys without keeping a copy of the index function with you. Let's add a key iterator function. Reviewed By: quark-zju Differential Revision: D13010744 fbshipit-source-id: 1fcaf959ae82417e5cbafae7c1927c3ae8f8e76a	2018-12-06 11:47:41 -08:00

1 2 3

137 Commits