speedscope

mirror of https://github.com/jlfwong/speedscope.git synced 2024-11-22 12:53:23 +03:00

Author	SHA1	Message	Date
Jamie Wong	3f3da22853	Update README.md - Add link to Hermes import instructions - Re-order import sources higher in the list - Update first line	2023-12-27 10:48:07 -05:00
Jamie Wong	bea0ef6913	1.18.0	2023-12-26 17:12:59 -05:00
Zachary Marion	4feb1e564d	Add hermes-specific support for the trace event format (#458 ) Profiles that are transformed into the hermes trace event format are guaranteed to have specific arguments that supply metadata that is useful for debugging - namely the filename, line + col number that the function call originated from, with sourcemaps applied. This PR adds specific support for this information to the trace event importer. This means that we can have the Frame name be just the name of the function, since we know all the information we want to be displayed in the UI is captured in the frame info, which makes the traces cleaner to look at. \| Before \| After \| \|-----\|-----\| \| <img width="1728" alt="Screenshot 2023-12-26 at 2 40 01 PM" src="https://github.com/jlfwong/speedscope/assets/9957046/f9dff608-5df3-4098-b1f8-91a69185d906"> \| <img width="1726" alt="Screenshot 2023-12-26 at 2 39 13 PM" src="https://github.com/jlfwong/speedscope/assets/9957046/b8ff360e-a316-4bef-8ebc-620c9ff1a998"> \| \| <img width="1728" alt="Screenshot 2023-12-26 at 2 41 03 PM" src="https://github.com/jlfwong/speedscope/assets/9957046/127a49b5-458e-4ac8-934a-202e565cb20f"> \| <img width="1728" alt="Screenshot 2023-12-26 at 2 41 29 PM" src="https://github.com/jlfwong/speedscope/assets/9957046/ebb285ce-6b33-4535-8e45-b9ada4e4d97f"> \|	2023-12-26 17:00:18 -05:00
Jamie Wong	88f4fe6105	Update README-ADMINS.md with npm login instructions (#457 )	2023-12-25 22:52:46 -05:00
Jamie Wong	60f1812e51	1.17.0	2023-12-25 22:40:40 -05:00
Jamie Wong	1717fecafb	Upgrade prettier, update prettier & react-hooks eslint plugins (#456 ) Re-ran prettier with latest version	2023-12-25 22:37:45 -05:00
Jamie Wong	c296f530c7	Upgrade typescript & eslint to latest, fix resulting errors (#455 ) Also updated the ci.yml node test versions to 18.x and 20.x, given current support: https://endoflife.date/nodejs	2023-12-25 22:23:33 -05:00
Jamie Wong	8e0fa58d65	Re-enable eslint prettier rule after being accidentally disabled for 3 years (#454 ) It looks like in #267 (which was 3 years ago!), I accidentally disabled prettier linting altogether 😱 https://github.com/jlfwong/speedscope/pull/267/files#diff-e2954b558f2aa82baff0e30964490d12942e0e251c1aa56c3294de6ec67b7cf5 There's no comment in that PR about this being an intentional thing, so I have to assume this was a dumb mistake.	2023-12-25 21:22:56 -05:00
Zachary Marion	b21480494e	Support the chrome JSON trace format (allows viewing of hermes traces) (#453 )	2023-12-25 21:11:45 -05:00
Zachary Marion	dfd3a0dfb3	Fix bug in selectQueueToTakeFromNext for trace profiles (#450 ) I have been taking a lot of profiles using the Hermes profiler, but I noticed that they sometimes to not show up properly. After debugging what exactly was going on, I realized it was because the logic in `selectQueueToTakeFromNext` only checks for name, instead of the key for the event. I had a bunch of events with the name `anonymous` that were getting improperly exited before they should have been due to this logic. This fix makes the code more robust if there are added "args" which differentiate an event from another (as is the case in Hermes profiles), however it would still be an issue if they key just defaults to the name. Example profile before: <img width="1728" alt="Screenshot 2023-12-15 at 12 54 04 AM" src="https://github.com/jlfwong/speedscope/assets/9957046/345f556e-f944-40f1-b59c-748133acb950"> What it should look like (in Perfetto): <img width="1051" alt="Screenshot 2023-12-15 at 8 51 38 AM" src="https://github.com/jlfwong/speedscope/assets/9957046/7473cdf8-95f1-49de-a0c7-ef4ac081ff85"> After the fix: <img width="1728" alt="Screenshot 2023-12-15 at 12 54 29 AM" src="https://github.com/jlfwong/speedscope/assets/9957046/56b0870a-538b-4916-acc8-de2b7dfd78eb">	2023-12-16 00:27:03 -08:00
Jamie Wong	ac4a015559	Add bounds checking for sampleTypeIndex (#449 ) Wow this was surprising. As reported in #448, the `simple.speedscope.json` file failed import. This was surprising to me because there's a test that covers that file to ensure it imports correctly. The file provided in #448, however, is from a version of speedscope from 5 years ago before the file had been pretty printed. It turns out that when this particular file is not pretty-printed, it's a schematically valid pprof profile. The fix is to do some bounds checks and return null. After the change, the file imports as you'd expect after realizing its not actually a valid pprof profile. Fixes #448	2023-12-07 18:43:47 -08:00
byronhe	de17f128d0	Update README-zh_CN.md (#442 ) fix url	2023-09-04 09:54:35 -07:00
Jamie Wong	304ccf33ab	Update publish-and-deploy to remove automated release creation (#440 ) Turns out the `--attach` command was hallucinated by GPT!	2023-07-16 15:31:31 -07:00
Jamie Wong	650e505b55	1.16.0	2023-07-16 03:04:29 -07:00
Jamie Wong	8dad28e5e2	Automate more of the release process (#439 ) The publish, deploy, and release process is annoying enough at the moment that I avoid doing it frequently. Let's automate most of it to reduce the friction	2023-07-16 03:01:50 -07:00
Nguyễn Văn Đức	f62519ab69	Improve profile builder performance (#437 ) Follow up on this PR #435. Currently, it took roughly 22 seconds to load my 1.3GB file. After inspecting the profiler, there's a large chunk of time spending in Frame.getOrInsert. I figure we can reduce the number of invocations by half. It reduces the load time to roughly 18 seconds.I also tested with a smaller file (~350MB), and it show similar gains, about 15-20%	2023-06-28 22:11:12 -07:00
Nguyễn Văn Đức	984bf1296a	Fix crash when importing big linux perf tool files (#435 ) Currently, importing files generated by linux perf tool whose some blocks exceed V8 strings limit can crash the application. This issue is similar to the one in #385. This PR fixes it by changing parseEvents to work directly with lines instead of chunking lines into blocks first. Fixes #433	2023-06-28 01:30:08 -07:00
Nguyễn Văn Đức	26884c116c	Improve splitLines: return iterator instead (#434 ) The current behavior of splitLines is to eagerly split all the lines and return an array of strings. This PR improves this by returning an iterator instead, which will emit lines. This lets callers decide how to best use the splitLines function (i.e. lazily enumerate over lines) Relates to #433	2023-06-27 23:27:15 -07:00
Jamie Wong	bb063e49e3	Fix trimTextMid (#431 ) There was a subtle bug in `trimTextMid` caused by calling substring methods with non-integer values. This happens because `findValueBisect` returns non-integer values, and there was no special handling of this. The bug results in strings cutting off many of the last few characters in a string, rather than always displaying it when possible. Before: <img width="368" alt="image" src="https://github.com/jlfwong/speedscope/assets/150329/754a25f1-a6f7-46f1-8e34-059503d9e4cf"> After: <img width="386" alt="image" src="https://github.com/jlfwong/speedscope/assets/150329/b2688ca1-54af-4d2e-b704-9f3322d2e5b4"> Fixes #411	2023-06-23 13:22:53 -07:00
xieve	693545b77a	Added support for Papyrus profiles (#428 ) Fixed #427	2023-06-23 13:02:08 -07:00
Jamie Wong	b3b4b1492a	1.15.2	2023-06-21 17:28:24 -07:00
Jamie Wong	9cdceede15	Support showing pprof lines from the pprof Line object (take 2) (#430 ) The previous behavior was to use the StartLine of a function as the line number to show in speedscope. However, the Line object has more precise line information, and we should only fallback to StartLine if we don't have this more detailed information. Looking at the [documentation for the pprof proto](https://github.com/google/pprof/tree/main/proto#general-structure-of-a-profile), this is more how it intends to interpret line information: > location: A unique place in the program, commonly mapped to a single instruction address. It has a unique nonzero id, to be referenced from the samples. It contains source information in the form of lines, and a mapping id that points to a binary. > function: A program function as defined in the program source. It has a unique nonzero id, referenced from the location lines. It contains a human-readable name for the function (eg a C++ demangled name), a system name (eg a C++ mangled name), the name of the corresponding source file, and other function attributes. Here is a sample profile that had line-level info on the Line object of the profile: Before: <img width="449" alt="Screen Shot 2022-11-03 at 11 17 11 AM" src="https://user-images.githubusercontent.com/618615/199760730-712daa70-6cfb-4e90-b037-b571809c26d9.png"> After: <img width="449" alt="Screen Shot 2022-11-03 at 11 17 22 AM" src="https://user-images.githubusercontent.com/618615/199760777-1c0d5581-7b29-42b7-b642-6035f7d25405.png">	2023-06-21 15:25:16 -07:00
Manuel Correa	741fdeb427	Stackprof: weight on-cpu samples by period rather than timestamp delta (#425 ) This attempts to improve the quality of the on-CPU profiles stackprof provides. Rather than weighing samples by their timestamp deltas, which, in our opinion, are only valid in wall-clock mode, this weighs callchains by: ``` S = number of samples P = sample period in nanoseconds W = S * P ``` The difference after this change is quite substantial, specially in profiles that previously were showing up with heavy IO frames: * Total profile weight is almost down by 90%, which actually makes sense for an on-CPU profile if the app is relatively idle * Certain callchains that blocked in syscalls / IO are now much lower weight. This was what I was expecting to find. Here is an example of the latter point. In delta mode, we see an io select taking a long time, it is a significant portion of the profile: <img width="1100" alt="236936508-709bee01-d616-4246-ba74-ab004331dcd3" src="https://github.com/dalehamel/speedscope/assets/4398256/39140f1e-50a9-4f33-8a61-ec98b6273fd4"> But in period scaling mode, it is only a couple of sample periods ultimately: <img width="206" alt="236936693-9d44304e-a1c2-4906-b3c8-50e19e6f9f27" src="https://github.com/dalehamel/speedscope/assets/4398256/7d19077f-ef25-4d79-980b-cfa1775d928d">	2023-06-17 20:50:19 -07:00
Jake Zimmerman	e9133be353	Use `frame.name?.startsWith` for stackprof (#419 ) Sometimes, stackprof frames don't get generated with a `name` in the frame. I think it's probably worth tracking down why that is, but in the mean time, speedscope simply crashes with a method call on `undefined`. The crash is bad because it only shows up in the console--there's no visible message saying that speedscope failed to parse and load a profile. For more information, see #378 This fixes the crash by simply skipping the logic in demangle if the name field isn't present on a frame. That's probably a fine tradeoff? Because in this case, stackprof is generating ruby frames, which means that C++ name demangling won't apply. I have tested this by running the scripts/prepare-test-installation.sh script and verifying that `bin/cli.js` can now successfully load the included profile. Before these changes, I verified that speedscope failed with the behavior mentioned in #378. I've also included a snapshot test case, but it seems that the Jest test harness only tests the parsing, not the rendering (correct me if I'm wrong). So I haven't actually been able to create an automated test that would catch a regression. Please let me know if there's a better way to have written this test. I've staged the commits on this branch so that the second commit (`dcb9840`) showcases the minimal diff to a stackprof file that reproduces the bug. That is, rather than look at the thousands of new lines in the stackprof profile, you can view the second commit to see the salient part of the file.	2023-06-15 01:30:53 -07:00
Dave Vasilevsky	fcc1fa5689	fix pprof defaultSampleType (#424 ) Fixes #415 * Interprets pprof's defaultSampleType as an index into the string table [as documented in the proto](`0414c2f617/src/import/profile.proto (L85)`), not as an index into the sample types repeated-field. This allows parsing to succeed when the string-table index is not a valid sampleTypes index, which is common on allocation profiles. * Update the pprof snapshot. This is necessarily because we were previously interpreting an empty defaultSampleType as truthy-but-zero when long.js is present, ie: the first sample-type. But Speedscope-in-the-browser doesn't seem to include long.js, so our tests were disagreeing with in-browser behavior. With this PR, that should be fixed.	2023-06-15 01:14:47 -07:00
Jamie Wong	0414c2f617	1.15.1	2023-06-04 04:14:07 -07:00
Jamie Wong	f23f65b3af	Callgrind: Subposition compression and weight correction (#423 ) This fixes a number of bugs with callgrind import. Dealing with this file format is a big pain because the documentation on https://www.valgrind.org/docs/manual/cl-format.html doesn't contain enough examples to disambiguate some of the behaviour, and because there's a fundamental impedance mismatch between call-trees and call-graphs. In any case, after this PR, the behavior of callgrind file import is much better. The file provided in #414 now imports correctly and, as far as I can tell, displays the same weights as what I see in KCacheGrind. Some of the key changes: - Implementing subposition compression. This was just a TODO in the code that was never implemented - Fixing a misinterpretation of how `fe` and `fi` were intended to be used. Previously, I was using it to change the filename of a symbol, meaning that an `fi` or an `fe` line in the middle of a block describing costs for an `fn` would split a node in the call-graph into multiple nodes causing all manners of problems - Fixing a bug where `cfn` was persisting beyond a single call, also resulting in call graph nodes being split when they shouldn't be Fixes #414	2023-06-04 04:06:22 -07:00
Jamie Wong	8da9088ec1	Fix import from Chrome Devtools performance tab in Chrome >= 114 (#422 ) The file format uses by Chrome Devtools performance tab periodically changes. It uses the Chrome trace event format (https://docs.google.com/document/d/1CvAClvFfyA5R-PhYUmn5OOQtYMH4h6I0nSsKchNAySU/preview). This format, however, has two different types: one is `TraceEvent[]`, the other is `{traceEvents: traceEvent[]}`. The importer for non-Chrome devtools profiles already handled this, but the one for Chrome Devtools didn't because Chrome < 114 never used it. It seems like they changed the file format. This PR addresses that change. Fixes #420	2023-06-03 22:35:13 -07:00
Jamie Wong	81a6f29ad1	1.15.0	2022-10-22 00:18:24 +08:00
Jamie Wong	263f7d513e	Update package.json to use upstream version of uint8array-json-parser (#408 ) In #385, I introduced a dependency on the `uint8array-json-parser` npm package, but used a fork because of a typescript error. This was resolved in evanw/uint8array-json-parser#1 and published as part of `uint8array-json-parser@0.0.2`. Let's use the upstream. This also conveniently fixes a new typechecking error that was preventing deployment. The error looked like this: ``` src/import/utils.ts(2,26): error TS2306: File '\''/Users/jlfwong/code/speedscope/node_modules/uint8array-json-parser/uint8array-json-parser.ts'\'' is not a module.' ``` After updating to the upstream, the problem is fixed.	2022-10-22 00:12:45 +08:00
Jamie Wong	1bce806933	Replace fuzzy matching with exact substring matching for finding matching frames (#407 ) In #297, I re-used the fuzzy matching logic I implemented in #282 for profile selection. Based on feedback from several people in #352, this is surprising behavior. Upon reflection, this should have been obvious -- I hijacked the Ctrl/Cmd+F browser behaviour, so I should try to replicate the expected behaviour there as closely as possible. Given more patience, I also would've done some user research :) This PR updates this logic to try to more closely match browser behaviour. This means case-insensitive, exact-substring matching. I've left the fuzzy matching alone for profile selection since that doesn't attempt to mimic browser behaviour. The non-fuzzy matching feels slightly odd to me given the filtering behaviour on the sandwich view, but I think consistency across this find UI is important. Here are the before & after results when searching for the string "ca" in the example profile. \|Before\|After\| \|-\|-\| \|<img width="1791" alt="image" src="https://user-images.githubusercontent.com/150329/197232741-6d1d7a8a-8b8c-4a4f-98e3-2c043fd7efd5.png">\|<img width="1789" alt="image" src="https://user-images.githubusercontent.com/150329/197232694-82697b68-ca15-49e7-887b-2606646ee5e9.png">\| Fixes #352 Supersedes #403	2022-10-21 23:53:46 +08:00
Jamie Wong	6493c5f66f	Update deploy script to python3	2022-07-30 23:20:36 -07:00
Evan Wallace	639dae322b	Add support for cycle-based Instruments deep copy (#400 ) Unlike the Time Profiler, the CPU Profiler in Instruments use `cycles` for units instead of `ms`: <img width="872" src="https://user-images.githubusercontent.com/406394/175755999-289cb7c0-f29a-44b1-b00e-b55ef17ee303.png"> Currently Speedscope fails to import the data with the following error in the console: ``` Failed to load format Error: Unrecognized units Gc ``` This PR adds support for `cycles` as a unit to the Instruments deep copy importer as well as `Kc`, `Mc`, and `Gc`, which I'm assuming are increasing in multiples of 1000. Hopefully I've added support for this correctly and this PR is helpful.	2022-07-02 22:01:08 -04:00
Jamie Wong	33a8f3f313	1.4.0	2022-05-19 01:37:56 -07:00
Jamie Wong	7ae545a6c3	Improve HoverTip placement logic (#395 ) This changes the HoverTip placement logic to use measurements from the actual DOM node rather than basing everything on the maximum sizes. This avoids some counter-intuitive behaviour, most importantly situations where the label would overflow off the left side of the screen for no obvious reason. Fixes #394 Fixes #256	2022-05-17 13:42:13 -07:00
David Judd	48d692c2a3	Add a hash param to control view-mode (#362 ) e.g. "view=left-heavy", "view=sandwich" Fixes #355	2022-05-17 00:15:51 -07:00
Alex Coco	ca8fcb48cc	Support stackprof object mode (#391 ) This PR attempts to support stackprof's object mode which tracks the number of allocated objects. This differs from the other modes (cpu and wall) by taking samples every time a Ruby object is allocated using Ruby's [`NEWOBJ` tracepoint](`df24b85953/ext/stackprof/stackprof.c (L198-L199)`). When importing an object mode profile into speedscope today it still works but what you see is a profile using time units. The profile will only have samples for when an object was allocated which means even if time is reported, the profile is not really meaningful when looking at time. To address this I've done three things when `mode` is `object`: + adjusted the total size of the `StackListProfileBuilder` to use the number of samples (since each sample is one allocation) + adjusted the weight of each sample to be `nSamples` (which I believe is always `1` but I'm not positive) + do not set the value formatter to a time formatter Here's what it looks like before and after my changes (note the units and weight of samples): wall (before) \| object (before) \| object (after) -- \| -- \| -- <img width="1624" alt="Screen Shot 2022-05-11 at 4 51 31 PM" src="https://user-images.githubusercontent.com/898172/167945635-2401ca73-4de7-4559-b884-cf8947ca9738.png"> \| <img width="1624" alt="Screen Shot 2022-05-11 at 4 51 34 PM" src="https://user-images.githubusercontent.com/898172/167945641-ef302a60-730b-4afd-8e44-5f02e54b3cb7.png"> \| <img width="1624" alt="Screen Shot 2022-05-11 at 4 51 42 PM" src="https://user-images.githubusercontent.com/898172/167945643-5611b267-f8b2-4227-a2bf-7145c4030aa2.png"> <details> <summary>Test code</summary> ```ruby require 'stackprof' require 'json' def do_test 5.times do make_a_word end end def make_a_word ('a'..'z').to_a.shuffle.map(&:upcase).join end StackProf.start(mode: :object, interval: 1, raw: true) do_test StackProf.stop File.write('tmp/object_profile.json', JSON.generate(StackProf.results)) StackProf.start(mode: :wall, interval: 1, raw: true) do_test StackProf.stop File.write('tmp/wall_profile.json', JSON.generate(StackProf.results)) ``` </details>	2022-05-17 00:05:49 -07:00
Dan Vanderkam	63f3bc0395	Support relative URLs (#357 ) Fixes #312 This turns out not to be very deep: you have to pass an optional second parameter to the [`URL` constructor](https://developer.mozilla.org/en-US/docs/Web/API/URL/URL) to resolve relative URLs. ``` > new URL('/path/to/file#hashcode').pathnaem VM252:1 Uncaught TypeError: Failed to construct 'URL': Invalid URL at <anonymous>:1:1 (anonymous) @ VM252:1 > new URL('/path/to/file#hashcode', 'http://example.com/').pathname "/path/to/file" ```	2022-05-17 00:01:02 -07:00
Tobias Koppers	1ac88cc09a	add file and line information (#365 ) * add file and line to tooltips * add file and line to anonymous methods ## Before: ![image](https://user-images.githubusercontent.com/1365881/134863173-0d5635e8-1884-4276-a2cc-b0b7af5a579b.png) ![image](https://user-images.githubusercontent.com/1365881/134863456-f56aca3b-2742-4194-ba1e-823ff871a316.png) While you was able to get this info with clicking in "Time Order" and "Left Heavy" view, it was impossible to receive in the sandwich view. ## After: ![image](https://user-images.githubusercontent.com/1365881/134863282-6719db20-e528-4eb5-b876-b35f99c34da4.png) ![image](https://user-images.githubusercontent.com/1365881/134863365-b6a550d4-56d8-4cf0-a17f-f682f1d2fe57.png)	2022-05-16 23:59:17 -07:00
Jamie Wong	9a2c2a270b	Add PHP import instructions (fixes #368 )	2022-05-16 23:24:14 -07:00
Jamie Wong	229d48eca5	Update README-zh_CN.md to reflect changes in `103db68`	2022-05-16 23:18:11 -07:00
Joe Rickerby	103db681d2	Add link to pyinstrument wiki page (#377 )	2022-05-16 23:14:25 -07:00
Jamie Wong	21167e69d8	Support importing profiles whose contents exceed V8s maximum string size (#385 ) Browsers have a limit on how big you can make strings. In Chrome on a 64 bit machine, this is around 512MB, which explains why in #340, a 600MB file fails to load. To work around this issue, we avoid making strings this large. To do so, we need two core changes: 1. Instead of sending large strings as the import mechanism to different file format importers, we introduce a new `TextFileContent` interface which exposes methods to get the lines in the file or the JSON representation. In the case of line splitting, we assume that no single line exceeds the 512MB limit. 2. We introduce a dependency on https://github.com/evanw/uint8array-json-parser to allow us to parse JSON files contained in `Uint8Array` objects To ensure that this code doesn't code rot without introducing 600MB test files or test file generation into the repository, we also re-run a small set of tests with a mocked maximum string size of 100 bytes. You can see that the chunked string representation code is getting executed via test coverage. Fixes #340	2022-05-16 23:11:13 -07:00
轩灵	e37f6fa7c3	Add `README-zh_CN.md` file (#364 )	2021-09-22 11:20:25 -07:00
Daniel Giger	6d02bf510f	Fix typo in README (#360 )	2021-08-09 23:20:20 -07:00
Jamie Wong	b71cef5db4	Bump TypeScript to 4.3.2 (#343 ) * Bump TypeScript to 4.3.2 * Bump eslint deps * Fix eslint errors from upgrade	2021-03-28 16:08:15 -07:00
Jamie Wong	e6351a3c22	Remove accidentally checked-in vscode settings	2021-03-28 15:32:09 -07:00
Gabriele N. Tornetta	36aebfbda6	Allow collapsed stacks with invalid lines (#336 ) Ingest files containing collapsed stacks and tolerate invalid lines, like FlameGraph does. Some files might contain lines starting with a # to add comments to the collected samples. Speedscope should still attempt to parse these files as collapsed stacks and only keep the samples that it can find. Only fail if there are no samples reported.	2021-03-28 15:30:36 -07:00
Jamie Wong	246fc3dd5d	Remove redux in favor of a small recoil-inspired "atom" library (#341 ) This is an experiment in replacing redux entirely with a tiny library I wrote for global application state management. Redux has been okay, but all of the redux actions in speedscope are setters, which always made me think there must be a simpler way. This is an attempt to find that simpler way. See `src/lib/atom.ts` for the library.	2021-03-28 02:44:43 -07:00
Gabriele N. Tornetta	d6f5efa06c	fix(search): allow paste in search box (#338 )	2021-03-27 15:05:16 -07:00

1 2 3 4 5 ...

417 Commits