mirror of https://github.com/aaronallen8455/graph-trace.git synced 2024-07-07 08:16:19 +03:00

Trace the call graph of a Haskell program

debugging haskell starred-aaronallen8455-repo starred-repo

Go to file

Aaron Allen e17edc37a1 support 9.6		2023-03-16 21:07:38 -05:00
.github/workflows	support 9.6	2023-03-16 21:07:38 -05:00
exe	generate unboxed tuple/sum instances	2022-01-17 21:15:53 -06:00
graph-trace	support 9.6	2023-03-16 21:07:38 -05:00
graph-trace-dot	support 9.6	2023-03-16 21:07:38 -05:00
graph-trace-viz	support 9.6	2023-03-16 21:07:38 -05:00
images	fix view pat bug	2021-12-22 17:59:10 -06:00
.gitignore	add IP update for PatBinds	2021-12-08 21:35:04 -06:00
cabal.project	separate build targets	2021-11-25 15:49:18 -06:00
README.md	use kmett levity polymorphism trick	2022-05-28 13:02:31 -05:00
run-exe	nexus wip	2022-01-16 15:40:25 -06:00
Setup.hs	figuring stuff out	2021-10-09 14:55:57 -05:00
shell.nix	support 9.6	2023-03-16 21:07:38 -05:00

README.md

Graph Trace

This is a GHC plugin that instruments your Haskell program so that it creates a log file when run which can then be compiled to a graph using the accompanying graph-trace-viz utility. In contrast to the usual debug tracing a la Debug.Trace where all output is interleaved into a flat sequence of lines, the graph structure produced by this plugin takes into account the actual call graph of the program with individual function calls represented as edges in the graph.

Contents:

Demonstration
Quickstart
User's guide
- Graph.Trace plugin
- graph-trace-viz utility
Caveats

Demonstration

Consider this simplistic program that greets the user by name:

import Graph.Trace (TraceDeep, trace, traceM)
import Data.Char (toUpper, toLower)

main :: TraceDeep => IO ()
main = do
  firstName <- prompt "Enter your first name"
  lastName <- prompt "Enter your last name"
  greet firstName lastName

prompt :: String -> IO String
prompt str = do
  putStrLn str
  input <- getLine
  traceM $ "input: " <> input
  pure $ capitalize input

capitalize :: String -> String
capitalize [] = []
capitalize (x:xs) =
  let result = toUpper x : map toLower xs
   in trace ("result: " <> result) result

greet :: String -> String -> IO ()
greet firstName lastName =
  putStrLn $ "Hello, " <> firstName <> " " <> lastName <> "!"

Using the Graph.Trace plugin along with the graph-trace-viz utility, we can run this program to generate the following trace of the call graph:

Quickstart

Add the graph-trace package as a dependency to your project.
Enable the plugin by adding the following GHC options: -fplugin=Graph.Trace -fplugin-opt Graph.Trace:trace-all -fno-full-laziness -fno-cse. This can be placed in the ghc-options field of the cabal or package.yaml file (depending on whether you use cabal or stack to build). For example:

in package.yaml:
```
executables:
  my-exe:
    ...
    ghc-options:
      -fplugin=Graph.Trace
      -fplugin-opt Graph.Trace:trace-all
      -fno-full-laziness
      -fno-cse
```
or in foo.cabal:
```
executable my-exe
  ...
  ghc-options:
    -fplugin=Graph.Trace
    -fplugin-opt Graph.Trace:trace-all
    -fno-full-laziness
    -fno-cse
```
Build your project (cabal build all or stack build).
Running your program should now generate a file called <executable-name>.trace.
Install Graphviz
Install the graph-trace-viz utility by running cabal update && cabal install graph-trace-viz or stack update && stack install graph-trace-viz.
Invoke graph-trace-viz within the same directory as the trace file. There should now be a file called <executable-name>.html which can be viewed in your browser.

User's Guide

`Graph.Trace` plugin

To use this plugin simply add graph-trace as a package dependency and pass the -fplugin=Graph.Trace option to GHC. The main functionality of the Graph.Trace plugin is to automatically instrument your code so that it will emit trace logging to a *.trace file when run. There are two types of traces:

Function call
This trace is emitted when the term returned by a function is evaluated to WHNF. Each emission of an entry trace generates a unique ID for that particular function invocation. The trace also includes the ID of the code from which the function was called (if applicable) so that a graph edge can be constructed between the two. Entry traces are only emitted for functions that have signatures. If the function does not have a signature then it will simply inherit the ID of the calling context and will therefore not generate a new node in the call graph.
Debug trace
Debug traces are textual messages that the user can emit from the body of a function. When rendered, they appear as plain text in the body of a call graph node. The API for debug traces matches that of the familiar Debug.Trace module and is available through the Graph.Trace module. A debug trace will only be emitted when the thing it is applied to is evaluated to WHNF.

The plugin gives you some control over how and when traces are emitted. This comes in the form of various type class constraints that you can put on function signatures to control tracing, all of which are exported by Graph.Trace:

Trace
On its own, this constraint says that traces should be emitted for a given function using the name of that function as the identifier. Notably, function calls made from within the body of the function do not inherit this behavior and so will not emit traces unless otherwise instructed to do so.
TraceDeep
This is similar to Trace except that function calls made within the function body will also emit traces even if they don't have a trace constraint (unless they are being muted). For example, putting TraceDeep on the main function will result in the full execution the program being traced.
TraceMute
If a function has this constraint then any invocation of it as well as the function calls it makes will not emit any traces. This constraint overrides all the others and is inherited by function calls made from within its body.
TraceKey
This constraint is the same as Trace but it takes a type level string argument which will be used as the function identifier instead of the function's actual name. Notably, if the identifier of a called function is overridden to be the same as that of the function calling it then its output will be placed in the graph node of the calling function rather than generating its own node. There is also a TraceDeepKey constraint that does the same thing but for TraceDeep.

If you want every function in your program to emit traces, you can use the trace-all plugin option which effectively adds the Trace constraint to all function definitons automatically. To use this option, pass the -fplugin-opt Graph.Trace:trace-all option to GHC in addition to -fplugin=Graph.Trace.

Notably, if you don't use the trace-all plugin option and don't place a Trace or TraceDeep constraint on any functions, then no tracing will occur.

If a trace file already exists for the executable being traced, then new entries will be appended to that file. To start a new trace you'll need to rename or delete the old file.

`graph-trace-viz` utility

Once you've generated a *.trace file by compiling your program with the Graph.Trace plugin and running it, the graph-trace-viz utility can render the resulting graph as an interactive html document. It is dependent on Graphviz to render the graph, so you must install that on your system (there should be an executable called dot accessible through your $PATH). Simply invoke graph-trace-viz in the same directory as the *.trace file and it will write the resulting *.html document. By default it will read all trace files in the current directory but you can also specify the files by giving them as command line arguments instead.

Installing `graph-trace-viz`

To install, run cabal update && cabal install graph-trace-viz or stack update && stack install graph-trace-viz.

Viewing call graphs

Each node in the graph corresponds to the invocation of a function. It has the name of the function in its header as well as a body containing function calls made from within that function and also any trace messages the user has added. The children function calls will have an edge going out to the node representing that call. If a node doesn't have any entries in its body then it will only appear as an entry in its calling function rather than drawing an edge to an empty node.
You can mouse over the node headers and body entries to see a tooltip with their corresponding source code locations.
You can navigate the graph by clicking on the body entry of a function call to go to that call's node while clicking on the header of a node takes you to the node from which it was called. This feature is very useful for large call graphs.
By default each node represents a unique function call. You can change this behavior by passing the --nexus flag to graph-trace-viz which causes nodes that have identical content to be merged into a single node with multiple parents. This can help remove visual clutter in the case of a function being called repeatedly.

Caveats

There are several known caveats you should be aware of:

Undesirable optimisations
If you compile with the common sub-expression or full laziness optimisations, which are on by default for O1 and O2 settings, graph nodes that should be distinct can sometimes get merged into a single node. To prevent this, it is recommended that you turn these optimisations off using the -fno-cse and -fno-full-laziness GHC options when compiling with the graph-trace plugin.
Type signatures
Top level function bindings that don't have signatures won't be traced at all. Function bindings in where or let blocks that don't have type signatures won't emit function call traces, meaning they won't generate new nodes in the graph but traces emitted from them will still appear in the node body of the calling function.
Type class methods
For type class instance methods to be traced correctly, the class method definitions must be in a package compiled with the plugin and you'll also need to put a type signature on the instance method declarations, which requires the InstanceSigs GHC extension.
Impredicative types
If you have a function binding that takes a rank-n quantified type as a parameter, this can cause compilation with the plugin to fail. With GHC 9.2 and above, giving a type signature to the binding should resolve the issue. Another option is to use a newtype wrapper.
Performance
Since additional work must be done to instrument the code, compilation may be noticeably slower than normal. Additionally, the instrumented program may perform worse due to the additional overhead of emitting traces.
View patterns
Traces for function calls in view patterns get associated to the node one level up from the function using the view pattern.
Pattern synonyms
Function calls in pattern synonym matches do not get traced.
It probably goes without saying but you should not compile production deployments with this plugin.
The plugin does not support GHC versions less than 8.10.x