mirror of https://github.com/barrucadu/dejafu.git synced 2024-12-23 21:42:09 +03:00

Systematic concurrency testing meets Haskell.

concurrency haskell starred-barrucadu-repo starred-repo testing

Go to file

Michael Walker 76d5d2f77a Merge pull request #419 from barrucadu/fix Fix prerelease check		2024-12-11 22:11:49 +00:00
.github	Bump cachix/install-nix-action from 29 to 30	2024-10-04 08:21:49 +00:00
concourse	Fix prerelease check	2024-12-11 21:51:49 +00:00
concurrency	Fix formatting issues in CHANGELOGs	2023-10-19 20:01:59 +01:00
dejafu	dejafu-2.4.0.6	2024-12-11 21:19:25 +00:00
dejafu-tests	Bump tasty upper bound to <1.6	2023-09-11 19:56:10 +01:00
docs	Redirect old readthedocs site	2023-10-19 20:31:09 +01:00
hunit-dejafu	Fix formatting issues in CHANGELOGs	2023-10-19 20:01:59 +01:00
tasty-dejafu	Fix formatting issues in CHANGELOGs	2023-10-19 20:01:59 +01:00
.gitignore	Add new mdbook / GitHub Pages based website	2023-10-19 20:31:09 +01:00
.hlint.yaml	Add LTS-19.0 to build matrix & supported GHC versions doc	2022-08-22 14:40:35 +01:00
.readthedocs.yaml	Redirect old readthedocs site	2023-10-19 20:31:09 +01:00
.stylish-haskell.yaml	Use stylish-haskell to format import lists.	2017-04-08 05:42:25 +01:00
.weeder.yaml	Turn all tests into benchmarks	2018-06-24 11:34:31 +01:00
CHANGELOG.rst	Rewrite generic changelog	2018-02-16 18:59:20 +00:00
lint.sh	Warn, lint, & style tests	2018-02-16 20:04:54 +00:00
README.markdown	dejafu-2.4.0.6	2024-12-11 21:19:25 +00:00
stack.yaml	Add GHC 9.2 (LTS-20.0) to CI	2022-11-26 23:19:42 +00:00
style.sh	Warn, lint, & style tests	2018-02-16 20:04:54 +00:00

README.markdown

dejafu

[Déjà Fu is] A martial art in which the user's limbs move in time as well as space, […] It is best described as "the feeling that you have been kicked in the head this way before"

-- Terry Pratchett, Thief of Time

Installation
Quick start guide
Why Déjà Fu?
Contributing
Release notes
Questions, feedback, discussion
Bibliography
The website!

Déjà Fu is a unit-testing library for concurrent Haskell programs. Tests are deterministic and expressive, making it easy and convenient to test your threaded code.

Available on GitHub, Hackage, and Stackage.

Features:

An abstraction over the concurrency functionality in IO
Deterministic testing of nondeterministic code
Both complete (slower) and incomplete (faster) modes
A unit-testing-like approach to writing test cases
A property-testing-like approach to comparing stateful operations
Testing of potentially nonterminating programs
Integration with HUnit and tasty

There are a few different packages under the Déjà Fu umbrella:

	Version	Summary
concurrency	1.11.0.3	Typeclasses, functions, and data types for concurrency and STM.
dejafu	2.4.0.6	Systematic testing for Haskell concurrency.
hunit-dejafu	2.0.0.6	Deja Fu support for the HUnit test framework.
tasty-dejafu	2.1.0.1	Deja Fu support for the Tasty test framework.

See the latest package documentation.

Each package has its own README and CHANGELOG in its subdirectory.

There is also dejafu-tests, the test suite for dejafu. This is in a separate package due to Cabal being bad with test suite transitive dependencies.

Installation

Install from Hackage globally:

$ cabal install dejafu

Or add it to your cabal file:

build-depends: ...
             , dejafu

Or to your package.yaml:

dependencies:
  ...
  - dejafu

Quick start guide

Déjà Fu supports unit testing, and comes with a helper function called autocheck to look for some common issues. Let's see it in action:

import Control.Concurrent.Classy

myFunction :: MonadConc m => m String
myFunction = do
  var <- newEmptyMVar
  fork (putMVar var "hello")
  fork (putMVar var "world")
  readMVar var

That MonadConc is a typeclass abstraction over concurrency, but we'll get onto that shortly. First, the result of testing:

> autocheck myFunction
[pass] Never Deadlocks
[pass] No Exceptions
[fail] Consistent Result
    "hello" S0----S1--S0--

    "world" S0----S2--S0--
False

There are no deadlocks or uncaught exceptions, which is good; but the program is (as you probably spotted) nondeterministic!

Along with each result, Déjà Fu gives us a representative execution trace in an abbreviated form. Sn means that thread n started executing, and Pn means that thread n pre-empted the previously running thread.

Why Déjà Fu?

Testing concurrent programs is difficult, because in general they are nondeterministic. This leads to people using work-arounds like running their testsuite many thousands of times; or running their testsuite while putting their machine under heavy load.

These approaches are inadequate for a few reasons:

How many runs is enough? When you are just hopping to spot a bug by coincidence, how do you know to stop?
How do you know if you've fixed a bug you saw previously? Because the scheduler is a black box, you don't know if the previously buggy schedule has been re-run.
You won't get that much scheduling variety! Operating systems and language runtimes like to run threads for long periods of time, which reduces the variety you get (and so drives up the number of runs you need).

Déjà Fu addresses these points by offering complete testing. You can run a test case and be guaranteed to find all results with some bounds. These bounds can be configured, or even disabled! The underlying approach used is smarter than merely trying all possible executions, and will in general explore the state-space quickly.

If your test case is just too big for complete testing, there is also a random scheduling mode, which is necessarily incomplete. However, Déjà Fu will tend to produce much more schedule variety than just running your test case in IO the same number of times, and so bugs will tend to crop up sooner. Furthermore, as you get execution traces out, you can be certain that a bug has been fixed by simply following the trace by eye.

Contributing

See the "contributing" page on the website.

If you'd like to get involved with Déjà Fu, check out the "good first issue" label on the issue tracker.

Release notes

See the CHANGELOG.markdown file.

Questions, feedback, discussion

For general help talk to me in IRC (barrucadu in #haskell) or shoot me an email (mike@barrucadu.co.uk)
For bugs, issues, or requests, please file an issue.

Bibliography

These libraries wouldn't be possible without prior research, which I mention in the documentation. Haddock comments get the full citation, whereas in-line comments just get the shortened name:

[BPOR] Bounded partial-order reduction, K. Coons, M. Musuvathi, and K. McKinley (2013)
[RDPOR] Dynamic Partial Order Reduction for Relaxed Memory Models, N. Zhang, M. Kusano, and C. Wang (2015)
[Empirical] Concurrency Testing Using Schedule Bounding: an Empirical Study, P. Thompson, A. Donaldson, and A. Betts (2014)
[RMMVerification] On the Verification of Programs on Relaxed Memory Models, A. Linden (2014)

There are also a few papers on dejafu itself:

Déjà Fu: A Concurrency Testing Library for Haskell, M. Walker and C. Runciman (2015)

This details dejafu-0.1, and was presented at the 2015 Haskell Symposium.
Déjà Fu: A Concurrency Testing Library for Haskell, M. Walker and C. Runciman (2016)

This is a more in-depth technical report, written between the dejafu-0.2 and dejafu-0.3 releases.
Revealing Behaviours of Concurrent Functional Programs by Systematic Testing, M. Walker (2018)

This is my Ph.D thesis, which discusses dejafu and my other research projects.