mirror of https://github.com/ilyakooo0/roboservant.git synced 2024-09-17 11:17:39 +03:00

generate contextually sensible fuzz tests for servant apps

Go to file

Mark Wotton 77f565b993 wip number whatever		2020-08-24 17:30:25 -04:00
.github/workflows	ci tweaking	2020-06-06 11:24:32 -04:00
app	autogenerated stuff	2020-06-05 12:01:44 -04:00
scripts	tidying up	2020-06-06 10:24:56 -04:00
src	wip number whatever	2020-08-24 17:30:25 -04:00
test	wip number whatever	2020-08-24 17:30:25 -04:00
.gitignore	cleaned up some type level stuff, added some type level bits	2020-08-08 18:46:09 -04:00
ChangeLog.md	autogenerated stuff	2020-06-05 12:01:44 -04:00
LICENSE	autogenerated stuff	2020-06-05 12:01:44 -04:00
Makefile	wip	2020-08-23 11:58:10 -04:00
package.yaml	wip number whatever	2020-08-24 17:30:25 -04:00
README.md	coverage	2020-08-09 11:55:16 -04:00
roboservant.cabal	wip number whatever	2020-08-24 17:30:25 -04:00
Setup.hs	autogenerated stuff	2020-06-05 12:01:44 -04:00
stack.yaml	wip	2020-08-23 11:58:10 -04:00
TODO.md	write readme	2020-06-06 12:52:57 -04:00

README.md

roboservant

Automatically fuzz your servant apis in a contextually-aware way.

why?

Servant gives us a lot of information about what a server can do. We use this information to generate arbitrarily long request/response sessions and verify properties that should hold over them.

why not servant-quickcheck?

servant-quickcheck is a great package and I've learned a lot from it. Unfortunately, there's a lot of the state space it can't explore: modern webapps are full of pointer-like structures, whether they're URLs or database keys/uuids. servant-quickcheck demands that you be able to generate these without context via Arbitrary: good luck exploring an API that requires you to generate just the right UUID to hit non-trivial code.

roboservant avoids this by using quickcheck-state-machine, which models the dynamic state in such a way that we can use results of previous calls.

concept

we start with a servant api and a server that fulfills the type.

From that api, we should be able to summon up an empty type-indexed store with a key for each response type in the API.

We can then look at each callable endpoint, and eliminate any that require values that are empty in the type-indexed store. this allows a generative process where we can extend a sequence of calls indefinitely, by making a call to the concrete server and recording the result in the type-indexed store.

extensions

add some "starter" values to the store
- there may be a JWT that's established outside the servant app, for instance.
class Extras a where extras :: Gen [a]
- default implementation pure []
- selectively allow some types to create values we haven't seen from the api. newtype FirstName = FirstName Text, say.
break down each response type into its components
- if i have
  - data Foo = FBar Bar | FBaz Baz
  - an endpoint foo that returns a Foo
  - and an endpoint bar that takes a Bar
- I should be able to call foo to get a Foo, and if it happens to be an FBar Bar, I should be able to use that Bar to call bar.

applications

testing
- some properties should always hold (no 500s)
- there may be some other properties that hold contextually
  - healthcheck should be 200
  - test complex permissions/ownership/delegation logic - should never be able to get access to something you don't own or haven't been delegated access to.
coverage
- if you run the checker for a while and hpc suggests you still have bad coverage, your api is designed in a way that requires external manipulation and may be improvable.
benchmarking
- we can generate "big-enough" call sequences, then save the database & a sample call for each endpoint that takes long enough to be a reasonable test.
- from this we can generate tests that a given call on that setup never gets slower.