macaw

mirror of https://github.com/GaloisInc/macaw.git synced 2024-11-29 11:02:13 +03:00

Author	SHA1	Message	Date
Stanislav Lyakhov	616b266559	Fix refinement panic caused by duplicate cfg edges Refinement frequently tried to add duplicate edges to the cfg when encountering jump tables, since jump tables often have common targets (e.g. because of a `default` case). Instead of panicking, this is now handled as a benign case where no edge insertion is needed.	2024-05-10 15:40:11 -07:00
Ryan Scott	984f7cb368	Support building with GHC 9.6 This patch contains a handful of tweaks needed to make the libraries in the `macaw` repo build with GHC 9.6: * GHC 9.6 bundles `mtl-2.3.`, which no longer re-exports `Control.Monad`, `Control.Monad.Trans`, and similar modules from `mtl`-related modules. To accommodate this, various imports have been made more explicit. I have disambiguated a use of `Data.Parameterized.NatRepr.withKnownNat` in `macaw-aarch32` to avoid clashing with a newly exported function of the same name in `GHC.TypeNats`. * I have bumped various upper version bounds on `doctest`, `optparse-applicative`, and `what4` to allow building these libraries with GHC 9.6. * I have bumped the following submodules to bring in GHC 9.6–related changes: * `asl-translator`: GaloisInc/asl-translator#53 * `crucible`: GaloisInc/crucible#1102 * `dwarf`: GaloisInc/dwarf#6 * `elf-edit`: GaloisInc/elf-edit#38 * `flexdis86`: GaloisInc/flexdis86#54 * `grift`: GaloisInc/grift#9 * `llvm-pretty`: elliottt/llvm-pretty#112 * `llvm-pretty-bc-parser`: GaloisInc/llvm-pretty-bc-parser#225 * `semmc`: GaloisInc/semmc#80 * `what4`: GaloisInc/what4#235	2023-08-21 08:16:10 -04:00
Ryan Scott	7e346081a5	Fix -Wtype-equality-requires-operators warnings GHC 9.4 adds `-Wtype-equality-requires-operators` to `-Wall`, which warns about certain uses of type equalities that are not forward-compatible with planned changes in GHC. See [this section](https://gitlab.haskell.org/ghc/ghc/-/wikis/migration/9.4?version_id=b60e52482a666d25638d59cd7e86851ddf971dc1#-is-now-a-type-operator) of the GHC 9.4 Migration Guide. These warnings are easily fixed by enabling the `TypeOperators` extension.	2023-04-18 15:17:23 -04:00
Ryan Scott	36cade1690	Bundle more options into MemModelConfig This folds the menagerie of various configuration option arguments to `macawExtensions` into the `MemModelConfig` data type. The advantage to doing this is that is will make it easier to extend the memory model configuration options in the future without needlessly foisting breaking changes on all `macaw-symbolic` users. Unfortunately, it does require a breaking change to get to this point, but the migration path is straightforward for existing code. I have included this migration story in the `macaw-symbolic` changelog.	2023-03-14 13:27:07 -04:00
Ryan Scott	6e020bcde6	Fix -Wincomplete-uni-patterns warnings GHC 9.2 adds `-Wincomplete-uni-patterns` to `-Wall`, which uncovers a slew of previously unnoticed warnings in `macaw`. This patch fixes them, mostly by adding explicit fall-through cases.	2022-05-31 15:50:48 -04:00
Daniel Matichuk	5b4d16dd8a	generalize treatment for terminal statements in CFG slicing code (#278 ) * remove/generalize MacawBlockEnd from CFG slicing * expose functions in symbolic backend * hide bvLit from Backend import * add CI version to workflow	2022-04-20 10:08:15 -07:00
Tristan Ravitch	45f8af1e5a	[symbolic] Optimize how initial binary memories are populated (#267 ) Before, the API provided by macaw-symbolic asserted the initial value of each byte of memory individually. This was fairly expensive for large binaries, as each such assertion flushed the solver pipe. This change generates a large conjunction of assertions and sends them all at once. In unscientific testing, this saved half an hour on a large binary. API Changes: - Note that it introduces a minor API change. The optimization required that the `sym` parameter be concretely an `ExprBuilder`.	2022-03-10 16:43:00 -08:00
Rob Dockins	465a84ee49	Update with changes flowing from GaloicInc/crucible#945 . This mostly deals with the splitting of the old `sym` type into two: one for dealing with expression creation, and a new simulator backend type for dealing with control-flow and assertions.	2022-01-24 16:24:07 -08:00
Ryan Scott	729c65054b	Replace deprecated prettyprinter import with modern equivalent Also bump the lower version bounds on `prettyprinter`.	2022-01-10 16:40:23 -05:00
Ryan Scott	d3a53a6769	Update crucible, semmc submodules; adapt to GaloisInc/crucible#906 This updates the `crucible` submodule to include GaloisInc/crucible#906 (`Control granularity of reading uninitialized memory`), as well as the `semmc` submodule to bring in corresponding changes on its side (GaloisInc/semmc#69). Some additional `?memOpts :: MemOptions` constraints needed to be added to some functions in `macaw-symbolic` and `macaw-refinement` as a result.	2021-11-22 18:27:46 -05:00
Tristan Ravitch	952fe5578d	Submodule updates	2021-11-18 21:40:09 -08:00
Tristan Ravitch	2c85dce18e	Expose block classification in the ArchitectureInfo This change makes the block classifier heuristic part of the `ArchitectureInfo` structure. This enables clients and architecture backends to customize the block classification heuristics. This is most useful for architectures that have complex architecture-specific block terminators that require analysis to generate (e.g., conditional returns). It will also make macaw-refinement simpler in the future, as the SMT-based refinement is just an additional block classifier (but is currently implemented via a hacky side channel). This change introduces an ancillary change, which should not be very user-visible. It splits the Macaw.Discovery and Macaw.Discovery.State modules to break module import cycles in a way that enables us to expose the classifier. This should not be user-visible, as Macaw.Discovery still exports the same names (with one minor exception that should not appear in user code). It also moves the definition of the `ArchBlockPrecond` type family; the few affected places should be updated. User code should probably not be able to see this.	2021-11-05 18:25:03 -07:00
Brett Boston	a336895da7	Add optional override for MacawArchStmtExtensions to genArchVals (#230 ) This change adds an optional argument to `genArchVals` that allows client code to override the backend translation behavior of `MacawArchStmtExtension`s on a statement-by-statement basis. The new argument has type `Maybe (MacawArchStmtExtensionOverride arch)`, where `MacawArchStmtExtensionOverride` is a function that takes a statement and a crucible state, and returns an optional tuple containing the value produced by the statement, as well as an updated state. Returning 'Nothing' indicates that the backend should use its default handler for the statement. Client code that wishes to maintain the existing default behavior in all cases can simply pass `Nothing` for the new argument to `genArchVals`.	2021-09-14 18:24:47 -07:00
Tristan Ravitch	380d732d0e	Implement system call support for x86 (#226 ) Implement support for symbolically executing system calls in macaw-symbolic. To update code that does not need to symbolically execute system calls (i.e., most clients of macaw-symbolic), just pass the new `unsupportedSyscalls` default handler as the fifth argument of `macawExtensions`. The primary interface is via the new `LookupSyscallHandle` callback passed to `macawExtensions`. This callback inspects the environment and returns a Crucible `FunctionHandle` that models the behavior of the requested system call. Note that this mechanism only supports concrete system calls (i.e., system calls where the system call number is concrete). The x86 backend has been updated to support this new functionality. The representation of system calls in macaw is still architecture-specific (because there are interesting differences between system call instructions across architectures). The idea is that system calls are now treated in two steps: 1. A macaw-symbolic extension statement that looks up the override to invoke for the given syscall (returned as a Crucible FunctionHandle) 2. A call to that handle We need this two step approach because the handlers that interpret syntax extension statements cannot symbolically branch (and thus cannot call overrides). The extension interpreter just looks up the necessary handle and uses the standard call/override machinery to handle any branching required to support the system call model functionality. The major complication to this approach is that system calls need to update values in registers when they return. To capture these updates, the architecture-specific syntax extension needs to explicitly update any machine registers that could possibly be affected. The explicit updates are necessary because machine registers do not exist anymore at the macaw-symbolic level (at least within a block). To handle all of these constraints: 1. System calls are represented as extension functions at the macaw level when lifted from machine code. 2. During translation into crucible (via macaw-symbolic), the extension functions are translated into two statements: a function handle lookup and then a function call (with the return values being explicitly threaded through the Crucible function). 3. During symbolic execution, the lookup statement examines the environment to return the necessary function handle, while the handle is called via the normal machinery. Note that the feature is entirely controlled by the `LookupSyscallHandle` function, which determines the system call dispatch policy. No system call models are included with this change. Co-authored-by: Brett Boston <boston@galois.com>	2021-08-27 15:47:40 -07:00
Ryan Scott	7f7de2a59b	Adapt to GaloisInc/crucible#794 (#224 ) GaloisInc/crucible#794 increases the number of functions that use implicit `MemOptions`, including a handful of key LLVM memory model–related functions. As a result, many parts of `macaw` need to add implicit `?memOpts` parameters to accommodate to this change.	2021-08-23 20:39:08 -04:00
Tristan Ravitch	6b712c6280	Fix the default ppc32 ABI (#222 ) None of the common default ppc32 ABIs use a Table of Contents (TOC), so default our code to not assume it either. This has accompanying changes in macaw-loader-ppc, which also made incorrect assumptions about ppc32. Note that we may eventually need to support rarely-used ABIs that do use a TOC (or similar dedicated registers, e.g., the Small Data Area mode). When we do, we will probably want that to be a data-oriented decision rather than a type-level one, as each architecture supports multiple ABIs. We may also need to modify ppc64 to support ABIs without TOCs, but we'll do it when we need to.	2021-08-19 07:41:29 -07:00
Tristan Ravitch	df839de678	Update submodules This commit adapts to recent changes in crucible and raises the version bounds to admit newer versions of what4.	2021-07-15 21:22:30 -07:00
Kevin Quick	c9b06090d2	[refinement] add memory variable name for mkMemVar call.	2021-04-03 23:15:28 -07:00
Tristan Ravitch	3f85e01b67	Raise the upper bound on the optparse dependency (#195 )	2021-03-20 09:17:54 -07:00
Tristan Ravitch	92bf4c2595	Lift the version bound on the lumberjack dependency (#194 )	2021-03-19 18:38:58 -07:00
Tristan Ravitch	dbb4c83f08	Add a testing framework for macaw-symbolic (#184 ) The new test suites cover x86_64, PowerPC, and ARM. They test that the semantics are actually correct (rather than just seeing if symbolic execution produces any result). The `Data.Macaw.Symbolic.Testing` module in macaw-symbolic provides some common utilities for symbolic execution engine setup, while there are tailored test harnesses for each architecture. The semantics of the test harnesses are documented in each architecture test suite, but they: 1. Discover all of the test binaries (which are generated from the included makefiles) 2. Treat each function whose name begins with `test_` as a test entry point 3. Symbolically executes each test case with fully symbolic register states 4. Extracts the return value after symbolic execution, which is treated as the predicate to an assertion that must be proved - If the test case is in the `pass` subdirectory, it is proved and expected to hold - If the test case is in the `fail` subdirectory, it is proved and expected to not hold. Each test harness supports two options for debugging: - Dumping generated SMT queries - Dumping generated Macaw IR for inspection This testing uncovered a bug in the (previously untested) macaw-aarch32-symbolic code. It required a number of submodule updates to: - Adapt to some what4 changes - Fix a bug in the LLVM memory model that lets these tests pass - Adapt to changes to some crucible APIs This change also modifies the CI configuration to install SMT solvers earlier (which are now needed for all of the symbolic package tests).	2021-03-01 09:21:44 -08:00
robdockins	a58f1e25dd	Update to follow changes in What4. Nat is no longer a base type (#190 ) Nat is no longer a what4 base type, so we have to adapt various APIs to accommodate that. The template haskell in macaw-semmc is updated to remove Nat cases. Changes to the `SymFn` type required removing a type parameter. This commit also adds macaw-refinement to CI (which requires installing SMT solvers); that code had to be updated due to the what4 changes. Co-authored-by: Tristan Ravitch <tristan@galois.com>	2021-02-19 15:44:56 -08:00
Joe Hendrix	6d1e47623d	Provide jumptable layout info	2021-01-27 15:27:53 -08:00
Daniel Matichuk	cd5dfe8c65	macaw-symbolic: record original block endings when making a CFG slice (#185 ) * reify block exit in macaw extension * add return address to block end classification * use global variable to retain block endings * update to mkBlockSliceCFG signature * add haddocks for MacawBlockEnd Co-authored-by: Tristan Ravitch <tristan@galois.com>	2021-01-11 16:56:54 -08:00
Kevin Quick	2ba115324d	Bump upper bound for scheduler in refinement.	2020-12-06 10:54:47 -08:00
Brian Huffman	2a620d41de	Switch from `ansi-wl-pprint` to the `prettyprinter` package. This patch relies on the following submodule updates: - GaloisInc/what4#77 - GaloisInc/elf-edit#20 - GaloisInc/crucible#586 - GaloisInc/asl-translator#28 This patch updates the following packages: - macaw-base - macaw-symbolic - macaw-x86 - macaw-x86-symbolic - macaw-aarch32 - macaw-ppc - macaw-semmc - macaw-refinement	2020-12-02 11:38:19 -08:00
Joe Hendrix	5aad8ca32e	Upgrade to elf-edit 0.39 and other libraries.	2020-11-10 17:15:47 -08:00
Rob Dockins	99f8cb0bdf	Update to use new `HasLLVMAnn` API, which requires an action for recording (or discarding) annotations rather than a map.	2020-09-11 14:40:02 -07:00
Kevin Quick	fc419e4c18	[refinement] update for badBehaviorMap implicit parameter.	2020-08-10 13:47:44 -07:00
Kevin Quick	f695c4d4c1	[macaw-refinement] updates for app-refactor what4 changes.	2020-07-17 17:30:44 -07:00
Ben Selfridge	039b8497fc	updates what4, crucible, etc. (#146 ) * update to bv-sized branch of what4 and other things * removed parameterized-utils submodule completely * Updates submodules * Fixes macaw-symbolic w.r.t. crucible-llvm changes Co-authored-by: Ben Selfridge <ben@000548-benselfridge.local>	2020-06-16 16:49:55 -07:00
Kevin Quick	0f430d1b38	Update dependencies for GHC 8.4 support.	2020-05-21 23:42:41 -07:00
Daniel Wagner	4ffec20d0a	complete the merge	2020-04-03 22:49:34 -04:00
Daniel Wagner	d39ad7a024	Merge branch 'master' into wip/equiv	2020-04-03 00:20:53 -04:00
Andrei Stefanescu	0be59e5815	Update Macaw to use HasLLVMAnn. (#122 )	2020-04-02 17:58:47 -07:00
Tristan Ravitch	e024646860	macaw-refinement (#114 ) This commit updates macaw-refinement to work with the latest macaw/crucible and makes a few improvements along the way. The major changes involved in this are: * Block labels were removed from macaw, so we had to come up with an alternative approach to making synthetic blocks to represent dispatch resolved by macaw-refinement that is not really a jump table. We considered adding a new terminator that encoded "computed IP-based dispatch", but there was concern about the impact on client code. Instead, we added a field to the `DiscoveryFunInfo` that records "external" resolutions to indirect control flow (e.g., as by an SMT solver in macaw-refinement). The hook by which we feed SMT-based resolutions back into macaw was modified accordingly (`addDiscoveredFunctionBlockTargets`). * Solver invocation changed to allow solver selection and parallel solver application. * Logging is now done via the `lumberjack` library. * macaw-symbolic now uses the "external" resolutions in `DiscoveryFunInfo` while building crucible CFGs. * The path creation code in macaw-refinement was simplified significantly and the approach to path creation has been documented. * The run-refinement tool is now more featureful. * The test suite is a bit more structured and no longer depends on the printed output of the discovery process.	2020-03-12 17:15:08 -07:00
Daniel Wagner	10a1fbc24a	misc. build fixes	2019-10-17 16:38:16 -04:00
Kevin Quick	898e7bdb95	[refinement] Added newline to end of expected output file.	2019-02-26 14:51:49 -08:00
Kevin Quick	3fe6fdf949	Sort refinement results for test stability and visual convenience.	2019-02-23 21:33:40 -08:00
Kevin Quick	3e7bd01560	[refinement] Disabled PPC switching test. Attempting to refine the switching test for PPC executables ends up with a non-terminating Z3 process, so this test is disabled until this is diagnosed.	2019-02-08 17:32:21 -08:00
Kevin Quick	edb486c6b3	Added toCrucibleEndian in symbolic and use for memory setup in refinement. Requires updated macaw-loader BinaryFormat information.	2019-02-08 17:30:18 -08:00
Kevin Quick	d3228847f7	[refinement] add PPC tests expected results.	2019-02-08 14:26:17 -08:00
Kevin Quick	6cb0868d1b	[refinement] import cleanup	2019-02-08 14:20:29 -08:00
Kevin Quick	290e33279a	[refinement] Enable PPC testing.	2019-02-08 14:10:08 -08:00
Kevin Quick	3eeaed08a2	Merge branch 'master' of github.com:GaloisInc/macaw	2019-02-08 08:30:26 -08:00
Andrei Stefanescu	c67e474a19	[refinement] Move symbolic execution code in its own module.	2019-02-07 19:43:58 -08:00
Andrei Stefanescu	5b93187d5b	[refinement] Update .cabal file.	2019-02-07 17:25:52 -08:00
Andrei Stefanescu	45e4251bf3	[refinement] Add an unbounded memory allocation at the bottom of the allocation stack.	2019-02-07 17:23:02 -08:00
Kevin Quick	33540121f5	Merge branch 'master' of github.com:GaloisInc/macaw	2019-02-07 17:15:24 -08:00
Kevin Quick	3baf55376b	[refinement] update run-refinement for summary and verbose modes.	2019-02-07 17:15:00 -08:00

1 2 3

122 Commits