catala

mirror of https://github.com/CatalaLang/catala.git synced 2024-09-19 00:15:39 +03:00

Author	SHA1	Message	Date
Louis Gesbert	403156b36e	Computation and checking of module hashes This includes a few separate changes: - pass visibility information of declarations (depending on wether the declaration was in a ```catala-metadata block or not) - add reasonable hash computation functions to discriminate the interfaces. In particular: * Uids have a `hash` function that depends on their string, but not on their actual uid (which is not stable between runs of the compiler) ; the existing `hash` function and its uses have been renamed to `id`. * The `Hash` module provides the tools to properly combine hashes, etc. While we rely on `Hashtbl.hash` for the atoms, we take care not to use it on any recursive structure (it relies on a bounded traversal). - insert the hashes in the artefacts, and properly check and report those (for OCaml) Remains to do: - Record and check the hashes in the other backends - Provide a way to get stable inline-test outputs in the presence of module hashes - Provide a way to write external modules that don't break at every Catala update.	2024-05-28 11:43:50 +02:00
Louis Gesbert	b78cd9c29a	Improve message formatting throughout Ensuring messages don't print overlong lines still requires some manual work: - if they don't contain any `Format` directives (`%` or `@`), use `"%a" Format.pp_print_text` to turn word-wrapping on. - otherwise replace spaces with `@ ` to mark possible cutting points, as soon that it's possible the line will get over 80 chars (most often, this means starting before the first `%a`)	2024-04-12 17:07:36 +02:00
Louis Gesbert	def10e7f98	Message: further simplification	2024-04-10 19:26:23 +02:00
Louis Gesbert	98fc97a241	Rewriting message calls to use the new intf	2024-04-10 19:26:23 +02:00
Louis Gesbert	75bf768264	Reformat	2024-04-04 10:56:56 +02:00
Louis Gesbert	4eeb8221f4	Fix var bindings in desugared->scopelang	2024-04-04 10:24:18 +02:00
Louis Gesbert	7951661981	Turn subscope-vars into scope vars They are to become citizens of the same class if we want to allow output-subscopes (without unnecessary complications like deconstructing and reconstructing the same structure). And it's reasonable to assume that they share the same namespace. With this we should shortly collapse the (internal) ambiguity between - `subscope.subvar`: access to a variable within a subscope - `subscope.subfield`: access to a field of the output structure contained in a subscope variable With the subscope a variable, these should now become strictly equivalent, so the plan is that the first could be removed.	2024-04-04 10:24:18 +02:00
Louis Gesbert	0be2636e65	Typing defaults: making the compiler adhere to the new type discipline	2023-11-27 11:06:01 +01:00
Louis Gesbert	d09113163d	Fix bug in dependency analysis around scope call arguments	2023-10-13 15:09:36 +02:00
Louis Gesbert	4910158aea	Reformat	2023-09-01 16:24:27 +02:00
Louis Gesbert	544e18e110	Fixes related to environments and lookups	2023-08-31 18:31:48 +02:00
Louis Gesbert	8e33355ead	Reformat	2023-08-31 18:31:48 +02:00
Louis Gesbert	7db63e5f78	Simplification: store paths in Uids rather than scattered in structures The context is still hierarchical for defs though, so one needs to retrieve the path to lookup in the correct context for info. Exceptions are enums and struct defs, which are re-exposed at toplevel.	2023-08-31 18:31:48 +02:00
Louis Gesbert	17172cf47d	Some fixes & cleanup after early review	2023-08-31 17:55:36 +02:00
Louis Gesbert	72882f82df	Reformat	2023-08-31 17:55:36 +02:00
Louis Gesbert	9bac045d03	Implement module lookups for scopes, structs, and enums	2023-08-31 17:54:39 +02:00
Louis Gesbert	f2fc79f640	Add some helper functions in a wrapper `Map` module and use them throughout. No more `List.map fst (Map.bindings m)` ! Also adds some facilities for direct formatting without going through a list.	2023-07-12 11:51:15 +02:00
Denis Merigoux	6962761774	Merge branch 'master' into closure_conversion	2023-06-15 17:56:41 +02:00
Denis Merigoux	a3087ee163	Adding typing for closure env	2023-06-15 11:11:56 +02:00
Aminata-Dev	10d147a8b1	Messages renamed to Message (lighter syntax)	2023-06-13 11:50:56 +02:00
Denis Merigoux	0faa97b8fc	Abstract messages interface	2023-06-02 10:50:29 +02:00
Louis Gesbert	fc531777c0	Rework and normalise the `Marked` interface The module is renamed to `Mark`, and functions renamed to avoid redundancy: `Marked.mark` is now `Mark.add` `Marked.unmark` is now `Mark.remove` `Marked.map_under_mark` is now simply `Mark.map` etc. `Marked.same_mark_as` is replaced by `Mark.copy`, but with the arguments swapped (which seemed more convenient throughout) Since a type `Mark.t` would indicate a mark, and to avoid confusion, the type `Marked.t` is renamed to `Mark.ed` as a shorthand for `Mark.marked` ; this part can easily be removed if that's too much quirkiness.	2023-05-17 17:37:00 +02:00
Louis Gesbert	6388a4b79f	Improved some error messages (cyclic defs, def positions)	2023-03-08 15:27:48 +01:00
adelaett	e519b7f146	until desugared	2023-02-20 17:21:44 +01:00
Louis Gesbert	c3af0b4097	Toplevel definitions: branch cleanup - fix remaining warnings (mostly unused arguments) - renamings throughout for consistency and clarity	2023-02-13 18:02:09 +01:00
Louis Gesbert	9b0c7583ec	Add top-level definitions Only handled until before scalc at the moment.	2023-02-13 11:43:49 +01:00
Louis Gesbert	660e5775de	Rename utils to catala_utils	2022-11-28 16:38:09 +01:00
Louis Gesbert	b329afbbdb	Rename all Map/Set calls accordingly This is just a bunch of `sed` calls: ```shell sed -i 's/ScopeSet/ScopeName.Set/g' compiler/*/.ml* sed -i 's/ScopeMap/ScopeName.Map/g' compiler/*/.ml* sed -i 's/StructMap/StructName.Map/g' compiler/*/.ml* sed -i 's/StructSet/StructName.Set/g' compiler/*/.ml* sed -i 's/EnumMap/EnumName.Map/g' compiler/*/.ml* sed -i 's/EnumSet/EnumName.Set/g' compiler/*/.ml* sed -i 's/StructFieldName/StructField/g' compiler/*/.ml* sed -i 's/StructFieldMap/StructField.Map/g' compiler/*/.ml* sed -i 's/StructFieldSet/StructField.Set/g' compiler/*/.ml* sed -i 's/EnumConstructorMap/EnumConstructor.Map/g' compiler/*/.ml* sed -i 's/EnumConstructorSet/EnumConstructor.Set/g' compiler/*/.ml* sed -i 's/RuleMap/RuleName.Map/g' compiler/*/.ml* sed -i 's/RuleSet/RuleName.Set/g' compiler/*/.ml* sed -i 's/LabelMap/LabelName.Map/g' compiler/*/.ml* sed -i 's/LabelSet/LabelName.Set/g' compiler/*/.ml* sed -i 's/ScopeVarMap/ScopeVar.Map/g' compiler/*/.ml* sed -i 's/ScopeVarSet/ScopeVar.Set/g' compiler/*/.ml* sed -i 's/SubScopeNameMap/SubScopeName.Map/g' compiler/*/.ml* sed -i 's/SubScopeNameSet/SubScopeName.Set/g' compiler/*/.ml* ``` ... and reformat	2022-11-28 16:38:09 +01:00
Louis Gesbert	4ae392c900	AST refactoring Many changes got bundled in here and would be too tedious to separate. Closes #330 See changes in `shared_ast/definitions.ml` to check the main point. - the biggest change is a modification of the struct and enum types in expressions: they are now stored as `Map`s throughout passes, and no longer converted to indexed lists after scopelang. Their accessors are also changed, and tuples only exist in Lcalc (they're used for closure conversion). This implied adding some more information in the contexts, to keep the mapping between struct fields and scope output variables. It should also be much more robust (no longer relying on assumptions upon different orderings). - another very pervasive change is more cosmetic: the rewrite of the main AST to use inline records, labelling individual subfields. - moved the checks for correct definitions and accesses of structures from `Scope_to_dcalc` to `Typing` - defining some new shallow iterators in module `Shared_ast.Expr`, and factorising a few same-pass rewriting functions accordingly (closure conversion, optimisations, etc.) - some smaller style improvements (ensuring we use the proper compare/equal functions instead of `=` in a few `when` closes, for example)	2022-11-17 18:16:09 +01:00
Louis Gesbert	41d6d3cbe9	Make scopes directly callable Quite a few changes are included here, some of which have some extra implications visible in the language: - adds the `Scope of { -- input_v: value; ... }` construct in the language - handle it down the pipeline: * `ScopeCall` in the surface AST * `EScopeCall` in desugared and scopelang * expressions are now traversed to detect dependencies between scopes * transformed into a normal function call in dcalc - defining a scope now implicitely defines a structure with the same name, with the output variables of the scope defined as fields. This allows us to type the return value from a scope call and access its fields easily. * the implications are mostly in surface/name_resolution.ml code-wise * the `Scope_out` struct that was defined in scope_to_dcalc is no longer needed/used and the fields are no longer renamed (changes some outputs; the explicit suffix for variables with multiple states is ignored as well) * one benefit is that disambiguation works just like for structures when there are conflicts on field names * however, it's now a conflict if a scope and a structure have the same name (side-note: issues with conflicting enum / struct names or scope variables / subscope names were silent and are now properly reported) - you can consequently use scope names as types for variables as well. Writing literals is not allowed though, they can only be obtained by calling the scope. Remaining TODOs: - context variables are not handled properly at the moment - error handling on invalid calls - tests show a small error message regression; lots of examples will need tweaking to avoid scope/struct name or struct fields / output variable conflicts - add a `->` syntax to make struct field access distinct from scope output var access, enforced with typing. This is expected to reduce confusion of users and add a little typing precision. - document the new syntax & implications (tutorial, cheat-sheet) - a consequence of the changes is that subscope variables also can now be typed. A possible future evolution / simplification would be to rewrite subscopes as explicit scope calls early in the pipeline. That could also allow to manipulate them as expressions (bind them in let-ins, return them...)	2022-10-21 17:17:26 +02:00
Louis Gesbert	0fdefacf7c	Add marks to scopelang `Call`	2022-10-04 14:50:37 +02:00
Louis Gesbert	51f79af13e	Generalise the types to allow scopelang ASTs to be typed	2022-10-04 14:50:37 +02:00
Louis Gesbert	af9f497ffb	Implement typing of desugared/scopelang and lcalc terms Note that this is incomplete in the case of desugared/scopelang because we only have typing for expressions yet, and the scope/program structure is different. The code allows passing an environment of types for scope/subscope variables in order to resolve `ELocation` terms, but that's unused until we implement scopelang typing at the scope level.	2022-10-04 14:50:37 +02:00
Louis Gesbert	a9c8bab2b3	Same treatment for `typ` and `marked_typ`	2022-08-29 10:57:21 +02:00
Louis Gesbert	01cc957b3b	Used `shared_ast` for scopelang expressions	2022-08-26 11:31:14 +02:00
Louis Gesbert	49e37c71b4	Add scopelang / desugared cases to the shared AST expressions	2022-08-26 11:31:14 +02:00
Louis Gesbert	06dbab74d2	reformat	2022-08-22 19:28:27 +02:00
Louis Gesbert	2b6ee8dd4b	Leverage the shared AST: big cleanup (part I)	2022-08-22 19:28:21 +02:00
Emile Rolley	ba620fca28	ocamlformat: new break-infix rule	2022-08-05 10:55:48 +02:00
Louis Gesbert	af0ac95682	Propagate renaming of `Pos.mark` into module `Marked` this patch is just a bunch of `sed` commands ```shell cd compiler sed -i 's/Pos.marked/Marked.pos/g' .ml* */.ml* sed -i 's/Pos.unmark/Marked.unmark/g' .ml */.ml* sed -i 's/Pos\.get_position/Marked.get_mark/g' .ml */.ml* sed -i 's/Pos\.same_pos_as/Marked.same_mark_as/g' .ml */.ml* sed -i 's/Pos\.map_under_mark/Marked.map_under_mark/g' .ml */.ml* sed -i 's/Pos\.mark/Marked.mark/g' .ml */.ml* sed -i 's/Pos\.compare_marked/Marked.compare/g' .ml */.ml* ```	2022-07-11 16:51:54 +02:00
Louis Gesbert	f17875f90e	Formatting: some other personal preferences	2022-05-11 16:25:49 +02:00
Louis Gesbert	74c5629153	Formatting: reduce extra match-case indentation 2 is plenty enough, esp. for nested matches :)	2022-05-11 16:25:20 +02:00
Denis Merigoux	5bd66142a6	Big reformatting ocamlformat 0.19.0 -> 0.20.1 100 -> 80 columns per line Reestablished @emilerolley's smart fun break	2022-03-08 15:03:14 +01:00
Louis Gesbert	12ec65601d	Use format strings directly in debug/error/log functions This avoids many intermediate calls to e.g. `Format.asprintf`; should result in some cases in "more correct" use of `Format`¹, avoid the computation of unused debug strings, and make the code more readable. ¹ for `Format` to work as expected, all intermediate calls need to go through it. Some cases of formatting to an intermediate string then printing through Format again are still present, but this makes the situation better.	2022-03-08 13:04:27 +01:00
Denis Merigoux	bbd50747d9	Big renaming and dir reorg	2021-06-21 11:39:06 +02:00

45 Commits