catala

mirror of https://github.com/CatalaLang/catala.git synced 2024-11-08 07:51:43 +03:00

Author	SHA1	Message	Date
Louis Gesbert	f04e889173	Pass the "external module" info along passes	2024-05-28 11:43:50 +02:00
Louis Gesbert	709b51beb6	Fix hashing of submodule references	2024-05-28 11:43:50 +02:00
Louis Gesbert	403156b36e	Computation and checking of module hashes This includes a few separate changes: - pass visibility information of declarations (depending on wether the declaration was in a ```catala-metadata block or not) - add reasonable hash computation functions to discriminate the interfaces. In particular: * Uids have a `hash` function that depends on their string, but not on their actual uid (which is not stable between runs of the compiler) ; the existing `hash` function and its uses have been renamed to `id`. * The `Hash` module provides the tools to properly combine hashes, etc. While we rely on `Hashtbl.hash` for the atoms, we take care not to use it on any recursive structure (it relies on a bounded traversal). - insert the hashes in the artefacts, and properly check and report those (for OCaml) Remains to do: - Record and check the hashes in the other backends - Provide a way to get stable inline-test outputs in the presence of module hashes - Provide a way to write external modules that don't break at every Catala update.	2024-05-28 11:43:50 +02:00
Louis Gesbert	e19332522e	Output subscope: improvements following review Thanks @denismerigoux This renames the "ScopeDef" variant from `SubScope` to `SubScopeInput`, which is much clearer and avoids confusion with the `SubScope` elements in the surface AST (which are really subscopes and not variables at this point). And improves some error message by specialising depending on whether we are dealing with a subscope or an explicit structure.	2024-04-10 18:38:50 +02:00
Louis Gesbert	75bf768264	Reformat	2024-04-04 10:56:56 +02:00
Louis Gesbert	cde9a66295	Output scopes and subscope variable rework done	2024-04-04 10:24:18 +02:00
Louis Gesbert	4eeb8221f4	Fix var bindings in desugared->scopelang	2024-04-04 10:24:18 +02:00
Louis Gesbert	7951661981	Turn subscope-vars into scope vars They are to become citizens of the same class if we want to allow output-subscopes (without unnecessary complications like deconstructing and reconstructing the same structure). And it's reasonable to assume that they share the same namespace. With this we should shortly collapse the (internal) ambiguity between - `subscope.subvar`: access to a variable within a subscope - `subscope.subfield`: access to a field of the output structure contained in a subscope variable With the subscope a variable, these should now become strictly equivalent, so the plan is that the first could be removed.	2024-04-04 10:24:18 +02:00
Louis Gesbert	4cec981f62	Move global options of Cli to their own module This resolves a dependency cycle that would forbid `Cli` from using the modue `File`, which was annoying.	2024-03-19 15:18:35 +01:00
Louis Gesbert	1ae955b504	Reformat	2023-11-30 23:53:38 +01:00
Louis Gesbert	3649f92975	Rework resolution of module elements This changes the `decl_ctx` to be toplevel only, with flattened references to uids for most elements. The module hierarchy, which is still useful in a few places, is kept separately. Module names are also changed to UIDs early on, and support for module aliases has been added (needs testing). This resolves some issues with lookup, and should be much more robust, as well as more convenient for most lookups. The `decl_ctx` was also extended for string ident lookups, which avoids having to keep the desugared resolution structure available throughout the compilation chain.	2023-11-30 21:14:12 +01:00
Louis Gesbert	f162f6e9bd	Improve handling of module name definitions and add some sanity-checks for consistency of used modules w.r.t. actually loaded modules.	2023-09-27 13:14:03 +02:00
Denis Merigoux	9cecf5587a	Register surface syntax languge in program troughout the compilation chain	2023-09-22 18:05:26 +02:00
Louis Gesbert	7db63e5f78	Simplification: store paths in Uids rather than scattered in structures The context is still hierarchical for defs though, so one needs to retrieve the path to lookup in the correct context for info. Exceptions are enums and struct defs, which are re-exposed at toplevel.	2023-08-31 18:31:48 +02:00
Louis Gesbert	9bac045d03	Implement module lookups for scopes, structs, and enums	2023-08-31 17:54:39 +02:00
Louis Gesbert	f2fc79f640	Add some helper functions in a wrapper `Map` module and use them throughout. No more `List.map fst (Map.bindings m)` ! Also adds some facilities for direct formatting without going through a list.	2023-07-12 11:51:15 +02:00
Louis Gesbert	26c75af2ae	Allow declaration of toplevel values without definition	2023-06-15 17:57:01 +02:00
Denis Merigoux	8987d358e7	Implement the rest	2023-05-26 16:54:52 +02:00
Denis Merigoux	1f23f701bc	Move types	2023-05-26 16:03:26 +02:00
Louis Gesbert	209be6b758	Improve integration of marks into the main AST Two interdependent changes here: 1. Enforce all instances of Shared_ast.gexpr to use the generic type for marks. This makes the interfaces a tad simpler to manipulate: you now write `('a, 'm) gexpr` rather than `('a, 'm mark) gexpr`. 2. Define a polymorphic `Custom` mark case for use by pass-specific annotations. And leverage this in the typing module	2023-05-17 17:37:00 +02:00
Louis Gesbert	fc531777c0	Rework and normalise the `Marked` interface The module is renamed to `Mark`, and functions renamed to avoid redundancy: `Marked.mark` is now `Mark.add` `Marked.unmark` is now `Mark.remove` `Marked.map_under_mark` is now simply `Mark.map` etc. `Marked.same_mark_as` is replaced by `Mark.copy`, but with the arguments swapped (which seemed more convenient throughout) Since a type `Mark.t` would indicate a mark, and to avoid confusion, the type `Marked.t` is renamed to `Mark.ed` as a shorthand for `Mark.marked` ; this part can easily be removed if that's too much quirkiness.	2023-05-17 17:37:00 +02:00
Denis Merigoux	8d7666ee92	Interleave assertions with rules from desugared to scopelang	2023-04-28 14:15:43 +02:00
Denis Merigoux	57da622567	Refactoring changes after @altgr's suggestions	2023-04-18 10:31:44 +02:00
Denis Merigoux	3c364aa1fa	Progress on linting, bugguy unused field detection	2023-03-30 18:52:29 +02:00
Raphaël Monat	d5cd5b206a	Show conflicting date rounding mode declarations when they happen	2023-03-16 18:51:01 +01:00
Raphaël Monat	7021c41f93	Add date rounding option within scopes	2023-03-16 16:55:55 +01:00
Louis Gesbert	fc5910e093	Fix multi-argument scopedefs, improve error message positions This adds a few positions to the parser, and tweaks some others, vastly improving the reporting of some errors (inconsistent functions definitions, but also exceptions cycles, etc.)	2023-02-28 15:53:50 +01:00
Louis Gesbert	2c97d5de14	Enforce consistency of function arguments naming	2023-02-28 15:53:50 +01:00
adelaett	e519b7f146	until desugared	2023-02-20 17:21:44 +01:00
Louis Gesbert	c3af0b4097	Toplevel definitions: branch cleanup - fix remaining warnings (mostly unused arguments) - renamings throughout for consistency and clarity	2023-02-13 18:02:09 +01:00
Louis Gesbert	9b0c7583ec	Add top-level definitions Only handled until before scalc at the moment.	2023-02-13 11:43:49 +01:00
Louis Gesbert	660e5775de	Rename utils to catala_utils	2022-11-28 16:38:09 +01:00
Louis Gesbert	b329afbbdb	Rename all Map/Set calls accordingly This is just a bunch of `sed` calls: ```shell sed -i 's/ScopeSet/ScopeName.Set/g' compiler/*/.ml* sed -i 's/ScopeMap/ScopeName.Map/g' compiler/*/.ml* sed -i 's/StructMap/StructName.Map/g' compiler/*/.ml* sed -i 's/StructSet/StructName.Set/g' compiler/*/.ml* sed -i 's/EnumMap/EnumName.Map/g' compiler/*/.ml* sed -i 's/EnumSet/EnumName.Set/g' compiler/*/.ml* sed -i 's/StructFieldName/StructField/g' compiler/*/.ml* sed -i 's/StructFieldMap/StructField.Map/g' compiler/*/.ml* sed -i 's/StructFieldSet/StructField.Set/g' compiler/*/.ml* sed -i 's/EnumConstructorMap/EnumConstructor.Map/g' compiler/*/.ml* sed -i 's/EnumConstructorSet/EnumConstructor.Set/g' compiler/*/.ml* sed -i 's/RuleMap/RuleName.Map/g' compiler/*/.ml* sed -i 's/RuleSet/RuleName.Set/g' compiler/*/.ml* sed -i 's/LabelMap/LabelName.Map/g' compiler/*/.ml* sed -i 's/LabelSet/LabelName.Set/g' compiler/*/.ml* sed -i 's/ScopeVarMap/ScopeVar.Map/g' compiler/*/.ml* sed -i 's/ScopeVarSet/ScopeVar.Set/g' compiler/*/.ml* sed -i 's/SubScopeNameMap/SubScopeName.Map/g' compiler/*/.ml* sed -i 's/SubScopeNameSet/SubScopeName.Set/g' compiler/*/.ml* ``` ... and reformat	2022-11-28 16:38:09 +01:00
Louis Gesbert	47799ea24f	Uniform naming of conversion modules across compilation passes	2022-11-22 12:08:18 +01:00
Louis Gesbert	e925ec1795	Swap boxing and annotations in expressions This was the only reasonable solution I found to the issue raised [here](https://github.com/CatalaLang/catala/pull/334#discussion_r987175884). This was a pretty tedious rewrite, but it should now ensure we are doing things correctly. As a bonus, the "smart" expression constructors are now used everywhere to build expressions (so another refactoring like this one should be much easier) and this makes the code overall feel more straightforward (`Bindlib.box_apply` or `let+` no longer need to be visible!) --- Basically, we were using values of type `gexpr box = naked_gexpr marked box` throughout when (re-)building expressions. This was done 99% of the time by using `Bindlib.box_apply add_mark naked_e` right after building `naked_e`. In lots of places, we needed to recover the annotation of this expression later on, typically to build its parent term (to inherit the position, or build the type). Since it wasn't always possible to wrap these uses within `box_apply` (esp. as bindlib boxes aren't a monad), here and there we had to call `Bindlib.unbox`, just to recover the position or type. This had the very unpleasant effect of forcing the resolution of the whole box (including applying any stored closures) to reach the top-level annotation which isn't even dependant on specific variable bindings. Then, generally, throwing away the result. Therefore, the change proposed here transforms - `naked_gexpr marked Bindlib.box` into - `naked_gexpr Bindlib.box marked` (aliased to `boxed_gexpr` or `gexpr boxed` for convenience) This means only 1. not fitting the mark into the box right away when building, and 2. accessing the top-level mark directly without unboxing The functions for building terms from module `Shared_ast.Expr` could be changed easily. But then they needed to be consistently used throughout, without manually building terms through `Bindlib.apply_box` -- which covers most of the changes in this patch. `Expr.Box.inj` is provided to swap back to a box, before binding for example. Additionally, this gives a 40% speedup on `make -C examples pass_all_tests`, which hints at the amount of unnecessary work we were doing --'	2022-10-07 18:00:23 +02:00
Louis Gesbert	af9f497ffb	Implement typing of desugared/scopelang and lcalc terms Note that this is incomplete in the case of desugared/scopelang because we only have typing for expressions yet, and the scope/program structure is different. The code allows passing an environment of types for scope/subscope variables in order to resolve `ELocation` terms, but that's unused until we implement scopelang typing at the scope level.	2022-10-04 14:50:37 +02:00
Louis Gesbert	76569bb1af	Fix position on type error concerning sub-scope variable	2022-09-26 14:11:25 +02:00
Louis Gesbert	84c78a234f	Make desugared and scopelang use the `'a mark` type for AST annotations This gives further uniformity in their interfaces and allows more common handling. The next step will be for all the `Expr.make_*` functions to work on expressions annotated with the `'a mark` type, correctly propagating type information when it is present. Then we could even imagine early propagation of type information (without complete inference), which could for example be used for overloaded operator disambiguation.	2022-08-29 11:29:24 +02:00
Louis Gesbert	7e0d24efd2	Make all supertypes use ('a, 't) gexpr as parameter instead of naked_gexpr	2022-08-29 11:29:24 +02:00
Louis Gesbert	5e9c3d630e	Same treatment for `typ` and `marked_typ`	2022-08-29 11:29:24 +02:00
Louis Gesbert	be58610061	Rename marked_expr -> expr, expr -> naked_expr throughout Since the marked kind is used throughout, this should be more clear	2022-08-29 11:29:23 +02:00
Louis Gesbert	8f7ba5ccaf	Rename marked_gexpr -> gexpr, gexpr -> naked_gexpr Since the marked kind is used throughout, this should be more clear	2022-08-29 11:29:23 +02:00
Louis Gesbert	ef36b18dfe	And finally the desugared AST as well	2022-08-26 11:31:14 +02:00
Louis Gesbert	01cc957b3b	Used `shared_ast` for scopelang expressions	2022-08-26 11:31:14 +02:00
Louis Gesbert	49e37c71b4	Add scopelang / desugared cases to the shared AST expressions	2022-08-26 11:31:14 +02:00
Louis Gesbert	06dbab74d2	reformat	2022-08-22 19:28:27 +02:00
Louis Gesbert	2b6ee8dd4b	Leverage the shared AST: big cleanup (part I)	2022-08-22 19:28:21 +02:00
Denis Merigoux	3895743f20	Cleaned code for generating exception graph, now correct and fully general Missing some encoding optimizations	2022-07-22 12:25:41 +02:00
Louis Gesbert	513647cd32	AST simplification: remove redundant annotations The AST structures track annotations (e.g., at the moment, source code position information) in a lot of places. This patch tidies up a bit and removes some duplication, ensuring a single level of annotation wrapping at each AST recursion level. This will be important when adding type information in these annotations, because there will be consitency constraints to be ensured and duplication is a likely source of mistakes.	2022-07-11 16:51:54 +02:00
Louis Gesbert	af0ac95682	Propagate renaming of `Pos.mark` into module `Marked` this patch is just a bunch of `sed` commands ```shell cd compiler sed -i 's/Pos.marked/Marked.pos/g' .ml* */.ml* sed -i 's/Pos.unmark/Marked.unmark/g' .ml */.ml* sed -i 's/Pos\.get_position/Marked.get_mark/g' .ml */.ml* sed -i 's/Pos\.same_pos_as/Marked.same_mark_as/g' .ml */.ml* sed -i 's/Pos\.map_under_mark/Marked.map_under_mark/g' .ml */.ml* sed -i 's/Pos\.mark/Marked.mark/g' .ml */.ml* sed -i 's/Pos\.compare_marked/Marked.compare/g' .ml */.ml* ```	2022-07-11 16:51:54 +02:00

1 2

60 Commits