catala

mirror of https://github.com/CatalaLang/catala.git synced 2024-11-09 22:16:10 +03:00

Author	SHA1	Message	Date
Louis Gesbert	51df581aba	Small cleanup/fixes following PR review on Syntax Changes	2022-12-20 16:03:41 +01:00
Louis Gesbert	f236e2cfb2	Replace the type conversion and rounding operators with overloads Ref. #366 Also updates `CONTRIBUTING.md`. This was pretty straight-forward :)	2022-12-13 15:32:49 +01:00
Louis Gesbert	9b939d07a4	New syntax for collection operations	2022-12-13 12:30:40 +01:00
Louis Gesbert	4a66848eb3	Clarify some bits of the overload handling code and address other remarks from the review of #365	2022-12-13 12:00:05 +01:00
Louis Gesbert	fea01cfe4c	Add overloaded operators for the common operations This uses the same disambiguation mechanism put in place for structures, calling the typer on individual rules on the desugared AST to propagate types, in order to resolve ambiguous operators like `+` to their strongly typed counterparts (`+!`, `+.`, `+$`, `+@`, `+$`) in the translation to scopelang. The patch includes some normalisation of the definition of all the operators, and classifies them based on their typing policy instead of their arity. It also adds a little more flexibility: - a couple new operators, like `-` on date and duration - optional type annotation on some aggregation constructions The `Shared_ast` lib is also lightly restructured, with the `Expr` module split into `Type`, `Operator` and `Expr`.	2022-12-13 11:55:24 +01:00
Louis Gesbert	5bcc0a65eb	Improve some messages on structure disambiguation	2022-12-13 11:47:21 +01:00
Louis Gesbert	8960e5dbbc	Add typing-based disambiguation pass after desugaring Some typing errors are changed a little, because they get triggered during the typing of the disambiguation pass, which does not specify the expected return type (it's an expected invariant that it should not be needed for disambiguation). It would be possible to still specify these types during disambiguation just to get the same errors, but since the newer ones don't appear to be clearly worse at the moment, it has not been done.	2022-11-28 16:38:09 +01:00
Louis Gesbert	01957c6698	Desugaring: inline the Fold predicate the intermediate variable made it much harder to type.	2022-11-28 16:38:09 +01:00
Louis Gesbert	3f2aa19e97	Add ambiguous StructAccess for desugared to be resolved in scopelang	2022-11-28 16:38:09 +01:00
Louis Gesbert	660e5775de	Rename utils to catala_utils	2022-11-28 16:38:09 +01:00
Louis Gesbert	b329afbbdb	Rename all Map/Set calls accordingly This is just a bunch of `sed` calls: ```shell sed -i 's/ScopeSet/ScopeName.Set/g' compiler/*/.ml* sed -i 's/ScopeMap/ScopeName.Map/g' compiler/*/.ml* sed -i 's/StructMap/StructName.Map/g' compiler/*/.ml* sed -i 's/StructSet/StructName.Set/g' compiler/*/.ml* sed -i 's/EnumMap/EnumName.Map/g' compiler/*/.ml* sed -i 's/EnumSet/EnumName.Set/g' compiler/*/.ml* sed -i 's/StructFieldName/StructField/g' compiler/*/.ml* sed -i 's/StructFieldMap/StructField.Map/g' compiler/*/.ml* sed -i 's/StructFieldSet/StructField.Set/g' compiler/*/.ml* sed -i 's/EnumConstructorMap/EnumConstructor.Map/g' compiler/*/.ml* sed -i 's/EnumConstructorSet/EnumConstructor.Set/g' compiler/*/.ml* sed -i 's/RuleMap/RuleName.Map/g' compiler/*/.ml* sed -i 's/RuleSet/RuleName.Set/g' compiler/*/.ml* sed -i 's/LabelMap/LabelName.Map/g' compiler/*/.ml* sed -i 's/LabelSet/LabelName.Set/g' compiler/*/.ml* sed -i 's/ScopeVarMap/ScopeVar.Map/g' compiler/*/.ml* sed -i 's/ScopeVarSet/ScopeVar.Set/g' compiler/*/.ml* sed -i 's/SubScopeNameMap/SubScopeName.Map/g' compiler/*/.ml* sed -i 's/SubScopeNameSet/SubScopeName.Set/g' compiler/*/.ml* ``` ... and reformat	2022-11-28 16:38:09 +01:00
Louis Gesbert	8a4462f6cc	Name resolution: small reformatting	2022-11-28 16:38:09 +01:00
Louis Gesbert	47799ea24f	Uniform naming of conversion modules across compilation passes	2022-11-22 12:08:18 +01:00
Louis Gesbert	4ae392c900	AST refactoring Many changes got bundled in here and would be too tedious to separate. Closes #330 See changes in `shared_ast/definitions.ml` to check the main point. - the biggest change is a modification of the struct and enum types in expressions: they are now stored as `Map`s throughout passes, and no longer converted to indexed lists after scopelang. Their accessors are also changed, and tuples only exist in Lcalc (they're used for closure conversion). This implied adding some more information in the contexts, to keep the mapping between struct fields and scope output variables. It should also be much more robust (no longer relying on assumptions upon different orderings). - another very pervasive change is more cosmetic: the rewrite of the main AST to use inline records, labelling individual subfields. - moved the checks for correct definitions and accesses of structures from `Scope_to_dcalc` to `Typing` - defining some new shallow iterators in module `Shared_ast.Expr`, and factorising a few same-pass rewriting functions accordingly (closure conversion, optimisations, etc.) - some smaller style improvements (ensuring we use the proper compare/equal functions instead of `=` in a few `when` closes, for example)	2022-11-17 18:16:09 +01:00
Louis Gesbert	3e004551fc	Callable scopes: fixes following review	2022-11-03 15:18:51 +01:00
Louis Gesbert	96b623c724	Subscope calls: fix handling of input variables with multiple states	2022-10-27 11:42:27 +02:00
Louis Gesbert	41d6d3cbe9	Make scopes directly callable Quite a few changes are included here, some of which have some extra implications visible in the language: - adds the `Scope of { -- input_v: value; ... }` construct in the language - handle it down the pipeline: * `ScopeCall` in the surface AST * `EScopeCall` in desugared and scopelang * expressions are now traversed to detect dependencies between scopes * transformed into a normal function call in dcalc - defining a scope now implicitely defines a structure with the same name, with the output variables of the scope defined as fields. This allows us to type the return value from a scope call and access its fields easily. * the implications are mostly in surface/name_resolution.ml code-wise * the `Scope_out` struct that was defined in scope_to_dcalc is no longer needed/used and the fields are no longer renamed (changes some outputs; the explicit suffix for variables with multiple states is ignored as well) * one benefit is that disambiguation works just like for structures when there are conflicts on field names * however, it's now a conflict if a scope and a structure have the same name (side-note: issues with conflicting enum / struct names or scope variables / subscope names were silent and are now properly reported) - you can consequently use scope names as types for variables as well. Writing literals is not allowed though, they can only be obtained by calling the scope. Remaining TODOs: - context variables are not handled properly at the moment - error handling on invalid calls - tests show a small error message regression; lots of examples will need tweaking to avoid scope/struct name or struct fields / output variable conflicts - add a `->` syntax to make struct field access distinct from scope output var access, enforced with typing. This is expected to reduce confusion of users and add a little typing precision. - document the new syntax & implications (tutorial, cheat-sheet) - a consequence of the changes is that subscope variables also can now be typed. A possible future evolution / simplification would be to rewrite subscopes as explicit scope calls early in the pipeline. That could also allow to manipulate them as expressions (bind them in let-ins, return them...)	2022-10-21 17:17:26 +02:00
Louis Gesbert	7267543ca1	Rename Expr.Box.inj to Expr.Box.lift it is more consistent with the naming of functions in Bindlib.	2022-10-21 15:35:49 +02:00
Louis Gesbert	f9f834e30a	Add a helper to fold on expressions	2022-10-21 15:35:49 +02:00
Louis Gesbert	e925ec1795	Swap boxing and annotations in expressions This was the only reasonable solution I found to the issue raised [here](https://github.com/CatalaLang/catala/pull/334#discussion_r987175884). This was a pretty tedious rewrite, but it should now ensure we are doing things correctly. As a bonus, the "smart" expression constructors are now used everywhere to build expressions (so another refactoring like this one should be much easier) and this makes the code overall feel more straightforward (`Bindlib.box_apply` or `let+` no longer need to be visible!) --- Basically, we were using values of type `gexpr box = naked_gexpr marked box` throughout when (re-)building expressions. This was done 99% of the time by using `Bindlib.box_apply add_mark naked_e` right after building `naked_e`. In lots of places, we needed to recover the annotation of this expression later on, typically to build its parent term (to inherit the position, or build the type). Since it wasn't always possible to wrap these uses within `box_apply` (esp. as bindlib boxes aren't a monad), here and there we had to call `Bindlib.unbox`, just to recover the position or type. This had the very unpleasant effect of forcing the resolution of the whole box (including applying any stored closures) to reach the top-level annotation which isn't even dependant on specific variable bindings. Then, generally, throwing away the result. Therefore, the change proposed here transforms - `naked_gexpr marked Bindlib.box` into - `naked_gexpr Bindlib.box marked` (aliased to `boxed_gexpr` or `gexpr boxed` for convenience) This means only 1. not fitting the mark into the box right away when building, and 2. accessing the top-level mark directly without unboxing The functions for building terms from module `Shared_ast.Expr` could be changed easily. But then they needed to be consistently used throughout, without manually building terms through `Bindlib.apply_box` -- which covers most of the changes in this patch. `Expr.Box.inj` is provided to swap back to a box, before binding for example. Additionally, this gives a 40% speedup on `make -C examples pass_all_tests`, which hints at the amount of unnecessary work we were doing --'	2022-10-07 18:00:23 +02:00
Louis Gesbert	0fdefacf7c	Add marks to scopelang `Call`	2022-10-04 14:50:37 +02:00
Louis Gesbert	41315dc650	Scopelang: add toplevel mark for convenience it allows to discriminate typed and non-typed ASTs	2022-10-04 14:50:37 +02:00
Louis Gesbert	9f7a0f6078	Fix naming error in interface Hmm I used too much `sed` and overlooked this one (patch looks big because of reformatting, but it's mostly just `naked_expr`→`expr`)	2022-10-04 14:50:37 +02:00
Louis Gesbert	51f79af13e	Generalise the types to allow scopelang ASTs to be typed	2022-10-04 14:50:37 +02:00
Louis Gesbert	af9f497ffb	Implement typing of desugared/scopelang and lcalc terms Note that this is incomplete in the case of desugared/scopelang because we only have typing for expressions yet, and the scope/program structure is different. The code allows passing an environment of types for scope/subscope variables in order to resolve `ELocation` terms, but that's unused until we implement scopelang typing at the scope level.	2022-10-04 14:50:37 +02:00
Louis Gesbert	d93b699a4c	Forward types in the `Expr.make_*` constructors Also add some safeguards against bad propagation of types (e.g. checking the arrow type of functions upon application); partly disabled at the moment since they don't pass yet but that'll be further work.	2022-10-04 14:50:37 +02:00
Louis Gesbert	76569bb1af	Fix position on type error concerning sub-scope variable	2022-09-26 14:11:25 +02:00
Louis Gesbert	84c78a234f	Make desugared and scopelang use the `'a mark` type for AST annotations This gives further uniformity in their interfaces and allows more common handling. The next step will be for all the `Expr.make_*` functions to work on expressions annotated with the `'a mark` type, correctly propagating type information when it is present. Then we could even imagine early propagation of type information (without complete inference), which could for example be used for overloaded operator disambiguation.	2022-08-29 11:29:24 +02:00
Louis Gesbert	7e0d24efd2	Make all supertypes use ('a, 't) gexpr as parameter instead of naked_gexpr	2022-08-29 11:29:24 +02:00
Louis Gesbert	5e9c3d630e	Same treatment for `typ` and `marked_typ`	2022-08-29 11:29:24 +02:00
Louis Gesbert	be58610061	Rename marked_expr -> expr, expr -> naked_expr throughout Since the marked kind is used throughout, this should be more clear	2022-08-29 11:29:23 +02:00
Louis Gesbert	8f7ba5ccaf	Rename marked_gexpr -> gexpr, gexpr -> naked_gexpr Since the marked kind is used throughout, this should be more clear	2022-08-29 11:29:23 +02:00
Louis Gesbert	ef36b18dfe	And finally the desugared AST as well	2022-08-26 11:31:14 +02:00
Louis Gesbert	01cc957b3b	Used `shared_ast` for scopelang expressions	2022-08-26 11:31:14 +02:00
Louis Gesbert	49e37c71b4	Add scopelang / desugared cases to the shared AST expressions	2022-08-26 11:31:14 +02:00
Louis Gesbert	4caf828e48	Additional cleanup/fixes on the compiler refactoring following review ^^	2022-08-23 00:13:02 +02:00
Louis Gesbert	06dbab74d2	reformat	2022-08-22 19:28:27 +02:00
Louis Gesbert	2b6ee8dd4b	Leverage the shared AST: big cleanup (part I)	2022-08-22 19:28:21 +02:00
Denis Merigoux	2ae0efa5dd	Fix subtle bug with the default false value for conditions	2022-08-15 16:47:36 +02:00
Emile Rolley	ba620fca28	ocamlformat: new break-infix rule	2022-08-05 10:55:48 +02:00
Denis Merigoux	85144c35fb	Simplify default tree encoding	2022-07-22 12:31:02 +02:00
Denis Merigoux	3895743f20	Cleaned code for generating exception graph, now correct and fully general Missing some encoding optimizations	2022-07-22 12:25:41 +02:00
Denis Merigoux	85cb2d5c8d	Starting debug	2022-07-22 12:11:34 +02:00
Louis Gesbert	7485c7f2ce	Reformat	2022-07-11 17:42:34 +02:00
Louis Gesbert	504f54c31e	Rework type error reporting	2022-07-11 17:42:34 +02:00
Louis Gesbert	513647cd32	AST simplification: remove redundant annotations The AST structures track annotations (e.g., at the moment, source code position information) in a lot of places. This patch tidies up a bit and removes some duplication, ensuring a single level of annotation wrapping at each AST recursion level. This will be important when adding type information in these annotations, because there will be consitency constraints to be ensured and duplication is a likely source of mistakes.	2022-07-11 16:51:54 +02:00
Louis Gesbert	af0ac95682	Propagate renaming of `Pos.mark` into module `Marked` this patch is just a bunch of `sed` commands ```shell cd compiler sed -i 's/Pos.marked/Marked.pos/g' .ml* */.ml* sed -i 's/Pos.unmark/Marked.unmark/g' .ml */.ml* sed -i 's/Pos\.get_position/Marked.get_mark/g' .ml */.ml* sed -i 's/Pos\.same_pos_as/Marked.same_mark_as/g' .ml */.ml* sed -i 's/Pos\.map_under_mark/Marked.map_under_mark/g' .ml */.ml* sed -i 's/Pos\.mark/Marked.mark/g' .ml */.ml* sed -i 's/Pos\.compare_marked/Marked.compare/g' .ml */.ml* ```	2022-07-11 16:51:54 +02:00
Denis Merigoux	34fd8a0bb2	Print a position when logging assignment of a default false for a condition	2022-06-03 17:50:03 +02:00
Louis Gesbert	cd70e16ea3	Compiler: simplify EDefault term at construction This allows to match on their structure further on.	2022-05-25 18:36:07 +02:00
Louis Gesbert	2d41f53300	Compiler: add a few helper comparison functions Adds syntactic comparison for some expressions, etc., allowing in particular to detect syntactically equal expressions. Positions are, obviously, ignored.	2022-05-25 18:35:57 +02:00
Louis Gesbert	f17875f90e	Formatting: some other personal preferences	2022-05-11 16:25:49 +02:00
Louis Gesbert	74c5629153	Formatting: reduce extra match-case indentation 2 is plenty enough, esp. for nested matches :)	2022-05-11 16:25:20 +02:00
Louis Gesbert	fa3693d813	Formatting: disable deprecated "smart" fun-decl option	2022-05-11 16:22:56 +02:00
Louis Gesbert	ac85d8dbb4	Upgrade ocamlformat version	2022-05-11 16:21:58 +02:00
Denis Merigoux	c59451751b	Fix logging operator location for default justifications	2022-03-16 11:44:34 +01:00
Denis Merigoux	5bd66142a6	Big reformatting ocamlformat 0.19.0 -> 0.20.1 100 -> 80 columns per line Reestablished @emilerolley's smart fun break	2022-03-08 15:03:14 +01:00
Louis Gesbert	12ec65601d	Use format strings directly in debug/error/log functions This avoids many intermediate calls to e.g. `Format.asprintf`; should result in some cases in "more correct" use of `Format`¹, avoid the computation of unused debug strings, and make the code more readable. ¹ for `Format` to work as expected, all intermediate calls need to go through it. Some cases of formatting to an intermediate string then printing through Format again are still present, but this makes the situation better.	2022-03-08 13:04:27 +01:00
Denis Merigoux	5ef7e45e11	Desugared to scope complete but untested [skip ci]	2022-03-06 14:15:09 +01:00
Denis Merigoux	cf8c6233d9	Finished translation of expressions [skip ci]	2022-03-01 20:41:01 +01:00
Denis Merigoux	5a0719b25d	Starting to translate expressions [skip ci]	2022-03-01 10:15:44 +01:00
Denis Merigoux	171e8932bc	Desugaring implemented [skip ci]	2022-02-28 18:34:32 +01:00
Denis Merigoux	518ff02696	Added desugared AST, necessary to distinguish Desugared.ScopeVar from Scopelang.ScopeVar [skip ci]	2022-02-28 17:19:39 +01:00
Denis Merigoux	1e11f6c1d5	Name resolution	2022-02-28 15:40:19 +01:00
Denis Merigoux	a140614069	Bring IO down in Scopelang AST	2022-02-10 10:09:58 +01:00
Denis Merigoux	e4be116ded	Add negative tests	2022-02-09 17:22:04 +01:00
Denis Merigoux	a8733e6a61	Fix tests [skip ci]	2022-02-09 15:56:57 +01:00
Denis Merigoux	cb04ef3f9d	OnlyInput variables nore more thunked	2022-02-09 15:34:13 +01:00
Denis Merigoux	30061b8c86	Better encoding of inputs [skip ci]	2022-02-09 11:37:52 +01:00
Denis Merigoux	1c4a0bdd5d	Updated all examples with input/output decorations [skip ci]	2022-02-07 18:18:23 +01:00
Denis Merigoux	e3b5d2d0b6	Finer control over input IO (distinction only_input/reentrant) [skip ci]	2022-02-07 10:30:36 +01:00
Denis Merigoux	f4200bb638	Dcalc encoding good, missing error messages for bad cases	2022-02-06 18:52:18 +01:00
Denis Merigoux	d8c120bf97	Propagate visibility down, missing handling in scope_to_dcalc	2022-02-04 14:41:22 +01:00
Denis Merigoux	73ce2f142f	Fixed last bug around rule refactoring	2022-01-31 13:55:25 +01:00
Denis Merigoux	13b476d0a1	[skip ci] reorganized desugared to scope encoding, broke some invariants WIP: fixed some bugs and provided documentations but one thing missing	2022-01-28 17:31:31 +01:00
Denis Merigoux	d2977b48ce	Documentation pass	2022-01-19 10:54:16 +01:00
Denis Merigoux	82c09ee455	Fixed a bug [skip ci]	2022-01-05 15:37:34 +01:00
Denis Merigoux	3752328671	Code working but needs debugging [skip ci]	2022-01-05 10:42:46 +01:00
Denis Merigoux	9733f39653	Refactoring done except Desugared_to_scope.def_map_to_tree [skip ci]	2022-01-05 09:14:43 +01:00
Denis Merigoux	f6825668dd	Propagate labels in desugaring, not building desugared/ yet [skip-ci]	2022-01-04 18:19:15 +01:00
Emile Rolley	397b0e1d7c	fix(build/doc): remove warnings due to .ml* files	2022-01-02 14:53:51 +01:00
Denis Merigoux	d1b75b047b	Give a name to every rule and def, label or not	2021-11-28 13:09:44 +01:00
Denis Merigoux	bbd50747d9	Big renaming and dir reorg	2021-06-21 11:39:06 +02:00

1 2 3 4 5

232 Commits