catala

mirror of https://github.com/CatalaLang/catala.git synced 2024-09-19 16:28:12 +03:00

Author	SHA1	Message	Date
Louis Gesbert	b9156bb60e	Implement safe renaming of idents for backend printing Previously we had some heuristics in the backends trying to achieve this with a lot of holes ; this should be much more solid, relying on `Bindlib` to do the correct renamings. Note1: it's not plugged into the backends other than OCaml at the moment. Note2: the related, obsolete heuristics haven't been cleaned out yet Note3: we conservatively suppose a single namespace at the moment. This is required for e.g. Python, but it forces vars named like struct fields to be renamed, which is more verbose in e.g. OCaml. The renaming engine could be improved to support different namespaces, with a way to select how to route the different kinds of identifiers into them. Similarly, customisation for what needs to be uppercase or lowercase is not available yet. Note4: besides excluding keywords, we should also be careful to exclude (or namespace): - the idents used in the runtime (e.g. `o_add_int_int`) - the dynamically generated idents (e.g. `embed_`) Note5: module names themselves aren't handled yet. The reason is that they must be discoverable by the user, and even need to match the filenames, etc. In other words, imagine that `Mod` is a keyword in the target language. You can't rename a module called `Mod` to `Mod1` without knowing the whole module context, because that would destroy the mapping for a module already called `Mod1`. A reliable solution would be to translate all module names to e.g. `CatalaModule_`, which we can assume will never conflict with any built-in, and forbid idents starting with that prefix. We may also want to restrict their names to ASCII ? Currently we use a projection, but what if I have two modules called `Là` and `La` ?	2024-08-28 17:18:26 +02:00
vbot	7f4824bed6	adapt existing errors to changes	2024-07-31 14:01:55 +02:00
Louis Gesbert	a5278244ec	Debugging monomorphisation	2024-07-04 15:08:13 +02:00
Louis Gesbert	88f5e932c8	Remove RaiseEmpty and CatchEmpty from the AST	2024-07-04 15:08:13 +02:00
Louis Gesbert	293bcd3817	Replace HandleDefault* internal operators by HandleExceptions HandleExceptions only takes an array of exceptions, and returns Some if only one of them is Some, None if they are all None, or raises a conflict error otherwise. The compilation of default terms then wraps this in a match (for the result of HandleExceptions), and an if-then-else (for the justification-consequence in the None case). This avoids the complexity of having to handle thunked functions as arguments.	2024-07-04 15:08:13 +02:00
Louis Gesbert	c3b978bef8	Remove the now unused HandleDefault operator (we now only need HandleDefaultOpt)	2024-07-04 15:08:13 +02:00
Louis Gesbert	d073103578	Typecheck operator args bottom-up after monomorphisation (?)	2024-07-04 15:08:13 +02:00
Louis Gesbert	dc1b725e9b	Reformat	2024-06-21 12:23:01 +02:00
Louis Gesbert	0e05c75028	Internal typing: print the faulty program when debug is enabled Since the typing error happened on generated code, it's hard to debug without printing it.	2024-06-21 12:23:01 +02:00
Louis Gesbert	70c01749b2	Fix regression on internal typing errors reported as user errors	2024-06-21 12:23:01 +02:00
vbot	441dd54ec3	Refactor suggestions	2024-06-20 15:59:54 +02:00
vbot	e7beb0daad	Fix typing error message consistency	2024-06-19 14:52:31 +02:00
vbot	b2449f7b4c	Add multiple typing errors using delayed errors	2024-06-19 14:52:31 +02:00
Louis Gesbert	403156b36e	Computation and checking of module hashes This includes a few separate changes: - pass visibility information of declarations (depending on wether the declaration was in a ```catala-metadata block or not) - add reasonable hash computation functions to discriminate the interfaces. In particular: * Uids have a `hash` function that depends on their string, but not on their actual uid (which is not stable between runs of the compiler) ; the existing `hash` function and its uses have been renamed to `id`. * The `Hash` module provides the tools to properly combine hashes, etc. While we rely on `Hashtbl.hash` for the atoms, we take care not to use it on any recursive structure (it relies on a bounded traversal). - insert the hashes in the artefacts, and properly check and report those (for OCaml) Remains to do: - Record and check the hashes in the other backends - Provide a way to get stable inline-test outputs in the presence of module hashes - Provide a way to write external modules that don't break at every Catala update.	2024-05-28 11:43:50 +02:00
Louis Gesbert	c864f658c8	Cleaner message printing Aerates the printing of messages, removes extraneous noise, and more clearly attaches the various information tied to a given message.	2024-05-03 12:21:34 +02:00
Louis Gesbert	cee8e57d02	More precise positions for operators throughout	2024-04-30 16:35:08 +02:00
Louis Gesbert	9d07015864	Unify runtime error handling - Clearly distinguish Exceptions from Errors. The only catchable exception available in our AST is `EmptyError`, so the corresponding nodes are made less generic, and a node `FatalError` is added - Runtime errors are defined as a specific type in the OCaml runtime, with a carrier exception and printing functions. These are used throughout, and consistently by the interpreter. They always carry a position, that can be converted to be printed with the fancy compiler location printer, or in a simpler way from the backends. - All operators that might be subject to an error take a position as argument, in order to print an informative message without relying on backtraces from the backend	2024-04-26 18:31:26 +02:00
Louis Gesbert	eded54d2b5	Support for direct tuple member access As discussed in #549 NOTE: This implements only the direct tuple member access (syntax `foo.N` with N a number) - It seems more efficient to wait for the general pattern-matching rewrite to handle pattern-matching on tuples - Until then we keep the (now obsolete) `let (x, y) = pair in x` syntax, to leave time for updates, but we won't be documenting it	2024-04-13 09:37:03 +02:00
Louis Gesbert	a61ae7979f	Support for structure updates Closes #592 A new node is added in `desugared`, and translated into an exploded structure literal during translation to `scopelang`. The main reason to put it there is that it needs to be after disambiguation, since that is used to discover the type of the structure that is being updated.	2024-04-12 17:17:48 +02:00
Louis Gesbert	b78cd9c29a	Improve message formatting throughout Ensuring messages don't print overlong lines still requires some manual work: - if they don't contain any `Format` directives (`%` or `@`), use `"%a" Format.pp_print_text` to turn word-wrapping on. - otherwise replace spaces with `@ ` to mark possible cutting points, as soon that it's possible the line will get over 80 chars (most often, this means starting before the first `%a`)	2024-04-12 17:07:36 +02:00
Louis Gesbert	def10e7f98	Message: further simplification	2024-04-10 19:26:23 +02:00
Louis Gesbert	98fc97a241	Rewriting message calls to use the new intf	2024-04-10 19:26:23 +02:00
Louis Gesbert	75bf768264	Reformat	2024-04-04 10:56:56 +02:00
Louis Gesbert	0ec04c4925	Branch cleanup and test outputs update Lots of tests have a new warning because they were calling subscopes without using their outputs. A better solution could be to mark these subscopes as `output`, now that it's possible !	2024-04-04 10:55:21 +02:00
Louis Gesbert	cde9a66295	Output scopes and subscope variable rework done	2024-04-04 10:24:18 +02:00
Louis Gesbert	852bebd357	Completely remove subscope input vars (use local vars)	2024-04-04 10:24:18 +02:00
Louis Gesbert	f151a9a794	Wip: handling subscope calls as normal definitions - for the moment we still have specific vars for subscope inputs (in scopelang) - this causes trouble with default-embedding of scope calls	2024-04-04 10:24:18 +02:00
Louis Gesbert	4eeb8221f4	Fix var bindings in desugared->scopelang	2024-04-04 10:24:18 +02:00
Louis Gesbert	619cafebb8	Reformat	2024-03-20 14:41:06 +01:00
Louis Gesbert	4cec981f62	Move global options of Cli to their own module This resolves a dependency cycle that would forbid `Cli` from using the modue `File`, which was annoying.	2024-03-19 15:18:35 +01:00
Louis Gesbert	e308ff8d02	Generalise the definition of lists of nested binders	2024-02-09 18:33:41 +01:00
Louis Gesbert	a56d95d790	Typing: add a "assume operator types" mode This allows for retyping after monomorphisation: a new function just extracts the return type of the operator, without checking the operand types. Also to avoid multiplying function arguments around the typer, the flags have been gathered in a record that is included in the typing environment; it's ok to give them default values as long as these are the strictest.	2024-02-07 17:41:04 +01:00
Louis Gesbert	ea512bfd5b	The typer doesn't handle monomorphised code yet so don't retype after monomorphisation, which is now possible as the pass itself correctly preserves types. In time the typer will need a special case to get knowledge of the new types and modified operator types.	2024-02-06 18:00:45 +01:00
Denis Merigoux	a39b67bf12	Merge branch 'master' into c_backend	2024-01-26 17:43:07 +01:00
Louis Gesbert	371f9554b8	Support for list recombinations The primary use-case for this was to be able to run computations on a list of structures, then return an updated list with some fields in the structures modified : that is what we need for distribution of tax amounts among household members, for example. This patch has a few components: - Addition of a test as an example for tax distributions - Added a transformation, performed during desugaring, that -- where lists are syntactically expected, i.e. after the `among` keyword -- turns a (syntactic) tuple of lists into a list of tuples ("zipping" the lists) - Arg-extremum transformation was also fixed to use an intermediate list instead of computing the predicate twice - For convenience, allow to bind multiple variables in most* list operations (previously only `let in` and functions allowed it) - Fixed the printer for tuples to differentiate them from lists *Note: tuples are not yet allowed on the left-hand side of filters and arg-extremums for annoying syntax conflict reasons.	2024-01-26 11:29:33 +01:00
Louis Gesbert	bc90a7b890	Add (internally) a map2 operator	2024-01-26 11:22:12 +01:00
Denis Merigoux	8a139f6a3c	Work on arrays	2024-01-22 16:49:58 +01:00
Denis Merigoux	5310e47e5b	Fix monomorphization problems with [TAny] left	2024-01-17 16:03:20 +01:00
Denis Merigoux	0a8fdde7de	Fix monomorphization	2024-01-17 14:02:32 +01:00
Denis Merigoux	9f03b6b931	Merge branch 'master' into c_backend Some tests still failing...	2024-01-15 17:19:17 +01:00
Denis Merigoux	919dd2b812	Monomorphization properly done	2023-12-20 17:45:39 +01:00
Denis Merigoux	a9764d607e	Merge branch 'master' into c_backend	2023-12-19 17:39:02 +01:00
Louis Gesbert	5384394a72	Fix typing upon detuplification	2023-12-19 17:30:32 +01:00
Louis Gesbert	2823795f9f	AST change: more specific application As part of making tuples first-class citizens, expliciting the arity upon function application was needed (so that a function of two args can transparently -- in the surface language -- be applied to either two arguments or a pair). It was decided to actually explicit the whole type of arguments because the cost is the same, and this is consistent with lambda definitions. A related change done here is the replacement of the `EOp` node for operators by an "operator application" `EAppOp` node, enforcing a pervasive invariant that operators are always directly applied. This makes matches terser, and highlights the fact that the treatment of operator application is almost always different from function application in practice.	2023-12-19 17:27:40 +01:00
Louis Gesbert	e123d7eb95	Change type syntax of `collection` into `list of`	2023-12-19 15:26:44 +01:00
Denis Merigoux	4eead4850b	There are no more defaults in type after compile_with_exceptions...	2023-12-18 15:14:28 +01:00
Louis Gesbert	1ae955b504	Reformat	2023-11-30 23:53:38 +01:00
Louis Gesbert	3649f92975	Rework resolution of module elements This changes the `decl_ctx` to be toplevel only, with flattened references to uids for most elements. The module hierarchy, which is still useful in a few places, is kept separately. Module names are also changed to UIDs early on, and support for module aliases has been added (needs testing). This resolves some issues with lookup, and should be much more robust, as well as more convenient for most lookups. The `decl_ctx` was also extended for string ident lookups, which avoids having to keep the desugared resolution structure available throughout the compilation chain.	2023-11-30 21:14:12 +01:00
adelaett	4a5335162e	Typed default: implementing the type for handle defaults, as well as the compile without exceptions compilation pass, using the newly found invariant	2023-11-27 11:09:08 +01:00
Louis Gesbert	cc4e5339dd	Typed defaults: small simplification and fixes	2023-11-27 11:09:08 +01:00

1 2 3

149 Commits