juvix

mirror of https://github.com/anoma/juvix.git synced 2024-11-30 14:13:27 +03:00

Author	SHA1	Message	Date
Łukasz Czajka	6b38543ae3	Fix bug in instance termination checking (#2423 ) * Closes #2421	2023-10-04 12:00:48 +02:00
Łukasz Czajka	5fc0f33bb4	Add `Partial` trait (#2409 ) Adds a Partial trait to the standard library, similar to the one in PureScript: https://book.purescript.org/chapter6.html#nullary-type-classes This enables using partial functions in an encapsulated manner and without depending on the Debug module. Adds a trait: ``` trait type Partial := mkPartial { fail : {A : Type} -> String -> A }; runPartial {A} (f : {{Partial}} -> A) : A := f {{mkPartial Debug.failwith}}; ```	2023-10-02 15:14:40 +02:00
Łukasz Czajka	1260853583	Improve closure calls in the runtime (#2396 ) * Closes #1653 The running time improvement is 20-30% on benchmarks dominated by closure calls.	2023-09-29 14:20:00 +02:00
Łukasz Czajka	7c3b70459b	Fix delay instance resolution (#2393 ) * Closes #2348	2023-09-29 13:08:05 +02:00
Łukasz Czajka	e4a5ca7399	Record creation syntax with function definitions (#2369 ) * Closes #2280 * Record creation syntax uses normal function definition syntax like at the top-level or in lets. * It is now allowed to omit the result type annotation in function definitions (the `: ResultType` part) with `_` inserted by default. This is allowed only for simple definitions of the form `x := value` in lets and record creation, but not at the top level.	2023-09-28 17:07:38 +02:00
Łukasz Czajka	019579bba2	Fix case formatting (#2387 ) * Closes #2361 * Aligns `case` and `let`	2023-09-26 10:11:46 +02:00
Łukasz Czajka	e509a6e16e	Instance holes (#2384 ) * Closes #2383	2023-09-25 19:11:58 +02:00
Łukasz Czajka	a058487608	Support functions in instance parameters (#2385 ) * Closes #2371	2023-09-25 13:08:55 +01:00
Łukasz Czajka	487fc58ba6	Fix bug in isTrait (#2368 ) * Closes #2367	2023-09-14 17:50:38 +01:00
Jan Mas Rovira	8daca2ba0a	New fixity/iterator syntax (#2332 ) - Closes #2330 - Closes #2329 This pr implements the syntax changes described in #2330. It drops support for the old yaml-based syntax. Some valid examples: ``` syntax iterator for {init := 1; range := 1}; syntax fixity cons := binary {assoc := right}; syntax fixity cmp := binary; syntax fixity cmp := binary {}; -- debatable whether we want to accept empty {} or not. I think we should ``` # Future work This pr creates an asymmetry between iterators and operators definitions. Iterators definition do not require a constructor. We could add it to make it homogeneous, but it looks a bit redundant: ``` syntax iterator for := mkIterator {init := 1; range := 1}; ``` We could consider merging iterator and fixity declarations with this alternative syntax. ``` syntax XXX for := iterator {init := 1; range := 1}; syntax XXX cons := binary {assoc := right}; ``` where `XXX` is a common keyword. Suggestion by @lukaszcz XXX = declare --------- Co-authored-by: Łukasz Czajka <62751+lukaszcz@users.noreply.github.com> Co-authored-by: Lukasz Czajka <lukasz@heliax.dev>	2023-09-14 10:57:38 +02:00
Łukasz Czajka	95665283ac	Update the standard library to use the trait framework (#2359 ) * Updates standard library to use traits * Updates tests	2023-09-13 14:48:25 +02:00
Łukasz Czajka	08f123fa3d	Traits (#2320 ) * Closes #1646 Implements a basic trait framework. A simple instance search mechanism is included which fails if there is more than one matching instance at any step. Example usage: ``` import Stdlib.Prelude open hiding {Show; mkShow; show}; trait type Show A := mkShow { show : A → String }; instance showStringI : Show String := mkShow (show := id); instance showBoolI : Show Bool := mkShow (show := λ{x := if x "true" "false"}); instance showNatI : Show Nat := mkShow (show := natToString); showList {A} : {{Show A}} → List A → String \| nil := "nil" \| (h :: t) := Show.show h ++str " :: " ++str showList t; instance showListI {A} {{Show A}} : Show (List A) := mkShow (show := showList); showMaybe {A} {{Show A}} : Maybe A → String \| (just x) := "just (" ++str Show.show x ++str ")" \| nothing := "nothing"; instance showMaybeI {A} {{Show A}} : Show (Maybe A) := mkShow (show := showMaybe); main : IO := printStringLn (Show.show true) >> printStringLn (Show.show false) >> printStringLn (Show.show 3) >> printStringLn (Show.show [true; false]) >> printStringLn (Show.show [1; 2; 3]) >> printStringLn (Show.show [1; 2]) >> printStringLn (Show.show [true; false]) >> printStringLn (Show.show [just true; nothing; just false]) >> printStringLn (Show.show [just [1]; nothing; just [2; 3]]) >> printStringLn (Show.show "abba") >> printStringLn (Show.show ["a"; "b"; "c"; "d"]); ``` It is possible to manually provide an instance and to match on implicit instances: ``` f {A} : {{Show A}} -> A -> String \| {{mkShow s}} x -> s x; f' {A} : {{Show A}} → A → String \| {{M}} x := Show.show {{M}} x; ``` The trait parameters in instance types are checked to be structurally decreasing to avoid looping in the instance search. So the following is rejected: ``` type Box A := box A; trait type T A := mkT { pp : A → A }; instance boxT {A} : {{T (Box A)}} → T (Box A) := mkT (λ{x := x}); ``` We check whether each parameter is a strict subterm of some trait parameter in the target. This ordering is included in the finite multiset extension of the subterm ordering, hence terminating.	2023-09-08 12:16:43 +02:00
Łukasz Czajka	e8fa77a58d	Fix record update formatting (#2315 ) * Closes #2287	2023-08-28 13:11:26 +02:00
Łukasz Czajka	2baad15a41	Remove old function syntax (#2305 ) * Enables new function syntax in local let-declarations * Closes #2251	2023-08-24 16:24:47 +02:00
Łukasz Czajka	eebe961321	User-friendly operator declaration syntax (#2270 ) * Closes #1964 Adds the possibility to define operator fixities. They live in a separate namespace. Standard library defines a few in `Stdlib.Data.Fixity`: ``` syntax fixity rapp {arity: binary, assoc: right}; syntax fixity lapp {arity: binary, assoc: left, same: rapp}; syntax fixity seq {arity: binary, assoc: left, above: [lapp]}; syntax fixity functor {arity: binary, assoc: right}; syntax fixity logical {arity: binary, assoc: right, above: [seq]}; syntax fixity comparison {arity: binary, assoc: none, above: [logical]}; syntax fixity pair {arity: binary, assoc: right}; syntax fixity cons {arity: binary, assoc: right, above: [pair]}; syntax fixity step {arity: binary, assoc: right}; syntax fixity range {arity: binary, assoc: right, above: [step]}; syntax fixity additive {arity: binary, assoc: left, above: [comparison, range, cons]}; syntax fixity multiplicative {arity: binary, assoc: left, above: [additive]}; syntax fixity composition {arity: binary, assoc: right, above: [multiplicative]}; ``` The fixities are identifiers in a separate namespace (different from symbol and module namespaces). They can be exported/imported and then used in operator declarations: ``` import Stdlib.Data.Fixity open; syntax operator && logical; syntax operator \|\| logical; syntax operator + additive; syntax operator * multiplicative; ```	2023-08-09 18:15:51 +02:00
Łukasz Czajka	9718433a89	Enable builtin list syntax in the standard library (#2282 )	2023-08-08 17:33:43 +02:00
Jan Mas Rovira	98b3621ae1	Record patterns (#2271 ) - Closes #2269 Example: ``` type Sum (A B : Type) := \| inj1 { fst : A; snd : B } \| inj2 { fst : A; snd2 : B }; sumSwap {A B : Type} : Sum A B -> Sum B A \| inj1@{fst; snd := y} := inj2 y fst \| inj2@{snd2 := y; fst := fst} := inj1 y fst; ```	2023-08-08 16:01:20 +02:00
Jan Mas Rovira	f7382caeac	Record updates (#2263 ) - Closes #1642. This pr introduces syntax for convenient record updates. Example: ``` type Triple (A B C : Type) := \| mkTriple { fst : A; snd : B; thd : C; }; main : Triple Nat Nat Nat; main := let p : Triple Nat Nat Nat := mkTriple 2 2 2; p' : Triple Nat Nat Nat := p @Triple{ fst := fst + 1; snd := snd * 3 }; f : Triple Nat Nat Nat -> Triple Nat Nat Nat := (@Triple{fst := fst * 10}); in f p'; ``` We write `@InductiveType{..}` to update the contents of a record. The `@` is used for parsing. The `InductiveType` symbol indicates the type of the record update. Inside the braces we have a list of `fieldName := newValue` items separated by semicolon. The `fieldName` is bound in `newValue` with the old value of the field. Thus, we can write something like `p @Triple{fst := fst + 1;}`. Record updates `X@{..}` are parsed as postfix operators with higher priority than application, so `f x y @X{q := 1}` is equivalent to `f x (y @X{q := 1})`. It is possible the use a record update with no argument by wrapping the update in parentheses. See `f` in the above example.	2023-08-07 12:35:36 +02:00
Jan Mas Rovira	2c8a364143	New syntax for function definitions (#2243 ) - Closes #2060 - Closes #2189 - This pr adds support for the syntax described in #2189. It does not drop support for the old syntax. It is possible to automatically translate juvix files to the new syntax by using the formatter with the `--new-function-syntax` flag. E.g. ``` juvix format --in-place --new-function-syntax ``` # Syntax changes Type signatures follow this pattern: ``` f (a1 : Expr) .. (an : Expr) : Expr ``` where each `ai` is a non-empty list of symbols. Braces are used instead of parentheses when the argument is implicit. Then, we have these variants: 1. Simple body. After the signature we have `:= Expr;`. 2. Clauses. The function signature is followed by a non-empty sequence of clauses. Each clause has the form: ``` \| atomPat .. atomPat := Expr ``` # Mutual recursion Now identifiers do not need to be defined before they are used, making it possible to define mutually recursive functions/types without any special syntax. There are some exceptions to this. We cannot forward reference a symbol `f` in some statement `s` if between `s` and the definition of `f` there is one of the following statements: 1. Local module 2. Import statement 3. Open statement I think it should be possible to drop the restriction for local modules and import statements	2023-07-10 19:57:55 +02:00
Jan Mas Rovira	0cc689a2ef	Add syntax for builtin list (#2239 ) - Depends on #2219 - Closes #1643 This pr introduces a `list` as a new builtin so that we can use syntax sugar both in expressions and patterns. E.g. it is now possible to write `[1; 2; 3;]`.	2023-07-03 13:24:56 +02:00
Łukasz Czajka	ff940b427e	Local pragmas improvements (#2236 ) * Fixes the indices in `inline` and `specialize` for local lambda-lifted identifiers. * Adds the `specialize-by` pragma which allows to specialize a local function by some of its free variables, or specialize arguments by name. For example: ``` funa : {A : Type} -> (A -> A) -> A -> A; funa {A} f a := let {-# specialize-by: [f] #-} go : Nat -> A; go zero := a; go (suc n) := f (go n); in go 10; ``` If `funa` is inlined, then `go` will be specialized by the actual argument substituted for `f`.	2023-06-30 16:59:29 +02:00
Łukasz Czajka	b4347bdd23	Numeric range types (#2232 ) * Closes #2145	2023-06-28 17:24:08 +02:00
Łukasz Czajka	5b44b2e654	Use specialization pragmas in the standard library (#2230 ) * Adds `specialize` pragmas for folds, maps, etc. * Fixes two bugs in `specializeArgs`	2023-06-27 15:37:11 +02:00
Łukasz Czajka	2fff65030b	Case folding (#2229 ) Fold constant cases, i.e., convert ``` case C a b \| A := .. \| B x := .. \| C x y := M ``` to ``` let x := a; y := b; in M ```	2023-06-27 12:29:26 +02:00
Łukasz Czajka	7c039d687b	Specialization optimisation (#2164 ) * Closes #2147 Adds a `specialize` pragma which allows to specify (explicit) arguments considered for specialization. Whenever a function is applied to a constant known value for the specialized argument, a new version of the function will be created with the argument pasted in. For example, the code ```juvix {-# specialize: [1] #-} mymap : {A B : Type} -> (A -> B) -> List A -> List B; mymap f nil := nil; mymap f (x :: xs) := f x :: mymap f xs; main : Nat; main := length (mymap λ{x := x + 3} (1 :: 2 :: 3 :: 4 :: nil)); ``` will be transformed into code equivalent to ```juvix mymap' : (Nat -> Nat) -> List Nat -> List Nat; mymap' f nil := nil; mymap' f (x :: xs) := λ{x := x + 3} x :: mymap' xs; main : Nat; main := length (mymap' (1 :: 2 :: 3 :: 4 :: nil)); ```	2023-06-26 16:49:19 +02:00
Łukasz Czajka	f7118d485c	Local pragmas (#2222 ) * Closes #2196	2023-06-22 16:23:36 +02:00
Łukasz Czajka	50553758c2	Improve iterator formatting and syntax (#2204 ) * Closes #2202	2023-06-20 13:21:46 +02:00
Łukasz Czajka	4c0e0b13c8	Respect fixity in runtime printer (#2182 ) * Closes #2180	2023-06-07 11:44:41 +02:00
Łukasz Czajka	68b4b38f98	Improve parsing error messages (#2170 ) * Closes #2160 * Closes #2167 * Closes #2168	2023-06-05 12:06:05 +02:00
Łukasz Czajka	e0689801cf	Update standard library for better readability, efficiency and iterator use (#2153 ) This does not change any examples or documentation. The interface of the standard library remains unchanged (except the addition of new iterators), so this PR can be merged without side effects. * Closes #2146	2023-06-01 18:21:03 +02:00
Łukasz Czajka	e9284b3ef6	Iterator syntax (#2126 ) * Closes #1992 A function identifier `fun` can be declared as an iterator with ``` syntax iterator fun; ``` For example: ```haskell syntax iterator for; for : {A B : Type} -> (A -> B -> A) -> A -> List B -> List A; for f acc nil := acc; for f acc (x :: xs) := for (f acc x) xs; ``` Iterator application syntax allows for a finite number of initializers `acc := a` followed by a finite number of ranges `x in xs`. For example: ``` for (acc := 0) (x in lst) acc + x ``` The number of initializers plus the number of ranges must be non-zero. An iterator application ``` fun (acc1 := a1; ..; accn := an) (x1 in b1; ..; xk in bk) body ``` gets desugared to ``` fun \{acc1 .. accn x1 .. xk := body} a1 .. an b1 .. bk ``` The `acc1`, ..., `accn`, `x1`, ..., `xk` can be patterns. The desugaring works on a purely syntactic level. Without further restrictions, it is not checked if the number of initializers/ranges matches the type of the identifier. The restrictions on the number of initializers/ranges can be specified in iterator declaration: ``` syntax iterator fun {init: n, range: k}; syntax iterator for {init: 1, range: 1}; syntax iterator map {init: 0, range: 1}; ``` The attributes (`init`, `range`) in between braces are parsed as YAML to avoid inventing and parsing a new attribute language. Both attributes are optional.	2023-05-30 15:30:11 +02:00
Jan Mas Rovira	39b797ecfa	Add Bottom node (#2112 ) - Closes #2056 - Depends on #2103 I am not sure about the implementation of `isType` for `NBot`. (solved). The `Eq` instance returns `True` for every two `Bottom` terms, regardless of their type. --------- Co-authored-by: Jonathan Cubides <jonathan.cubides@uib.no> Co-authored-by: Lukasz Czajka <lukasz@heliax.dev>	2023-05-23 18:31:28 +02:00
Łukasz Czajka	6bfe727a0e	Add `syntax` keyword (#2107 ) * Closes #2007	2023-05-22 12:50:07 +02:00
Łukasz Czajka	8aa54ecc28	Inlining (#2036 ) * Closes #1989 * Adds optimization phases to the pipline (specified by `opt-phase-eval`, `opt-phase-exec` and `opt-phase-geb` transformations). * Adds the `-O` option to the `compile` command to specify the optimization level. * Functions can be declared for inlining with the `inline` pragma: ``` {-# inline: true #-} const : {A B : Type} -> A -> B -> A; const x _ := x; ``` By default, the function is inlined only if it's fully applied. One can specify that a function (partially) applied to at least `n` explicit arguments should be inlined. ``` {-# inline: 2 #-} compose : {A B C : Type} -> (B -> C) -> (A -> B) -> A -> C; compose f g x := f (g x); ``` Then `compose f g` will be inlined, even though it's not fully applied. But `compose f` won't be inlined. * Non-recursive fully applied functions are automatically inlined if the height of the body term does not exceed the inlining depth limit, which can be specified with the `--inline` option to the `compile` command. * The pragma `inline: false` disables automatic inlining on a per-function basis.	2023-05-15 17:27:05 +02:00
Jan Mas Rovira	3cbf6f5cb6	Fix ordering of statements in Abstract -> Internal (#2040 ) - Closes #2039 - Closes #2055 - Depends on #2053 Changes in this pr: - Local modules are removed (flattened) in the translation abstract -> internal. - In the translation abstract -> internal we group definitions in mutually recursive blocks. These blocks can contain function definitions and type definitions. Previously we only handled functions. - The translation of Internal has been enhanced to handle these mutually recursive blocks. - Some improvements the pretty printer for internal (e.g. we now print builtin tags properly). - A "hack" that puts the builtin bool definition at the beginning of a module if present. This was the easiest way to handle the implicit dependency of the builtin stringToNat with bool in the internal-to-core translation. - A moderately sized test defining a simple lambda calculus involving and an evaluator for it. This example showcases mutually recursive types in juvix. --------- Co-authored-by: Jonathan Cubides <jonathan.cubides@uib.no>	2023-05-15 13:02:09 +02:00
Jan Mas Rovira	2b0668b55a	Allow nested block comments (#2075 ) - Closes #2070	2023-05-15 09:57:23 +02:00
janmasrovira	2017bc0504	Add bank example (#2037 ) This PR adds the Juvix program for the `Bank` example provided in the [Leo workshop repository](https://github.com/AleoHQ/workshop/blob/master/basic_bank/src/main.leo).	2023-04-27 10:56:31 +02:00
janmasrovira	528eaa72d0	Substitute calls after lambda lifting (#2031 ) - Closes #2006 During lambda lifting, we now substitute the calls to the lifted functions before recursively applying lambda lifting. This will slightly increase the amount of captured variables. However, I think this is the only way since we need all identifiers to have a type when recursing.	2023-04-26 12:56:44 +02:00
Łukasz Czajka	46360a753c	Add: the 'seq' builtin (>>>) (#1982 ) * Closes #1913 * Closes #1976	2023-04-14 19:24:58 +02:00
Paul Cadman	ea09ec3068	Add builtin integer type to the surface language (#1948 ) This PR adds a builtin integer type to the surface language that is compiled to the backend integer type. ## Inductive definition The `Int` type is defined in the standard library as: ``` builtin int type Int := \| --- ofNat n represents the integer n ofNat : Nat -> Int \| --- negSuc n represents the integer -(n + 1) negSuc : Nat -> Int; ``` ## New builtin functions defined in the standard library ``` intToString : Int -> String; + : Int -> Int -> Int; neg : Int -> Int; * : Int -> Int -> Int; - : Int -> Int -> Int; div : Int -> Int -> Int; mod : Int -> Int -> Int; == : Int -> Int -> Bool; <= : Int -> Int -> Bool; < : Int -> Int -> Bool; ``` Additional builtins required in the definition of the other builtins: ``` negNat : Nat -> Int; intSubNat : Nat -> Nat -> Int; nonNeg : Int -> Bool; ``` ## REPL types of literals In the REPL, non-negative integer literals have the inferred type `Nat`, negative integer literals have the inferred type `Int`. ``` Stdlib.Prelude> :t 1 Nat Stdlib.Prelude> :t -1 Int :t let x : Int := 1 in x Int ``` ## The standard library Prelude The definitions of ``, `+`, `div` and `mod` are not exported from the standard library prelude as these would conflict with the definitions from `Stdlib.Data.Nat`. Stdlib.Prelude ``` open import Stdlib.Data.Int hiding {+;;div;mod} public; ``` * Closes https://github.com/anoma/juvix/issues/1679 * Closes https://github.com/anoma/juvix/issues/1984 --------- Co-authored-by: Lukasz Czajka <lukasz@heliax.dev>	2023-04-13 08:16:49 +01:00
Łukasz Czajka	dbd24f2f93	Check for the executable (WASM/native) pipeline prerequisites (#1970 ) * Closes #1959	2023-04-04 11:58:36 +02:00
Łukasz Czajka	63a2c5144d	Print quoted strings in the runtime (#1969 ) * Closes #1968	2023-04-04 10:29:02 +01:00
janmasrovira	ff495ef326	Support local modules (#1872 ) Support local modules. --------- Co-authored-by: Paul Cadman <git@paulcadman.dev>	2023-03-31 14:57:37 +01:00
Łukasz Czajka	58dbf62520	Fix removal of polymorphic type arguments (#1954 ) * Closes #1930	2023-03-30 19:56:07 +02:00
Łukasz Czajka	2b5524ded1	Preserve name and location information in Internal-to-Core (#1933 ) * Closes #1846 * Preserves location information for all created `Match` nodes so that match-to-case always has a location available.	2023-03-28 10:29:24 +02:00
Łukasz Czajka	c9b8cdd5e9	Pattern matching compilation (#1874 ) This implements a basic version of the algorithm from: Luc Maranget, [Compiling pattern matching to good decision trees](http://moscova.inria.fr/~maranget/papers/ml05e-maranget.pdf). No heuristics are used - the first column is always chosen. * Closes #1798 * Closes #1225 * Closes #1926 * Adds a global `--no-coverage` option which turns off coverage checking in favour of generating runtime failures * Changes the representation of Match patterns in JuvixCore to achieve a more streamlined implementation * Adds options to the Core pipeline	2023-03-27 10:42:27 +02:00
Łukasz Czajka	8d7e669f74	Fix bug with unregistered builtin bool (#1917 ) * Closes #1884	2023-03-24 10:29:57 +01:00
Łukasz Czajka	2a8585ede0	Fix bug in IO runtime (#1906 ) Co-authored-by: Paul Cadman <git@paulcadman.dev>	2023-03-21 14:34:46 +00:00
Łukasz Czajka	2803f3feee	Fix JuvixAsm validation (#1903 ) Co-authored-by: Paul Cadman <git@paulcadman.dev>	2023-03-20 12:01:35 +00:00
Paul Cadman	51978f947c	Fix registration of builtin inductive axioms (#1901 ) builtin inductive axioms must be registered in the same pass as inductive types becuase inductive types may use builtin inductives in the types of their constructors. ``` builtin string axiom String : Type; type BoxedString := \| boxed : String -> BoxedString; ``` The separate passes for processing functions and inductives was unnecessary. This commit combines `registerInductiveDefs` and `registerFunctionDefs` into a single pass over a modules statements	2023-03-20 10:00:22 +00:00
Paul Cadman	786ff9d075	internal-to-core: Fix index shifting of pattern arguments (#1900 )	2023-03-18 00:30:51 +01:00
janmasrovira	934a273e2d	Automatically detect and split mutually recursive blocks in let expressions (#1894 ) - Closes #1677	2023-03-17 11:05:55 +00:00
Łukasz Czajka	2d798ec31c	New compilation pipeline (#1832 ) * Depends on PR #1824 * Closes #1556 * Closes #1825 * Closes #1843 * Closes #1729 * Closes #1596 * Closes #1343 * Closes #1382 * Closes #1867 * Closes #1876 * Changes the `juvix compile` command to use the new pipeline. * Removes the `juvix dev minic` command and the `BackendC` tests. * Adds the `juvix eval` command. * Fixes bugs in the Nat-to-integer conversion. * Fixes bugs in the Internal-to-Core and Core-to-Core.Stripped translations. * Fixes bugs in the RemoveTypeArgs transformation. * Fixes bugs in lambda-lifting (incorrect de Bruijn indices in the types of added binders). * Fixes several other bugs in the compilation pipeline. * Adds a separate EtaExpandApps transformation to avoid quadratic runtime in the Internal-to-Core translation due to repeated calls to etaExpandApps. * Changes Internal-to-Core to avoid generating matches on values which don't have an inductive type. --------- Co-authored-by: Paul Cadman <git@paulcadman.dev> Co-authored-by: janmasrovira <janmasrovira@gmail.com>	2023-03-14 16:24:07 +01:00
janmasrovira	098c256da8	Allow shadowing local variables with let function definitions (#1847 ) Co-authored-by: Paul Cadman <git@paulcadman.dev>	2023-02-22 10:26:54 +01:00
Paul Cadman	c93013229a	Add compilation of complex pattern matching to case (#1824 ) This PR adds the `match-to-case` Core transformation. This transforms pattern matching nodes to a sequence of case and let nodes. ## High level description Each branch of the match is compiled to a lambda. In the combined match Each branch of the match is compiled to a lambda. These lambdas are combined in nested lets and each lambda is called in turn as each branch gets checked. The lambda corresponding to the first branch gets called first, if the pattern match in the branch fails, the lambda corresponding to the next branch is called and so on. If no branches match then a lambda is called which returns a fail node. Conceptually: <table> <tr> <td> Core </td> <td> Transformed </td> </tr> <tr> <td> ``` match v1 .. vn { b1 b2 ... bk } ``` </td> <td> ``` λ let c0 := λ FAIL in let ck := λ {...} in ... let c1 := λ {...} in c1 v1 ... vn ``` </td> </tr> </table> The patterns on each branch are compiled to either let bindings (pattern binders) or case expressions (constructor patterns). Auxillary bindings are added in the case of nested constructor patterns. The default branch in each case expression has a call to the lambda corresponding to the next branch of the match. This is because the default branch is reached if the pattern match fails. <table> <tr> <td> Pattern match </td> <td> Transformed </td> </tr> <tr> <td> ``` suc (suc n) ↦ n ``` </td> <td> ``` case ?$0 of { suc arg_8 := case ?$0 of { suc n := let n := ?$0 in n$0; _ := ?$2 ?$1 }; _ := ?$1 ?$0 } ``` </td> </tr> </table> The body of each branch is wrapped in let bindings so that the indicies of bound variables in the body point to the correct variables in the compiled expression. This is necessary because the auxiliary bindings added for nested constructor patterns will cause the original indicies to be offset. Finally, the free variables in the match branch body need to be shifted by all the bindings we've added as part of the compilation. ## Examples ### Single wildcard <table> <tr> <td> Juvix </td> <td> Core </td> <td> Transformed Core </td> </tr> <tr> <td> ``` f : Nat -> Nat; f _ := 1; ``` </td> <td> ``` λ? match ?$0 with { _ω309 ↦ ? 1 } ``` </td> <td> ``` λ? let ? := λ? fail "Non-exhaustive patterns" in let ? := λ? let _ω309 := ?$0 in let _ω309 := ?$0 in 1 in ?$0 ?$2 ``` </td> </tr> </table> ### Single binder <table> <tr> <td> Juvix </td> <td> Core </td> <td> Transformed Core </td> </tr> <tr> <td> ``` f : Nat -> Nat; f n := n; ``` </td> <td> ``` λ? match ?$0 with { n ↦ n$0 } ``` </td> <td> ``` λ? let ? := λ? fail "Non-exhaustive patterns" in let ? := λ? let n := ?$0 in let n := ?$0 in n$0 in ?$0 ?$2 ``` </td> </tr> </table> ### Single Constructor <table> <tr> <td> Juvix </td> <td> Core </td> <td> Transformed Core </td> </tr> <tr> <td> ``` f : Nat -> Nat; f (suc n) := n; ``` </td> <td> ``` λ? match ?$0 with { suc n ↦ n$0 } ``` </td> <td> ``` λ? let ? := λ? fail "Non-exhaustive patterns" in let ? := λ? case ?$0 of { suc n := let n := ?$0 in let n := ?$0 in n$0; _ := ?$1 ?$0 } in ?$0 ?$2 ``` </td> </tr> </table> ### Nested Constructor <table> <tr> <td> Juvix </td> <td> Core </td> <td> Transformed Core </td> </tr> <tr> <td> ``` f : Nat -> Nat; f (suc (suc n)) := n; ``` </td> <td> ``` λ? match ?$0 with { suc (suc n) ↦ n$0 } ``` </td> <td> ``` λ? let ? := λ? fail "Non-exhaustive patterns" in let ? := λ? case ?$0 of { suc arg_8 := case ?$0 of { suc n := let n := ?$0 in let n := ?$0 in n$0; _ := ?$2 ?$1 }; _ := ?$1 ?$0 } in ?$0 ?$2 ``` </td> </tr> </table> ### Multiple Branches <table> <tr> <td> Juvix </td> <td> Core </td> <td> Transformed Core </td> </tr> <tr> <td> ``` f : Nat -> Nat; f (suc n) := n; f zero := 0; ``` </td> <td> ``` λ? match ?$0 with { suc n ↦ n$0; zero ↦ ? 0 } ``` </td> <td> ``` λ? let ? := λ? fail "Non-exhaustive patterns" in let ? := λ? case ?$0 of { zero := ? 0; _ := ?$1 ?$0 } in let ? := λ? case ?$0 of { suc n := let n := ?$0 in let n := ?$0 in n$0; _ := ?$1 ?$0 } in ?$0 ?$3 ``` </td> </tr> </table> ### Nested case with captured variable <table> <tr> <td> Juvix </td> <td> Core </td> <td> Transformed Core </td> </tr> <tr> <td> ``` f : Nat -> Nat -> Nat; f n m := case m \| suc k := n + k; ``` </td> <td> ``` f = λ? λ? match ?$1, ?$0 with { n, m ↦ match m$0 with { suc k ↦ + n$2 k$0 } } ``` </td> <td> ``` λ? λ? let ? := λ? λ? fail "Non-exhaustive patterns" in let ? := λ? λ? let n := ?$1 in let m := ?$1 in let n := ?$1 in let m := ?$1 in let ? := λ? fail "Non-exhaustive patterns" in let ? := λ? case ?$0 of { suc k := let k := ?$0 in let k := ?$0 in + n$6 k$0; _ := ?$1 ?$0 } in ?$0 m$2 in ?$0 ?$3 ?$2 ``` </td> </tr> </table> ## Testing The `tests/Compilation/positive` tests are run up to the Core evaluator with `match-to-case` and `nat-to-int` transformations on Core turned on. --------- Co-authored-by: Lukasz Czajka <lukasz@heliax.dev>	2023-02-15 11:30:12 +01:00
Łukasz Czajka	45aa415b71	Short syntax for sequences of function and datatype parameters (#1809 )	2023-02-06 19:01:54 +01:00
janmasrovira	929a8658ac	Special syntax for case (#1800 ) - Closes #1716 --------- Co-authored-by: Paul Cadman <git@paulcadman.dev>	2023-02-06 14:53:35 +01:00
janmasrovira	3c33916034	Remove braces from let expressions (#1790 )	2023-02-01 19:22:43 +01:00
janmasrovira	d8ba7ca36f	Pipes for lambda clauses (#1781 ) - Closes #1639	2023-01-30 12:06:18 +01:00
Łukasz Czajka	acea6615a4	Lazy boolean operators (#1743 ) Closes #1701	2023-01-25 18:57:47 +01:00
Łukasz Czajka	ecac5e07c7	Translate 'let' to Core (#1740 ) Closes #1351	2023-01-19 12:56:37 +01:00
Łukasz Czajka	6499100d67	Add test for div and mod (#1741 )	2023-01-19 12:52:51 +01:00
janmasrovira	0193a33d4c	Fix inference loop (#1726 )	2023-01-17 13:28:38 +01:00
janmasrovira	f7205915a5	Typecheck let expressions (#1712 )	2023-01-17 09:41:07 +01:00
Łukasz Czajka	186f4f66ef	Tests for the new compilation pipeline (#1703 ) Adds Juvix tests for the compilation pipeline - these are converted from the JuvixCore tests (those that make sense). Currently, only the translation from Juvix to JuvixCore is checked for the tests that can be type-checked. Ultimately, the entire compilation pipeline down to native code / WebAssembly should be checked on these tests. Closes #1689	2023-01-12 11:22:32 +01:00

1 2 3

115 Commits