enso-org/enso - enso - gitea: Gitea Service

mirror of https://github.com/enso-org/enso.git synced 2024-12-20 17:01:36 +03:00

Author	SHA1	Message	Date
Kaz Wesley	d2ae17ffc9	Disallow a confusing constructor syntax (#11856 ) Previously, a constructor or type definition with a single argument defined inline could use spaces within the argument's definition without parentheses, as in the draft spec. This syntax was found confusing and is no longer allowed. No usages occurred in our `.enso` files. Fixes #10812.	2024-12-16 15:54:19 +00:00
Kaz Wesley	8771855993	Fix failure to parse an invalid syntax case (#11754 ) Fix failure to parse a syntax case involving unexpected usage of annotated expressions (fixes #11691). The root cause of the bug was usage of the `matches!` macro in a situation where a `match` statement should have been used to enforce exhaustiveness. - I have corrected the bug, and reviewed all usages of the `matches!` macro. I didn't find any other incorrect usages of `matches!`, but I found and replaced some potentially fragile uses. I also simplified some unnecessary uses of `matches!`, replacing them with expressions that are more-obviously correct. - Stricter parsing: It is now a syntax error for an inline annotation to occur in the RHS of an application (e.g. `fn @Tail_Call recur`)--the precedence in this case may be surprising, so parentheses should be used around the annotated expression. `parse_all_enso_files.sh`: This PR does not affect any AST in the .enso corpus.	2024-12-05 20:16:40 +00:00
Kaz Wesley	7eca04a522	`:` operator is left-associative (#11671 ) Adjust operator parsing to allow chained conversions, like `3.14 : Integer : Text`. Change the precedence and associativity of the `:` operator, when used as a binary operator in an expression: - It is now left-associative - It now has lower precedence than `->` (previously they were equal) # Important Notes One previously-reasonable syntax has changed interpretation: `x->x:Type` is no longer a valid way to write a casting function, and would likely result in a type error. There was 1 instance of this syntax in our .enso sources.	2024-11-27 04:45:39 +00:00
Kaz Wesley	2b3bd2cc90	Move documentation into documentable types (#11441 ) Move documentation into documentable types (implements #11302). # Important Notes GUI: - Distinguish expression and statement - `Ast.Ast` is still present, as the base class for AST objects. Most references to `Ast.Ast` are now references to `Ast.Expression`. Operations on blocks use `Ast.Statement`. - `Ast.parse` has been replaced with: `Ast.parseExpression`, `Ast.parseStatement`, and `Ast.parseBlock` - `syncToCode` is internally context-aware; it parses the provided code appropriately depending on whether its AST is an expression, a statement, or the top level of a module. - Remove `wrappingExpression` / `innerExpression` APIs: Wrapper types have been eliminated; modifier lines are now fields inside parent types. - Simplify AST printing: - Fully implemented autospacing in `concreteChildren` implementations; the type returned by `concreteChildren` now ensures that spacing has been fully resolved. - Eliminate `printBlock` / `printDocs`: `concreteChildren` is now aware of indentation context, and responsible for indentation of its child lines. - The `Pattern` type is now parameterized to identify the AST type it constructs. The `Pattern.parseExpression` function helps create a `Pattern<Expression>`. - Refactor `performCollape` for testability. - e2e tests: Improve table viz test: It still doesn't pass on my Mac, but these changes are necessary if not sufficient. Compiler (TreeToIr): - An expression in statement context is now found in an `ExpressionStatement` wrapper. - Documentation for a `Function` is now found inside the function node. - Deduplicate some polyglot-function logic.	2024-11-04 15:33:53 +00:00
Kaz Wesley	33904912ee	Move annotations into fields of Function and ConstructorDefinition (#11374 ) Move annotations into fields of Function and ConstructorDefinition. # Important Notes New syntax: Constructor argument-definition lines - Each argument in a type-constructor definition may be specified on its own (indented) line. Relaxed syntax: Unparenthesized arguments to annotations - A generic annotation now uses the rest of the line as its argument expression; the expression no longer needs to be parenthesized.	2024-10-23 20:35:06 +00:00
Kaz Wesley	4e4a1e1df2	Move TypeSignature into field of Function (#11364 ) Move type-signature lines into `Function` field. Also implements #11293. Stacked on #11346.	2024-10-22 19:50:14 +00:00
Kaz Wesley	d278ad636c	Replace `private` modifier node with field on supporting types (#11346 ) Eliminate `private` modifier-node: `private` is a field in supporting types, or a single-token node in the case of `private` declarations. # Important Notes - Rust parser tests: Switch to a builder-style API for defining expected `Function` ASTs to allow further changes to `Function` fields without rewriting all the tests again. - TreeToIr: Fix discarded module-level `diag`; add a test that covers module diagnostics. - Syntax: Disallow `private` methods in function blocks. (Previously this was enforced in the compiler.)	2024-10-22 16:26:00 +00:00
Kaz Wesley	4d4a2990a0	Distinguish assignment/thunk by statement context (#11324 ) Align `Assignment`/`Function` distinction in AST with compiler's implemented semantics: - The ambiguous case `funcOrVar = expression` is now parsed as a `Function` when in a `Type` definition or in the top level of a module. I.e. it is an `Assignment` in contexts where the RHS is evaluated immediately when the binding is evaluated, and a `Function` in contexts where the RHS is evaluated each time the bound name is evaluated. - `Assignment` statements now may only occur in function bodies. Correcting this distinction lays the groundwork for #11302. Other changes: - Fixed incorrect source code locations for negative literals and negated expressions. # Important Notes New APIs: - The parser now exposes a `parse_block` entry point, which allows parsing input lines as if in the body of a function. The previous entry point has been renamed to `parse_module`.	2024-10-18 17:54:55 +00:00
Pavel Marek	fb821303ab	Inline doc comment is a compiler error (#11333 ) Inline documentation comment is a syntax error --------- Co-authored-by: Kaz Wesley <kaz@lambdaverse.org>	2024-10-16 06:24:18 -07:00
Paweł Grabarz	7653280e33	Remove all usages of unstable rust features (#10813 ) Removed all `#![feature]` flags, except for `#![feature(test)]`. Once parser benchmarks are ported to something that is compatible with stable rust, we will be able to switch to it.	2024-08-22 14:09:17 +00:00
Paweł Grabarz	f14b79f8cf	Rust bump, reduce dependencices (#10803 ) Updated rust version, removed some unnecessary or problematic dependencies. Ported some changes from bazel branch.	2024-08-13 23:16:55 +00:00
Kaz Wesley	e47bdd3e17	Implement full new-lambda syntax (#10756 ) Implement full `ArgumentDefinition` syntax for new-lambda arguments, e.g `\a=1 (b:Integer = 23)-> a + b`; add backend support for new lambdas. Emit an error when any syntactic operator is used outside of its associated syntax (fixes #10473). Phase out complex arguments for old-lambdas: It is now a syntax error to specify default arguments for an old-lambda. This capability had no usage in real code; affected tests have been updated to test new lambdas. For now, old lambdas can continue to be used with simple arguments; if default arguments are desired, a new-style lambda can be used.	2024-08-06 17:02:32 +00:00
Kaz Wesley	aafdef1aeb	Improve parser contextualization (#10734 )	2024-08-05 15:46:58 +00:00
Kaz Wesley	8b48637691	Provide syntax warnings to Java (#10645 ) Translate syntax warnings and attach to IR when translating operator applications. We should ensure that all Trees are checked for warnings and every warning is attached to some IR. That would require a bit of refactoring: In TreeToIr, we could define helpers wrapping every IR constructor and accepting a `Tree` parameter. The `Tree` could be used to populate the `IdentifiedLocation` when constructing the IR type, and then to attach all warnings after constructing the IR object. # Important Notes - Update JNI dependency. - Introduces a `cargo bench` runner for parser.	2024-07-24 17:54:23 +00:00
Kaz Wesley	e5b85bf16e	Space-precedence does not apply to value-level operators (#10597 ) In a sequence of value-level operators, whitespace does not affect relative precedence. Functional operators still follow the space-precedence rules. The "functional" operators are: `>> << \|> \|>> <\| <<\| : .`, application, and any operator containing `<-` or `->`. All other operators are considered value-level operators. Asymmetric whitespace can still be used to form operator sections of value-level operators, e.g. `+2 * 3` is still equivalent to `x -> (x+2) * 3`. Precedence of application is unchanged, so `f x+y` is still equivalent to `f (x + y)` and `f x+y * z` is still equivalent to `(f (x + y)) * z`. Any attempt to use spacing to override value-level operator precedence will be caught by the new enso linter. Mixed spacing (for clarity) in value-operator expressions is allowed, as long as it is consistent with the precedences of the operators. Closes #10366. # Important Notes Precedence warnings: - The parser emits a warning if the whitespace in an expression is inconsistent with its effective precedence. - A new enso linter can be run with `./run libraries lint`. It parses all `.enso` files in `distribution/lib` and `test`, and reports any errors or warnings. It can also be run on individual files: `cargo run --release --bin check_syntax -- file1 file2...` (the result may be easier to read than the `./run` output). - The linter is also run as part of `./run lint`, so it is checked in CI. Additional language change: - The exponentiation operator (`^`) now has higher precedence than the multiplication class (`*`, `/`, `%`). This change did not affect any current enso files. Library changes: - The libraries have been updated. The new warnings were used to identify all affected code; the changes themselves have not been programmatically verified (in many cases their equivalence relies on the commutativity of string concatenation).	2024-07-24 10:55:44 +00:00
Kaz Wesley	4cff789b69	Refactor precedence for whitespace changes (#10569 ) Single-phase whitespace-aware precedence resolution. #### Performance ![newplot(4)](https://github.com/user-attachments/assets/9822b0dc-17c3-4d2d-adf7-eb8b1c240522) Since this is a major refactor of the core of the parser, I benchmarked it; it's about 3% faster. # Important Notes - Move operator-identifier recognition to lexer. - Move compound-token assembly out of precedence resolver	2024-07-18 15:48:20 +00:00
Pavel Marek	270f708e4f	Implement private methods (#10060 ) Add support for private methods. Most of the changes are in parser and compiler. The runtime checking of private functions was already present since #9692 # Important Notes - Only top-level methods can be declared `private`. - private method cannot be called from different project - private method cannot be accessed from polyglot code (private method does not exist for polyglot code)	2024-05-31 08:00:20 +00:00
Pavel Marek	660c5e7a9d	Atom constructors can be private (#9692 ) Closes #8836. Atom constructors can be declared as private (project-private). project-private constructors can be called only from the same project. See the encapsulation.md docs for more info. --------- Co-authored-by: Jaroslav Tulach <jaroslav.tulach@enso.org> Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org> Co-authored-by: Hubert Plociniczak <hubert.plociniczak@gmail.com> Co-authored-by: Kaz Wesley <kaz@lambdaverse.org>	2024-04-29 14:43:18 +02:00
Kaz Wesley	0de490be24	Text literals: Accept unpaired-surrogate escape codes. (#9731 ) * Text literals: Accept unpaired-surrogate escape codes. Unpaired surrogates are not allowed by Unicode, but they occur in practice because many systems accept them; for example, they may be present in filenames on Windows (which are otherwise constrained to UTF-16). Programs written in Enso should be able to work with them, if only because they represent edge cases that should be tested when converting encodings and at other system boundaries. - Generalize the representation of interpreted-text-escapes in the lexer, so that we are not tied to the strict Unicode of Rust's `str`. - Move some doc-comment code from the parser to test utilities. - Simplify token serialization.	2024-04-18 09:21:05 -04:00
Kaz Wesley	2254dfe9fa	Reduce parser dependencies (#9671 ) * Reduce parser dependencies - `enso-parser-syntax-tree-visitor` is now only used when building tests and debug tools. - Remove `enso-logging` crate and its macros. - The main bin for `enso-parser` has been moved to a `check_syntax` tool in `enso-parser-debug`.	2024-04-11 00:27:19 -04:00
Michał Wawrzyniec Urbańczyk	90bbee352e	Bump Rust Toolchain (#9517 ) This PR updates the Rust toolchain to recent nightly. Most of the changes are related to fixing newly added warnings and adjusting the feature flags. Also the formatter changed its behavior slightly, causing some whitespace changes. Other points: * Changed debug level of the `buildscript` profile to `lint-tables-only` — this should improve the build times and space usage somewhat. * Moved lint configuration to the worksppace `Cargo.toml` definition. Adjusted the formatter appropriately. * Removed auto-generated IntelliJ run configurations, as they are not useful anymore. * Added a few trivial stdlib nightly functions that were removed to our codebase. * Bumped many dependencies but still not all: * `clap` bump encountered https://github.com/clap-rs/clap/issues/5407 — for now the warnings were silenced by the lint config. * `octocrab` — our forked diverged to far with the original, needs more refactoring. * `derivative` — is unmaintained and has no updated version, despite introducing warnings in the generated code. There is no direct replacement.	2024-03-24 23:45:55 +00:00
Kaz Wesley	a1c0d9ac08	Autoscope syntax (#9372 ) Add autoscope syntax (`..Ident`). # Important Notes - Also rename previous `Tree.Autoscope` to `SuspendedDefaultArguments`.	2024-03-12 19:31:16 +00:00
Jaroslav Tulach	ad2f5b031e	Chained if_then_else application change (#8671 ) * Test describing the current behavior of chained if then else application * Chained block should behave just like Group around if_then_else * Finishing line on BlockStart fixes if_then_else_chained_block * Only finish the line when there was not start of a macro segment * Fix tests * Refine else-body with macro patterns. * Update test syntax to maintain original semantics * Few additional tests --------- Co-authored-by: Kaz Wesley <kaz@lambdaverse.org>	2024-02-22 09:17:25 -05:00
Kaz Wesley	ce6c770fd7	Parse inline function signatures (#8470 ) Implements #6166. # Important Notes - More consistent handling of `default` arguments. `default` is a valid identifier, and only has special meaning when it isn't bound in scope. Since distinguishing the builtin `default` from an identifier called `default` cannot be done until alias analysis has been performed, `default` is now represented in the AST as a regular identifier. - `TreeToIr`: Remove `insideTypeAscription`. It was only used for bug-for-bug compatibility with the old parser during the transition.	2023-12-12 14:48:44 +00:00
Michał Wawrzyniec Urbańczyk	8021109d8c	Build script fixes & improvements (#8285 )	2023-11-23 21:19:31 +01:00
Kaz Wesley	ce042569b0	line:col positions in parser (#8203 ) Add `line:column` information to source code references produced by the parser. This information will be used by GUI2 as part of the solution to #8134. # Important Notes - `parse_all_enso_files.sh` has been used to ensure this doesn't affect tree structures. - `parse_all_enso_files.sh` now checks emitted locations for consistency, and has been used to verify that all line:col references match the values found by an independent scan of the source up to the given UTF8 position.	2023-11-08 16:53:39 +00:00
Kaz Wesley	2edd2bd7ff	Ensure all spans have document offsets (#8039 ) - Validate spans during existing lexer and parser unit tests, and in `enso_parser_debug`. - Fix lost span info causing failures of updated tests. # Important Notes - [x] Output of `parse_all_enso_files.sh` is unchanged since before #7881 (modulo libs changes since then). - When the parser encounters an input with the first line indented, it now creates a sub-block for lines at than indent level, and emits a syntax error (every indented block must have a parent). - When the parser encounters a number with a base but no digits (e.g. `0x`), it now emits a `Number` with `None` in the digits field rather than a 0-length digits token.	2023-10-19 12:36:42 +00:00
Michał Wawrzyniec Urbańczyk	e693147c17	CI improvements / IDE2 releasing (#8059 )	2023-10-16 14:18:17 +02:00
Kaz Wesley	9fd1ab9092	Parser TS bindings (#7881 ) Generate TS bindings and lazy deserialization for the parser types. # Important Notes - The new API is imported into `ffi.ts`, but not yet used. - I have tested the generated code in isolation, but cannot commit tests as we are not currently able to load WASM modules when running in `vitest`.	2023-10-11 13:04:38 +00:00
Pavel Marek	c22928ecc2	Implement private modules (#7840 ) Adds the ability to declare a module as private. Modifies the parser to add the `private` keyword as a reserved keyword. All the checks for private modules are implemented as an independent Compiler pass. No checks are done at runtime. # Important Notes - Introduces new keyword - `private` - a reserved keyword. - Modules that have `private` keyword as the first statement are declared as private (Project private) - Public module cannot have private submodules and vice versa. - This would require runtime access checks - See #7088 for the specification.	2023-10-04 10:33:10 +00:00
Kaz Wesley	e3de68858c	Eager macro resolution. (#7711 ) Resolve macros eagerly. Improves performance; allows parser to handle arbitrarily-long lines (fixes #7691). # Important Notes - A new utility, `lib/rust/parser/debug/tools/parse_all_enso_files.sh`, supports comparing ASTs parsed with different versions of the parser. This tool has been used to verify that this refactor doesn't change the result of parsing any standard library or test file.	2023-09-19 14:26:42 +00:00
Kaz Wesley	7a934cadcc	Parser: Fix handling of syntax errors such as `x =- 2` (#7399 ) * Parser: Correctly handle #7335 syntax errors.	2023-07-27 08:25:45 -07:00
Kaz Wesley	75fda33837	Fix precedence of `-1.x` (#5830 ) Fixes #5826. # Important Notes - Change frontend representation of negation. - Fix a precedence issue: The `.` operators in -1.x and -1.2 must have different precedences. - Remove a no-longer-needed special case from backend translation. - Add tests for this case after all translations.	2023-03-17 18:53:34 +00:00
Kaz Wesley	e171fba301	New documentation parser (#5917 ) Implement new Enso documentation parser; remove old Scala Enso parser. Performance: Total time parsing documentation is now ~2ms. # Important Notes - Doc parsing is now done only in the frontend. - Some engine tests had never been switched to the new parser. We should investigate tests that don't pass after the switch: #5894. - The option to run the old searcher has been removed, as it is obsolete and was already broken before this (see #5909). - Some interfaces used only by the old searcher have been removed.	2023-03-15 15:43:51 +00:00
Kaz Wesley	d1af25793a	Port graph editor to new AST (#4113 ) Use the Rust parser rather than the Scala parser to parse Enso code in the IDE. Implements: - https://www.pivotaltracker.com/story/show/182975925 - https://www.pivotaltracker.com/story/show/182988419 - https://www.pivotaltracker.com/story/show/182970096 - https://www.pivotaltracker.com/story/show/182973659 - https://www.pivotaltracker.com/story/show/182974161 - https://www.pivotaltracker.com/story/show/182974205 There is additional functionality needed before the transition is fully-completed, however I think it's time for this to see review and testing, so I've opened separate issues. In rough order of urgency (these issues are also linked from the corresponding disabled tests): - #5573 - #5571 - #5572 - #5574 # Important Notes The implementation is based partly on translation, and partly on new analysis. Method- and operator-related shapes are translated to the old `Ast` variants, so that all the analysis applied to them doesn't need to be ported at this time. Everything else (mostly "macros" in the old AST) is implemented with new analysis.	2023-02-10 18:05:40 +00:00
Kaz Wesley	4f70bcc5ff	Fix unhandled macro match failure case (#5056 ) (#5599 ) Fix an unhandled case in macro resolution	2023-02-09 23:00:29 +00:00
Michael Mauderer	349cc210e0	Bump rustc to nightly-2023-01-12 (#4053 ) Bump rustc nightly-2022-08-30 and fix new errors and lints. https://www.pivotaltracker.com/story/show/184229094	2023-02-02 23:05:25 +00:00
Kaz Wesley	1097c41297	Fix lexing bug encountered when doc-comments contain space-only lines (#4070 ) See: https://www.pivotaltracker.com/story/show/184249780	2023-01-26 12:53:20 +00:00
Kaz Wesley	591cacb79a	Reject `@` as binary operator (#4021 ) `@` should not be legal to use as a binary operator. I accepted it in the parser because it occurred in the .enso sources, but it was actually used to create a syntax error to test error recovery. See: https://www.pivotaltracker.com/story/show/184054024	2023-01-19 20:31:14 +00:00
Kaz Wesley	e15583fe65	Parser: Support annotations in type defs (#4036 ) Support application of the new type of annotation to method bindings in type definitions.	2023-01-12 16:51:44 +00:00
Kaz Wesley	0a782a2ed1	AST support for new annotation syntax (#4018 ) * rename Annotated -> AnnotatedBuiltin * update Panic.catch test * new annotations * Revert "update Panic.catch test" This reverts commit `ccda46e3a7`.	2023-01-04 09:50:38 -08:00
Kaz Wesley	d24019aa57	Implement SKIP/FREEZE in parser/TreeToIr (#3942 ) See: https://www.pivotaltracker.com/story/show/183919788 # Important Notes `SKIP` would be simpler if implemented in the parser, but there is some work needed before the Rust AST and Java IR are able to represent the results of macro-expansion: https://www.pivotaltracker.com/story/show/184004555	2022-12-20 17:32:59 +00:00
Kaz Wesley	336bbf505c	Parser: Newline normalization in text literals (#3903 ) - Newlines in text literals are now normalized to `\n` when producing IR representation. - Re-enabled tests that were dependent on the old behavior.	2022-11-27 09:40:44 +00:00
Kaz Wesley	580ed74726	Parser: identify lines earlier (#3900 ) Unify line-finding by doing it at the beginning of parsing. See: https://www.pivotaltracker.com/story/show/183797744	2022-11-22 19:59:33 +00:00
Kaz Wesley	8f3bfe8ce2	Parser: Fix @Ignored tests (#3879 ) - 3 ignored cases were old-parser bugs (one of which was quite interesting) - 1 was a new-lexer bug, fixed here See: https://www.pivotaltracker.com/story/show/183797809	2022-11-18 20:58:41 +00:00
Kaz Wesley	a1db36b57c	Support mixed constructors/bindings in types (#3870 ) Libraries: Revert changes that were necessitated by a new rule we have decided not to introduce. Parser: - Support mixed constructors/bindings in types. - Disallow zero-length hex sequences in character escapes: `\x`, `\u`, `\u{}`, `\U`, `\U{}` are no longer legal synonyms for `\0` (matches old parser behavior).	2022-11-14 20:24:07 +00:00
Kaz Wesley	e8f3ad3979	Ensure parses of invalid inputs represent all tokens (#3860 ) Ensure all tokens from the input are represented in trees resulting from invalid inputs--tests now cover every reachable code line that creates an `Invalid` node. (Also implemented stricter validation, mainly of `import`/`export` statements.) See: https://www.pivotaltracker.com/story/show/183405907	2022-11-09 02:57:40 +00:00
Kaz Wesley	330612119a	Parse the standard library (#3830 ) Fix bugs in `TreeToIr` (rewrite) and parser. Implement more undocumented features in parser. Emulate some old parser bugs and quirks for compatibility. Changes in libs: - Fix some bugs. - Clean up some odd syntaxes that the old parser translates idiosyncratically. - Constructors are now required to precede methods. # Important Notes Out of 221 files: - 215 match the old parser - 6 contain complex types the old parser is known not to handle correctly So, compared to the old parser, the new parser parses 103% of files correctly.	2022-10-31 16:19:12 +00:00
Kaz Wesley	feb8eb4f83	fix span bug in doc comments (#3808 )	2022-10-18 20:37:36 +00:00
Kaz Wesley	28daf14f75	Parse case-by-type, add an old-lambda syntax rule (#3802 ) - Special precedence rules for case-of so that `:` operator works without parens or nospace-grouping. - Support an old-lambda syntax: `x->x-> x`. According to the usual rules, the first nospace group would be parsed as an operator section. The expression now parses as a lambda that contains a lambda. - Match old parser treatment of # in doc comments. - Tweak precedence so (a : B = c) works. - Documented constructors.	2022-10-17 22:46:52 +00:00

1 2

53 Commits