enso-org/enso - enso - gitea: Gitea Service

mirror of https://github.com/enso-org/enso.git synced 2024-12-24 08:12:06 +03:00

Author	SHA1	Message	Date
Jaroslav Tulach	5ce173316b	More improvements that work with both parsers (#3868 )	2022-11-12 02:34:14 +01:00
Marcin Kostrzewa	23e04f905f	Another attempt at M1 compilation (#3859 )	2022-11-09 15:26:25 +00:00
Kaz Wesley	e8f3ad3979	Ensure parses of invalid inputs represent all tokens (#3860 ) Ensure all tokens from the input are represented in trees resulting from invalid inputs--tests now cover every reachable code line that creates an `Invalid` node. (Also implemented stricter validation, mainly of `import`/`export` statements.) See: https://www.pivotaltracker.com/story/show/183405907	2022-11-09 02:57:40 +00:00
Jaroslav Tulach	c2633bc137	Metadata, in context and imports (#3856 ) Another set of improvements extracted from #3611. This time it includes a fix to the Rust part of the parser. # Important Notes After digging into metadata parsing I realized the positions used to query the BTree data structure are wrong. This PR tries to address that by re-arranging the order of serialized fields and passing `startCode` and `endCode` locations in. Originally I though I need changes on the Rust side to support `in` operator. Turned out I can do that just with changes on the Java side. Qualified names in imports were missing UUIDs. Fixed now.	2022-11-07 19:05:19 +00:00
Wojciech Daniło	48ce68cda1	Wip/wdanilo/text shape system single scene 183406745 (#3776 )	2022-11-03 08:35:06 +01:00
Jaroslav Tulach	85f71cbfc8	Including enso_parser library in the engine distribution (#3842 ) Make sure `libenso_parser.so`, `.dll` or `.dylib` are packaged and included when `sbt buildEngineDistribution`. # Important Notes There was [a discussion](https://discord.com/channels/401396655599124480/1036562819644141598) about proper location of the library. It was concluded that _"there's no functional difference between a dylib and a jar."_ and as such the library is placed in `component` folder. Currently the old parser is still used for parsing. This PR just integrates the build system changes and makes us ready for smooth flipping of the parser in the future as part of #3611.	2022-11-02 17:13:53 +00:00
Kaz Wesley	330612119a	Parse the standard library (#3830 ) Fix bugs in `TreeToIr` (rewrite) and parser. Implement more undocumented features in parser. Emulate some old parser bugs and quirks for compatibility. Changes in libs: - Fix some bugs. - Clean up some odd syntaxes that the old parser translates idiosyncratically. - Constructors are now required to precede methods. # Important Notes Out of 221 files: - 215 match the old parser - 6 contain complex types the old parser is known not to handle correctly So, compared to the old parser, the new parser parses 103% of files correctly.	2022-10-31 16:19:12 +00:00
Jaroslav Tulach	21fe0b8865	Process documentation in type definitions (#3806 ) Another part of #3611 ready for integration into `develop` branch. # Important Notes Test `org.enso.compiler.EnsoCompilerTest.testTestGroup` is ignored as it has problems with source offsets - identifiers don't have the appropriate names due to `Tree.codeRepr()` being _off_.	2022-10-19 09:19:42 +00:00
Kaz Wesley	feb8eb4f83	fix span bug in doc comments (#3808 )	2022-10-18 20:37:36 +00:00
Kaz Wesley	28daf14f75	Parse case-by-type, add an old-lambda syntax rule (#3802 ) - Special precedence rules for case-of so that `:` operator works without parens or nospace-grouping. - Support an old-lambda syntax: `x->x-> x`. According to the usual rules, the first nospace group would be parsed as an operator section. The expression now parses as a lambda that contains a lambda. - Match old parser treatment of # in doc comments. - Tweak precedence so (a : B = c) works. - Documented constructors.	2022-10-17 22:46:52 +00:00
Kaz Wesley	2740406f93	Lex doc comments and attach text to AST (#3795 ) - New `Documented` node attaches documentation, lexed as a raw text literal, to a statement. - Handle a case of lambdas with body blocks.	2022-10-15 06:13:32 +00:00
Kaz Wesley	0e412044f6	Macro contexts (#3792 ) - Implement macro-contexts-lite (`from` is now only a keyword at the beginning of a line) - Support special nospace-group handling for old lambdas (so expressions like this work: `x-> y-> x + y`) - Fix a text-escape incompatibility # Important Notes - There is now an `OperatorFunction`, which is like a `Function` but has an operator for a name, and likewise an `OperatorTypeSignature`.	2022-10-13 22:47:02 +00:00
Kaz Wesley	5668cbcc24	Qualified defs (#3785 ) Allow qualified names in LHS of type signatures and method definitions.	2022-10-12 17:40:16 +00:00
Michał Wawrzyniec Urbańczyk	ad69eeb4ad	Build script merge (#3743 ) Merged the build script into main repository. Some related cleanups.	2022-10-10 23:38:48 +02:00
Kaz Wesley	2fab9ee1e9	Implement annotations (#3780 ) - `->` lambda operator isn't bound by nospace groups; see new test case. - Implemented annotations.	2022-10-10 07:09:01 +00:00
Kaz Wesley	44a031f9f0	Parser: Full constructor syntax for type definitions; Field syntax; Complex operator sections; Template functions; Text improvements; Operator methods; eliminate Unsupported; better ArgumentDefinitions (#3716 ) I believe all parse failures remaining after these changes are because the new parser is intentionally stricter about some things. I'll be reviewing those failures and opening a bug to change the library/tests code. Implements: - https://www.pivotaltracker.com/story/show/182941610: full type def syntax - https://www.pivotaltracker.com/story/show/182497490: field syntax - https://www.pivotaltracker.com/story/show/182497395: complex operator sections - https://www.pivotaltracker.com/story/show/182497236: template functions - `codeRepr` without leading whitespace - text literals: interpret escape sequences in lexer - the multiline text-literal left-trim algorithm - type operator-methods - the `<=` operator is no longer treated as a modifier - https://www.pivotaltracker.com/story/show/183315038: eliminate Unsupported - use ArgumentDefinition for type constructor arguments - more detailed ArgumentDefinition type	2022-10-05 04:45:31 +00:00
Wojciech Daniło	61546a7ade	Wip/wdanilo/widgets 182746060 (#3678 )	2022-10-04 04:51:27 +02:00
Jaroslav Tulach	9134f9b2d7	EnsoCompilerTest to verify compatibility of parsers (#3723 ) Adding new _compatibility test_ `EnsoCompilerTest` to verify the new Rust based parser can produce the same `IR` as the original `AST` based one. The simplest way to execute the test from an empty repository is: ```bash enso$ sbt bootstrap enso$ sbt "testOnly *EnsoCompilerTest" ``` There are [GitHub Actions run](https://github.com/enso-org/enso/actions/runs/3087664644/jobs/4993266212#step:9:5187) on Linux as well as [run on Windows](https://github.com/enso-org/enso/actions/runs/3087664644/jobs/4993266370#step:9:5254) that show `EnsoCompilerTest` is being executed by the CI (good, as that means `.so` was properly built and linked to the JVM running the test). The [linux](https://github.com/enso-org/enso/actions/runs/3087664644/jobs/4993266212#step:9:5187) as well as [windows](https://github.com/enso-org/enso/actions/runs/3087664644/jobs/4993266370#step:9:5254) runs also demonstrate that failures in the `EnsoCompilerTest` suite fail the CI. # Important Notes Right now [there are five test failures](https://github.com/enso-org/enso/actions/runs/3087664644/jobs/4993266212#step:9:5187) - waiting for @kazcw to make sure `codeRepr()` doesn't contain spaces. However, as this PR is more about the infrastructure, I am disabling the currently failing tests in [`031169b`](`031169bd05`)	2022-09-20 15:50:27 +00:00
Kaz Wesley	d8f274158a	Parser: Named and default arguments; Text interpolation; Escape sequences (#3709 ) * named and default arguments * text interpolation and escapes * work around a limitation of Java	2022-09-14 22:32:28 -07:00
Kaz Wesley	605bd08e8d	Parser: Utf16, recursive spans, `toString`, lambdas, case expressions, operator precedence, array and tuple literals, numeric literals (#3706 ) Implements: - https://www.pivotaltracker.com/story/show/182807114 - Utf16 - https://www.pivotaltracker.com/story/show/182931097 - recursive span info - https://www.pivotaltracker.com/story/show/182940917 - readable `toString` - https://www.pivotaltracker.com/story/show/182497196 - lambdas - https://www.pivotaltracker.com/story/show/182497518 - case expressions - https://www.pivotaltracker.com/story/show/182497344 - operator precedence and associativity - https://www.pivotaltracker.com/story/show/182497111 - array and tuple literals - https://www.pivotaltracker.com/story/show/182496909 - numeric literals	2022-09-14 18:09:58 +00:00
Kaz Wesley	1e3b9a3624	Parse text literals (#3681 ) Parse text literals. See: https://www.pivotaltracker.com/story/show/182496940 # Important Notes - The left-trimming algorithm (https://github.com/enso-org/design/blob/wip/wd/enso-spec/epics/enso-spec-1.0/04.%20Expressions.md#inline-and-block-text-literals) requires two passes over the sequence of text segments. This implementation performs one pass while parsing (identifying the correct amount of trim). The other pass (applying the trim) can be done when building the value of the quoted string: Trim the amount of whitespace identified by the `trim` field off of the whitespace of each `TextSection` (the value will not exceed the amount of whitespace found in the tokens' offsets, except for tokens with 0 offset, in which case no trimming is necessary/possible).	2022-09-03 06:38:06 +00:00
Kaz Wesley	c3f758e0dc	Parser: Parse UUIDs; implement comments in AST; implement type annotations and signatures; fix field names (#3653 ) Implements: - UUIDs: https://www.pivotaltracker.com/story/show/182931137 - Comments: https://www.pivotaltracker.com/story/show/182981779 - Type annotations and signatures: https://www.pivotaltracker.com/story/show/182497454 - Fix getter names (https://github.com/enso-org/enso/pull/3627#discussion_r940887460). # Important Notes - I can't fully test UUIDs; I have tested that the data obtained in Rust matches my understanding of how the format is supposed to work. What remains to be tested is that the data in Java matches the way the old parser handles the format. So @JaroslavTulach, let me know if you see any cases where I'm not returning the same values. - This implementation of type annotations and signatures accepts any expression in type context. It would probably be nice to narrow this down at some point, but for now I have no design info on what specifically should be allowed in type expressions; this implementation should be at least an incremental improvement.	2022-09-03 03:15:27 +00:00
Wojciech Daniło	4b96b4887c	Better fonts support. (#3616 )	2022-08-27 00:25:34 +02:00
Kaz Wesley	60b1dce79e	Parser: hide internal APIs in generated Java (#3605 ) Now that there's a public `org.enso.syntax2.Parser` interface (after #3599), make APIs that don't need to be exposed package-private.	2022-08-09 23:32:49 +02:00
Kaz Wesley	db7593b3fd	Parser: Unary minus (#3626 ) Implement unary minus. https://www.pivotaltracker.com/story/show/182497332 # Important Notes - This one had a lot of edge cases, so it has a lot of tests.	2022-08-09 20:31:23 +00:00
Ilya Bogdanov	9f8829650a	Layouting algorithm for the Favorites section (#3625 ) [ci no changelog needed] [Task link](https://www.pivotaltracker.com/story/show/181431035) This PR implements an algorithm that arranges component groups of the Favorites section into three columns, with the more important groups being accessible by fewer keystrokes. [The algorithm description](https://github.com/enso-org/design/blob/main/epics/component-browser/design.md#layouting-algorithm). https://user-images.githubusercontent.com/6566674/181642100-62769419-66e0-4a82-8dd8-be10662745d6.mp4	2022-08-05 09:39:51 +00:00
Kaz Wesley	796b1b5b82	Parser: implement import (#3627 ) Based on usage; I believe this handles every case in current `.enso` files. # Important Notes - `import` is a built-in macro, so an import statement parses as a `MultiSegmentApp`. - Every `import` syntax will have a segment whose leading keyword is `import`; however `import` macros can be identified more efficiently by looking at only the first keyword. A `MultiSegmentApp` is an import if and only if its first keyword is in the set { "polyglot", "from", "import" }.	2022-08-02 15:09:20 +00:00
Kaz Wesley	c525b201b9	Parser: don't panic for any standard library files (#3609 )	2022-07-28 19:17:33 +02:00
Kaz Wesley	c670718e3c	JNI bindings for enso-parser (#3599 ) Provide a JNI dynamic-library interface to `enso_parser`. # Important Notes - The library can be built with: `cargo build -p enso-parser-jni`. - A new `org.enso.syntax2.Parser` API is implemented on top of the JNI interface provided by `enso-parser-jni`. - We are using the `jni` crate, since apparently Java cannot just call C-ABI functions. The crate is not well-maintained. I came across an obviously-unsound `safe` function, and found it was reported over a year ago, with a PR to fix: jni-rs/jni-rs#303. However our needs are simple. We can't trust any safety guarantees they imply, but I think we are unlikely to encounter any logic bugs using the basic bindings.	2022-07-25 14:24:21 +00:00
Kaz Wesley	3b99e18f94	Code blocks (#3585 )	2022-07-20 16:53:20 +02:00
Kaz Wesley	100eeda673	Simple assignments and function definitions (#3572 ) implement simple variable assignments and function definitions. This implements: - https://www.pivotaltracker.com/story/show/182497122 - https://www.pivotaltracker.com/story/show/182497144 (the code blocks are not created yet, but the function declaration is recognized.) # Important Notes - Introduced S-expression-based tests, and pretty-printing-roundtrip testing. - Started writing tests for TypeDef based on the examples in the issue. None of them parse successfully. - Fixed Number tokenizing. - Moved most contents of parser's `main.rs` to `lib.rs` (fixes a warning).	2022-07-07 22:31:00 +00:00
Kaz Wesley	bc66078251	Parser: Transpile Rust AST types to Java types (#3555 ) Implement generation of Java AST types from the Rust AST type definitions, with support for deserializing in Java syntax trees created in Rust. ### New Libraries #### `enso-reflect` Implements a `#[derive(Reflect)]` macro to enable runtime analysis of datatypes. Macro interface includes helper attributes; the Rust types and the `reflect` attributes applied to them fully determine the Java types ultimately produced (by `enso-metamodel`). This is the most important API, as it is used in the subject crates (`enso-parser`, and dependencies with types used in the AST). [Module docs](https://github.com/enso-org/enso/blob/wip/kw/parser/ast-transpiler/lib/rust/reflect/macros/src/lib.rs). #### `enso-metamodel` Provides data models for data models in Rust/Java/Meta (a highly-abstracted language-independent model--I have referred to it before as the "generic representation", but that was an overloaded term). The high-level interface consists of operations on data models, and between them. For example, the only operations needed by [the binary that drives datatype transpilation](https://github.com/enso-org/enso/blob/wip/kw/parser/ast-transpiler/lib/rust/parser/generate-java/src/main.rs) are: `rust::to_meta`, `java::from_meta`, `java::transform::optional_to_null`, `java::to_syntax`. The low-level interface consists of direct usage of the datatypes; this is used by [the module that implements some serialization overrides](https://github.com/enso-org/enso/blob/wip/kw/parser/ast-transpiler/lib/rust/parser/generate-java/src/serialization.rs) (so that the Java interface to `Code` references can produce `String`s on demand based on serialized offset/length pairs). The serialization override mechanism is based on customizing, not replacing, the generated deserialization methods, so as to be as robust as possible to changes in the Rust source or in the transpilation process. ### Important Notes - Rust/Java serialization is exhaustively tested for structural compatibility. A function [`metamodel::meta::serialization::testcases`](https://github.com/enso-org/enso/blob/wip/kw/parser/ast-transpiler/lib/rust/metamodel/src/meta/serialization.rs) uses `reflect`-derived data to generate serialized representations of ASTs to use as test cases. Its should-accept cases cover every type a tree can contain; it also produces a representative set of should-reject cases. A Rust `#[test]` confirms that these cases are accepted/rejected as expected, and generated Java tests (see Binaries below) check the generated Java deserialization code against the same test cases. - Deserializing `Code` is untested. The mechanism is in place (in Rust, we serialize only the offset/length of the `Cow`; in Java, during deserialization we obtain a context object holding a buffer for all string data; the accessor generated in Java uses the buffer and the offset/length to return `String`s), but it will be easier to test once we have implemented actually parsing something and instantiating the `Cow`s with source code. - `#[tagged_enum]` [now supports](https://github.com/enso-org/enso/blob/wip/kw/parser/ast-transpiler/lib/rust/shapely/macros/src/tagged_enum.rs#L36-L51) control over what is done with container-level attributes; they can be applied to the container and variants (default), only to the container, or only to variants. - Generation of `sealed` classes is supported, but currently disabled by `TARGET_VERSION` in `metamodel::java::syntax` so that tests don't require Java 15 to run. (The same logic is run either way; there is a shallow difference in output.) ### Binaries The `enso-parser-generate-java` crate defines several binaries: - `enso-parser-generate-java`: Performs the transpilation; after integration, this will be invoked by the build script. - `java-tests`: Generates the Java code that tests format deserialization; after integration this command will be invoked by the build script, and its Java output compiled and run during testing. - `graph-rust`/`graph-meta`/`graph-java`: Produce GraphViz representations of data models in different typesystems; these are for developing and understanding model transformations. Until integration, a script regenerates the Java and runs the format tests: `./tools/parser_generate_java.sh`. The generated code can be browsed in `target/generated_java`.	2022-07-07 04:46:42 +02:00
Wojciech Daniło	7c0330290f	Parser improvements (#3549 )	2022-07-01 05:42:29 +02:00
Michał Wawrzyniec Urbańczyk	14a01c4635	New IDE build script (#3466 )	2022-05-23 04:16:04 +02:00
Wojciech Daniło	9e219d698c	Initial parser implementation in Rust (#3341 )	2022-05-17 05:13:20 +02:00
Wojciech Danilo	55dce7af65	Fixing build.	2022-03-10 06:21:57 +01:00
Wojciech Danilo	2accb722a8	Applying review	2022-03-10 05:57:59 +01:00
Wojciech Danilo	73f33d1c7f	Linting codebase	2022-03-10 05:32:33 +01:00
Michael Mauderer	32cfb0333a	Bump Rust toolchain to `nightly-2022-01-20`. (#3255 )	2022-02-16 12:58:02 +00:00
Adam Obuchowicz	c68ac5c0d6	Integration Test Framework (#3257 )	2022-02-11 13:19:02 +01:00
Adam Obuchowicz	c4d22102cf	Switch to 2021 edition (#3173 )	2021-12-01 16:06:57 +01:00
Adam Obuchowicz	9ab4f45e72	Refactored `enso-data` crate and text utilities. (#3166 )	2021-11-25 11:45:42 +01:00
Adam Obuchowicz	753f787a6f	Refactored config crate + moving crates to lib/rust directory. (#3155 )	2021-11-12 13:56:23 +01:00
Adam Obuchowicz	942464cbaf	Refactoring: merge utils into prelude; merge workspaces. (#3151 )	2021-11-10 14:36:08 +01:00
Michał Wawrzyniec Urbańczyk	848cef7dca	Use libraries merged from `rust-lib` directly (#3137 )	2021-11-05 16:55:55 +01:00
Ara Adkins	48bcebc723	Update to GraalVM 21.1.0 (#1738 )	2021-05-14 13:08:39 +01:00
Dmitry Bushev	e2b583c64a	Return Correct Name for the Module in AtomConstructor (#1375 ) return the module name if it's the atom constructor of the module.	2021-01-11 14:29:50 +03:00
Ara Adkins	4cc36e8c81	Preparation for more parser work (#1363 )	2020-12-18 14:25:30 +00:00
Josef	5345bdcb3e	Stub Out the Parser Interface (#1065 )	2020-08-14 11:10:52 +02:00

49 Commits