enso-org/enso - enso - gitea: Gitea Service

mirror of https://github.com/enso-org/enso.git synced 2024-12-24 19:03:27 +03:00

Author	SHA1	Message	Date
Pavel Marek	a67297aebf	Add graalpy packages to the component directory (#8351 ) Adds these JAR modules to the `component` directory inside Engine distribution: - `graal-language-23.1.0` - `org.bouncycastle.` - these need to be added for graalpy language # Important Notes - Remove `org.bouncycastle.` packages from `runtime.jar` fat jar. - Make sure that the `./run` script preinstalls GraalPy standalone distribution before starting engine tests - Note that using `python -m venv` is only possible from standalone distribution, we cannot distribute `graalpython-launcher`. - Make sure that installation of `numpy` and its polyglot execution example works. - Convert `Text` to `TruffleString` before passing to GraalPy - `8ee9a2816f`	2023-12-04 11:50:59 +00:00
Jaroslav Tulach	81f06456bf	400x faster with linear hashing of the hash map entries (#8425 ) Fixes #5233 by removing `EconomicMap` & co. and using plain old good _linear hashing_. Fixes #8090 by introducing `StorageEntry.removed()` rather than copying the builder on each removal.	2023-12-01 06:43:13 +00:00
James Dunkerley	dd28517d9e	Separating Table tests from the Tests project. (#8397 ) Moving a couple of tests to Table_Tests so Tests no longer depends on Tests.	2023-11-27 16:46:35 +00:00
Jaroslav Tulach	c6eb61a055	Filter for test/Benchmarks (#8391 ) With herein proposed change one can pass an optional filter to `enso --run test/Benchmarks` to execute only groups and specs that contain given string in its name.	2023-11-27 15:27:12 +00:00
Jaroslav Tulach	893965ed5c	3% speedup with LazyMap and MetadataStorage (#8359 )	2023-11-27 10:28:12 +01:00
Radosław Waśko	c6b6384fe6	Improve performance of anti-join (#8338 ) - Closes #8217	2023-11-24 02:44:57 +00:00
Jaroslav Tulach	4464a15035	Benchmark to measure startup time (#8378 ) While trying to speed `MetadataStorage` up - #8324 - I felt the need to have an independent (on my computer) measurement of startup time. Here is a benchmark that measures how long execution of two simple hello world programs take. # Important Notes There are two benchmarks: - `empty_startup` measures the time needed to boot without using any `Standard` library - basically _an overhead of the JVM and engine_ - `hello_world_startup` measures the use of `IO.println` - it shall take longer than `empty_startup` and show the overhead we have while processing the standard library	2023-11-23 21:31:01 +00:00
James Dunkerley	347b5a7cf5	Linting and Groups update (#8357 ) - Fix issues from the linter. - Rename the constructors for `Blank_Selector`. - Update various GROUP tags.	2023-11-21 18:12:27 +00:00
James Dunkerley	ecaca12df1	Integrating Enso Cloud with the libraries (part 1...) (#8006 ) - Add a `File_For_Read` type. Used for `File_Format` to read files. - Added `Enso_User` representing the current user in `Enso_Cloud`. - Will be later able to list known users. - Added `Enso_Secret` representing a value defined in `Enso_Cloud`. - Value not used within Enso only accessed within polyglot Java. - Integrated into `Username_And_Password` and can be used within JDBC connections. - Integrated into HTTP Headers so a secret can be used as a value. - New `URI_With_Query` with the same API as `URI`. Supporting secrets in the value. - Will be integrated with AWS credentials. - Added `Enso_File` representing a file or a folder in the cloud. - Support the same API as `File` (like the `S3_File`). - Will support `enso://` URI style access.	2023-11-20 23:21:14 +00:00
Jaroslav Tulach	1138dfe147	Specify expression to get more advanced results on_return callback (#8331 )	2023-11-20 18:47:11 +01:00
Pavel Marek	5a7ad6bfe4	Upgrade enso to GraalVM for jdk 21 (#7991 ) Upgrade to GraalVM JDK 21. ``` > java -version openjdk version "21" 2023-09-19 OpenJDK Runtime Environment GraalVM CE 21+35.1 (build 21+35-jvmci-23.1-b15) OpenJDK 64-Bit Server VM GraalVM CE 21+35.1 (build 21+35-jvmci-23.1-b15, mixed mode, sharing) ``` With SDKMan, download with `sdk install java 21-graalce`. # Important Notes - After this PR, one can theoretically run enso with any JRE with version at least 21. - Removed `sbt bootstrap` hack and all the other build time related hacks related to the handling of GraalVM distribution. - `project-manager` remains backward compatible - it can open older engines with runtimes. New engines now do no longer require a separate runtime to be downloaded. - sbt does not support compilation of `module-info.java` files in mixed projects - https://github.com/sbt/sbt/issues/3368 - Which means that we can have `module-info.java` files only for Java-only projects. - Anyway, we need just a single `module-info.class` in the resulting `runtime.jar` fat jar. - `runtime.jar` is assembled in `runtime-with-instruments` with a custom merge strategy (`sbt-assembly` plugin). Caching is disabled for custom merge strategies, which means that re-assembly of `runtime.jar` will be more frequent. - Engine distribution contains multiple JAR archives (modules) in `component` directory, along with `runner/runner.jar` that is hidden inside a nested directory. - The new entry point to the engine runner is [EngineRunnerBootLoader](https://github.com/enso-org/enso/pull/7991/files#diff-9ab172d0566c18456472aeb95c4345f47e2db3965e77e29c11694d3a9333a2aa) that contains a custom ClassLoader - to make sure that everything that does not have to be loaded from a module is loaded from `runner.jar`, which is not a module. - The new command line for launching the engine runner is in [distribution/bin/enso](https://github.com/enso-org/enso/pull/7991/files#diff-0b66983403b2c329febc7381cd23d45871d4d555ce98dd040d4d1e879c8f3725) - [Newest version of Frgaal](https://repo1.maven.org/maven2/org/frgaal/compiler/20.0.1/) (20.0.1) does not recognize `--source 21` option, only `--source 20`.	2023-11-17 18:02:36 +00:00
GregoryTravis	ea3d778456	Allow the creation of a constant column on an in-memory table with no rows. (#8218 )	2023-11-09 14:40:51 +00:00
GregoryTravis	6be94a854b	Implement truncate Date_Time for database backend (#8235 ) Also adds some checks for column names generated for floor, ceil, truncate, round.	2023-11-08 23:23:59 +00:00
Radosław Waśko	1b8b30a68d	Improve performance of `Join_Condition.Between` by sorting on one dimension (#8212 ) - Closes #5303 - Refactors `JoinStrategy` allowing us to 'stack' join strategies on top of each other (to some extent) - currently a `HashJoin` can be followed by another join strategy (currently `SortJoin`) - Adds benchmarks for join - Due to limitations of the sorting approach this will still not be as fast as possible for cases where there is more than 1 `Between` condition in a single query - trying to demonstrate that in benchmarks. - We can replace sorting by d-dimensional [RangeTrees](https://en.wikipedia.org/wiki/Range_tree) to get `O((n + m) log^d n + k)` performance (where `n` and `m` are sizes of joined tables, `d` is the amount of `Between` conditions used in the query and `k` is the result set size). - Follow up ticket for consideration later: #8216 - Closes #8215 - After all, it turned out that `TreeSet` was problematic (because of not enough flexibility with duplicate key handling), so the simplest solution was to immediately implement this sub-task. - Closes #8204 - Unrelated, but I ran into this here: adds type checks to other arguments of `set`. - Before, putting in a Column as `new_name` (i.e. mistakenly messing up the order of arguments), lead to a hard to understand `Method `if_then_else` of type Column could not be found.`, instead now it would file with type error 'expected Text got Column`.	2023-11-08 12:59:55 +00:00
Radosław Waśko	2ce1567384	Limit `max_rows` that are downloaded in `Table.read` by default, and warn if more rows are available (#8159 ) - Sets the default limit for `Table.read` in Database to be max 1000 rows. - The limit for in-memory compatible API still defaults to `Nothing`. - Adds a warning if there are more rows than limit. - Enables a few unrelated asserts.	2023-11-06 16:41:47 +00:00
Radosław Waśko	237aae33c7	Simplify internal logic of `Table.order_by`, avoid unnecessary warning (#8221 ) - Fixes #8213	2023-11-06 11:00:01 +00:00
GregoryTravis	3c371adbef	Implement Table.format similar to Table.parse allowing to format columns in bulk (#8150 ) * doc * one test * date tests * empty and nothing * ints floats * bools * all columns * regex and index * locales * bad formats * all with one format * docs * examples, not impl db * docs, more errors * cleanup * changelog * check list * reorder * clue * review * review * review * review * review * review * specify time zone	2023-11-02 09:36:36 -04:00
Cassandra-Clark	b5d6628c57	Change filter_blank_rows when_any parameter to have a more user-friendly type (#7935 ) Added Blank_Selector constructor and applied to remove_blank_columns, select_blank_columns, filter_blank_rows for #7931 . Changed when_any to when for readability.	2023-11-01 16:51:15 +00:00
GregoryTravis	d467683ed1	Constant columns (in expressions and Column_Operations) should have clearer names (#8188 ) Previously, constant columns were given generated names with UUIDs in them, which are long and provide no information. Instead, we now use the constant value itself to form the name. Since these new generated names are less unique, we must explicitly make them unique, in cases where the caller did not explicilty set a name.	2023-11-01 14:41:03 +00:00
GregoryTravis	1480f50207	Overhaul the random number and item generation code (#8127 ) Rewrite most of Random.enso.	2023-10-31 15:25:37 +00:00
Radosław Waśko	79011bd550	Implement `Table.lookup_and_replace` in Database (#8146 ) - Closes #7981 - Adds a `RUNTIME_ERROR` operation into the DB dialect, that may be used to 'crash' a query if a condition is met - used to validate if `lookup_and_replace` invariants are still satisfied when the query is materialized. - Removes old `Table_Helpers.is_table` and `same_backend` checks, in favour of the new way of checking this that relies on `Table.from` conversions, and is much simpler to use and also more robust.	2023-10-31 15:19:55 +00:00
Pavel Marek	b084e097c9	Add special handling for Dataflow_Error passed to Runtime.assert (#8168 ) Add special handling for `Dataflow_Error` passed as action to `Runtime.assert`. It caused an infinite recursion.	2023-10-27 17:22:15 +00:00
Radosław Waśko	c1259cb4d2	Compare performance of Panic / Java Exception / Dataflow error (#8130 ) After a discussion, I was really curious that our panics are supposed to be almost free - and while trusting that statement, it was really hard to believe - so I wanted to see for myself - knowing that an experiment is the most robust source of this kind of information - testing that in practice. So I wrote a benchmark comparing various ways of reporting errors, also testing them both at 'shallow' and 'deep' stack traces (adding 200 additional frames) - to see how stack depth affects them, if at all. The panics are indeed blazing fast! Kudos to the engine team. However, it seems that our dataflow errors are relatively slow (and we tend to use them _more_ than panics and want to be using them more and more). This uncovers a possible optimization opportunity. Can we make them as fast as panics?? Analysis of the benchmark results in comment below.	2023-10-24 12:03:44 +00:00
Jaroslav Tulach	e283c78977	Properly convert defaulted arguments before on_call back (#8143 )	2023-10-24 13:39:54 +02:00
Radosław Waśko	0c278391fe	Test and improve handling of `Date_Time with_timezone=False` in Postgres (#8114 ) - Fixes #8049 - Adds tests for handling of Date_Time upload/download in Postgres. - Adds tests for edge cases of handling of Decimal and Binary types in Postgres.	2023-10-21 21:35:13 +00:00
Radosław Waśko	8172896065	Support `Previous_Value` in `fill_nothing` and `fill_missing` (#8105 ) - Adds `Previous_Value` to `fill_nothing` and `fill_empty`, as requested by #7192.	2023-10-20 13:18:53 +00:00
James Dunkerley	ad95dc9815	Fixes for Table viz (#8102 ) - Fixes issue with `to_display_text` on `Date_Time`. Now a reversible format. - Adjusted the auto width code on table so it works. - Manually sized columns should remain fixed size. - Enable Excel range style selection in table. - Removed page support as not implemented yet. # Important Notes Will need to apply to Vue based viz at some point as well.	2023-10-19 08:22:23 +00:00
GregoryTravis	7383db0e04	Restructuring XML into Table form (#8083 ) # Important Notes Adds `.to Table` support, as well as XML support for `expand_column`.	2023-10-19 07:02:48 +00:00
Radosław Waśko	28fc183f92	Review places where we can use `Column_Ref` (#8101 ) Closes #8046	2023-10-18 19:03:50 +00:00
Radosław Waśko	93a31fcc8b	Add benchmarks related to `add_row_number` performance investigation (#8091 ) - Follow-up of #8055 - Adds a benchmark comparing performance of Enso Map and Java HashMap in two scenarios - _only incremental_ updates (like `Vector.distinct`) and _replacing_ updates (like keeping a counter for each key). These benchmarks can be used as a metric for #8090	2023-10-18 17:21:59 +00:00
Radosław Waśko	e9fa12763e	Improve performance of `add_row_number` (#8076 ) Fixes #8055	2023-10-17 00:42:35 +00:00
Radosław Waśko	08b717eb54	Refactor Table problem handling to a more robust and hopefully cleaner approach (#7879 ) Closes #7514	2023-10-16 15:09:08 +00:00
GregoryTravis	f18d1323e1	Add Table.expand_to_rows to allow flattening vector and array values in table (#8042 ) # Important Notes Also includes a fix for a reallocation bug in `InferredBuilder`.	2023-10-13 20:54:06 +00:00
James Dunkerley	b7d7910a88	Use US locale for Date/Time parsing by default. (#8053 ) Fixes issue with parsing long format month names.	2023-10-13 17:47:14 +00:00
James Dunkerley	fac9e7a420	Expand capabilities of `Table.set` and better dropdown support, (#8005 ) - Adds the ability to use numbers, date/time and Boolean values as constants in `set`. - `Table.set` can take a `Column_Operation`, allowing for deriving of a new column based on other columns. - Added `Column_Ref` type to refer to a column in `filter`.	2023-10-13 16:03:28 +00:00
Jaroslav Tulach	31624b5314	Helper method to turn iterator to a Vector (#8035 )	2023-10-12 17:50:51 +02:00
James Dunkerley	0dcfc3e9bf	Minor improvements from last couple of Book Clubs (#8034 ) - Added some ALIASes. - Added `sheet` to `Excel_Workbook` to give familiar API to read sheet. - Added conversion from range to vector allowing easy use with Zip. - Add `Map.from_keys_and_values.	2023-10-12 14:29:59 +00:00
Radosław Waśko	cd84ac16ce	Restructure `Table.from_objects` to use conversions (#8020 ) Closes #7957	2023-10-11 22:25:18 +00:00
Jaroslav Tulach	7ae0971d11	Runtime checks of signatures with polyglot java classes (#8016 )	2023-10-11 15:05:36 +02:00
GregoryTravis	a2c8940218	Add tests for Table.from_objects and Table.expand_column. (#8010 )	2023-10-11 11:50:51 +00:00
somebody1234	826127d8ff	Eliminate line feeds from `XML.outer_xml` on Windows (#8013 ) - Closes #7999 # Important Notes None	2023-10-10 23:21:34 +00:00
Radosław Waśko	6f78570115	Fix a DROP table bug, add SQL debug logging (#8007 ) - Fixes a bug where creating a temporary table could accidentally issue a `DROP` statement of the table name that the user provided, risking destruction of user data. - Fortunately, the bad scenario was almost impossible, because the `DROP` statement was only issued _if_ we previously checked that the mentioned table _does not exist_ - dropping a nonexistent table does not do any harm. - It could have been dangerous in a very unlikely scenario that a table was created just between the _existence check_ and the _drop_. - After the fix the existence check and any modifications are done within a transaction to avoid interference from concurrent modifications, and the DROP is correctly applied to a temporary Enso table instead of the original one. - Replaced a temporary log with proper simple logging of SQL statements into a file, if an Environment variable is set. - Used that feature to test that no unexpected statements occur.	2023-10-10 13:16:06 +00:00
Radosław Waśko	6e0bd86753	Implement `Table.lookup_and_replace` for in-memory (#7979 ) - Closes #7749 implementing the in-memory logic. - Additional complications have surfaced regarding the Database logic, so it has been split off into a separate ticket: #7981	2023-10-10 10:42:06 +00:00
Jaroslav Tulach	a234e82ee9	Instrumenter to observe behavior of nodes with UUID (#7833 ) Exposing instrumentation capabilities to Enso. Fixes #7683.	2023-10-10 02:36:59 +00:00
GregoryTravis	9ba7be20af	Basic XML support (#7947 ) This PR includes * Reading XML from a file, stream, or string * Reading XML via Data.fetch * Accessing the root element, element children, and attributes * Accessing tag text contents * Get tags by name * Inner / Outer XML string	2023-10-06 17:52:19 +00:00
Pavel Marek	c525698e5c	Enable Enso asserts on CI (#7929 ) --------- Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>	2023-10-04 15:52:44 +02:00
Jaroslav Tulach	421e3f22a4	Avoid unnecessary conversions (#7952 ) Fixes #7854 by checking all elements of a union type for direct match and only then, if direct match isn't available, applying conversions.	2023-10-04 12:31:06 +02:00
Radosław Waśko	0cd446432f	Fix inconsistency when building a Mixed column, fixes to Union (#7919 ) - Fixes #7352 by remembering original value types in type inference mode to be able to reconstruct them for Mixed. - Added more benchmarks for comparing performance of constructing columns. - Fixes missing implementations that caused `Table.union` crashing on some type pairs. - Ensures that `Loss_Of_Integer_Precision` warning is not swallowed when numeric columns are unioned to create a `Float` column. - Adds test for all of the above cases. - Allow to output benchmark results to a CSV by setting an environment variable - useful for quickly comparing benchmarks, e.g. in Enso.	2023-10-03 20:33:34 +02:00
Radosław Waśko	08cd449a99	Fix `NumberParser` to avoid `thousandSeparator==decimalPoint` and prefer US decimal format (#7946 ) Closes #7930	2023-10-03 20:07:54 +02:00
Radosław Waśko	3222e5af62	Avoid exponential growth of column names (#7934 ) - Fixes #7933 - Avoids duplicating `]` as `]]` in generated column names - now column names grow linearly.	2023-10-02 16:05:28 +00:00

1 2 3 4 5 ...

647 Commits