enso-org/enso - enso - gitea: Gitea Service

mirror of https://github.com/enso-org/enso.git synced 2024-11-27 06:32:30 +03:00

Author	SHA1	Message	Date
Jaroslav Tulach	7f0cb88fa1	Consistent simple and qualified type name (#8448 ) Fixes #8255 by unifying `get_qualified_type_name` and `get_simple_type_name` implementations.	2023-12-06 04:30:24 +00:00
Radosław Waśko	c6b6384fe6	Improve performance of anti-join (#8338 ) - Closes #8217	2023-11-24 02:44:57 +00:00
James Dunkerley	f60836d9e1	Apply ICONs (#8360 ) - Amend a couple of missed groups. - Add the first pass of some ICONs. The linter tool has been updated to support rewriting the ICON as well.	2023-11-22 15:24:16 +00:00
James Dunkerley	347b5a7cf5	Linting and Groups update (#8357 ) - Fix issues from the linter. - Rename the constructors for `Blank_Selector`. - Update various GROUP tags.	2023-11-21 18:12:27 +00:00
James Dunkerley	ecaca12df1	Integrating Enso Cloud with the libraries (part 1...) (#8006 ) - Add a `File_For_Read` type. Used for `File_Format` to read files. - Added `Enso_User` representing the current user in `Enso_Cloud`. - Will be later able to list known users. - Added `Enso_Secret` representing a value defined in `Enso_Cloud`. - Value not used within Enso only accessed within polyglot Java. - Integrated into `Username_And_Password` and can be used within JDBC connections. - Integrated into HTTP Headers so a secret can be used as a value. - New `URI_With_Query` with the same API as `URI`. Supporting secrets in the value. - Will be integrated with AWS credentials. - Added `Enso_File` representing a file or a folder in the cloud. - Support the same API as `File` (like the `S3_File`). - Will support `enso://` URI style access.	2023-11-20 23:21:14 +00:00
Jaroslav Tulach	ba19813511	Speeding up "hello world" example by 16%	2023-11-19 16:38:31 +01:00
Pavel Marek	5a7ad6bfe4	Upgrade enso to GraalVM for jdk 21 (#7991 ) Upgrade to GraalVM JDK 21. ``` > java -version openjdk version "21" 2023-09-19 OpenJDK Runtime Environment GraalVM CE 21+35.1 (build 21+35-jvmci-23.1-b15) OpenJDK 64-Bit Server VM GraalVM CE 21+35.1 (build 21+35-jvmci-23.1-b15, mixed mode, sharing) ``` With SDKMan, download with `sdk install java 21-graalce`. # Important Notes - After this PR, one can theoretically run enso with any JRE with version at least 21. - Removed `sbt bootstrap` hack and all the other build time related hacks related to the handling of GraalVM distribution. - `project-manager` remains backward compatible - it can open older engines with runtimes. New engines now do no longer require a separate runtime to be downloaded. - sbt does not support compilation of `module-info.java` files in mixed projects - https://github.com/sbt/sbt/issues/3368 - Which means that we can have `module-info.java` files only for Java-only projects. - Anyway, we need just a single `module-info.class` in the resulting `runtime.jar` fat jar. - `runtime.jar` is assembled in `runtime-with-instruments` with a custom merge strategy (`sbt-assembly` plugin). Caching is disabled for custom merge strategies, which means that re-assembly of `runtime.jar` will be more frequent. - Engine distribution contains multiple JAR archives (modules) in `component` directory, along with `runner/runner.jar` that is hidden inside a nested directory. - The new entry point to the engine runner is [EngineRunnerBootLoader](https://github.com/enso-org/enso/pull/7991/files#diff-9ab172d0566c18456472aeb95c4345f47e2db3965e77e29c11694d3a9333a2aa) that contains a custom ClassLoader - to make sure that everything that does not have to be loaded from a module is loaded from `runner.jar`, which is not a module. - The new command line for launching the engine runner is in [distribution/bin/enso](https://github.com/enso-org/enso/pull/7991/files#diff-0b66983403b2c329febc7381cd23d45871d4d555ce98dd040d4d1e879c8f3725) - [Newest version of Frgaal](https://repo1.maven.org/maven2/org/frgaal/compiler/20.0.1/) (20.0.1) does not recognize `--source 21` option, only `--source 20`.	2023-11-17 18:02:36 +00:00
GregoryTravis	ea3d778456	Allow the creation of a constant column on an in-memory table with no rows. (#8218 )	2023-11-09 14:40:51 +00:00
GregoryTravis	6be94a854b	Implement truncate Date_Time for database backend (#8235 ) Also adds some checks for column names generated for floor, ceil, truncate, round.	2023-11-08 23:23:59 +00:00
Radosław Waśko	1b8b30a68d	Improve performance of `Join_Condition.Between` by sorting on one dimension (#8212 ) - Closes #5303 - Refactors `JoinStrategy` allowing us to 'stack' join strategies on top of each other (to some extent) - currently a `HashJoin` can be followed by another join strategy (currently `SortJoin`) - Adds benchmarks for join - Due to limitations of the sorting approach this will still not be as fast as possible for cases where there is more than 1 `Between` condition in a single query - trying to demonstrate that in benchmarks. - We can replace sorting by d-dimensional [RangeTrees](https://en.wikipedia.org/wiki/Range_tree) to get `O((n + m) log^d n + k)` performance (where `n` and `m` are sizes of joined tables, `d` is the amount of `Between` conditions used in the query and `k` is the result set size). - Follow up ticket for consideration later: #8216 - Closes #8215 - After all, it turned out that `TreeSet` was problematic (because of not enough flexibility with duplicate key handling), so the simplest solution was to immediately implement this sub-task. - Closes #8204 - Unrelated, but I ran into this here: adds type checks to other arguments of `set`. - Before, putting in a Column as `new_name` (i.e. mistakenly messing up the order of arguments), lead to a hard to understand `Method `if_then_else` of type Column could not be found.`, instead now it would file with type error 'expected Text got Column`.	2023-11-08 12:59:55 +00:00
Radosław Waśko	2ce1567384	Limit `max_rows` that are downloaded in `Table.read` by default, and warn if more rows are available (#8159 ) - Sets the default limit for `Table.read` in Database to be max 1000 rows. - The limit for in-memory compatible API still defaults to `Nothing`. - Adds a warning if there are more rows than limit. - Enables a few unrelated asserts.	2023-11-06 16:41:47 +00:00
James Dunkerley	a850ecb787	Change widget for Order By. (#8226 ) Simplifies the Order By drop and always adds as a `Sort_Column`. If user wants to do descending is a two step process but feels more natural.	2023-11-06 16:27:36 +00:00
Radosław Waśko	237aae33c7	Simplify internal logic of `Table.order_by`, avoid unnecessary warning (#8221 ) - Fixes #8213	2023-11-06 11:00:01 +00:00
GregoryTravis	3c371adbef	Implement Table.format similar to Table.parse allowing to format columns in bulk (#8150 ) * doc * one test * date tests * empty and nothing * ints floats * bools * all columns * regex and index * locales * bad formats * all with one format * docs * examples, not impl db * docs, more errors * cleanup * changelog * check list * reorder * clue * review * review * review * review * review * review * specify time zone	2023-11-02 09:36:36 -04:00
Cassandra-Clark	b5d6628c57	Change filter_blank_rows when_any parameter to have a more user-friendly type (#7935 ) Added Blank_Selector constructor and applied to remove_blank_columns, select_blank_columns, filter_blank_rows for #7931 . Changed when_any to when for readability.	2023-11-01 16:51:15 +00:00
GregoryTravis	d467683ed1	Constant columns (in expressions and Column_Operations) should have clearer names (#8188 ) Previously, constant columns were given generated names with UUIDs in them, which are long and provide no information. Instead, we now use the constant value itself to form the name. Since these new generated names are less unique, we must explicitly make them unique, in cases where the caller did not explicilty set a name.	2023-11-01 14:41:03 +00:00
Radosław Waśko	79011bd550	Implement `Table.lookup_and_replace` in Database (#8146 ) - Closes #7981 - Adds a `RUNTIME_ERROR` operation into the DB dialect, that may be used to 'crash' a query if a condition is met - used to validate if `lookup_and_replace` invariants are still satisfied when the query is materialized. - Removes old `Table_Helpers.is_table` and `same_backend` checks, in favour of the new way of checking this that relies on `Table.from` conversions, and is much simpler to use and also more robust.	2023-10-31 15:19:55 +00:00
Radosław Waśko	0c278391fe	Test and improve handling of `Date_Time with_timezone=False` in Postgres (#8114 ) - Fixes #8049 - Adds tests for handling of Date_Time upload/download in Postgres. - Adds tests for edge cases of handling of Decimal and Binary types in Postgres.	2023-10-21 21:35:13 +00:00
Radosław Waśko	8172896065	Support `Previous_Value` in `fill_nothing` and `fill_missing` (#8105 ) - Adds `Previous_Value` to `fill_nothing` and `fill_empty`, as requested by #7192.	2023-10-20 13:18:53 +00:00
GregoryTravis	7383db0e04	Restructuring XML into Table form (#8083 ) # Important Notes Adds `.to Table` support, as well as XML support for `expand_column`.	2023-10-19 07:02:48 +00:00
Radosław Waśko	28fc183f92	Review places where we can use `Column_Ref` (#8101 ) Closes #8046	2023-10-18 19:03:50 +00:00
Radosław Waśko	e9fa12763e	Improve performance of `add_row_number` (#8076 ) Fixes #8055	2023-10-17 00:42:35 +00:00
Radosław Waśko	08b717eb54	Refactor Table problem handling to a more robust and hopefully cleaner approach (#7879 ) Closes #7514	2023-10-16 15:09:08 +00:00
GregoryTravis	f18d1323e1	Add Table.expand_to_rows to allow flattening vector and array values in table (#8042 ) # Important Notes Also includes a fix for a reallocation bug in `InferredBuilder`.	2023-10-13 20:54:06 +00:00
James Dunkerley	fac9e7a420	Expand capabilities of `Table.set` and better dropdown support, (#8005 ) - Adds the ability to use numbers, date/time and Boolean values as constants in `set`. - `Table.set` can take a `Column_Operation`, allowing for deriving of a new column based on other columns. - Added `Column_Ref` type to refer to a column in `filter`.	2023-10-13 16:03:28 +00:00
James Dunkerley	0dcfc3e9bf	Minor improvements from last couple of Book Clubs (#8034 ) - Added some ALIASes. - Added `sheet` to `Excel_Workbook` to give familiar API to read sheet. - Added conversion from range to vector allowing easy use with Zip. - Add `Map.from_keys_and_values.	2023-10-12 14:29:59 +00:00
Radosław Waśko	cd84ac16ce	Restructure `Table.from_objects` to use conversions (#8020 ) Closes #7957	2023-10-11 22:25:18 +00:00
Radosław Waśko	6e0bd86753	Implement `Table.lookup_and_replace` for in-memory (#7979 ) - Closes #7749 implementing the in-memory logic. - Additional complications have surfaced regarding the Database logic, so it has been split off into a separate ticket: #7981	2023-10-10 10:42:06 +00:00
Radosław Waśko	0cd446432f	Fix inconsistency when building a Mixed column, fixes to Union (#7919 ) - Fixes #7352 by remembering original value types in type inference mode to be able to reconstruct them for Mixed. - Added more benchmarks for comparing performance of constructing columns. - Fixes missing implementations that caused `Table.union` crashing on some type pairs. - Ensures that `Loss_Of_Integer_Precision` warning is not swallowed when numeric columns are unioned to create a `Float` column. - Adds test for all of the above cases. - Allow to output benchmark results to a CSV by setting an environment variable - useful for quickly comparing benchmarks, e.g. in Enso.	2023-10-03 20:33:34 +02:00
Radosław Waśko	08cd449a99	Fix `NumberParser` to avoid `thousandSeparator==decimalPoint` and prefer US decimal format (#7946 ) Closes #7930	2023-10-03 20:07:54 +02:00
Radosław Waśko	3222e5af62	Avoid exponential growth of column names (#7934 ) - Fixes #7933 - Avoids duplicating `]` as `]]` in generated column names - now column names grow linearly.	2023-10-02 16:05:28 +00:00
James Dunkerley	fb50eb7595	Using conversions in a few places (#7859 ) - Shuffle a few `from`s into correct places: - `Day_Of_Week.from` removing `Day_Of_Week_From` module. - Adding short cut for `http` and `https` in `Data.read` so it calls onto `Data.fetch` giving a single entry point. - Moved `URI` extensions from `Standard.Base.Data` module into `Standard.Base.Network.Extensions`. - Added `post` extension for `URI`. - Added `contains_key` to `JS_Object`. - Restored `into` in `JS_Object`: - Follows old logic populating a constructor. - Will use conversion from `JS_Object` if present. - Added automatic deserialization of `Date`, `Time_Of_Day` and `Date_Time` from JSON. - Uses conversion from `JS_Object`. - Added conversion from `Text` to a `HTTP_Method` and type checking where `HTTP_Method` used in public APIs. - Added support for `Date`, `Time_Of_Day` and `Date_Time` in `Table.from_objects`. - Added `expand_column` to `Table` to expand `JS_Object` to values. - Add type checking for `Table` in `right` arguments (allowing `Column`s to be used). - Use type checking in `Table.set` to allow for conversion to a `Column`. - Remove some unused imports. - Fix for bug in S3 edge case.	2023-10-02 14:54:22 +00:00
Radosław Waśko	8d926166ea	Follow up improvements to `Date_Time_Formatter` (#7875 ) - Closes #7872 - Also closes #7866	2023-09-28 09:38:00 +00:00
Radosław Waśko	c690559ec4	Implement `auto_value_type` operation (#7908 ) Closes #6113	2023-09-27 15:45:34 +00:00
Hubert Plociniczak	18b2491a41	Always log to console and file (#7825 ) * Always log verbose to a file The change adds an option by default to always log to a file with verbose log level. The implementation is a bit tricky because in the most common use-case we have to always log in verbose mode to a socket and only later apply the desired log levels. Previously socket appender would respect the desired log level already before forwarding the log. If by default we log to a file, verbose mode is simply ignored and does not override user settings. To test run `project-manager` with `ENSO_LOGSERVER_APPENDER=console` env variable. That will output to the console with the default `INFO` level and `TRACE` log level for the file. * add docs * changelog * Address some PR requests 1. Log INFO level to CONSOLE by default 2. Change runner's default log level from ERROR to WARN Took a while to figure out why the correct log level wasn't being passed to the language server, therefore ignoring the (desired) verbose logs from the log file. * linter * 3rd party uses log4j for logging Getting rid of the warning by adding a log4j over slf4j bridge: ``` ERROR StatusLogger Log4j2 could not find a logging implementation. Please add log4j-core to the classpath. Using SimpleLogger to log to the console... ``` * legal review update * Make sure tests use test resources Having `application.conf` in `src/main/resources` and `test/resources` does not guarantee that in Tests we will pick up the latter. Instead, by default it seems to do some kind of merge of different configurations, which is far from desired. * Ensure native launcher test log to console only Logging to console and (temporary) files is problematic for Windows. The CI also revealed a problem with the native configuration because it was not possible to modify the launcher via env variables as everything was initialized during build time. * Adapt to method changes * Potentially deal with Windows failures	2023-09-26 11:32:04 +02:00
Radosław Waśko	12c4f2981d	More robust Date/Time format patterns parsing (#7826 ) - Closes #7461 by introducing a `Date_Time_Formatter` type and making parsing date time formats more robust and safer. - The default ('simple') set of patterns is slightly simplified and made case insensitive (except for `M/m` and `H/h`) to avoid the `YYYY` vs `yyyy` issues and make it less error prone. - The `YYYY` now has the same meaning as `yyyy` in simple mode. The old meaning (week-based year) is moved to a _separate mode_, triggered by `Date_Time_Formatter.from_iso_week_date_pattern`. - Full Java syntax, as well as custom-built Java `DateTimeFormatter` can also be used by `Date_Time_Formatter.from_java`. - Text-based constants (e.g. `ISO_ZONED_DATE_TIME`) have now become methods on `Date_Time_Formatter`, e.g. `Date_Time_Formatter.iso_zoned_date_time`).	2023-09-22 10:12:18 +00:00
James Dunkerley	74d1d0861c	S3 Read Access, Input Stream based reading (#7776 ) - Added a `FileSystemSPI` allowing protocol resolution to a target type. - Separated `Input_Stream` and `Output_Stream` from `File` to allow use in other spaces. - `File_Format` types `read_web` changed to be `read_stream` working with `InputStream`. - Added directory listing to `Auto_Detect` allowing for `Data.read` to list a folder. - Adjusted HTTP to return an `InputStream` not a `byte[]`: - `Response_Body` adjusted to wrap an `InputStream`. - Added ability to materialize to either and in-memory vector (<4KB) or a temporary file. - `Data.fetch` will materialize if not a recognized mime-type. - Added `HTTP_Error` to handle IO exceptions from the stream. - `Excel_Format` now supports mime-type and reading a stream. - `Excel_Workbook` can now get a `Excel_Section` using `read_section`. - Added S3 APIs: - `parse_uri`: splits an S3 URI into bucket and key. - `list_objects`: list the items in a S3 bucket with specified prefix. - `read_bucket`: list prefixes and keys with a delimiter in a S3 bucket with specified prefix. - `head`: either head_bucket (tests existance) or head_object API (reads object meta data). - `get_object`: gets an object from S3 returning as a `Response_Body`. - Added `S3_File` type acting like a `File`: - No support for writing in this PR. - ToDo: recursive listing, glob filtering, exists, size. - Fixed a few invalid type signature line. - Moved `create` methods for `Postgres_Connection` and `SQLite_Connection` into type instead of module. - Renamed `Column_Fetcher.Builder` to `Column_Fetcher_Builder`. - Fixed bug with `select_into` in Dry Run mode creating permanent tables. ToDo: Unit tests.	2023-09-20 15:09:11 +00:00
Hubert Plociniczak	1ee3d8f4f0	Rename Decimal to Float (#7807 ) Implements #6889.	2023-09-14 15:01:30 +00:00
Radosław Waśko	8b6e70b155	Support for BigInteger values in Table (#7715 ) - Fixes #7354 - And also closes #7712 - Refactors how we handle numeric ops - ensuring that the 'kernels' are placed all in one place and selected based on storage types.	2023-09-12 13:18:04 +00:00
James Dunkerley	f0ae9bf9c5	Fixes issue writing to a dry run Excel File (#7763 ) - Adds `size` to `File`. - If file is empty, then create a new Excel file. - Fixes dry run Excel write issue.	2023-09-08 08:52:00 +00:00
GregoryTravis	061876e640	Add simple parts of Table.take and Table.drop functions to Database table (#7615 ) Implements database Table and Column take/drop, except While and Sample. Additional features and optimizations are in https://github.com/enso-org/enso/issues/7614.	2023-08-31 18:52:02 +00:00
Radosław Waśko	255b424b72	Add `value_type` to `Column.from_vector` and `expected_value_type` to `Column.map` and `Column.zip` (#7637 ) - Closes #6111 - Aligns semantics of handling Mixed columns. - Now, if an operation like `iif` or `fill_nothing` is given a `Mixed` column, the result will also be `Mixed` regardless of the `inferred_precise_value_type`. - Enables a few old tests that were pending but could be enabled since the types work is advanced enough.	2023-08-31 13:20:49 +00:00
James Dunkerley	7d83b3d7b4	Add GROUP to functions (#7622 ) - Update list of groups to agreed list. - Lower case `ALIAS` names to be consistent with function names. - Add `GROUP` to methods. - All constructors and functions have doc comments. - Correct a few typos (e.g. `PRVIATE`). - Mark some more things as `PRIVATE`. - Use `ToDo:` and `Note:` consistently. - Order tags in doc comment. # Important Notes We don't have all the doc comments on types and will want to add them in future,	2023-08-23 13:20:38 +00:00
Radosław Waśko	2385f5b357	Add size-limited strings and varying bit-width integer Value_Types to in-memory backend and check for ArithmeticOverflow in LongStorage (#7557 ) - Closes #5159 - Now data downloaded from the database can keep the type much closer to the original type (like string length limits or smaller integer types). - Cast also exposes these types. - The integers are still all stored as 64-bit Java `long`s, we just check their bounds. Changing underlying storage for memory efficiency may come in the future: #6109 - Fixes #7565 - Fixes #7529 by checking for arithmetic overflow in in-memory integer arithmetic operations that could overflow. Adds a documentation note saying that the behaviour for Database backends is unspecified and depends on particular database.	2023-08-22 18:10:46 +00:00
GregoryTravis	c9d7c5cb2b	Convert in-memory Column.round to Java (#7521 )	2023-08-16 14:45:23 +00:00
James Dunkerley	296c95d414	Fix for empty column on replace and out of memory catching for join and tab (#7593 ) - Added a Panic.catch to catch heap memory error in joins and cross_tab. - Adjusted column replace so type is correct.	2023-08-15 17:06:51 +00:00
Radosław Waśko	8541a9e1ac	Improve generation of long operation in presence of column name length limit (#7556 ) I planned to do this as part of #7428, but I forgot. Making up for that now.	2023-08-14 16:58:36 +00:00
Radosław Waśko	b656b336c7	Report `Loss_Of_Integer_Precision` when an integer is not exactly representable as a float during conversion (#7509 ) Closes #7353 I introduce a new type `WithAggregatedProblems`, because `WithProblems` was too simple - it only allowed to hold a `List<Problem>` but `AggregatedProblems` is more than that. Ideally we shouldn't multiply entities like this too much. We should probably unify all to use `WithAggregatedProblems` - but after starting this, I realised it will likely just take too much effort to do for this little PR. So instead, I created a follow-up task for this: #7514	2023-08-08 12:30:44 +00:00
GregoryTravis	758b3b31b9	Avoid indexing the table twice for Cross Tab (#7417 ) Rewrites MultiValueIndex.makeCrossTabTable to build only a single index.	2023-08-04 21:14:18 +00:00
Radosław Waśko	bc9cde6543	Fix column naming edge cases - invalid and duplicated columns, case-insensitive name aliasing for case-insensitive backends (#7495 ) - Fixes #7412 - Also adds tests and fixes some more edge cases: - Ensures correct handling of existing Database tables whose column names may be invalid from Enso perspective, or clashing from Enso perspective (e.g. for most DBs `ś` and `s\u0301` are different names, but for Enso they are basically the same so this would cause issues - thus Enso now renames such columns when accessed (still using the correct column reference in the generated SQL under the hood).	2023-08-04 09:04:38 +00:00

1 2 3 4 5 ...

292 Commits