enso-org/enso - enso - gitea: Gitea Service

mirror of https://github.com/enso-org/enso.git synced 2024-12-26 06:02:03 +03:00

Author	SHA1	Message	Date
James Dunkerley	0b6db5797c	Refactor OrderMask to avoid memory copying (#8863 ) Goal of this PR is to refactor the design of OrderMask and avoid copying arrays or lists wherever possible. We have removed a few legacy functions which were not being used. On a poor mans benchmark seems to be quicker (13s vs 16s) and memory usage should be lower.	2024-01-26 11:16:16 +00:00
GregoryTravis	5eb3f3bd1d	Implement relational NULL semantics for Nothing for in-memory Column operations (#8816 ) Updates in-memory table column operations to treat Nothing as a relational NULL. This PR does not include changes to Table.join.	2024-01-24 17:02:45 +00:00
James Dunkerley	d55c9c99b4	First few changes from the Churn workflow (#8782 ) - `up_to` should have the `step` as an optional argument. - `Date_Range` conversion can do a clever auto rename, so if a period use the name of it. - Add `to Table` for `Range`, `Date_Range`.	2024-01-17 21:57:24 +00:00
AdRiley	ef7b11fb67	Add new aliases for cross_join, info and drop (#8773 ) Adds new aliases append -> cross_join metadata -> info skip -> drop	2024-01-16 16:08:54 +00:00
GregoryTravis	f2cb1f097e	Support on_problems=Problem_Behavior.Report_Warning and Map_Error wrapping in Vector.map (#8595 ) Implements `Warnings.get_all wrap_errors=True` which wraps warnings attached to values inside vectors with `Map_Error`, which includes the position of the value within the vector. See [the documentation](https://github.com/enso-org/enso/blob/develop/docs/semantics/wrapped-errors.md) for more details. `get_all wrap_errors=True` does not change the warnings that are attached to values -- it wraps them before returning them to the caller, but does not change the original warnings attached to the values. Wrapped warnings only appear attached to the vector itself. The values inside the vector do not have their warnings wrapped. Warning propagation is not changed at all; `Warnings.get_all` (with default `wrap_errors=False`) behaves as before. `get_all wrap_errors=True` is meant to be used primarily by the IDE, although it can be used anywhere this wrapping is desired.	2024-01-16 09:36:22 +00:00
AdRiley	b8e93b3cba	Add new text_left and text_right functions (#8691 ) Added text_left and text_right functions for in-memory and databases	2024-01-15 23:43:23 +00:00
AdRiley	1b3c9638ea	Make fill nothing return types tighter (#8734 ) This is the follow up PR addressing the last couple of points from https://github.com/enso-org/enso/pull/8643 around what the return type from fill_nothing. # Important Notes The biggest change is changing what we size we need for an empty string. This change says a variable length string of length 1 and does it at a low enough level that it will effect the whole language. But I think that is correct.	2024-01-12 11:20:36 +00:00
Pavel Marek	428e83de36	Remove org.bouncycastle dependency (#8664 ) Remove `org.bouncycastle` dependency from `org.enso.runtime`.	2024-01-04 17:16:41 +01:00
AdRiley	bf8dd1888c	Give file read its own helper widget for delimiters. (#8627 ) Give file read its own helper widget for delimiters. Remove newline add none. The file read delimiter is similar but different to the split one and so should have its own set of options.	2024-01-04 11:59:42 +00:00
Radosław Waśko	a1207e029d	Unify `File_Format_Metadata` with `File_For_Read` (#8628 ) - Closes #8555 - Refactors the file format detection logic, compacting lots of repetitive logic for HTTP handling into helper functions. - Some updates to CODEOWNERS.	2024-01-04 03:57:05 +00:00
AdRiley	689c8f7c3c	Make split to rows of Nothing value equal Nothing. (#8640 ) Split to rows of Nothing value should equal Nothing. Add some additional test cases. And updated existing to help readability	2024-01-03 12:09:35 +00:00
AdRiley	ec51127635	Change null to Nothing (#8637 ) Change the generated column name for is_nothing to "[a] is Nothing" from "[a] is null" as Nothing is our customer facing term.	2023-12-28 18:02:23 +00:00
Jaroslav Tulach	07d58f2c02	DataflowError.withoutTrace shall not store a trace (#8608 )	2023-12-24 11:07:32 +01:00
AdRiley	cfe0cbe0c1	Add text_length to column for in-memory and database (#8606 ) Closes #8521 Adds text_length to Column	2023-12-21 11:31:13 +00:00
James Dunkerley	2e9bd86854	Small linting fixes. (#8592 )	2023-12-20 17:25:43 +00:00
Cassandra-Clark	232077f25e	Renamed lookup_and_replace to merge and renamed Table.replace to text… (#8564 )	2023-12-20 16:28:45 +00:00
Radosław Waśko	d4714af826	Add a few new `Filter_Condition`s (#8539 ) - Closes #8045	2023-12-16 15:12:23 +00:00
Radosław Waśko	9428d12a1e	Fixes `Date_Diff` widget (#8561 ) The widget for `Date_Diff` was using wrong old name and thus did not work properly on one of the arguments. Before it worked for second argument (`end`) but did not work for `input`: ![image](https://github.com/enso-org/enso/assets/1436948/ef7556db-9518-4854-b7b9-d423f1e6421b) ![image](https://github.com/enso-org/enso/assets/1436948/670a757b-fbed-4fe6-bdf6-13aa46d81aac) Afterwards it works there too: ![image](https://github.com/enso-org/enso/assets/1436948/5afb88fb-55a0-48bf-9300-8604c672bde3)	2023-12-16 15:04:35 +00:00
James Dunkerley	9e27b6487b	Minor fixes and tweak for Cloud APIs. (#8557 ) - Fix secret to at least be working again - Tweak to allow a MIMIC flow to work with value types (revisit in 2024).	2023-12-15 17:10:07 +00:00
Radosław Waśko	b5c995a7bf	Reworking Excel support to allow for reading of big files (#8403 ) - Closes #8111 by making sure that all Excel workbooks are read using a backing file (which should be more memory efficient). - If the workbook is being opened from an input stream, that stream is materialized to a `Temporary_File`. - Adds tests fetching Table formats from HTTP. - Extends `simple-httpbin` with ability to serve files for our tests. - Ensures that the `Infer` option on `Excel` format also works with streams, if content-type metadata is available (e.g. from HTTP headers). - Implements a `Temporary_File` facility that can be used to create a temporary file that is deleted once all references to the `Temporary_File` instance are GCed.	2023-12-15 00:02:15 +00:00
Radosław Waśko	7a05e679c3	Improve details attached to `No_Output_Columns` reported from various operations (#8528 ) - Closes #7635	2023-12-14 10:49:07 +00:00
GregoryTravis	1c815a3d45	Better Error Trapping in map (#8307 ) * tests * wip * wip * additional warnings * wip * wip * cleanup * nested wrapping * multiple nestings * wraps_error uses looks_for, test for should_fail_with * wip * stack trace line fix * use catch_primitive internally * fix warning mapping, dtf spec * just one wrapper checker, vector spec * missing ctor, back to non-primitive catch * back to c_p * put old map back * wip * unnest tests * Array.map on_problems * wip * Revert "wip" This reverts commit `c30d171457`. * better test names * warning logging * wip * wip * move logic into ALH * doc * constant * My_Error.Error * nested * doc * map_primtiive in warning mapper * composition * ref spec * Remove warnings prior to matching on the value If an expression has warnings and is matched we: 1) extract the warnings 2) execute the branch of a pattern that matches the value 3) attach extracted warnings to the result This caused warnings to reappear when doing the custom warnings manipulation. This is also consistent with how `CaseNode`'s `doWarning` specialization is defined. * fix 1 * do not auto unwrap in test error checkers * nested error matcher * in problems too * dtf * v * statistics * wip * Table_Spec, map_with_index_primitive * Column_Operations_Spec * disable warning wrapping and Report_Warning * unimpl test * Warnings_Spec * DCS * ACG JP * zip_primitive * join_helpers * Lookup_Helpers * Table * Data_Formatter * Value_Type_Helpers * revert check types changes * table_helpers * table tests * remove st * do not remove warnings from value * vec docs, tests for zip, mwi, flat_map * docs, fixes * remove nested_error_matcher * cleanup * benchmark * one error * alter * add bench to main * review * review * review * tail call * changelog * tail call was not a tail call * ws * bad import * Added missing import * Update distribution/lib/Standard/Base/0.0.0-dev/src/Data/Array.enso Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org> * review, ref example * lazy benchmark data * extra paren * check outside of catch * review * vector too * actually lazy * disambiguate Map_Error * finish rename * move to extensions * combine Additional_Warnings error * rename to map_no_wrap * do not catch and rethrow * review * wip * remove _primitives entirely * remove unused should_fail_with function options * remove expected_warning as function in Problems --------- Co-authored-by: Hubert Plociniczak <hubert.plociniczak@gmail.com> Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org>	2023-12-13 09:38:09 -05:00
Jaroslav Tulach	7f0cb88fa1	Consistent simple and qualified type name (#8448 ) Fixes #8255 by unifying `get_qualified_type_name` and `get_simple_type_name` implementations.	2023-12-06 04:30:24 +00:00
Radosław Waśko	c6b6384fe6	Improve performance of anti-join (#8338 ) - Closes #8217	2023-11-24 02:44:57 +00:00
James Dunkerley	f60836d9e1	Apply ICONs (#8360 ) - Amend a couple of missed groups. - Add the first pass of some ICONs. The linter tool has been updated to support rewriting the ICON as well.	2023-11-22 15:24:16 +00:00
James Dunkerley	347b5a7cf5	Linting and Groups update (#8357 ) - Fix issues from the linter. - Rename the constructors for `Blank_Selector`. - Update various GROUP tags.	2023-11-21 18:12:27 +00:00
James Dunkerley	ecaca12df1	Integrating Enso Cloud with the libraries (part 1...) (#8006 ) - Add a `File_For_Read` type. Used for `File_Format` to read files. - Added `Enso_User` representing the current user in `Enso_Cloud`. - Will be later able to list known users. - Added `Enso_Secret` representing a value defined in `Enso_Cloud`. - Value not used within Enso only accessed within polyglot Java. - Integrated into `Username_And_Password` and can be used within JDBC connections. - Integrated into HTTP Headers so a secret can be used as a value. - New `URI_With_Query` with the same API as `URI`. Supporting secrets in the value. - Will be integrated with AWS credentials. - Added `Enso_File` representing a file or a folder in the cloud. - Support the same API as `File` (like the `S3_File`). - Will support `enso://` URI style access.	2023-11-20 23:21:14 +00:00
Jaroslav Tulach	ba19813511	Speeding up "hello world" example by 16%	2023-11-19 16:38:31 +01:00
Pavel Marek	5a7ad6bfe4	Upgrade enso to GraalVM for jdk 21 (#7991 ) Upgrade to GraalVM JDK 21. ``` > java -version openjdk version "21" 2023-09-19 OpenJDK Runtime Environment GraalVM CE 21+35.1 (build 21+35-jvmci-23.1-b15) OpenJDK 64-Bit Server VM GraalVM CE 21+35.1 (build 21+35-jvmci-23.1-b15, mixed mode, sharing) ``` With SDKMan, download with `sdk install java 21-graalce`. # Important Notes - After this PR, one can theoretically run enso with any JRE with version at least 21. - Removed `sbt bootstrap` hack and all the other build time related hacks related to the handling of GraalVM distribution. - `project-manager` remains backward compatible - it can open older engines with runtimes. New engines now do no longer require a separate runtime to be downloaded. - sbt does not support compilation of `module-info.java` files in mixed projects - https://github.com/sbt/sbt/issues/3368 - Which means that we can have `module-info.java` files only for Java-only projects. - Anyway, we need just a single `module-info.class` in the resulting `runtime.jar` fat jar. - `runtime.jar` is assembled in `runtime-with-instruments` with a custom merge strategy (`sbt-assembly` plugin). Caching is disabled for custom merge strategies, which means that re-assembly of `runtime.jar` will be more frequent. - Engine distribution contains multiple JAR archives (modules) in `component` directory, along with `runner/runner.jar` that is hidden inside a nested directory. - The new entry point to the engine runner is [EngineRunnerBootLoader](https://github.com/enso-org/enso/pull/7991/files#diff-9ab172d0566c18456472aeb95c4345f47e2db3965e77e29c11694d3a9333a2aa) that contains a custom ClassLoader - to make sure that everything that does not have to be loaded from a module is loaded from `runner.jar`, which is not a module. - The new command line for launching the engine runner is in [distribution/bin/enso](https://github.com/enso-org/enso/pull/7991/files#diff-0b66983403b2c329febc7381cd23d45871d4d555ce98dd040d4d1e879c8f3725) - [Newest version of Frgaal](https://repo1.maven.org/maven2/org/frgaal/compiler/20.0.1/) (20.0.1) does not recognize `--source 21` option, only `--source 20`.	2023-11-17 18:02:36 +00:00
GregoryTravis	ea3d778456	Allow the creation of a constant column on an in-memory table with no rows. (#8218 )	2023-11-09 14:40:51 +00:00
GregoryTravis	6be94a854b	Implement truncate Date_Time for database backend (#8235 ) Also adds some checks for column names generated for floor, ceil, truncate, round.	2023-11-08 23:23:59 +00:00
Radosław Waśko	1b8b30a68d	Improve performance of `Join_Condition.Between` by sorting on one dimension (#8212 ) - Closes #5303 - Refactors `JoinStrategy` allowing us to 'stack' join strategies on top of each other (to some extent) - currently a `HashJoin` can be followed by another join strategy (currently `SortJoin`) - Adds benchmarks for join - Due to limitations of the sorting approach this will still not be as fast as possible for cases where there is more than 1 `Between` condition in a single query - trying to demonstrate that in benchmarks. - We can replace sorting by d-dimensional [RangeTrees](https://en.wikipedia.org/wiki/Range_tree) to get `O((n + m) log^d n + k)` performance (where `n` and `m` are sizes of joined tables, `d` is the amount of `Between` conditions used in the query and `k` is the result set size). - Follow up ticket for consideration later: #8216 - Closes #8215 - After all, it turned out that `TreeSet` was problematic (because of not enough flexibility with duplicate key handling), so the simplest solution was to immediately implement this sub-task. - Closes #8204 - Unrelated, but I ran into this here: adds type checks to other arguments of `set`. - Before, putting in a Column as `new_name` (i.e. mistakenly messing up the order of arguments), lead to a hard to understand `Method `if_then_else` of type Column could not be found.`, instead now it would file with type error 'expected Text got Column`.	2023-11-08 12:59:55 +00:00
Radosław Waśko	2ce1567384	Limit `max_rows` that are downloaded in `Table.read` by default, and warn if more rows are available (#8159 ) - Sets the default limit for `Table.read` in Database to be max 1000 rows. - The limit for in-memory compatible API still defaults to `Nothing`. - Adds a warning if there are more rows than limit. - Enables a few unrelated asserts.	2023-11-06 16:41:47 +00:00
James Dunkerley	a850ecb787	Change widget for Order By. (#8226 ) Simplifies the Order By drop and always adds as a `Sort_Column`. If user wants to do descending is a two step process but feels more natural.	2023-11-06 16:27:36 +00:00
Radosław Waśko	237aae33c7	Simplify internal logic of `Table.order_by`, avoid unnecessary warning (#8221 ) - Fixes #8213	2023-11-06 11:00:01 +00:00
GregoryTravis	3c371adbef	Implement Table.format similar to Table.parse allowing to format columns in bulk (#8150 ) * doc * one test * date tests * empty and nothing * ints floats * bools * all columns * regex and index * locales * bad formats * all with one format * docs * examples, not impl db * docs, more errors * cleanup * changelog * check list * reorder * clue * review * review * review * review * review * review * specify time zone	2023-11-02 09:36:36 -04:00
Cassandra-Clark	b5d6628c57	Change filter_blank_rows when_any parameter to have a more user-friendly type (#7935 ) Added Blank_Selector constructor and applied to remove_blank_columns, select_blank_columns, filter_blank_rows for #7931 . Changed when_any to when for readability.	2023-11-01 16:51:15 +00:00
GregoryTravis	d467683ed1	Constant columns (in expressions and Column_Operations) should have clearer names (#8188 ) Previously, constant columns were given generated names with UUIDs in them, which are long and provide no information. Instead, we now use the constant value itself to form the name. Since these new generated names are less unique, we must explicitly make them unique, in cases where the caller did not explicilty set a name.	2023-11-01 14:41:03 +00:00
Radosław Waśko	79011bd550	Implement `Table.lookup_and_replace` in Database (#8146 ) - Closes #7981 - Adds a `RUNTIME_ERROR` operation into the DB dialect, that may be used to 'crash' a query if a condition is met - used to validate if `lookup_and_replace` invariants are still satisfied when the query is materialized. - Removes old `Table_Helpers.is_table` and `same_backend` checks, in favour of the new way of checking this that relies on `Table.from` conversions, and is much simpler to use and also more robust.	2023-10-31 15:19:55 +00:00
Radosław Waśko	0c278391fe	Test and improve handling of `Date_Time with_timezone=False` in Postgres (#8114 ) - Fixes #8049 - Adds tests for handling of Date_Time upload/download in Postgres. - Adds tests for edge cases of handling of Decimal and Binary types in Postgres.	2023-10-21 21:35:13 +00:00
Radosław Waśko	8172896065	Support `Previous_Value` in `fill_nothing` and `fill_missing` (#8105 ) - Adds `Previous_Value` to `fill_nothing` and `fill_empty`, as requested by #7192.	2023-10-20 13:18:53 +00:00
GregoryTravis	7383db0e04	Restructuring XML into Table form (#8083 ) # Important Notes Adds `.to Table` support, as well as XML support for `expand_column`.	2023-10-19 07:02:48 +00:00
Radosław Waśko	28fc183f92	Review places where we can use `Column_Ref` (#8101 ) Closes #8046	2023-10-18 19:03:50 +00:00
Radosław Waśko	e9fa12763e	Improve performance of `add_row_number` (#8076 ) Fixes #8055	2023-10-17 00:42:35 +00:00
Radosław Waśko	08b717eb54	Refactor Table problem handling to a more robust and hopefully cleaner approach (#7879 ) Closes #7514	2023-10-16 15:09:08 +00:00
GregoryTravis	f18d1323e1	Add Table.expand_to_rows to allow flattening vector and array values in table (#8042 ) # Important Notes Also includes a fix for a reallocation bug in `InferredBuilder`.	2023-10-13 20:54:06 +00:00
James Dunkerley	fac9e7a420	Expand capabilities of `Table.set` and better dropdown support, (#8005 ) - Adds the ability to use numbers, date/time and Boolean values as constants in `set`. - `Table.set` can take a `Column_Operation`, allowing for deriving of a new column based on other columns. - Added `Column_Ref` type to refer to a column in `filter`.	2023-10-13 16:03:28 +00:00
James Dunkerley	0dcfc3e9bf	Minor improvements from last couple of Book Clubs (#8034 ) - Added some ALIASes. - Added `sheet` to `Excel_Workbook` to give familiar API to read sheet. - Added conversion from range to vector allowing easy use with Zip. - Add `Map.from_keys_and_values.	2023-10-12 14:29:59 +00:00
Radosław Waśko	cd84ac16ce	Restructure `Table.from_objects` to use conversions (#8020 ) Closes #7957	2023-10-11 22:25:18 +00:00
Radosław Waśko	6e0bd86753	Implement `Table.lookup_and_replace` for in-memory (#7979 ) - Closes #7749 implementing the in-memory logic. - Additional complications have surfaced regarding the Database logic, so it has been split off into a separate ticket: #7981	2023-10-10 10:42:06 +00:00

1 2 3 4 5 ...

314 Commits