enso-org/enso - enso - gitea: Gitea Service

mirror of https://github.com/enso-org/enso.git synced 2024-12-26 16:52:16 +03:00

Author	SHA1	Message	Date
James Dunkerley	0c39f8ec04	Allow Filter_Condition to be inverted. (#8861 ) - Various linting fixes (doc comments and type annotations etc.). - Add an action to determine if a `Filter_Condition` is keep or remove. https://github.com/enso-org/enso/assets/4699705/69ba2bd3-8893-4237-acc4-eb01f534a209 - Remove `Not_In`, `Not_Contains` and `Not_Like` from `Filter_Condition`. - Ability to use an `Expression` as a `Column_Ref`. https://github.com/enso-org/enso/assets/4699705/16a2e030-f8f9-4f59-beca-2646f56fcb90	2024-02-07 14:36:14 +00:00
GregoryTravis	6554972b7d	Implement Table.replace for the in-memory backend (#8935 )	2024-02-06 20:57:50 +00:00
James Dunkerley	eeaddbc434	Add parser for line by line processing (#8719 ) - ✅Linting fixes and groups. - ✅Add `File.from that:Text` and use `File` conversions instead of taking both `File` and `Text` and calling `File.new`. - ✅Align Unix Epoc with the UTC timezone and add converting from long value to `Date_Time` using it. - ❌Add simple first logging API allowing writing to log messages from Enso. - ✅Fix minor style issue where a test type had a empty constructor. - ❌Added a `long` based array builder. - Added `File_By_Line` to read a file line by line. - Added "fast" JSON parser based off Jackson. - ✅Altered range `to_vector` to be a proxy Vector. - ✅Added `at` and `get` to `Database.Column`. - ✅Added `get` to `Table.Column`. - ✅Added ability to expand `Vector`, `Array` `Range`, `Date_Range` to columns. - ✅Altered so `expand_to_column` default column name will be the same as the input column (i.e. no `Value` suffix). - ✅Added ability to expand `Map`, `JS_Object` and `Jackson_Object` to rows with two columns coming out (and extra key column). - ✅ Fixed bug where couldn't use integer index to expand to rows.	2024-02-01 07:29:50 +00:00
GregoryTravis	5eb3f3bd1d	Implement relational NULL semantics for Nothing for in-memory Column operations (#8816 ) Updates in-memory table column operations to treat Nothing as a relational NULL. This PR does not include changes to Table.join.	2024-01-24 17:02:45 +00:00
Radosław Waśko	edfcfde11c	Tests and improvements for secrets in cloud subdirectories (#8791 ) - Closes #8723 - Adds some missing features that were needed to make this work: - `Enso_File.create_directory` and `Enso_File.delete`, and basic tests for it - Changes how `Enso_Secret.list` is obtained - using a different Cloud endpoint allows us to implement the desired logic, the default endpoint was giving us _all_ secrets which was not what we wanted here. - Implements `Enso_Secret.update` and tests for it # Important Notes Notes describing any problems with the current Cloud API: https://docs.google.com/document/d/1x8RUt3KkwyhlxGux7XUGfOdtFSAZV3fI9lSSqQ3XsXk/edit Apparently, everything that was needed to make this feature work has already been implemented, although a few features needed workarounds on Enso side to work properly.	2024-01-24 10:17:22 +00:00
AdRiley	ef7b11fb67	Add new aliases for cross_join, info and drop (#8773 ) Adds new aliases append -> cross_join metadata -> info skip -> drop	2024-01-16 16:08:54 +00:00
GregoryTravis	f2cb1f097e	Support on_problems=Problem_Behavior.Report_Warning and Map_Error wrapping in Vector.map (#8595 ) Implements `Warnings.get_all wrap_errors=True` which wraps warnings attached to values inside vectors with `Map_Error`, which includes the position of the value within the vector. See [the documentation](https://github.com/enso-org/enso/blob/develop/docs/semantics/wrapped-errors.md) for more details. `get_all wrap_errors=True` does not change the warnings that are attached to values -- it wraps them before returning them to the caller, but does not change the original warnings attached to the values. Wrapped warnings only appear attached to the vector itself. The values inside the vector do not have their warnings wrapped. Warning propagation is not changed at all; `Warnings.get_all` (with default `wrap_errors=False`) behaves as before. `get_all wrap_errors=True` is meant to be used primarily by the IDE, although it can be used anywhere this wrapping is desired.	2024-01-16 09:36:22 +00:00
AdRiley	b8e93b3cba	Add new text_left and text_right functions (#8691 ) Added text_left and text_right functions for in-memory and databases	2024-01-15 23:43:23 +00:00
Radosław Waśko	f34abeda0c	Add tests for `Enso_Secret`s, update to new cloud API (#8736 ) - Closes #8556	2024-01-15 16:12:08 +00:00
Pavel Marek	428e83de36	Remove org.bouncycastle dependency (#8664 ) Remove `org.bouncycastle` dependency from `org.enso.runtime`.	2024-01-04 17:16:41 +01:00
Radosław Waśko	a1207e029d	Unify `File_Format_Metadata` with `File_For_Read` (#8628 ) - Closes #8555 - Refactors the file format detection logic, compacting lots of repetitive logic for HTTP handling into helper functions. - Some updates to CODEOWNERS.	2024-01-04 03:57:05 +00:00
AdRiley	ec51127635	Change null to Nothing (#8637 ) Change the generated column name for is_nothing to "[a] is Nothing" from "[a] is null" as Nothing is our customer facing term.	2023-12-28 18:02:23 +00:00
Radosław Waśko	d41d48e8a0	Merge `URI_With_Query` into `URI`, extend API of `URI` (#8591 ) - Closes #8544 - Adds `reset_query_arguments` and `/` operators allowing to transform a URI. - Adding tests for handling of various edge cases.	2023-12-21 18:39:26 +00:00
AdRiley	cfe0cbe0c1	Add text_length to column for in-memory and database (#8606 ) Closes #8521 Adds text_length to Column	2023-12-21 11:31:13 +00:00
Cassandra-Clark	232077f25e	Renamed lookup_and_replace to merge and renamed Table.replace to text… (#8564 )	2023-12-20 16:28:45 +00:00
Radosław Waśko	d4714af826	Add a few new `Filter_Condition`s (#8539 ) - Closes #8045	2023-12-16 15:12:23 +00:00
Radosław Waśko	7a05e679c3	Improve details attached to `No_Output_Columns` reported from various operations (#8528 ) - Closes #7635	2023-12-14 10:49:07 +00:00
GregoryTravis	1c815a3d45	Better Error Trapping in map (#8307 ) * tests * wip * wip * additional warnings * wip * wip * cleanup * nested wrapping * multiple nestings * wraps_error uses looks_for, test for should_fail_with * wip * stack trace line fix * use catch_primitive internally * fix warning mapping, dtf spec * just one wrapper checker, vector spec * missing ctor, back to non-primitive catch * back to c_p * put old map back * wip * unnest tests * Array.map on_problems * wip * Revert "wip" This reverts commit `c30d171457`. * better test names * warning logging * wip * wip * move logic into ALH * doc * constant * My_Error.Error * nested * doc * map_primtiive in warning mapper * composition * ref spec * Remove warnings prior to matching on the value If an expression has warnings and is matched we: 1) extract the warnings 2) execute the branch of a pattern that matches the value 3) attach extracted warnings to the result This caused warnings to reappear when doing the custom warnings manipulation. This is also consistent with how `CaseNode`'s `doWarning` specialization is defined. * fix 1 * do not auto unwrap in test error checkers * nested error matcher * in problems too * dtf * v * statistics * wip * Table_Spec, map_with_index_primitive * Column_Operations_Spec * disable warning wrapping and Report_Warning * unimpl test * Warnings_Spec * DCS * ACG JP * zip_primitive * join_helpers * Lookup_Helpers * Table * Data_Formatter * Value_Type_Helpers * revert check types changes * table_helpers * table tests * remove st * do not remove warnings from value * vec docs, tests for zip, mwi, flat_map * docs, fixes * remove nested_error_matcher * cleanup * benchmark * one error * alter * add bench to main * review * review * review * tail call * changelog * tail call was not a tail call * ws * bad import * Added missing import * Update distribution/lib/Standard/Base/0.0.0-dev/src/Data/Array.enso Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org> * review, ref example * lazy benchmark data * extra paren * check outside of catch * review * vector too * actually lazy * disambiguate Map_Error * finish rename * move to extensions * combine Additional_Warnings error * rename to map_no_wrap * do not catch and rethrow * review * wip * remove _primitives entirely * remove unused should_fail_with function options * remove expected_warning as function in Problems --------- Co-authored-by: Hubert Plociniczak <hubert.plociniczak@gmail.com> Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org>	2023-12-13 09:38:09 -05:00
James Dunkerley	f60836d9e1	Apply ICONs (#8360 ) - Amend a couple of missed groups. - Add the first pass of some ICONs. The linter tool has been updated to support rewriting the ICON as well.	2023-11-22 15:24:16 +00:00
James Dunkerley	347b5a7cf5	Linting and Groups update (#8357 ) - Fix issues from the linter. - Rename the constructors for `Blank_Selector`. - Update various GROUP tags.	2023-11-21 18:12:27 +00:00
James Dunkerley	ecaca12df1	Integrating Enso Cloud with the libraries (part 1...) (#8006 ) - Add a `File_For_Read` type. Used for `File_Format` to read files. - Added `Enso_User` representing the current user in `Enso_Cloud`. - Will be later able to list known users. - Added `Enso_Secret` representing a value defined in `Enso_Cloud`. - Value not used within Enso only accessed within polyglot Java. - Integrated into `Username_And_Password` and can be used within JDBC connections. - Integrated into HTTP Headers so a secret can be used as a value. - New `URI_With_Query` with the same API as `URI`. Supporting secrets in the value. - Will be integrated with AWS credentials. - Added `Enso_File` representing a file or a folder in the cloud. - Support the same API as `File` (like the `S3_File`). - Will support `enso://` URI style access.	2023-11-20 23:21:14 +00:00
GregoryTravis	ea3d778456	Allow the creation of a constant column on an in-memory table with no rows. (#8218 )	2023-11-09 14:40:51 +00:00
GregoryTravis	6be94a854b	Implement truncate Date_Time for database backend (#8235 ) Also adds some checks for column names generated for floor, ceil, truncate, round.	2023-11-08 23:23:59 +00:00
Radosław Waśko	1b8b30a68d	Improve performance of `Join_Condition.Between` by sorting on one dimension (#8212 ) - Closes #5303 - Refactors `JoinStrategy` allowing us to 'stack' join strategies on top of each other (to some extent) - currently a `HashJoin` can be followed by another join strategy (currently `SortJoin`) - Adds benchmarks for join - Due to limitations of the sorting approach this will still not be as fast as possible for cases where there is more than 1 `Between` condition in a single query - trying to demonstrate that in benchmarks. - We can replace sorting by d-dimensional [RangeTrees](https://en.wikipedia.org/wiki/Range_tree) to get `O((n + m) log^d n + k)` performance (where `n` and `m` are sizes of joined tables, `d` is the amount of `Between` conditions used in the query and `k` is the result set size). - Follow up ticket for consideration later: #8216 - Closes #8215 - After all, it turned out that `TreeSet` was problematic (because of not enough flexibility with duplicate key handling), so the simplest solution was to immediately implement this sub-task. - Closes #8204 - Unrelated, but I ran into this here: adds type checks to other arguments of `set`. - Before, putting in a Column as `new_name` (i.e. mistakenly messing up the order of arguments), lead to a hard to understand `Method `if_then_else` of type Column could not be found.`, instead now it would file with type error 'expected Text got Column`.	2023-11-08 12:59:55 +00:00
Radosław Waśko	2ce1567384	Limit `max_rows` that are downloaded in `Table.read` by default, and warn if more rows are available (#8159 ) - Sets the default limit for `Table.read` in Database to be max 1000 rows. - The limit for in-memory compatible API still defaults to `Nothing`. - Adds a warning if there are more rows than limit. - Enables a few unrelated asserts.	2023-11-06 16:41:47 +00:00
GregoryTravis	3c371adbef	Implement Table.format similar to Table.parse allowing to format columns in bulk (#8150 ) * doc * one test * date tests * empty and nothing * ints floats * bools * all columns * regex and index * locales * bad formats * all with one format * docs * examples, not impl db * docs, more errors * cleanup * changelog * check list * reorder * clue * review * review * review * review * review * review * specify time zone	2023-11-02 09:36:36 -04:00
Cassandra-Clark	b5d6628c57	Change filter_blank_rows when_any parameter to have a more user-friendly type (#7935 ) Added Blank_Selector constructor and applied to remove_blank_columns, select_blank_columns, filter_blank_rows for #7931 . Changed when_any to when for readability.	2023-11-01 16:51:15 +00:00
GregoryTravis	d467683ed1	Constant columns (in expressions and Column_Operations) should have clearer names (#8188 ) Previously, constant columns were given generated names with UUIDs in them, which are long and provide no information. Instead, we now use the constant value itself to form the name. Since these new generated names are less unique, we must explicitly make them unique, in cases where the caller did not explicilty set a name.	2023-11-01 14:41:03 +00:00
GregoryTravis	1480f50207	Overhaul the random number and item generation code (#8127 ) Rewrite most of Random.enso.	2023-10-31 15:25:37 +00:00
Radosław Waśko	79011bd550	Implement `Table.lookup_and_replace` in Database (#8146 ) - Closes #7981 - Adds a `RUNTIME_ERROR` operation into the DB dialect, that may be used to 'crash' a query if a condition is met - used to validate if `lookup_and_replace` invariants are still satisfied when the query is materialized. - Removes old `Table_Helpers.is_table` and `same_backend` checks, in favour of the new way of checking this that relies on `Table.from` conversions, and is much simpler to use and also more robust.	2023-10-31 15:19:55 +00:00
Radosław Waśko	0c278391fe	Test and improve handling of `Date_Time with_timezone=False` in Postgres (#8114 ) - Fixes #8049 - Adds tests for handling of Date_Time upload/download in Postgres. - Adds tests for edge cases of handling of Decimal and Binary types in Postgres.	2023-10-21 21:35:13 +00:00
Radosław Waśko	8172896065	Support `Previous_Value` in `fill_nothing` and `fill_missing` (#8105 ) - Adds `Previous_Value` to `fill_nothing` and `fill_empty`, as requested by #7192.	2023-10-20 13:18:53 +00:00
Radosław Waśko	28fc183f92	Review places where we can use `Column_Ref` (#8101 ) Closes #8046	2023-10-18 19:03:50 +00:00
Radosław Waśko	08b717eb54	Refactor Table problem handling to a more robust and hopefully cleaner approach (#7879 ) Closes #7514	2023-10-16 15:09:08 +00:00
GregoryTravis	f18d1323e1	Add Table.expand_to_rows to allow flattening vector and array values in table (#8042 ) # Important Notes Also includes a fix for a reallocation bug in `InferredBuilder`.	2023-10-13 20:54:06 +00:00
James Dunkerley	fac9e7a420	Expand capabilities of `Table.set` and better dropdown support, (#8005 ) - Adds the ability to use numbers, date/time and Boolean values as constants in `set`. - `Table.set` can take a `Column_Operation`, allowing for deriving of a new column based on other columns. - Added `Column_Ref` type to refer to a column in `filter`.	2023-10-13 16:03:28 +00:00
James Dunkerley	0dcfc3e9bf	Minor improvements from last couple of Book Clubs (#8034 ) - Added some ALIASes. - Added `sheet` to `Excel_Workbook` to give familiar API to read sheet. - Added conversion from range to vector allowing easy use with Zip. - Add `Map.from_keys_and_values.	2023-10-12 14:29:59 +00:00
Radosław Waśko	cd84ac16ce	Restructure `Table.from_objects` to use conversions (#8020 ) Closes #7957	2023-10-11 22:25:18 +00:00
Radosław Waśko	6f78570115	Fix a DROP table bug, add SQL debug logging (#8007 ) - Fixes a bug where creating a temporary table could accidentally issue a `DROP` statement of the table name that the user provided, risking destruction of user data. - Fortunately, the bad scenario was almost impossible, because the `DROP` statement was only issued _if_ we previously checked that the mentioned table _does not exist_ - dropping a nonexistent table does not do any harm. - It could have been dangerous in a very unlikely scenario that a table was created just between the _existence check_ and the _drop_. - After the fix the existence check and any modifications are done within a transaction to avoid interference from concurrent modifications, and the DROP is correctly applied to a temporary Enso table instead of the original one. - Replaced a temporary log with proper simple logging of SQL statements into a file, if an Environment variable is set. - Used that feature to test that no unexpected statements occur.	2023-10-10 13:16:06 +00:00
Radosław Waśko	6e0bd86753	Implement `Table.lookup_and_replace` for in-memory (#7979 ) - Closes #7749 implementing the in-memory logic. - Additional complications have surfaced regarding the Database logic, so it has been split off into a separate ticket: #7981	2023-10-10 10:42:06 +00:00
Radosław Waśko	3222e5af62	Avoid exponential growth of column names (#7934 ) - Fixes #7933 - Avoids duplicating `]` as `]]` in generated column names - now column names grow linearly.	2023-10-02 16:05:28 +00:00
James Dunkerley	fb50eb7595	Using conversions in a few places (#7859 ) - Shuffle a few `from`s into correct places: - `Day_Of_Week.from` removing `Day_Of_Week_From` module. - Adding short cut for `http` and `https` in `Data.read` so it calls onto `Data.fetch` giving a single entry point. - Moved `URI` extensions from `Standard.Base.Data` module into `Standard.Base.Network.Extensions`. - Added `post` extension for `URI`. - Added `contains_key` to `JS_Object`. - Restored `into` in `JS_Object`: - Follows old logic populating a constructor. - Will use conversion from `JS_Object` if present. - Added automatic deserialization of `Date`, `Time_Of_Day` and `Date_Time` from JSON. - Uses conversion from `JS_Object`. - Added conversion from `Text` to a `HTTP_Method` and type checking where `HTTP_Method` used in public APIs. - Added support for `Date`, `Time_Of_Day` and `Date_Time` in `Table.from_objects`. - Added `expand_column` to `Table` to expand `JS_Object` to values. - Add type checking for `Table` in `right` arguments (allowing `Column`s to be used). - Use type checking in `Table.set` to allow for conversion to a `Column`. - Remove some unused imports. - Fix for bug in S3 edge case.	2023-10-02 14:54:22 +00:00
Radosław Waśko	c690559ec4	Implement `auto_value_type` operation (#7908 ) Closes #6113	2023-09-27 15:45:34 +00:00
Radosław Waśko	12c4f2981d	More robust Date/Time format patterns parsing (#7826 ) - Closes #7461 by introducing a `Date_Time_Formatter` type and making parsing date time formats more robust and safer. - The default ('simple') set of patterns is slightly simplified and made case insensitive (except for `M/m` and `H/h`) to avoid the `YYYY` vs `yyyy` issues and make it less error prone. - The `YYYY` now has the same meaning as `yyyy` in simple mode. The old meaning (week-based year) is moved to a _separate mode_, triggered by `Date_Time_Formatter.from_iso_week_date_pattern`. - Full Java syntax, as well as custom-built Java `DateTimeFormatter` can also be used by `Date_Time_Formatter.from_java`. - Text-based constants (e.g. `ISO_ZONED_DATE_TIME`) have now become methods on `Date_Time_Formatter`, e.g. `Date_Time_Formatter.iso_zoned_date_time`).	2023-09-22 10:12:18 +00:00
James Dunkerley	74d1d0861c	S3 Read Access, Input Stream based reading (#7776 ) - Added a `FileSystemSPI` allowing protocol resolution to a target type. - Separated `Input_Stream` and `Output_Stream` from `File` to allow use in other spaces. - `File_Format` types `read_web` changed to be `read_stream` working with `InputStream`. - Added directory listing to `Auto_Detect` allowing for `Data.read` to list a folder. - Adjusted HTTP to return an `InputStream` not a `byte[]`: - `Response_Body` adjusted to wrap an `InputStream`. - Added ability to materialize to either and in-memory vector (<4KB) or a temporary file. - `Data.fetch` will materialize if not a recognized mime-type. - Added `HTTP_Error` to handle IO exceptions from the stream. - `Excel_Format` now supports mime-type and reading a stream. - `Excel_Workbook` can now get a `Excel_Section` using `read_section`. - Added S3 APIs: - `parse_uri`: splits an S3 URI into bucket and key. - `list_objects`: list the items in a S3 bucket with specified prefix. - `read_bucket`: list prefixes and keys with a delimiter in a S3 bucket with specified prefix. - `head`: either head_bucket (tests existance) or head_object API (reads object meta data). - `get_object`: gets an object from S3 returning as a `Response_Body`. - Added `S3_File` type acting like a `File`: - No support for writing in this PR. - ToDo: recursive listing, glob filtering, exists, size. - Fixed a few invalid type signature line. - Moved `create` methods for `Postgres_Connection` and `SQLite_Connection` into type instead of module. - Renamed `Column_Fetcher.Builder` to `Column_Fetcher_Builder`. - Fixed bug with `select_into` in Dry Run mode creating permanent tables. ToDo: Unit tests.	2023-09-20 15:09:11 +00:00
Hubert Plociniczak	1ee3d8f4f0	Rename Decimal to Float (#7807 ) Implements #6889.	2023-09-14 15:01:30 +00:00
Radosław Waśko	8b6e70b155	Support for BigInteger values in Table (#7715 ) - Fixes #7354 - And also closes #7712 - Refactors how we handle numeric ops - ensuring that the 'kernels' are placed all in one place and selected based on storage types.	2023-09-12 13:18:04 +00:00
Radosław Waśko	7d424bf8a2	Implement `Table.delete_rows`. (#7709 ) - Closes #7238 - Aligns `update_database_table` to a more consistent and clearer API - `update_rows`. - Adds a `truncate_table` helper function, to pair up with `drop_table`. Both are `PRIVATE` for now. - Adds tests for NULLs in keys in `update_rows` and `delete_rows`. - The behaviour is sometimes unexpected, so instead these fail with `Null_Values_In_Key_Columns`. - Adds a workaround for https://github.com/oracle/graal/issues/7359 - Adds a workaround for a related bug where a stack frame has no name (its `rootNode.getName() == null`). - I could not track down this bug to provide a neat repro.	2023-09-07 11:07:53 +00:00
GregoryTravis	061876e640	Add simple parts of Table.take and Table.drop functions to Database table (#7615 ) Implements database Table and Column take/drop, except While and Sample. Additional features and optimizations are in https://github.com/enso-org/enso/issues/7614.	2023-08-31 18:52:02 +00:00
Radosław Waśko	255b424b72	Add `value_type` to `Column.from_vector` and `expected_value_type` to `Column.map` and `Column.zip` (#7637 ) - Closes #6111 - Aligns semantics of handling Mixed columns. - Now, if an operation like `iif` or `fill_nothing` is given a `Mixed` column, the result will also be `Mixed` regardless of the `inferred_precise_value_type`. - Enables a few old tests that were pending but could be enabled since the types work is advanced enough.	2023-08-31 13:20:49 +00:00

1 2 3 4 5 ...

267 Commits