enso-org/enso - enso - gitea: Gitea Service

mirror of https://github.com/enso-org/enso.git synced 2024-12-26 16:11:44 +03:00

Author	SHA1	Message	Date
GregoryTravis	5eb3f3bd1d	Implement relational NULL semantics for Nothing for in-memory Column operations (#8816 ) Updates in-memory table column operations to treat Nothing as a relational NULL. This PR does not include changes to Table.join.	2024-01-24 17:02:45 +00:00
Radosław Waśko	edfcfde11c	Tests and improvements for secrets in cloud subdirectories (#8791 ) - Closes #8723 - Adds some missing features that were needed to make this work: - `Enso_File.create_directory` and `Enso_File.delete`, and basic tests for it - Changes how `Enso_Secret.list` is obtained - using a different Cloud endpoint allows us to implement the desired logic, the default endpoint was giving us _all_ secrets which was not what we wanted here. - Implements `Enso_Secret.update` and tests for it # Important Notes Notes describing any problems with the current Cloud API: https://docs.google.com/document/d/1x8RUt3KkwyhlxGux7XUGfOdtFSAZV3fI9lSSqQ3XsXk/edit Apparently, everything that was needed to make this feature work has already been implemented, although a few features needed workarounds on Enso side to work properly.	2024-01-24 10:17:22 +00:00
AdRiley	ef7b11fb67	Add new aliases for cross_join, info and drop (#8773 ) Adds new aliases append -> cross_join metadata -> info skip -> drop	2024-01-16 16:08:54 +00:00
GregoryTravis	f2cb1f097e	Support on_problems=Problem_Behavior.Report_Warning and Map_Error wrapping in Vector.map (#8595 ) Implements `Warnings.get_all wrap_errors=True` which wraps warnings attached to values inside vectors with `Map_Error`, which includes the position of the value within the vector. See [the documentation](https://github.com/enso-org/enso/blob/develop/docs/semantics/wrapped-errors.md) for more details. `get_all wrap_errors=True` does not change the warnings that are attached to values -- it wraps them before returning them to the caller, but does not change the original warnings attached to the values. Wrapped warnings only appear attached to the vector itself. The values inside the vector do not have their warnings wrapped. Warning propagation is not changed at all; `Warnings.get_all` (with default `wrap_errors=False`) behaves as before. `get_all wrap_errors=True` is meant to be used primarily by the IDE, although it can be used anywhere this wrapping is desired.	2024-01-16 09:36:22 +00:00
AdRiley	b8e93b3cba	Add new text_left and text_right functions (#8691 ) Added text_left and text_right functions for in-memory and databases	2024-01-15 23:43:23 +00:00
Radosław Waśko	f34abeda0c	Add tests for `Enso_Secret`s, update to new cloud API (#8736 ) - Closes #8556	2024-01-15 16:12:08 +00:00
Pavel Marek	428e83de36	Remove org.bouncycastle dependency (#8664 ) Remove `org.bouncycastle` dependency from `org.enso.runtime`.	2024-01-04 17:16:41 +01:00
Radosław Waśko	a1207e029d	Unify `File_Format_Metadata` with `File_For_Read` (#8628 ) - Closes #8555 - Refactors the file format detection logic, compacting lots of repetitive logic for HTTP handling into helper functions. - Some updates to CODEOWNERS.	2024-01-04 03:57:05 +00:00
AdRiley	ec51127635	Change null to Nothing (#8637 ) Change the generated column name for is_nothing to "[a] is Nothing" from "[a] is null" as Nothing is our customer facing term.	2023-12-28 18:02:23 +00:00
Radosław Waśko	d41d48e8a0	Merge `URI_With_Query` into `URI`, extend API of `URI` (#8591 ) - Closes #8544 - Adds `reset_query_arguments` and `/` operators allowing to transform a URI. - Adding tests for handling of various edge cases.	2023-12-21 18:39:26 +00:00
AdRiley	cfe0cbe0c1	Add text_length to column for in-memory and database (#8606 ) Closes #8521 Adds text_length to Column	2023-12-21 11:31:13 +00:00
Cassandra-Clark	232077f25e	Renamed lookup_and_replace to merge and renamed Table.replace to text… (#8564 )	2023-12-20 16:28:45 +00:00
Radosław Waśko	d4714af826	Add a few new `Filter_Condition`s (#8539 ) - Closes #8045	2023-12-16 15:12:23 +00:00
Radosław Waśko	7a05e679c3	Improve details attached to `No_Output_Columns` reported from various operations (#8528 ) - Closes #7635	2023-12-14 10:49:07 +00:00
GregoryTravis	1c815a3d45	Better Error Trapping in map (#8307 ) * tests * wip * wip * additional warnings * wip * wip * cleanup * nested wrapping * multiple nestings * wraps_error uses looks_for, test for should_fail_with * wip * stack trace line fix * use catch_primitive internally * fix warning mapping, dtf spec * just one wrapper checker, vector spec * missing ctor, back to non-primitive catch * back to c_p * put old map back * wip * unnest tests * Array.map on_problems * wip * Revert "wip" This reverts commit `c30d171457`. * better test names * warning logging * wip * wip * move logic into ALH * doc * constant * My_Error.Error * nested * doc * map_primtiive in warning mapper * composition * ref spec * Remove warnings prior to matching on the value If an expression has warnings and is matched we: 1) extract the warnings 2) execute the branch of a pattern that matches the value 3) attach extracted warnings to the result This caused warnings to reappear when doing the custom warnings manipulation. This is also consistent with how `CaseNode`'s `doWarning` specialization is defined. * fix 1 * do not auto unwrap in test error checkers * nested error matcher * in problems too * dtf * v * statistics * wip * Table_Spec, map_with_index_primitive * Column_Operations_Spec * disable warning wrapping and Report_Warning * unimpl test * Warnings_Spec * DCS * ACG JP * zip_primitive * join_helpers * Lookup_Helpers * Table * Data_Formatter * Value_Type_Helpers * revert check types changes * table_helpers * table tests * remove st * do not remove warnings from value * vec docs, tests for zip, mwi, flat_map * docs, fixes * remove nested_error_matcher * cleanup * benchmark * one error * alter * add bench to main * review * review * review * tail call * changelog * tail call was not a tail call * ws * bad import * Added missing import * Update distribution/lib/Standard/Base/0.0.0-dev/src/Data/Array.enso Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org> * review, ref example * lazy benchmark data * extra paren * check outside of catch * review * vector too * actually lazy * disambiguate Map_Error * finish rename * move to extensions * combine Additional_Warnings error * rename to map_no_wrap * do not catch and rethrow * review * wip * remove _primitives entirely * remove unused should_fail_with function options * remove expected_warning as function in Problems --------- Co-authored-by: Hubert Plociniczak <hubert.plociniczak@gmail.com> Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org>	2023-12-13 09:38:09 -05:00
James Dunkerley	f60836d9e1	Apply ICONs (#8360 ) - Amend a couple of missed groups. - Add the first pass of some ICONs. The linter tool has been updated to support rewriting the ICON as well.	2023-11-22 15:24:16 +00:00
James Dunkerley	347b5a7cf5	Linting and Groups update (#8357 ) - Fix issues from the linter. - Rename the constructors for `Blank_Selector`. - Update various GROUP tags.	2023-11-21 18:12:27 +00:00
James Dunkerley	ecaca12df1	Integrating Enso Cloud with the libraries (part 1...) (#8006 ) - Add a `File_For_Read` type. Used for `File_Format` to read files. - Added `Enso_User` representing the current user in `Enso_Cloud`. - Will be later able to list known users. - Added `Enso_Secret` representing a value defined in `Enso_Cloud`. - Value not used within Enso only accessed within polyglot Java. - Integrated into `Username_And_Password` and can be used within JDBC connections. - Integrated into HTTP Headers so a secret can be used as a value. - New `URI_With_Query` with the same API as `URI`. Supporting secrets in the value. - Will be integrated with AWS credentials. - Added `Enso_File` representing a file or a folder in the cloud. - Support the same API as `File` (like the `S3_File`). - Will support `enso://` URI style access.	2023-11-20 23:21:14 +00:00
GregoryTravis	ea3d778456	Allow the creation of a constant column on an in-memory table with no rows. (#8218 )	2023-11-09 14:40:51 +00:00
GregoryTravis	6be94a854b	Implement truncate Date_Time for database backend (#8235 ) Also adds some checks for column names generated for floor, ceil, truncate, round.	2023-11-08 23:23:59 +00:00
Radosław Waśko	1b8b30a68d	Improve performance of `Join_Condition.Between` by sorting on one dimension (#8212 ) - Closes #5303 - Refactors `JoinStrategy` allowing us to 'stack' join strategies on top of each other (to some extent) - currently a `HashJoin` can be followed by another join strategy (currently `SortJoin`) - Adds benchmarks for join - Due to limitations of the sorting approach this will still not be as fast as possible for cases where there is more than 1 `Between` condition in a single query - trying to demonstrate that in benchmarks. - We can replace sorting by d-dimensional [RangeTrees](https://en.wikipedia.org/wiki/Range_tree) to get `O((n + m) log^d n + k)` performance (where `n` and `m` are sizes of joined tables, `d` is the amount of `Between` conditions used in the query and `k` is the result set size). - Follow up ticket for consideration later: #8216 - Closes #8215 - After all, it turned out that `TreeSet` was problematic (because of not enough flexibility with duplicate key handling), so the simplest solution was to immediately implement this sub-task. - Closes #8204 - Unrelated, but I ran into this here: adds type checks to other arguments of `set`. - Before, putting in a Column as `new_name` (i.e. mistakenly messing up the order of arguments), lead to a hard to understand `Method `if_then_else` of type Column could not be found.`, instead now it would file with type error 'expected Text got Column`.	2023-11-08 12:59:55 +00:00
Radosław Waśko	2ce1567384	Limit `max_rows` that are downloaded in `Table.read` by default, and warn if more rows are available (#8159 ) - Sets the default limit for `Table.read` in Database to be max 1000 rows. - The limit for in-memory compatible API still defaults to `Nothing`. - Adds a warning if there are more rows than limit. - Enables a few unrelated asserts.	2023-11-06 16:41:47 +00:00
GregoryTravis	3c371adbef	Implement Table.format similar to Table.parse allowing to format columns in bulk (#8150 ) * doc * one test * date tests * empty and nothing * ints floats * bools * all columns * regex and index * locales * bad formats * all with one format * docs * examples, not impl db * docs, more errors * cleanup * changelog * check list * reorder * clue * review * review * review * review * review * review * specify time zone	2023-11-02 09:36:36 -04:00
Cassandra-Clark	b5d6628c57	Change filter_blank_rows when_any parameter to have a more user-friendly type (#7935 ) Added Blank_Selector constructor and applied to remove_blank_columns, select_blank_columns, filter_blank_rows for #7931 . Changed when_any to when for readability.	2023-11-01 16:51:15 +00:00
GregoryTravis	d467683ed1	Constant columns (in expressions and Column_Operations) should have clearer names (#8188 ) Previously, constant columns were given generated names with UUIDs in them, which are long and provide no information. Instead, we now use the constant value itself to form the name. Since these new generated names are less unique, we must explicitly make them unique, in cases where the caller did not explicilty set a name.	2023-11-01 14:41:03 +00:00
GregoryTravis	1480f50207	Overhaul the random number and item generation code (#8127 ) Rewrite most of Random.enso.	2023-10-31 15:25:37 +00:00
Radosław Waśko	79011bd550	Implement `Table.lookup_and_replace` in Database (#8146 ) - Closes #7981 - Adds a `RUNTIME_ERROR` operation into the DB dialect, that may be used to 'crash' a query if a condition is met - used to validate if `lookup_and_replace` invariants are still satisfied when the query is materialized. - Removes old `Table_Helpers.is_table` and `same_backend` checks, in favour of the new way of checking this that relies on `Table.from` conversions, and is much simpler to use and also more robust.	2023-10-31 15:19:55 +00:00
Radosław Waśko	0c278391fe	Test and improve handling of `Date_Time with_timezone=False` in Postgres (#8114 ) - Fixes #8049 - Adds tests for handling of Date_Time upload/download in Postgres. - Adds tests for edge cases of handling of Decimal and Binary types in Postgres.	2023-10-21 21:35:13 +00:00
Radosław Waśko	8172896065	Support `Previous_Value` in `fill_nothing` and `fill_missing` (#8105 ) - Adds `Previous_Value` to `fill_nothing` and `fill_empty`, as requested by #7192.	2023-10-20 13:18:53 +00:00
Radosław Waśko	28fc183f92	Review places where we can use `Column_Ref` (#8101 ) Closes #8046	2023-10-18 19:03:50 +00:00
Radosław Waśko	08b717eb54	Refactor Table problem handling to a more robust and hopefully cleaner approach (#7879 ) Closes #7514	2023-10-16 15:09:08 +00:00
GregoryTravis	f18d1323e1	Add Table.expand_to_rows to allow flattening vector and array values in table (#8042 ) # Important Notes Also includes a fix for a reallocation bug in `InferredBuilder`.	2023-10-13 20:54:06 +00:00
James Dunkerley	fac9e7a420	Expand capabilities of `Table.set` and better dropdown support, (#8005 ) - Adds the ability to use numbers, date/time and Boolean values as constants in `set`. - `Table.set` can take a `Column_Operation`, allowing for deriving of a new column based on other columns. - Added `Column_Ref` type to refer to a column in `filter`.	2023-10-13 16:03:28 +00:00
James Dunkerley	0dcfc3e9bf	Minor improvements from last couple of Book Clubs (#8034 ) - Added some ALIASes. - Added `sheet` to `Excel_Workbook` to give familiar API to read sheet. - Added conversion from range to vector allowing easy use with Zip. - Add `Map.from_keys_and_values.	2023-10-12 14:29:59 +00:00
Radosław Waśko	cd84ac16ce	Restructure `Table.from_objects` to use conversions (#8020 ) Closes #7957	2023-10-11 22:25:18 +00:00
Radosław Waśko	6f78570115	Fix a DROP table bug, add SQL debug logging (#8007 ) - Fixes a bug where creating a temporary table could accidentally issue a `DROP` statement of the table name that the user provided, risking destruction of user data. - Fortunately, the bad scenario was almost impossible, because the `DROP` statement was only issued _if_ we previously checked that the mentioned table _does not exist_ - dropping a nonexistent table does not do any harm. - It could have been dangerous in a very unlikely scenario that a table was created just between the _existence check_ and the _drop_. - After the fix the existence check and any modifications are done within a transaction to avoid interference from concurrent modifications, and the DROP is correctly applied to a temporary Enso table instead of the original one. - Replaced a temporary log with proper simple logging of SQL statements into a file, if an Environment variable is set. - Used that feature to test that no unexpected statements occur.	2023-10-10 13:16:06 +00:00
Radosław Waśko	6e0bd86753	Implement `Table.lookup_and_replace` for in-memory (#7979 ) - Closes #7749 implementing the in-memory logic. - Additional complications have surfaced regarding the Database logic, so it has been split off into a separate ticket: #7981	2023-10-10 10:42:06 +00:00
Radosław Waśko	3222e5af62	Avoid exponential growth of column names (#7934 ) - Fixes #7933 - Avoids duplicating `]` as `]]` in generated column names - now column names grow linearly.	2023-10-02 16:05:28 +00:00
James Dunkerley	fb50eb7595	Using conversions in a few places (#7859 ) - Shuffle a few `from`s into correct places: - `Day_Of_Week.from` removing `Day_Of_Week_From` module. - Adding short cut for `http` and `https` in `Data.read` so it calls onto `Data.fetch` giving a single entry point. - Moved `URI` extensions from `Standard.Base.Data` module into `Standard.Base.Network.Extensions`. - Added `post` extension for `URI`. - Added `contains_key` to `JS_Object`. - Restored `into` in `JS_Object`: - Follows old logic populating a constructor. - Will use conversion from `JS_Object` if present. - Added automatic deserialization of `Date`, `Time_Of_Day` and `Date_Time` from JSON. - Uses conversion from `JS_Object`. - Added conversion from `Text` to a `HTTP_Method` and type checking where `HTTP_Method` used in public APIs. - Added support for `Date`, `Time_Of_Day` and `Date_Time` in `Table.from_objects`. - Added `expand_column` to `Table` to expand `JS_Object` to values. - Add type checking for `Table` in `right` arguments (allowing `Column`s to be used). - Use type checking in `Table.set` to allow for conversion to a `Column`. - Remove some unused imports. - Fix for bug in S3 edge case.	2023-10-02 14:54:22 +00:00
Radosław Waśko	c690559ec4	Implement `auto_value_type` operation (#7908 ) Closes #6113	2023-09-27 15:45:34 +00:00
Radosław Waśko	12c4f2981d	More robust Date/Time format patterns parsing (#7826 ) - Closes #7461 by introducing a `Date_Time_Formatter` type and making parsing date time formats more robust and safer. - The default ('simple') set of patterns is slightly simplified and made case insensitive (except for `M/m` and `H/h`) to avoid the `YYYY` vs `yyyy` issues and make it less error prone. - The `YYYY` now has the same meaning as `yyyy` in simple mode. The old meaning (week-based year) is moved to a _separate mode_, triggered by `Date_Time_Formatter.from_iso_week_date_pattern`. - Full Java syntax, as well as custom-built Java `DateTimeFormatter` can also be used by `Date_Time_Formatter.from_java`. - Text-based constants (e.g. `ISO_ZONED_DATE_TIME`) have now become methods on `Date_Time_Formatter`, e.g. `Date_Time_Formatter.iso_zoned_date_time`).	2023-09-22 10:12:18 +00:00
James Dunkerley	74d1d0861c	S3 Read Access, Input Stream based reading (#7776 ) - Added a `FileSystemSPI` allowing protocol resolution to a target type. - Separated `Input_Stream` and `Output_Stream` from `File` to allow use in other spaces. - `File_Format` types `read_web` changed to be `read_stream` working with `InputStream`. - Added directory listing to `Auto_Detect` allowing for `Data.read` to list a folder. - Adjusted HTTP to return an `InputStream` not a `byte[]`: - `Response_Body` adjusted to wrap an `InputStream`. - Added ability to materialize to either and in-memory vector (<4KB) or a temporary file. - `Data.fetch` will materialize if not a recognized mime-type. - Added `HTTP_Error` to handle IO exceptions from the stream. - `Excel_Format` now supports mime-type and reading a stream. - `Excel_Workbook` can now get a `Excel_Section` using `read_section`. - Added S3 APIs: - `parse_uri`: splits an S3 URI into bucket and key. - `list_objects`: list the items in a S3 bucket with specified prefix. - `read_bucket`: list prefixes and keys with a delimiter in a S3 bucket with specified prefix. - `head`: either head_bucket (tests existance) or head_object API (reads object meta data). - `get_object`: gets an object from S3 returning as a `Response_Body`. - Added `S3_File` type acting like a `File`: - No support for writing in this PR. - ToDo: recursive listing, glob filtering, exists, size. - Fixed a few invalid type signature line. - Moved `create` methods for `Postgres_Connection` and `SQLite_Connection` into type instead of module. - Renamed `Column_Fetcher.Builder` to `Column_Fetcher_Builder`. - Fixed bug with `select_into` in Dry Run mode creating permanent tables. ToDo: Unit tests.	2023-09-20 15:09:11 +00:00
Hubert Plociniczak	1ee3d8f4f0	Rename Decimal to Float (#7807 ) Implements #6889.	2023-09-14 15:01:30 +00:00
Radosław Waśko	8b6e70b155	Support for BigInteger values in Table (#7715 ) - Fixes #7354 - And also closes #7712 - Refactors how we handle numeric ops - ensuring that the 'kernels' are placed all in one place and selected based on storage types.	2023-09-12 13:18:04 +00:00
Radosław Waśko	7d424bf8a2	Implement `Table.delete_rows`. (#7709 ) - Closes #7238 - Aligns `update_database_table` to a more consistent and clearer API - `update_rows`. - Adds a `truncate_table` helper function, to pair up with `drop_table`. Both are `PRIVATE` for now. - Adds tests for NULLs in keys in `update_rows` and `delete_rows`. - The behaviour is sometimes unexpected, so instead these fail with `Null_Values_In_Key_Columns`. - Adds a workaround for https://github.com/oracle/graal/issues/7359 - Adds a workaround for a related bug where a stack frame has no name (its `rootNode.getName() == null`). - I could not track down this bug to provide a neat repro.	2023-09-07 11:07:53 +00:00
GregoryTravis	061876e640	Add simple parts of Table.take and Table.drop functions to Database table (#7615 ) Implements database Table and Column take/drop, except While and Sample. Additional features and optimizations are in https://github.com/enso-org/enso/issues/7614.	2023-08-31 18:52:02 +00:00
Radosław Waśko	255b424b72	Add `value_type` to `Column.from_vector` and `expected_value_type` to `Column.map` and `Column.zip` (#7637 ) - Closes #6111 - Aligns semantics of handling Mixed columns. - Now, if an operation like `iif` or `fill_nothing` is given a `Mixed` column, the result will also be `Mixed` regardless of the `inferred_precise_value_type`. - Enables a few old tests that were pending but could be enabled since the types work is advanced enough.	2023-08-31 13:20:49 +00:00
James Dunkerley	7d83b3d7b4	Add GROUP to functions (#7622 ) - Update list of groups to agreed list. - Lower case `ALIAS` names to be consistent with function names. - Add `GROUP` to methods. - All constructors and functions have doc comments. - Correct a few typos (e.g. `PRVIATE`). - Mark some more things as `PRIVATE`. - Use `ToDo:` and `Note:` consistently. - Order tags in doc comment. # Important Notes We don't have all the doc comments on types and will want to add them in future,	2023-08-23 13:20:38 +00:00
Radosław Waśko	2385f5b357	Add size-limited strings and varying bit-width integer Value_Types to in-memory backend and check for ArithmeticOverflow in LongStorage (#7557 ) - Closes #5159 - Now data downloaded from the database can keep the type much closer to the original type (like string length limits or smaller integer types). - Cast also exposes these types. - The integers are still all stored as 64-bit Java `long`s, we just check their bounds. Changing underlying storage for memory efficiency may come in the future: #6109 - Fixes #7565 - Fixes #7529 by checking for arithmetic overflow in in-memory integer arithmetic operations that could overflow. Adds a documentation note saying that the behaviour for Database backends is unspecified and depends on particular database.	2023-08-22 18:10:46 +00:00
GregoryTravis	c9d7c5cb2b	Convert in-memory Column.round to Java (#7521 )	2023-08-16 14:45:23 +00:00
Jaroslav Tulach	aa0413e5a2	Use only Type instances as keys for State (#7585 )	2023-08-16 15:54:17 +02:00
Radosław Waśko	bc9cde6543	Fix column naming edge cases - invalid and duplicated columns, case-insensitive name aliasing for case-insensitive backends (#7495 ) - Fixes #7412 - Also adds tests and fixes some more edge cases: - Ensures correct handling of existing Database tables whose column names may be invalid from Enso perspective, or clashing from Enso perspective (e.g. for most DBs `ś` and `s\u0301` are different names, but for Enso they are basically the same so this would cause issues - thus Enso now renames such columns when accessed (still using the correct column reference in the generated SQL under the hood).	2023-08-04 09:04:38 +00:00
Radosław Waśko	c61c741476	Respect database backend naming limitations when generating table/column names and validate user-provided names to avoid silent name clashes; process JDBC warnings reported from backends (#7428 ) - Closes #5951 - Ensures any SQL warnings reported by the database through the JDBC driver are processed and forwarded to the user. - These warnings show issues like the implicit name truncation that this PR is also solving. It's good to make sure they are visible as they can help avoid and understand unexpected problems. They should not show up in most standard workflows. - Adds simple history to our REPL.	2023-08-03 09:44:27 +00:00
James Dunkerley	7345f0fd9a	Speed up statistics (#7390 ) - Allow `parse_to_columns` to take a `Regex` object. - Add `pattern` to the `Regex` object. - Add `column_names` to the `Row` object. - Improve statistics performance. - Add benchmarks for stats. \| Benchmark \| Reference \| New \| Improvement \| \| --- \| --- \| --- \| --- \| \| Max (by reduce) \| 16.4ms \| 16.3ms \| - \| \| Max (stats) \| 703ms \| 224ms \| 68% \| \| Sum (by reduce) \| 38ms \| 38ms \| - \| \| Sum (stats) \| 753ms \| 420ms \| 44% \| \| Variance (stats) \| 745ms \| 553s \| 26% \| Also tried using a Ref approach for stats but as slower (`7e13c45224`).	2023-07-26 10:01:18 +00:00
GregoryTravis	1f6fcf189b	Implement replace on the Database Column (#7275 ) Implements `replace` for database text columns, for text, regex, and column patterns.	2023-07-25 18:09:50 +00:00
James Dunkerley	88f32d9b2a	Various small tickets... (#7367 ) - Added `Text.length` into Text class so CB lists the built in. - Added `File.starts_with` and tests for the built in method. - Add `to_js_object` and `to_display_text` to `Regex`. ![image](https://github.com/enso-org/enso/assets/4699705/3b197c94-9c49-4bc5-a2cc-ce53b917942e) - Add `to_js_object` and `to_display_text` to `Match`. ![image](https://github.com/enso-org/enso/assets/4699705/962ec4f2-324d-4f10-8ec0-932b093c6729) - Remove the `bit_shift_l` alias from the built-ins. - Add test and Enso wrapper for `Text.is_normalized`.	2023-07-23 09:04:11 +00:00
James Dunkerley	fd0bdc86dd	Fix issue with rename_columns and revert order of parameter change on select_columns. (#7321 ) The Regex change introduced some issues. Added a test for missed case in `rename_columns` where using vector of pairs. Reverted parameter order change for `select_columns`.	2023-07-18 13:30:23 +00:00
James Dunkerley	aaa235fbad	Add drop down for replace, remove Column_Selector (#7295 ) - Add dropdowns for `replace` functions. - Retire `Column_Selector` type. - Add `select_blank_columns` and `remove_blank_columns` functions to table types. - Allow Regex to be used to pick columns.	2023-07-14 17:30:52 +00:00
Radosław Waśko	866283c0a8	Improve error message on `Filter_Condition` missing arguments in `Table.filter` (#7290 ) In #7148 I improved the error message when a `Filter_Condition` constructor without arguments is provided to `Vector.filter` and its friends. This PR applies the same check to the `Table.filter`. This is useful, because when we select a Filter_Condition from a widget, initially it does not have all its arguments applied. This used to lead to confusing errors being reported to the user, now, a much clearer error is shown: ![image](https://github.com/enso-org/enso/assets/1436948/19140a7b-d6fc-4292-81d3-dc6d61135cb9)	2023-07-14 08:00:13 +00:00
Radosław Waśko	620cc361ce	Add `date_diff`, `date_add` and `date_part` to scalar Enso date-time values. (#7273 ) Followup of #7221, adding `date_diff`, `date_add` and `date_part` to scalar Enso date-time values.	2023-07-13 15:17:21 +00:00
Radosław Waśko	ca68dd94da	Adding new Date/Time operations (`-`, `date_add`, `date_diff`, `date_part`) (#7221 ) - Adds `Column.date_diff` for computing date/time difference as integer multiply of some unit. - Adds `Column.date_add` for shifting date/time by a unit. - Adds `Column.date_part` for extracting various parts of the date/time value as integer. - Adds widgets for the 3 methods above whose content depends on the column value type. - Adds shorthands: `Column.hour`, `Column.minute` and `Column.second` to extract these date parts. - Extends `Time_Period` with support for milli-, micro- and nano- seconds; and adapts functions taking `Time_Period` to support these wherever possible.	2023-07-13 12:56:54 +00:00
James Dunkerley	0adab6c68c	Round on a column was always adding a warning (#7246 ) - Only warn if outside allowed range. - Added `is_infinite` to In-Memory column. - Allow integer value type for `is_nan` and `is_infinite`.	2023-07-10 17:35:23 +00:00
GregoryTravis	345d6b9cb1	Add cross_join support to Database Table (#7234 )	2023-07-10 16:29:37 +00:00
James Dunkerley	1fb60df61b	Fixes from the live demo. (#7243 ) - Removed defaults from `cross_tab`. It caused an out-of-heap space error when it attempted to build a 205k x 205k table. Now has a hard limit of 10,000 columns - we can increase this once we have more concrete test data. ![image](https://github.com/enso-org/enso/assets/4699705/bc38d41c-56dc-41bd-8a7c-fa89ecfa7f79) - Adjusted the dropdowns on `Aggregate_Column` for `columns` and `order_by` to be dropdowns as nested Vector editors are not supported. ![image](https://github.com/enso-org/enso/assets/4699705/f4a7c7cc-6a21-462c-a39e-65fbab82c367) - Altered `Aggregate_Column` so `new_name` now `new_name:Text=""` and not taking `Nothing` anymore. Makes it appear correctly in IDE. ![image](https://github.com/enso-org/enso/assets/4699705/196a49ba-4274-44bb-b876-0372c8f62746) - Added dropdowns for `fill_empty`, `fill_nothing` and `replace` on `Table`. ![image](https://github.com/enso-org/enso/assets/4699705/9ee5cec2-82d5-4452-b650-67015ac9fee5) - Added `replace` to Database table throwing `Unsupport_Database_Operation`.	2023-07-09 18:03:05 +00:00
GregoryTravis	bd26e95fd6	Add Table.replace; Change Text.replace to take a Text\|Pattern, and remove the use_regex param. (#7223 )	2023-07-06 16:13:11 +00:00
James Dunkerley	7749286c69	Tidy up the imports using script (#7220 ) Ordering the imports to test a script.	2023-07-06 14:22:50 +00:00
GregoryTravis	6eb46afb40	Do not rename column on fill_nothing and add version to the Table allowing filling multiple (include fill_empty as well). (#7166 ) Updated Column.fill_nothing and .fill_empty, and added the same to Table. (Both in-memory and db.)	2023-07-05 17:20:23 +00:00
James Dunkerley	4fbe7e3830	Remove `Array.new` and `Array.copy` and move Vector functions to builtins. (#7147 ) - Removed Array methods: `new`, `copy` and `new_[1234]`. - New builtins for `Vector.insert`, `Vector.remove` and `Vector.flatten`. - Replaced `Vector_Builder` use of `Array.copy` to a `Vector.Builder` approach.	2023-07-03 12:41:41 +00:00
Radosław Waśko	4ccf3566ce	Implement `add_row_number` for Database backends, fix primary key inference for SQLite (#7174 ) Closes #6921 and also closes #7037	2023-07-03 11:51:42 +00:00
GregoryTravis	550d146493	Add round, ceil, floor, truncate to the In-Database Column type (#6988 )	2023-06-30 16:47:40 +00:00
Paweł Grabarz	cb9d4c4607	move method icon definition to documentation tag (#7123 )	2023-06-29 14:48:55 +00:00
James Dunkerley	56688ec1e7	Minor fixes. (#7122 ) Mostly stuff to tidy up the static methods in the CB. - Remove default pattern from `parse_to_table` (caused IDE to freeze). - Rename any `_` arguments to what they are. - Merge `Date.now` into `Date.today` - Merge the Interval constructors into a single constructor. - Hide various methods.	2023-06-27 18:18:15 +00:00
Radosław Waśko	2bac9cc844	Execution Context integration for Database write operations (#7072 ) Closes #6887	2023-06-27 15:51:21 +00:00
James Dunkerley	937651f696	Code Clean Up, Fix Weird Namespace, S3 List Objects and Read Object (#7114 ) Mostly a tidy up as part of looking over the function catalogue for groups. Sorted some whitespaces issues.	2023-06-24 23:18:58 +00:00
James Dunkerley	1859ccbab5	Improving widgets and other minor tweaks. (#7052 ) - Removed `module` argument from `enso_project` (new `Project_Description.new` API). - Removed the custom option from date and time parse/format dropdowns. - The `format` dropdown uses the value to create the dropdown. (Screenshot below) - Removed `StorageType` coalescing rules and replaced them with simpler logic in `ObjectStorage`. - Update signature for `add_row_number` and add aliases.	2023-06-19 19:03:36 +00:00
James Dunkerley	760fb71798	First part of AWS S3 API, various small fixes. (#6973 ) - Add type detection for `Mixed` columns when calling column functions. - Excel uses column name for missing headers. - Add aliases for parse functions on text. - Adjust `Date`, `Time_Of_Day` and `Date_Time` parse functions to not take `Nothing` anymore and provide dropdowns. - Removed built-in parses. - All support Locale. - Add support for missing day or year for parsing a Date. - All will trim values automatically. - Added ability to list AWS profiles. - Added ability to list S3 buckets. - Workaround for Table.aggregate so default item added works.	2023-06-15 16:20:13 +00:00
Dmitry Bushev	35ef358121	Update sqlite jdbc dependency (#7031 ) # Important Notes Update `sqlite-jdbc`	2023-06-15 11:00:18 +00:00
Radosław Waśko	dad57e6c7d	Implement remaining `Update_Action`s for `update_database_table`. (#7035 ) Closes #6498	2023-06-15 08:48:22 +00:00
Dmitry Bushev	48f0c6f5e8	Scala 2.13.11 and libraries update (#7010 ) Update Scala and libraries.	2023-06-14 13:15:57 +00:00
Radosław Waśko	d9ed63fb89	Implement `Insert` update action for `update_database_table`. (#6990 ) This adds the spec for all update actions, but implements the common input validation framework and `Insert`. Tests for remaining actions are marked as pending - these will be implemented in a subsequent PR.	2023-06-14 00:14:32 +00:00
Radosław Waśko	b513839418	Refactor `create_database_table` into `Connection.create_table` and `select_into_database_table`, implement `Set`. (#6925 ) First part for #6498 - refactoring of the upload infrastructure, in preparation for `update_database_table`. Implemented a `Set` data structure which was long needed. The APIs are added and an initial implementation is created, but it is not complete - but it has grown significantly already so the remaining implementation will be done as a separate PR. Adds some basic ability for a function to ensure that it is only executed from within a transaction.	2023-06-06 10:36:05 +00:00
James Dunkerley	db96bd2e2c	Small fixes from book club. (#6933 ) - Add the missing dropdowns for `Locale` and `Encoding`. - Correct a few mismatched type signatures. - Adjust `order_by` calls with a single `Sort_Column` to call in a Vector. - Adjust parameter names for `transpose`. - Fix for the table viz: escape HTML and `suppressFieldDotNotation`. - Use `Filter_Condition.Equal True` for the default filter. - Adjust `Data.fetch` to return the response on success when parse fails. Rename `parse` to `try_auto_parse`. - Add various aliases for methods. - Add tests for `Table.set` when using a `Vector`, `Range` or `Date_Range`. - Add check for mismatched length on `Table.set`. ![image](https://github.com/enso-org/enso/assets/4699705/23ea0ba3-2b05-4af8-afd9-f35b55446c24) ![image](https://github.com/enso-org/enso/assets/4699705/8b0253e6-e9e8-490a-9607-0da51ab5a215)	2023-06-05 13:57:30 +00:00
Radosław Waśko	cfb2f2916e	Merge `Column_Indexes_Out_Of_Range` into `Missing_Input_Columns`. (#6901 ) Implements #6869	2023-06-02 12:09:20 +00:00
Radosław Waśko	d44b1250b7	Implement `Table.add_row_number` (#6890 ) Closes #5227 # Important Notes - This lays first steps towards #6292 - we get pure Enso variants of MultiValueKey. - Another part refactors `LongStorage` into `AbstractLongStorage` allowing it to provide alternative implementations of the underlying storage, in our case `LongRangeStorage` generating the values ad-hoc and `LongConstantStorage` - currently unused but in the future it can be adapted to support constant columns (once we implement similar facilities for other types).	2023-06-02 10:13:13 +00:00
James Dunkerley	343b5fb085	Execution control for Table.write and various widget tweaks... (#6835 ) - Adds execution control to `Table.write`. - Refactored the `Text.write` to make part reusable. - Tidied up some legacy mess in tests. - Add easier flow to go from `Text` to an `URI` to fetching data. - Add decode functions to `Response` and `Response_Body`. - Fix issue with 0 length regex matches (using same as Python and .Net approach). - Add various ALIAS entries to make function discovery easier. - Sort a lot of drop down and vector editors out (including switch to fully qualified names).	2023-06-01 22:10:03 +00:00
GregoryTravis	0337180384	Add rounding functions to the Column type (#6817 )	2023-06-01 20:06:23 +00:00
Jaroslav Tulach	c6cb937c0a	Runtime check of ascribed types (#6790 )	2023-05-30 11:07:26 +02:00
Hubert Plociniczak	6eb4737330	Append warnings extracted before tail call execution (#6849 ) Throwing `TailCallException` meant that exceptions that were extracted from the expression before the call was made could not be appended. This change catches the `TailCallException`, adds warnings to it and propagates it further, thus ensuring that we don't loose the information. Closes #6765. # Important Notes Removed workarounds introduced in stdlib.	2023-05-29 12:44:15 +00:00
Radosław Waśko	c3e771c75c	Allow casting a Mixed column into a concrete type (#6777 ) Follow-up of #6711 Closes #6838	2023-05-26 13:25:53 +00:00
Jaroslav Tulach	b33ae479dd	Various test improvements to increase coverage and speed things up (#6820 )	2023-05-25 10:18:46 +02:00
James Dunkerley	62fecfa474	Widgets, Vector as Column, Viz Fixes and Rename Columns (#6768 ) - Fix couple of bugs in Table viz: rounding of bottom div, missing character, not including row count as an option. - Add better JSON format for `Row`, add support for visualization in the Table viz both for `Vector Row` or `Row`. - Fix some type signature errors. - Move `Column_Format` to `Standard.Table.Internal`. - Move `format_widget` to `File_Format.default_widget` and sort the signature of `Widget` methods. - Added utility to make `Single_Choice` widgets. - Added dropdown for delimiter on split methods. - Removed `default_widget` from `Problem_Behavior` and `Filter_Condition`. - Altered signature and widgets for table functions. - Added `to_column` extension to allow easy conversion of Range and Vector to Column. - Added `compute`, `compute_bulk`, `running` to Column to allow statistic computation. - Added drop down for `Table.write` format parameter. - Added drop down for `Table.rename_columns`. - Added support for Vector of pairs for renaming columns. - Added check when making a map from Vector if not 2 items. ![image](https://github.com/enso-org/enso/assets/4699705/beed257c-efe3-44a3-9e3a-041354701735)	2023-05-19 23:24:47 +00:00
Radosław Waśko	a9a464af37	Implement simple variants of `parse` for the Database backend (#6731 ) Implements the simplest `parse` scenarios for the Database backend. Before #6711 these could have been done by `cast`, but in #6711 the APIs were unified to only allow casting to the same set of types in both in-memory and Database. Converting Text to other types is supposed to be done by `parse` and not `cast`, so the ability to use `cast` for rudimentary parsing is removed in the Database backend to make it consistent with in-memory. But now it is lacking any, even simplest, Text->Int/Text->Date support. To alleviate that, the simple scenarios for `parse` are implemented (no support for format customization yet, will boil down to a cast under the hood).	2023-05-19 22:11:23 +00:00
Radosław Waśko	447786a304	Implement `cast` for Table and Column (#6711 ) Closes #6112	2023-05-19 10:00:20 +00:00
Jaroslav Tulach	41bb52c901	Verify ascribed types of parameters really exist (#6584 ) Verify ascribed types `(a : Xyz)` are checked for existence.	2023-05-14 07:23:18 +00:00
Radosław Waśko	f5071a17fd	Improving widgets for take/drop (#6641 ) Related to #6410 # Important Notes - Updated some `Meta` methods (needed for error handling): - `Meta.Type` now has `name` and `qualified_name`. - `Meta.Constructor` has `declaring_type` allowing to get the type that this constructor is associated with.	2023-05-12 15:33:15 +00:00
Radosław Waśko	d8b926922a	Improve `Non_Unique_Primary_Key` error, split file format detection into read/write, improve SQLite format detection (#6604 ) Closes #6437 Related to #6410 - Add example duplicate row to `Non_Unique_Primary_Key`. - Ensure `File.read` fails if the file does not exist, always. - Ensure SQLite fails if file is empty or nonexistent or malformed. - Split file format detection into read and write modes, so that the read mode can depend on actual file _contents_.	2023-05-09 17:15:44 +00:00
GregoryTravis	4ba8409def	Add format to the in-memory Column (#6538 ) Add format to the in-memory Column # Important Notes Also updates .format in date types. Some rearrangement of date formatting builtins / Java libraries.	2023-05-09 08:47:40 +00:00
James Dunkerley	bc0db18a6e	Small changes from Book Club issues (#6533 ) - Add dropdown to tokenize and split `column`. - Remove the custom `Join_Kind` dropdown. - Adjust split and tokenize names to start numbering from 1, not 0. - Add JS_Object serialization for Period. - Add `days_until` and `until` to `Date`. - Add `Date_Period.Day` and create `next` and `previous` on `Date`. - Use simple names with `File_Format` dropdown. - Avoid using `Main.enso` based imports in `Standard.Base.Data.Map` and `Standard.Base.Data.Text.Helpers`. - Remove an incorrect import from `Standard.Database.Data.Table`. From #6587: A few small changes, lots of lines because this affected lots of tests: - `Table.join` now defaults to `Join_Kind.Left_Outer`, to avoid losing rows in the left table unexpectedly. If the user really wants to have an Inner join, they can switch to it. - `Table.join` now defaults to joining columns by name not by index - it looks in the right table for a column with the same name as the first column in left table. - Missing Input Column errors now specify which table they refer to in the join. - The unique name suffix in column renaming / default column names when loading from file is now a space instead of underscore.	2023-05-06 10:10:24 +00:00
GregoryTravis	f7282b7cff	"at_least_one" flag for tokenize_to_rows (#6539 ) Add "at_least_one" flag for tokenize_to_rows.	2023-05-06 06:22:09 +00:00
Radosław Waśko	0be572af6a	Only `Join_Kind.Inner` removes the common-named columns (#6564 ) As requested by @jdunkerley	2023-05-05 08:12:07 +00:00
Radosław Waśko	41a8257e8d	Separating Redshift connector from `Database` library into a new `AWS` library (#6550 ) Related to #5777	2023-05-04 17:36:51 +00:00
Radosław Waśko	5eb9c3a843	Implement `create_database_table` for Database Table (#6467 ) Closes #6327	2023-05-02 14:01:44 +00:00
Radosław Waśko	cdd0065800	Change defaults for `Connection.tables` and ensure that `Connection.query` recognizes all available tables (#6443 ) Closes #6398	2023-04-29 00:13:29 +00:00
Radosław Waśko	462016a428	Create database table from memory (#6429 )	2023-04-27 20:06:17 +00:00
James Dunkerley	0c7c3bdeaf	Fix for the massive number of warnings when renaming with invalid names. (#6450 ) * Rename makeUnique overloads to avoid issue when Nothing is passed. Suspend warnings when building the output table to avoid mass warning duplication. * Add test for mixed invalid names. Adjust so a single warning attached. * PR comments.	2023-04-27 14:51:59 +01:00
James Dunkerley	0e51131809	Table Visualization and display text changes. (#6382 ) - Missing tests from number parsing. - Fix type signature on some warning methods. - Fix warnings on `Standard.Database.Data.Table.parse_values`. - Added test for `Nothing` and empty string on `use_first_row_as_names`. - New API for `Number.format` taking a simple format string and `Locale`. - Add ellipsis to truncated `Text.to_display_text`. - Adjusted built-in `to_display_text` for numbers to not include type (but also to display BigInteger as value). - Remove `Noise.Generator` interface type. - Json: Added `to_display_text` to `JS_Object`. - Time: Added `to_display_text` for `Date`, `Time_Of_Day`, `Date_Time`, `Duration` and `Period`. - Text: Added `to_display_text` to `Locale`, `Case_Sensitivity`, `Encoding`, `Text_Sub_Range`, `Span`, `Utf_16_Span`. - System: Added `to_display_text` to `File`, `File_Permissions`, `Process_Result` and `Exit_Code`. - Network: Added `to_display_text` to `URI`, `HTTP_Status_Code` and `Header`. - Added `to_display_text` to `Maybe`, `Regression`, `Pair`, `Range`, `Filter_Condition`. - Added support for `to_js_object` and `to_display_text` to `Random_Number_Generator`. - Verified all error types have `to_display_text`. - Removed `BigInt`, `Date`, `Date_Time` and `Time_Of_Day` JS based rendering as using `to_display_text` now. - Added support for rendering nested structures in the table viz.	2023-04-26 18:15:48 +00:00
GregoryTravis	afd804d529	5127 Add Table.parse_to_columns to parse a single column to a set of columns. (#6383 ) Implement Table.parse_to_columns	2023-04-24 15:21:38 +00:00
Radosław Waśko	f3873f9768	Infer SQLite types locally (#6381 ) Closes #6208	2023-04-24 10:55:12 +00:00
Radosław Waśko	a43d524336	Add typechecks to Aggregate and Cross Tab (#6380 ) Follow up of #6298 as it grew too much. Adds the needed typechecks to aggregate operations. Ensures that the DB operations report `Floating_Point_Equality` warning consistently with in-memory.	2023-04-24 08:55:54 +00:00
Radosław Waśko	8db2ad51a1	Adding typechecks to Column Operations (#6298 ) Closes #6106	2023-04-21 12:20:12 +00:00
James Dunkerley	0350762386	Add `replace`, `trim` to Column. Better number parsing. (#6253 ) - Add `replace` with same syntax as on `Text` to an in-memory `Column`. - Add `trim` with same syntax as on `Text` to an in-memory `Column`. - Add `trim` to in-database `Column`. - Added `is_supported` to dialects and exposed the dialect consistently on the `Connection`. - Add `write_table` support to `JSON_File` allowing `Table.write` to write JSON. - Updated the parsing for integers and decimals: - Support for currency symbols. - Support for brackets for negative numbers. - Automatic detection of decimal points and thousand separators. - Tighter rules for scientific and thousand separated numbers. - Remove `replace_text` from `Table`. - Remove `write_json` from `Table`.	2023-04-20 16:04:59 +00:00
GregoryTravis	4dcf5faddd	Add split and tokenize to the Table. (#6233 ) Implement split and tokenize for tables.	2023-04-14 16:03:02 +00:00
Radosław Waśko	0f4f8a0542	Full-joins in SQLite (#6215 ) Closes #5254 In #6189 the SQLite version was bumped to a newer release which has builtin support for Full and Right joins, so no workaround is no longer needed.	2023-04-06 16:49:14 +00:00
Radosław Waśko	f5db35af07	Adjust `{Table\|Column}.parse` to use `Value_Type` (#6213 ) Closes #5660	2023-04-06 10:58:55 +00:00
Radosław Waśko	83b10a2088	Implement `Table.union` for Database backend (#6204 ) Closes #5235	2023-04-06 08:40:34 +00:00
Dmitry Bushev	df4491de50	Batch insert suggestions (#6189 ) close #6080 Changelog - add: implement `SuggestionsRepo.insertAll` as a batch SQL insert - update: `search/getSuggestionsDatabase` returns empty suggestions. Currently, the method is only used at startup and returns the empty response anyway because the libs are not loaded at that point. - update: serialize only global (defined in the module scope) suggestions during the distribution building. There's no sense in storing the local library suggestions. - update: sqlite dependency - remove: unused methods from `SuggestionsRepo` - remove: Arguments table # Important Notes Speeds up libraries loading by ~1 second. ![2023-04-03-173423_2086x324_scrot](https://user-images.githubusercontent.com/357683/229597470-19dcc010-2a34-43e1-87be-60af99afd275.png) ![2023-04-03-173514_2083x321_scrot](https://user-images.githubusercontent.com/357683/229597476-bf5b3c33-6321-4ac9-a0ca-2fb57d257857.png)	2023-04-06 07:47:23 +00:00
Jaroslav Tulach	741b394b0d	Suspended atom fields are evaluated only once (#6151 ) Implements #6134. # Important Notes One can define lazy atom fields as: ```haskell type Lazy Value ~x ~y ``` the evaluation of the `x` and `y` fields is then delayed until they are needed. The evaluation happens once. Then the computed value is kept in the atom for further use.	2023-04-05 23:46:08 +00:00
James Dunkerley	a05cee2571	Add PRIVATE so function hidden from Component Browser and other tidying... (#6207 ) Review the set of public functions. Mark many as PRIVATE so not in CB. Some other small tidying.	2023-04-05 17:54:06 +00:00
Radosław Waśko	6ddcb553e5	Date/time support for Postgres. Year/month/day operations on Columns. (#6153 ) Closes #6115	2023-03-31 18:37:04 +00:00
Radosław Waśko	6f86115498	Proper implementation of Value Types in Table (#6073 ) This is the first part of the #5158 umbrella task. It closes #5158, follow-up tasks are listed as a comment in the issue. - Updates all prototype methods dealing with `Value_Type` with a proper implementation. - Adds a more precise mapping from in-memory storage to `Value_Type`. - Adds a dialect-dependent mapping between `SQL_Type` and `Value_Type`. - Removes obsolete methods and constants on `SQL_Type` that were not portable. - Ensures that in the Database backend, operation results are computed based on what the Database is meaning to return (by asking the Database about expected types of each operation). - But also ensures that the result types are sane. - While SQLite does not officially support a BOOLEAN affinity, we add a set of type overrides to our operations to ensure that Boolean operations will return Boolean values and will not be changed to integers as SQLite would suggest. - Some methods in SQLite fallback to a NUMERIC affinity unnecessarily, so stuff like `max(text, text)` will keep the `text` type instead of falling back to numeric as SQLite would suggest. - Adds ability to use custom fetch / builder logic for various types, so that we can support vendor specific types (for example, Postgres dates). # Important Notes - There are some TODOs left in the code. I'm still aligning follow-up tasks - once done I will try to add references to relevant tasks in them.	2023-03-31 16:16:18 +00:00
James Dunkerley	58f2c7643f	Use new Enso Hash Codes and Comparable (#6060 ) Enables `distinct`, `aggregate` and `cross_tab` to use the Enso hashing and equality operations. Also, I rewired the way the ObjectComparators are obtained in polyglot code to be more consistent. Add Comparator for `Day_Of_Week`, `Header`, `SQL_Type`, `Image` and `Matrix`. Also, removed the custom `==` from these types as needed. (Closes #5626)	2023-03-24 15:02:25 +00:00
James Dunkerley	dd009fd1af	Tidy up the public module level statics (#6032 ) Tidies up a lot of PUBLIC module statics - marked some as PRIVATE, made some methods of types.	2023-03-22 18:02:37 +00:00
James Dunkerley	546cb0c4ab	Deal with warnings attached to value when making widgets. (#5994 ) - Fixes InvokeCallableNode to support warnings. - Strips warnings from annotations in `get_widget_json`. - Remove `get_full_annotations_json`. - Fix warnings on Dialect.	2023-03-22 10:50:19 +00:00
Hubert Plociniczak	8c6fd60aaf	Detect conflicts between exported types and FQNs (#5986 ) Exporting types named the same as the module where they are defined in `Main` modules of library components may lead to accidental name conflicts. This became apparent when trying to access `Problem_Behavior` module via a fully qualified name and the compiler rejected it. This is due to the fact that `Main` module exported `Error` type defined in `Standard.Base.Error` module, thus making it impossible to access any other submodules of `Standard.Base.Error` via a fully qualified name. This change adds a warning to FullyQualifiedNames pass that detects any such future problems. While only `Error` module was affected, it was widely used in the stdlib, hence the number of changes. Closes #5902. # Important Notes I left out the potential conflict in micro-distribution, thus ensuring we actually detect and report the warning.	2023-03-21 21:09:41 +00:00
James Dunkerley	7c9b9ead8e	Fix up some type signatures... (#5979 ) Align any type signatures with a mismatch in count between types and arguments.	2023-03-17 11:53:23 +00:00
Kaz Wesley	e171fba301	New documentation parser (#5917 ) Implement new Enso documentation parser; remove old Scala Enso parser. Performance: Total time parsing documentation is now ~2ms. # Important Notes - Doc parsing is now done only in the frontend. - Some engine tests had never been switched to the new parser. We should investigate tests that don't pass after the switch: #5894. - The option to run the old searcher has been removed, as it is obsolete and was already broken before this (see #5909). - Some interfaces used only by the old searcher have been removed.	2023-03-15 15:43:51 +00:00
Radosław Waśko	952beba8d1	Fix `cross_tab` column naming edge cases, add `fill_empty` (#5863 ) Closes #5151 and adds some additional tests for `cross_tab` that verify duplicated and invalid names. I decided that for empty or `Nothing` names, instead of replacing them with `Column` and implicitly losing connection with the value that was in the column, we should just error on such values. To make handling of these easier, `fill_empty` was added allowing to easily replace the empty values with something else. Also, `{is,fill}_missing` was renamed to `{is,fill}_nothing` to align with `Filter_Condition.Is_Nothing`.	2023-03-11 11:58:54 +00:00
Radosław Waśko	91ef8acf35	Review generated Column names (#5850 ) Closes #5583 and closes #5157	2023-03-10 19:07:58 +00:00
Jaroslav Tulach	8bbdd1af5b	Meta.is_a consistent with case-type-of check (#5853 ) Removing special handling of `AtomConstructor` in `Meta.is_a` check. # Important Notes A lot of tests are about to fail. Many of them indirectly call `Meta.is_a` with a constructor rather than type.	2023-03-10 07:41:04 +00:00
James Dunkerley	299bfd6b7d	Fixes from the Demo on 2nd March (#5823 ) - Fix issue with Geo Map viz. - Handle invalid format strings better in `Data_Formatter`. - New constants for the ISO format strings (and a special ENSO_ZONED_DATE_TIME) - Consistent Date Time format for parsing in all places. - Avoid throwing exception in datetime parsing. - Support for milliseconds (well nanoseconds) in Date_Time and Time_Of_Day. - `Column.map` stays within Enso. - Allow `Aggregate_Column.Group_By` in `cross_tab` group_by parameter.	2023-03-07 20:58:00 +00:00
Pavel Marek	b6e2319fcc	Comparators support partial ordering (#5778 )	2023-03-07 04:16:38 +00:00
James Dunkerley	01fc34c18a	Improving Expression Support for In Database (#5790 ) - Adjust Excel Workbook write behaviour. - Support Nothing / Null constants. - Deduce the type of arithmetic operations and `iif`. - Allow Date_Time constants, treating as local timezone. - Removed the `to_column_name` and `ensure_sane_name` code.	2023-03-03 12:03:05 +00:00
Radosław Waśko	b764b0b7b7	Improve error handling of Connection.query (#5693 ) Closes #5252	2023-02-24 17:15:10 +00:00
Radosław Waśko	793eafc866	Improve Table.parse_values API (#5692 ) Closes #5111	2023-02-24 13:35:01 +00:00
James Dunkerley	652b8d5db3	Update `rename_columns` to new API design, add `first_row`, `second_row` and `last_row` functions to the table. (#5719 ) - Updates the `rename_columns` API. - Add `first_row`, `second_row` and `last_row` to the Table types. - New option for reading only last row of ResultSet.	2023-02-23 19:42:45 +00:00
Radosław Waśko	3027c6f3a2	Ensure entries containing newlines are quoted when writing Delimited Files (#5652 ) Fixes #5638	2023-02-17 00:57:48 +00:00
James Dunkerley	1bc27501e6	Remove `Column` type from Aggregate_Column, simplify Column_Selector, some new `File_Format`s (#5646 ) - Updated `Widget.Vector_Editor` ready for use by IDE team. - Added `get` to `Row` to make API more aligned. - Added `first_column`, `second_column` and `last_column` to `Table` APIs. - Adjusted `Column_Selector` and associated methods to have simpler API. - Removed `Column` from `Aggregate_Column` constructors. - Added new `Excel_Workbook` type and added to `Excel_Section`. - Added new `SQLiteFormatSPI` and `SQLite_Format`. - Added new `IamgeFormatSPI` and `Image_Format`.	2023-02-16 15:15:49 +00:00
Radosław Waśko	a02eab451e	Implement basic warnings for column arithmetic, review warnings on expressions and `filter` (#5605 ) Closes #5109 # Important Notes - Currently the tests pass for the in-memory parts of Common_Table_Operations, but still some stuff not working on DB backends - in progress.	2023-02-14 09:33:04 +00:00
Pavel Marek	1f8511dab2	Add Comparator conversion for all types (#4067 ) Add `Comparator` type class emulation for all types. Migrate all the types in stdlib to this new `Comparator` API. The main documentation is in `Ordering.enso`. Fixes these pivotals: - https://www.pivotaltracker.com/story/show/183945328 - https://www.pivotaltracker.com/story/show/183958734 - https://www.pivotaltracker.com/story/show/184380208 # Important Notes - The new Comparator API forces users to specify both `equals` and `hash` methods on their custom comparators. - All the `compare_to` overrides were replaced by definition of a custom _ordered_ comparator. - All the call sites of `x.compare_to y` method were replaced with `Ordering.compare x y`. - `Ordering.compare` is essentially a shortcut for `Comparable.from x . compare x y`. - The default comparator for `Any` is `Default_Unordered_Comparator`, which just forwards to the builtin `EqualsNode` and `HashCodeNode` nodes. - For `x`, one can get its hash with `Comparable.from x . hash x`. - This makes `hash` as _hidden_ as possible. There are no other public methods to get a hash code of an object. - Comparing `x` and `y` can be done either by `Ordering.compare x y` or `Comparable.from x . compare x y` instead of `x.compare_to y`.	2023-02-10 09:22:11 +00:00
Radosław Waśko	4f90946d1e	Rework Invalid Aggregations (#5579 ) Closes #5108	2023-02-08 18:39:09 +00:00
James Dunkerley	c39410b4d1	Adjusting annotations to use Widget types properly. (#5564 ) Closes #5038 - Use the proper widget structure. - Provide new method `get_widget_json` with whole structure, but keep `get_full_annotations_json` in old form. - Start to get to some reusable functions. - Added widget to JS_Object field selections.	2023-02-07 07:33:51 +00:00
Radosław Waśko	3c72ab08c4	Review `Missing_Input_Column` and `Column_Index_Out_Of_Range` warnings (#4118 ) Implements https://www.pivotaltracker.com/story/show/184226383	2023-02-06 19:52:25 +00:00
James Dunkerley	0790ce494f	New set function, parse a column (#4097 ) - New `set` function design - takes a `Column` and works with that more easily and supports control of `Set_Mode`. - New simple `parse` API on `Column`. - Separated expression support for `filter` to new `filter_by_expression` on `Table`. - New `compute` function allowing creation of a column from an expression. - Added case sensitivity argument to `Column` based on `starts_with`, `ends_with` and `contains`. - Added case sensitivity argument to `Filter_Condition` for `Starts_With`, `Ends_With`, `Contains` and `Not_Contains`. - Fixed the issue in JS Table visualisation where JavaScript date was incorrectly set. - Some dynamic dropdown expressions - experimenting with ways to use them. - Fixed issue with `.pretty` that wasn't escaping `\`. - Changed default Postgres DB to `postgres`. - Fixed SQLite support for starts_with, ends_with and contains to be consistent (using GLOB not LIKE).	2023-01-31 20:48:16 +00:00
Radosław Waśko	c965ad3455	Review `Table.order_by` (#4104 )	2023-01-31 18:29:02 +00:00
Radosław Waśko	b9dbfd036f	First steps of the Problem Handling refactor to the new design (#4086 ) Implements: - https://www.pivotaltracker.com/story/show/184226137 - https://www.pivotaltracker.com/story/show/184226434 - https://www.pivotaltracker.com/story/show/184226462	2023-01-30 16:48:06 +00:00
James Dunkerley	359d4f56df	Initial Widget Dynamic Metadata (#4082 ) Adjusts the old mock to use new attributes. Expanded examples for dynamic widgets.	2023-01-25 14:16:10 +00:00
James Dunkerley	60f0e96047	Update starts_with, ends_with and contains to new API design (#4078 ) - Updated `Text.starts_with`, `Text.ends_with` and `Text.contains` to new simpler API. - Added a `Case_Sensitivity.Default` and adjusted `Table.distinct` to use it by default. - Fixed a bug with `Data.fetch` on an HTTP error. - Improved SQLite Case Sensitivity control in distinct to use collations.	2023-01-25 11:47:27 +00:00
Radosław Waśko	778d28fba3	Table with no columns is not valid, No_Output_Columns is always an error (#4073 ) Implements https://www.pivotaltracker.com/story/show/184226020	2023-01-25 02:40:23 +00:00
Radosław Waśko	d2e57edc8b	Add Table.cross_join and Table.zip to In-Memory Table (#4063 ) Implements https://www.pivotaltracker.com/story/show/184239059	2023-01-23 13:19:52 +00:00
Pavel Marek	fcc2163ae3	All Enso objects are hasheable (#3878 ) * Hash codes prototype * Remove Any.hash_code * Improve caching of hashcode in atoms * [WIP] Add Hash_Map type * Implement Any.hash_code builtin for primitives and vectors * Add some values to ValuesGenerator * Fix example docs on Time_Zone.new * [WIP] QuickFix for HashCodeTest before PR #3956 is merged * Fix hash code contract in HashCodeTest * Add times and dates values to HashCodeTest * Fix docs * Remove hashCodeForMetaInterop specialization * Introduce snapshoting of HashMapBuilder * Add unit tests for EnsoHashMap * Remove duplicate test in Map_Spec.enso * Hash_Map.to_vector caches result * Hash_Map_Spec is a copy of Map_Spec * Implement some methods in Hash_Map * Add equalsHashMaps specialization to EqualsAnyNode * get and insert operations are able to work with polyglot values * Implement rest of Hash_Map API * Add test that inserts elements with keys with same hash code * EnsoHashMap.toDisplayString use builder storage directly * Add separate specialization for host objects in EqualsAnyNode * Fix specialization for host objects in EqualsAnyNode * Add polyglot hash map tests * EconomicMap keeps reference to EqualsNode and HashCodeNode. Rather than passing these nodes to `get` and `insert` methods. * HashMapTest run in polyglot context * Fix containsKey index handling in snapshots * Remove snapshots field from EnsoHashMapBuilder * Prepare polyglot hash map handling. - Hash_Map builtin methods are separate nodes * Some bug fixes * Remove ForeignMapWrapper. We would have to wrap foreign maps in assignments for this to be efficient. * Improve performance of Hash_Map.get_builtin Also, if_nothing parameter is suspended * Remove to_flat_vector. Interop API requires nested vector (our previous to_vector implementation). Seems that I have misunderstood the docs the first time I read it. - to_vector does not sort the vector by keys by default * Fix polyglot hash maps method dispatch * Add tests that effectively test hash code implementation. Via hash map that behaves like a hash set. * Remove Hashcode_Spec * Add some polyglot tests * Add Text.== tests for NFD normalization * Fix NFD normalization bug in Text.java * Improve performance of EqualsAnyNode.equalsTexts specialization * Properly compute hash code for Atom and cache it * Fix Text specialization in HashCodeAnyNode * Add Hash_Map_Spec as part of all tests * Remove HashMapTest.java Providing all the infrastructure for all the needed Truffle nodes is no longer manageable. * Remove rest of identityHashCode message implementations * Replace old Map with Hash_Map * Add some docs * Add TruffleBoundaries * Formatting * Fix some tests to accept unsorted vector from Map.to_vector * Delete Map.first and Map.last methods * Add specialization for big integer hash * Introduce proper HashCodeTest and EqualsTest. - Use jUnit theories. - Call nodes directly * Fix some specializations for primitives in HashCodeAnyNode * Fix host object specialization * Remove Any.hash_code * Fix import in Map.enso * Update changelog * Reformat * Add truffle boundary to BigInteger.hashCode * Fix performance of HashCodeTest - initialize DataPoints just once * Fix MetaIsATest * Fix ValuesGenerator.textual - Java's char is not Text * Fix indent in Map_Spec.enso * Add maps to datapoints in HashCodeTest * Add specialization for maps in HashCodeAnyNode * Add multiLevelAtoms to ValuesGenerator * Provide a workaround for non-linear key inserts * Fix specializations for double and BigInteger * Cosmetics * Add truffle boundaries * Add allowInlining=true to some truffle boundaries. Increases performance a lot. * Increase the size of vectors, and warmup time for Vector.Distinct benchmark * Various small performance fixes. * Fix Geo_Spec tests to accept unsorted Map.to_vector * Implement Map.remove * FIx Visualization tests to accept unsorted Map.to_vector * Treat java.util.Properties as Map * Add truffle boundaries * Invoke polyglot methods on java.util.Properties * Ignore python tests if python lang is missing	2023-01-19 10:33:25 +01:00

1 2 3 4 5 ...

364 Commits