enso-org/enso - enso - gitea: Gitea Service

mirror of https://github.com/enso-org/enso.git synced 2024-12-23 19:21:54 +03:00

Author	SHA1	Message	Date
James Dunkerley	0e2a91cfe1	Remove `countMissing` from Storage and replace with a new `CountNothing` operation. (#9137 ) Removing another small piece of logic from the storages to it's own operation.	2024-02-22 19:32:46 +00:00
Radosław Waśko	d817df94a5	Move `Enso_Cloud` from `Data` to top-level in `Standard.Base` (#9140 ) - 4 weeks ago we discussed with @jdunkerley we may want to move this module. - `Data` is a submodule mostly for various data-structures etc., the cloud stuff did not fit that well in there.	2024-02-22 14:42:35 +00:00
Jaroslav Tulach	ad2f5b031e	Chained if_then_else application change (#8671 ) * Test describing the current behavior of chained if then else application * Chained block should behave just like Group around if_then_else * Finishing line on BlockStart fixes if_then_else_chained_block * Only finish the line when there was not start of a macro segment * Fix tests * Refine else-body with macro patterns. * Update test syntax to maintain original semantics * Few additional tests --------- Co-authored-by: Kaz Wesley <kaz@lambdaverse.org>	2024-02-22 09:17:25 -05:00
Pavel Marek	21e1284086	Enso tests can be run with filter from cmdline (#9065 ) Simplify the `Test.Suite.run_with_filter` to accept a single filter parameter that searches for all the groups and specs that matches that filter. This filter can be a simple text provided from the command line. # Important Notes - Pending groups are now printed at the end of the run - `Test.Suite.run_with_filter` is simplified to accept a single filter parameter that is either `Text` or `Nothing`. See the docs. - Passing a filter from the command line is therefore straightforward, it is treated as a regex. - For convenience, I have left all the `main` methods in all the test sources. I have just refactored them to accept the `filter` argument from the command line. - For example, to run only a single spec from `Vector_Spec.enso`, invoke `enso --run test/Base_Tests/src/Data/Vector_Spec.enso "should allow vector creation with a programmatic constructor"` - Majority of the PR is a regex replace of `^main =` for `main filter=Nothing =` and of `suite.run_with_filter` for `suite.run_with_filter filter`. - Fixed some internal engine bugs: - `AtomWithHole` allows to specify only one hole - https://github.com/enso-org/enso/pull/9065/files#diff-0f7bb7e85cf86a965de133aa7e6b5958ceb889bd1921c01e00d3a9ceb19626ef - NaN keys in hash maps are handled in polyglot maps as well - c5257f6c2b78f893214ff67300893b593ea05e21..db4b3c0e9828ee79208d52e02586b24bb845b0d6	2024-02-22 12:31:44 +00:00
James Dunkerley	e50ded8b24	Rename `new_name` to `as` on Aggregate_Column. (#9135 ) Small tweak renaming a property. ![image](https://github.com/enso-org/enso/assets/4699705/21f4ca29-2219-412b-a650-c900360f9dea)	2024-02-22 10:34:00 +00:00
Radosław Waśko	d845e709a1	Refactor S3 path handling (#9092 ) - Closes #9021	2024-02-22 09:15:34 +00:00
James Dunkerley	fa6fccb99e	Refactoring `Table.set` for GUI2 and other GUI fixes (#9128 ) - Added `regex` function to `Standard.Base` as a way to easily and cleanly make regular expressions. - Added `expr` and `Expression.Value` to distinguish expressions from text values. - Fixed issues with `Table.join` widget so dropdown exists. - Needed fully qualified name. - Added default empty text values for right column to provided text input boxes. - Deprecate `Table.filter_by_expression` and allow `Table.filter` to take an `Expression`. - Added `Simple_Expression` and deprecated `Column_Operation`. Changes the order so takes a column then a calculation. - Rename `column` to `value` and `new_name` to `as` on `Table.set`. - Rename `name_column` to `names` on `Table.cross_tab`. - Removed `Column_Ref.Expression` in favour of using `Expression.Value`.	2024-02-21 23:51:41 +00:00
Jaroslav Tulach	2a42388905	Including missing Polyglot_Spec in all the tests (#9126 )	2024-02-21 16:16:00 +01:00
AdRiley	4cbdc9dfb6	Remove table aliases (#9098 ) Now Table and DB_Table have different names we can remove a lot of aliases. This closes #8981	2024-02-20 01:03:57 +00:00
Radosław Waśko	167a91dab1	Updating Enso Cloud integration - cheaper `Enso_Secret.create`, introducing `Enso_File.creation_time` and `last_modified_time` (#9085 ) - Closes #9047	2024-02-19 18:53:20 +00:00
AdRiley	27ec236c19	rename database Table to DB_Table (#9067 ) Rename database.Table to database.DB_Table to avoid name collisions as part of https://github.com/enso-org/enso/issues/8981	2024-02-19 17:31:58 +00:00
James Dunkerley	ee66b9fb1d	Refactoring the Unary operations so uncoupled from Storage. (#9090 ) In order to allow clever masking, slicing, filtering and arrow backing stores... - Adding ColumnStorage interface with the base API a storage will need. - Refactored each of the unary operations to a new `UnaryOperation` interface which makes them responsible for deciding if they can be executed.	2024-02-19 17:11:52 +00:00
Jaroslav Tulach	a664dd9d56	Equality with conversions (#9070 )	2024-02-19 17:18:56 +01:00
Radosław Waśko	cf71a05c3b	Refactor common context handling pattern (#9087 )	2024-02-19 11:59:42 +00:00
Radosław Waśko	642d5a691e	Implement `copy_to` and `move_to` for `S3_File` (#9054 ) - Closes #8833 - Tests for copying between S3 and `Enso_File` will only be added once we implement Enso_File writing.	2024-02-16 10:42:28 +00:00
James Dunkerley	f2d2f73e89	Starting to refactor Storage and Operations (#9076 ) Cleaning up some of the structures in Storage before working on UnaryOperations. - Removed some legacy code: `countMask`, `Index` and `DefaultIndex`. - Renamed `mask` to `applyFilter` on `Column` and `Storage`. - Renamed `Table.mask` to `Table.filter`.	2024-02-15 18:21:07 +00:00
AdRiley	f4955815b2	Remove Column Aliases (#9056 )	2024-02-14 19:06:29 +00:00
James Dunkerley	08584b0423	Improving experience with `format` and `parse`. (#9045 ) - Add format dropdown to `Number.format`. ![image](https://github.com/enso-org/enso/assets/4699705/8aa74910-c6ad-4480-a7f2-04dacd9686e8) - Support case insensitive month names and abbreviations in dates. https://github.com/enso-org/enso/assets/4699705/4dbd8755-e1c2-4207-a8a1-65b427ca4fab - Improve locale dropdown for `parse_date` and `parse_date_time`. ![image](https://github.com/enso-org/enso/assets/4699705/5d605a2d-1248-46be-bc74-34a4afecf609) - Added dropdown to `Table.parse` and amended so now doesn't accept `Nothing` (using empty string instead). ![image](https://github.com/enso-org/enso/assets/4699705/340dd093-77db-4685-a34b-45ce09e3c3b3) - Added dropdown to `Table.format` and amended so now doesn't accept `Nothing` (using empty string instead). - Altered `Column.parse` to not accept `Nothing` and added drop down for format. - Altered `Column.format` to not accept `Nothing` and added drop down for format conditional on type. - Improved the locale date/time format drop to have the suggested formats too.	2024-02-14 16:30:37 +00:00
Radosław Waśko	d45f0fe4df	Check type of `self` in static dispatch (#8867 ) - Fixes #8805 - also btw. fixes #8706	2024-02-14 15:50:13 +00:00
AdRiley	5c7947ce60	Rename database column to DB_Column (#9046 ) Rename database column to DB_Column to avoid name collisions as part of https://github.com/enso-org/enso/issues/8981	2024-02-14 09:11:52 +00:00
GregoryTravis	3bdd1a0dce	Re-fetch Warnings in Warning.getWarnings() to correctly wrap them (#9011 )	2024-02-13 15:12:32 +00:00
James Dunkerley	8c197f325b	Update the `aggregate` API to take a separate `group_by` (#9027 ) Separate out the `Group_By` from the column definition in `aggregate`. ![image](https://github.com/enso-org/enso/assets/4699705/6b4f03bc-1c4a-4582-b38a-ba528ae94167) Supports the old API with a warning attached about deprecation: ![image](https://github.com/enso-org/enso/assets/4699705/0cc42ff7-6047-41a5-bb99-c717d06d0d93) Widgets have been updated with `Group_By` removed from the dropdown.	2024-02-13 10:23:59 +00:00
Radosław Waśko	eb59b475f6	Write support for S3 (#8921 ) - Closes #8809	2024-02-12 19:04:13 +00:00
James Dunkerley	f257952680	Changing Widgets to work better with GUI2 (#9017 ) - Rename `Column.replace` to `Column.text_replace` (consistency with Table and allow for adding `Column.replace`). - Changed Regex insert to be `.to_regex` rather than `Regex.compile`. https://github.com/enso-org/enso/assets/4699705/3694e876-5b46-49f9-8e8c-352fa0684777 - Adjusted `Table.rename_columns` to insert a `Pair.new` for each rename. ![image](https://github.com/enso-org/enso/assets/4699705/8240f302-498c-46d6-9b77-b1a0f5bb585e) - Added a `<Regular Expression>` option to the `rename_columns`. https://github.com/enso-org/enso/assets/4699705/d3c695ad-93f2-419d-b1c1-1f607ca825bd - Added constant values to `Table.fill_nothing` and made it value type dependent on `Column.fill_nothing`. https://github.com/enso-org/enso/assets/4699705/26e2e975-e972-46f0-ae1a-6b09dcd39fe6 ![image](https://github.com/enso-org/enso/assets/4699705/51393d09-c06b-4eb1-9471-1856430fbfdc) ![image](https://github.com/enso-org/enso/assets/4699705/064fa42b-6187-4816-b3f0-0b7f0f00f6a9) - Removed `use_regex` option from `Sort_Column.By_Name` (in favour of Regex type). - Improved widgets for `order_by`. Drop down now on the column name and allow using the Index method with specified numeric range. ![image](https://github.com/enso-org/enso/assets/4699705/549b9c4c-5753-4518-801c-ecd06db5c2e7)	2024-02-12 15:12:19 +00:00
Michał Wawrzyniec Urbańczyk	357f2aadd4	CI: Tweaks for backend tests (#9001 ) * Use glob pattern to discover stdlib tests (rather than a hardcoded list). * Don't fail CI check immediately after failing Scala test. * Remove meta test suite tests. # Important Notes The meta test suite tests are removed following the discussion with @radeusgd. In short, these were failing anyway and were supposed to be rewritten (probably using a different technology, like JUnit). The current code will be a useful reference but it doesn't have to be kept on a repository head. The relevant information and references shall be added to the task.	2024-02-12 13:05:41 +00:00
AdRiley	48a61d7b78	Change empty_error type to array. Add tests (#9019 ) * Change error type tp array. Add tests * Fix tests * Put this back as it was	2024-02-12 09:15:10 +00:00
AdRiley	9339672e0e	Remove _new and actually run the new tests (#9006 ) Merge conflict on develop meant this one got left with a new_test.	2024-02-09 14:19:02 +00:00
AdRiley	1dc8c1cb24	Merge empty_error (#9000 ) We don't need 3 copies of empty_error. This PR makes them one.	2024-02-09 14:18:14 +00:00
Pavel Marek	26aa27c9ad	Remove obsolete Test_New import (#9013 ) A quickfix after merge conflict introduced by #8991.	2024-02-09 12:27:26 +01:00
Pavel Marek	f3f0697d56	Merge Test_New into Test (#8991 ) Merges the temporary `Test_New` library into `Test`. This is the last PR in the series of PRs that refactor all the stdlib tests to the builder API.	2024-02-08 11:25:13 +00:00
AdRiley	e3f6ff1772	Add to_xml component (#8979 ) Adds new to_xml component	2024-02-07 20:54:48 +00:00
GregoryTravis	62cfa8a01e	Disable tests that fail because of locale-specific Postgres Unicode collation differences (#8869 ) Disables two Order_By tests that fail when the Postgres Unicode collation locale is not en_GB.UTF8. Further research would be needed to figure out exactly how to handle locale-specific collation.	2024-02-07 16:47:55 +00:00
James Dunkerley	0c39f8ec04	Allow Filter_Condition to be inverted. (#8861 ) - Various linting fixes (doc comments and type annotations etc.). - Add an action to determine if a `Filter_Condition` is keep or remove. https://github.com/enso-org/enso/assets/4699705/69ba2bd3-8893-4237-acc4-eb01f534a209 - Remove `Not_In`, `Not_Contains` and `Not_Like` from `Filter_Condition`. - Ability to use an `Expression` as a `Column_Ref`. https://github.com/enso-org/enso/assets/4699705/16a2e030-f8f9-4f59-beca-2646f56fcb90	2024-02-07 14:36:14 +00:00
Pavel Marek	83fffd9c05	Refactor stdlib tests to the builder API (#8968 ) Follow-up of #8890 Refactor the rest of the tests to the builder API (`Test_New`): - `Image_Tests` - `Geo_Tests` - `Google_Api_Test` - `Examples_Test` - `AWS_Tests` - `Meta_Test_Suite_Tests` - `Visualization_Tests` # Important Notes - Unrelated: Fix NPE in `File.new "/" . name`	2024-02-07 13:22:17 +00:00
GregoryTravis	6554972b7d	Implement Table.replace for the in-memory backend (#8935 )	2024-02-06 20:57:50 +00:00
Pavel Marek	7bba021dfd	Fix teardown in SQLite File spec (#8918 ) Fixes teardown of SQLite spec. There used to be only `connection.close`, but we also have to call `connection.drop_table` for every created table. This causes problems only in `[SQLite File]` tests. These are backed by a sqlite file and some tables are persistent in that table. It is possible, that before tests are run, this file is non-empty and contains garbage from previous runs. ### Important Notes Should fix https://github.com/enso-org/enso/actions/runs/7724599547/job/21057063900#step:10:7243 that was triggered when `Table_Tests` were run on a non-clean runner. --------- Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org>	2024-02-05 12:25:55 +01:00
Jaroslav Tulach	2028d6b2c1	Better `runEngineDistribution --run test/Base_Tests "filter"` (#8940 )	2024-02-04 11:39:36 +01:00
James Dunkerley	eeaddbc434	Add parser for line by line processing (#8719 ) - ✅Linting fixes and groups. - ✅Add `File.from that:Text` and use `File` conversions instead of taking both `File` and `Text` and calling `File.new`. - ✅Align Unix Epoc with the UTC timezone and add converting from long value to `Date_Time` using it. - ❌Add simple first logging API allowing writing to log messages from Enso. - ✅Fix minor style issue where a test type had a empty constructor. - ❌Added a `long` based array builder. - Added `File_By_Line` to read a file line by line. - Added "fast" JSON parser based off Jackson. - ✅Altered range `to_vector` to be a proxy Vector. - ✅Added `at` and `get` to `Database.Column`. - ✅Added `get` to `Table.Column`. - ✅Added ability to expand `Vector`, `Array` `Range`, `Date_Range` to columns. - ✅Altered so `expand_to_column` default column name will be the same as the input column (i.e. no `Value` suffix). - ✅Added ability to expand `Map`, `JS_Object` and `Jackson_Object` to rows with two columns coming out (and extra key column). - ✅ Fixed bug where couldn't use integer index to expand to rows.	2024-02-01 07:29:50 +00:00
Pavel Marek	bb8ff8f89e	Refactor Base_Tests/src/Data to Test_New (#8890 ) Refactor `Base_Tests` to `Test_New` testing framework. Mostly automatic text replacements. # Important Notes List of changes that were not done automatically (not via automatic text replacement): - Fix indexes in Instrumentor_Spec - `f590c4a398` - If group or spec is pending, its block is not evaluated - `8d797f1a4a` - Spec_Result is not private - `5767535af2` Tests marked as pending: - #8913 - #8910	2024-01-31 14:25:16 +00:00
GregoryTravis	7436848e90	Implement relational NULL/Nothing for join for in-memory tables (#8849 ) Implements relational NULL for join, for all `Join_Kind`s.	2024-01-29 16:19:07 +00:00
GregoryTravis	644c9af579	Attach a warning when Nothing is used in a Filter_Condition (#8865 ) Attach a warning when Nothing is used as a value in a comparison or `is_in` Filter_Condition.	2024-01-27 08:45:45 +00:00
Jaroslav Tulach	9a37357247	Binary operator resolution based on `that` value (#8779 )	2024-01-27 08:38:47 +01:00
Radosław Waśko	ca4f98c78e	Adding tests and missing methods for `Enso_File`. (#8815 ) - Closes #8808 - adds tests for various scenarios. - Implements `size` using HEAD. - Updates existing functions to changes in Cloud API. - Adds stubs for `*_time` methods, `parent`, `path`. - [x] TODO: resolve the `Enso_File.current_working_directory` from an environment variable. - ~~TODO: recursive directory deletion?~~ left for later # Important Notes - Currently, the Cloud API does not offer an easy way to extract metadata for a file, in particular to get the parent folder from the file `id`. - We should be able to get the parent, and stuff like creation/modified time. - We need a way to resolve paths to asset ids, for `path` to work as well as `current_working_directory`. - What is the environment variable that will be used to feed the `current_working_directory` property?	2024-01-26 19:04:42 +00:00
Pavel Marek	d0fdeca6df	Refactor Table_Tests to the builder API (#8622 ) Refactor `test/Table_Test` to the builder API. The builder API is in a new library called `Test_New` that is alongside the old `Test` library. There will be follow-up PRs that will migrate the rest of the tests. Meanwhile, let's keep these two libraries, and merge them after the last PR. # Important Notes - For a brief introduction into the new API, see Prototype 1 section in https://github.com/enso-org/enso/pull/8622#issuecomment-1889706168 - When executing all the tests, the behavior should be the same as with the old library. With the only exception that if `ENSO_TEST_ANSI_COLORS` env var is set, the output is more colorful than it used to be.	2024-01-26 12:08:24 +00:00
James Dunkerley	0b6db5797c	Refactor OrderMask to avoid memory copying (#8863 ) Goal of this PR is to refactor the design of OrderMask and avoid copying arrays or lists wherever possible. We have removed a few legacy functions which were not being used. On a poor mans benchmark seems to be quicker (13s vs 16s) and memory usage should be lower.	2024-01-26 11:16:16 +00:00
AdRiley	1c6898b19d	Add Join Large Small benchmarks (#8845 ) Adds new benchmark joining a large table to a small table in preparation for a coming optimisation that will only index the smaller of the 2 tables in #8342	2024-01-24 19:13:09 +00:00
GregoryTravis	5eb3f3bd1d	Implement relational NULL semantics for Nothing for in-memory Column operations (#8816 ) Updates in-memory table column operations to treat Nothing as a relational NULL. This PR does not include changes to Table.join.	2024-01-24 17:02:45 +00:00
AdRiley	23d6fcdd9c	Make max of Random methods inclusive rather than exclusive (#8768 ) * Make max inclusive rather than exclusive * Update names to min max. Remove end_inclusive * Indicies should remain exclusive * Make the tests pass	2024-01-24 16:58:33 +00:00
Radosław Waśko	edfcfde11c	Tests and improvements for secrets in cloud subdirectories (#8791 ) - Closes #8723 - Adds some missing features that were needed to make this work: - `Enso_File.create_directory` and `Enso_File.delete`, and basic tests for it - Changes how `Enso_Secret.list` is obtained - using a different Cloud endpoint allows us to implement the desired logic, the default endpoint was giving us _all_ secrets which was not what we wanted here. - Implements `Enso_Secret.update` and tests for it # Important Notes Notes describing any problems with the current Cloud API: https://docs.google.com/document/d/1x8RUt3KkwyhlxGux7XUGfOdtFSAZV3fI9lSSqQ3XsXk/edit Apparently, everything that was needed to make this feature work has already been implemented, although a few features needed workarounds on Enso side to work properly.	2024-01-24 10:17:22 +00:00
GregoryTravis	659243316a	Mark a JDBC test as pending if cloud not set up (#8828 ) Marking the test pending=cloud_setup.pending.	2024-01-22 20:05:22 +00:00
Radosław Waśko	368e4867b4	Allow secrets in `AWS_Credential` (#8774 ) - Closes #8722	2024-01-19 19:00:56 +00:00
Radosław Waśko	14be36c401	Allow secrets in `Header.authorization_*` (#8761 ) - Closes #8739	2024-01-18 12:49:47 +00:00
James Dunkerley	d55c9c99b4	First few changes from the Churn workflow (#8782 ) - `up_to` should have the `step` as an optional argument. - `Date_Range` conversion can do a clever auto rename, so if a period use the name of it. - Add `to Table` for `Range`, `Date_Range`.	2024-01-17 21:57:24 +00:00
Radosław Waśko	71427c3319	Rename `Tests` to `Base_Tests` to be more descriptive (#8781 )	2024-01-17 16:19:19 +01:00
Radosław Waśko	583345d8f2	Allow `Test` asserts to be run outside of the test suite, still raising sensible Panics (#8778 ) - Fixes #5962 by defaulting to no Clue if run outside of tests, ensuring that the assertions still throw a sensible panic.	2024-01-16 17:17:20 +00:00
GregoryTravis	f2cb1f097e	Support on_problems=Problem_Behavior.Report_Warning and Map_Error wrapping in Vector.map (#8595 ) Implements `Warnings.get_all wrap_errors=True` which wraps warnings attached to values inside vectors with `Map_Error`, which includes the position of the value within the vector. See [the documentation](https://github.com/enso-org/enso/blob/develop/docs/semantics/wrapped-errors.md) for more details. `get_all wrap_errors=True` does not change the warnings that are attached to values -- it wraps them before returning them to the caller, but does not change the original warnings attached to the values. Wrapped warnings only appear attached to the vector itself. The values inside the vector do not have their warnings wrapped. Warning propagation is not changed at all; `Warnings.get_all` (with default `wrap_errors=False`) behaves as before. `get_all wrap_errors=True` is meant to be used primarily by the IDE, although it can be used anywhere this wrapping is desired.	2024-01-16 09:36:22 +00:00
AdRiley	b8e93b3cba	Add new text_left and text_right functions (#8691 ) Added text_left and text_right functions for in-memory and databases	2024-01-15 23:43:23 +00:00
Radosław Waśko	5b70ff25f7	Remove `set_user_info` from URI (#8738 ) I have added this in #8591, but I have realised it may not be a good idea to have it, so I am removing that particular change.	2024-01-15 17:35:17 +00:00
Radosław Waśko	f34abeda0c	Add tests for `Enso_Secret`s, update to new cloud API (#8736 ) - Closes #8556	2024-01-15 16:12:08 +00:00
Pavel Marek	6ae35abc46	Fix `Runtime.assert` (#8742 )	2024-01-12 18:47:40 +01:00
AdRiley	1b3c9638ea	Make fill nothing return types tighter (#8734 ) This is the follow up PR addressing the last couple of points from https://github.com/enso-org/enso/pull/8643 around what the return type from fill_nothing. # Important Notes The biggest change is changing what we size we need for an empty string. This change says a variable length string of length 1 and does it at a low enough level that it will effect the whole language. But I think that is correct.	2024-01-12 11:20:36 +00:00
Jaroslav Tulach	0e6952710a	Executing (parts of) Truffle TCK with Enso values (#8685 )	2024-01-12 07:21:16 +01:00
AdRiley	f31ecc7c87	Make fill_nothing take an empty string (#8643 ) * Add new test for required behaviour * Handle case where strArg is an empty string * More tests around fixed width field. Remove unneeded duplicate logic * javafmtAll * Further simplification * SQLite doesn't have full type system * SQLite doesn't have full type system	2024-01-10 11:59:10 +00:00
GregoryTravis	e6ae366917	Ignore User-Agent in HTTP tests (#8630 ) `User-Agent` depends on the precise JDK version, so we ignore it when checking HTTP responses. Clsoes #8629.	2024-01-04 15:53:17 +00:00
AdRiley	bf8dd1888c	Give file read its own helper widget for delimiters. (#8627 ) Give file read its own helper widget for delimiters. Remove newline add none. The file read delimiter is similar but different to the split one and so should have its own set of options.	2024-01-04 11:59:42 +00:00
Radosław Waśko	a1207e029d	Unify `File_Format_Metadata` with `File_For_Read` (#8628 ) - Closes #8555 - Refactors the file format detection logic, compacting lots of repetitive logic for HTTP handling into helper functions. - Some updates to CODEOWNERS.	2024-01-04 03:57:05 +00:00
AdRiley	689c8f7c3c	Make split to rows of Nothing value equal Nothing. (#8640 ) Split to rows of Nothing value should equal Nothing. Add some additional test cases. And updated existing to help readability	2024-01-03 12:09:35 +00:00
James Dunkerley	ffa06c9476	Sort handling of Nothing within Column \|\| and && (#8656 ) Follows the database logic: ![image](https://github.com/enso-org/enso/assets/4699705/328a0e36-5508-4c63-a60b-ac9a280cd93a) Results: ![image](https://github.com/enso-org/enso/assets/4699705/77d6bf82-21f8-4aed-b4c5-45e429798189)	2024-01-03 10:40:40 +00:00
AdRiley	ec51127635	Change null to Nothing (#8637 ) Change the generated column name for is_nothing to "[a] is Nothing" from "[a] is null" as Nothing is our customer facing term.	2023-12-28 18:02:23 +00:00
Jaroslav Tulach	07d58f2c02	DataflowError.withoutTrace shall not store a trace (#8608 )	2023-12-24 11:07:32 +01:00
Radosław Waśko	b3de42eb23	Handle `Nothing` values in `Filter_Condition.to_predicate` (#8600 ) - Fixes #8549 - Ensures that a `Type_Error` is thrown instead of a `No_Such_Method` error on type mismatches. - I think this is more readable.	2023-12-21 19:17:55 +00:00
Radosław Waśko	d41d48e8a0	Merge `URI_With_Query` into `URI`, extend API of `URI` (#8591 ) - Closes #8544 - Adds `reset_query_arguments` and `/` operators allowing to transform a URI. - Adding tests for handling of various edge cases.	2023-12-21 18:39:26 +00:00
AdRiley	cfe0cbe0c1	Add text_length to column for in-memory and database (#8606 ) Closes #8521 Adds text_length to Column	2023-12-21 11:31:13 +00:00
Radosław Waśko	dfdb547616	Better context info in `Type_Error` raised from return type checks (#8566 ) - Followup to #8502 that adds better error messages	2023-12-20 18:22:47 +00:00
Radosław Waśko	d56b800c11	Remove the Apache dependency from `std-base` (#8571 ) - After [suggestion](https://github.com/enso-org/enso/pull/8497#discussion_r1429543815) from @JaroslavTulach I have tried reimplementing the URL encoding using just `URLEncode` builtin util. I will see if this does not complicate other followup improvements, but most likely all should work so we should be able to get rid of the unnecessary bloat.	2023-12-20 18:01:08 +00:00
Cassandra-Clark	232077f25e	Renamed lookup_and_replace to merge and renamed Table.replace to text… (#8564 )	2023-12-20 16:28:45 +00:00
Radosław Waśko	724f8d2a56	Add tests for Enso Cloud auth + simple API mock for `Enso_User` (#8511 ) - Closes #8354 - Extends `simple-httpbin` with a simple mock of the Cloud API (currently it checks the token and serves the `/users` endpoint). - Renames `simple-httpbin` to `http-test-helper`.	2023-12-19 17:41:09 +00:00
Radosław Waśko	f0c2a5fa7f	Opt-in return type checks (#8502 ) - Closes #8240	2023-12-19 15:32:30 +00:00
Radosław Waśko	d4714af826	Add a few new `Filter_Condition`s (#8539 ) - Closes #8045	2023-12-16 15:12:23 +00:00
Radosław Waśko	940b8f7d51	Improving tests and edge cases for URI and HTTP (#8497 ) - Closes #8352 - ~~Proposed fix for #8493~~ - The temporary fix is deemed not viable. I will try to figure out a workaround and leave fixing #8493 to the engine team.	2023-12-15 17:58:45 +00:00
Pavel Marek	4b65e44ef3	EpbLanguage re-uses other TruffleContext support to run tests with assertions enabled (#7882 )	2023-12-15 13:31:32 +01:00
Radosław Waśko	b5c995a7bf	Reworking Excel support to allow for reading of big files (#8403 ) - Closes #8111 by making sure that all Excel workbooks are read using a backing file (which should be more memory efficient). - If the workbook is being opened from an input stream, that stream is materialized to a `Temporary_File`. - Adds tests fetching Table formats from HTTP. - Extends `simple-httpbin` with ability to serve files for our tests. - Ensures that the `Infer` option on `Excel` format also works with streams, if content-type metadata is available (e.g. from HTTP headers). - Implements a `Temporary_File` facility that can be used to create a temporary file that is deleted once all references to the `Temporary_File` instance are GCed.	2023-12-15 00:02:15 +00:00
Radosław Waśko	7a05e679c3	Improve details attached to `No_Output_Columns` reported from various operations (#8528 ) - Closes #7635	2023-12-14 10:49:07 +00:00
Hubert Plociniczak	a978d70a9e	Minor improvement to Startup.enso error (#8536 ) Unable to parse the current error message due to newlines. # Important Notes For example https://github.com/enso-org/enso/actions/runs/7174245013/job/19535275294#step:10:16133	2023-12-14 09:38:58 +00:00
GregoryTravis	1c815a3d45	Better Error Trapping in map (#8307 ) * tests * wip * wip * additional warnings * wip * wip * cleanup * nested wrapping * multiple nestings * wraps_error uses looks_for, test for should_fail_with * wip * stack trace line fix * use catch_primitive internally * fix warning mapping, dtf spec * just one wrapper checker, vector spec * missing ctor, back to non-primitive catch * back to c_p * put old map back * wip * unnest tests * Array.map on_problems * wip * Revert "wip" This reverts commit `c30d171457`. * better test names * warning logging * wip * wip * move logic into ALH * doc * constant * My_Error.Error * nested * doc * map_primtiive in warning mapper * composition * ref spec * Remove warnings prior to matching on the value If an expression has warnings and is matched we: 1) extract the warnings 2) execute the branch of a pattern that matches the value 3) attach extracted warnings to the result This caused warnings to reappear when doing the custom warnings manipulation. This is also consistent with how `CaseNode`'s `doWarning` specialization is defined. * fix 1 * do not auto unwrap in test error checkers * nested error matcher * in problems too * dtf * v * statistics * wip * Table_Spec, map_with_index_primitive * Column_Operations_Spec * disable warning wrapping and Report_Warning * unimpl test * Warnings_Spec * DCS * ACG JP * zip_primitive * join_helpers * Lookup_Helpers * Table * Data_Formatter * Value_Type_Helpers * revert check types changes * table_helpers * table tests * remove st * do not remove warnings from value * vec docs, tests for zip, mwi, flat_map * docs, fixes * remove nested_error_matcher * cleanup * benchmark * one error * alter * add bench to main * review * review * review * tail call * changelog * tail call was not a tail call * ws * bad import * Added missing import * Update distribution/lib/Standard/Base/0.0.0-dev/src/Data/Array.enso Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org> * review, ref example * lazy benchmark data * extra paren * check outside of catch * review * vector too * actually lazy * disambiguate Map_Error * finish rename * move to extensions * combine Additional_Warnings error * rename to map_no_wrap * do not catch and rethrow * review * wip * remove _primitives entirely * remove unused should_fail_with function options * remove expected_warning as function in Problems --------- Co-authored-by: Hubert Plociniczak <hubert.plociniczak@gmail.com> Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org>	2023-12-13 09:38:09 -05:00
somebody1234	3419c77bc6	Fix typo in `Match.enso` (#8515 ) Fix issue causing visualizations to break: ![image](https://github.com/enso-org/enso/assets/4046547/d63239e2-3fde-424a-ba6c-3e163c6ce983) # Important Notes None	2023-12-12 17:59:44 +00:00
Jaroslav Tulach	80f94a21e1	Compare long and double and BigInteger properly (#8510 )	2023-12-12 06:41:43 +01:00
Jaroslav Tulach	e4b2b56a40	Turning Sieve benchmarks into Enso benchmarks (#8475 )	2023-12-08 10:27:52 +01:00
Jaroslav Tulach	7f0cb88fa1	Consistent simple and qualified type name (#8448 ) Fixes #8255 by unifying `get_qualified_type_name` and `get_simple_type_name` implementations.	2023-12-06 04:30:24 +00:00
Pavel Marek	a67297aebf	Add graalpy packages to the component directory (#8351 ) Adds these JAR modules to the `component` directory inside Engine distribution: - `graal-language-23.1.0` - `org.bouncycastle.` - these need to be added for graalpy language # Important Notes - Remove `org.bouncycastle.` packages from `runtime.jar` fat jar. - Make sure that the `./run` script preinstalls GraalPy standalone distribution before starting engine tests - Note that using `python -m venv` is only possible from standalone distribution, we cannot distribute `graalpython-launcher`. - Make sure that installation of `numpy` and its polyglot execution example works. - Convert `Text` to `TruffleString` before passing to GraalPy - `8ee9a2816f`	2023-12-04 11:50:59 +00:00
Jaroslav Tulach	81f06456bf	400x faster with linear hashing of the hash map entries (#8425 ) Fixes #5233 by removing `EconomicMap` & co. and using plain old good _linear hashing_. Fixes #8090 by introducing `StorageEntry.removed()` rather than copying the builder on each removal.	2023-12-01 06:43:13 +00:00
James Dunkerley	dd28517d9e	Separating Table tests from the Tests project. (#8397 ) Moving a couple of tests to Table_Tests so Tests no longer depends on Tests.	2023-11-27 16:46:35 +00:00
Jaroslav Tulach	c6eb61a055	Filter for test/Benchmarks (#8391 ) With herein proposed change one can pass an optional filter to `enso --run test/Benchmarks` to execute only groups and specs that contain given string in its name.	2023-11-27 15:27:12 +00:00
Jaroslav Tulach	893965ed5c	3% speedup with LazyMap and MetadataStorage (#8359 )	2023-11-27 10:28:12 +01:00
Radosław Waśko	c6b6384fe6	Improve performance of anti-join (#8338 ) - Closes #8217	2023-11-24 02:44:57 +00:00
Jaroslav Tulach	4464a15035	Benchmark to measure startup time (#8378 ) While trying to speed `MetadataStorage` up - #8324 - I felt the need to have an independent (on my computer) measurement of startup time. Here is a benchmark that measures how long execution of two simple hello world programs take. # Important Notes There are two benchmarks: - `empty_startup` measures the time needed to boot without using any `Standard` library - basically _an overhead of the JVM and engine_ - `hello_world_startup` measures the use of `IO.println` - it shall take longer than `empty_startup` and show the overhead we have while processing the standard library	2023-11-23 21:31:01 +00:00
James Dunkerley	347b5a7cf5	Linting and Groups update (#8357 ) - Fix issues from the linter. - Rename the constructors for `Blank_Selector`. - Update various GROUP tags.	2023-11-21 18:12:27 +00:00
James Dunkerley	ecaca12df1	Integrating Enso Cloud with the libraries (part 1...) (#8006 ) - Add a `File_For_Read` type. Used for `File_Format` to read files. - Added `Enso_User` representing the current user in `Enso_Cloud`. - Will be later able to list known users. - Added `Enso_Secret` representing a value defined in `Enso_Cloud`. - Value not used within Enso only accessed within polyglot Java. - Integrated into `Username_And_Password` and can be used within JDBC connections. - Integrated into HTTP Headers so a secret can be used as a value. - New `URI_With_Query` with the same API as `URI`. Supporting secrets in the value. - Will be integrated with AWS credentials. - Added `Enso_File` representing a file or a folder in the cloud. - Support the same API as `File` (like the `S3_File`). - Will support `enso://` URI style access.	2023-11-20 23:21:14 +00:00
Jaroslav Tulach	1138dfe147	Specify expression to get more advanced results on_return callback (#8331 )	2023-11-20 18:47:11 +01:00
Pavel Marek	5a7ad6bfe4	Upgrade enso to GraalVM for jdk 21 (#7991 ) Upgrade to GraalVM JDK 21. ``` > java -version openjdk version "21" 2023-09-19 OpenJDK Runtime Environment GraalVM CE 21+35.1 (build 21+35-jvmci-23.1-b15) OpenJDK 64-Bit Server VM GraalVM CE 21+35.1 (build 21+35-jvmci-23.1-b15, mixed mode, sharing) ``` With SDKMan, download with `sdk install java 21-graalce`. # Important Notes - After this PR, one can theoretically run enso with any JRE with version at least 21. - Removed `sbt bootstrap` hack and all the other build time related hacks related to the handling of GraalVM distribution. - `project-manager` remains backward compatible - it can open older engines with runtimes. New engines now do no longer require a separate runtime to be downloaded. - sbt does not support compilation of `module-info.java` files in mixed projects - https://github.com/sbt/sbt/issues/3368 - Which means that we can have `module-info.java` files only for Java-only projects. - Anyway, we need just a single `module-info.class` in the resulting `runtime.jar` fat jar. - `runtime.jar` is assembled in `runtime-with-instruments` with a custom merge strategy (`sbt-assembly` plugin). Caching is disabled for custom merge strategies, which means that re-assembly of `runtime.jar` will be more frequent. - Engine distribution contains multiple JAR archives (modules) in `component` directory, along with `runner/runner.jar` that is hidden inside a nested directory. - The new entry point to the engine runner is [EngineRunnerBootLoader](https://github.com/enso-org/enso/pull/7991/files#diff-9ab172d0566c18456472aeb95c4345f47e2db3965e77e29c11694d3a9333a2aa) that contains a custom ClassLoader - to make sure that everything that does not have to be loaded from a module is loaded from `runner.jar`, which is not a module. - The new command line for launching the engine runner is in [distribution/bin/enso](https://github.com/enso-org/enso/pull/7991/files#diff-0b66983403b2c329febc7381cd23d45871d4d555ce98dd040d4d1e879c8f3725) - [Newest version of Frgaal](https://repo1.maven.org/maven2/org/frgaal/compiler/20.0.1/) (20.0.1) does not recognize `--source 21` option, only `--source 20`.	2023-11-17 18:02:36 +00:00
GregoryTravis	ea3d778456	Allow the creation of a constant column on an in-memory table with no rows. (#8218 )	2023-11-09 14:40:51 +00:00
GregoryTravis	6be94a854b	Implement truncate Date_Time for database backend (#8235 ) Also adds some checks for column names generated for floor, ceil, truncate, round.	2023-11-08 23:23:59 +00:00
Radosław Waśko	1b8b30a68d	Improve performance of `Join_Condition.Between` by sorting on one dimension (#8212 ) - Closes #5303 - Refactors `JoinStrategy` allowing us to 'stack' join strategies on top of each other (to some extent) - currently a `HashJoin` can be followed by another join strategy (currently `SortJoin`) - Adds benchmarks for join - Due to limitations of the sorting approach this will still not be as fast as possible for cases where there is more than 1 `Between` condition in a single query - trying to demonstrate that in benchmarks. - We can replace sorting by d-dimensional [RangeTrees](https://en.wikipedia.org/wiki/Range_tree) to get `O((n + m) log^d n + k)` performance (where `n` and `m` are sizes of joined tables, `d` is the amount of `Between` conditions used in the query and `k` is the result set size). - Follow up ticket for consideration later: #8216 - Closes #8215 - After all, it turned out that `TreeSet` was problematic (because of not enough flexibility with duplicate key handling), so the simplest solution was to immediately implement this sub-task. - Closes #8204 - Unrelated, but I ran into this here: adds type checks to other arguments of `set`. - Before, putting in a Column as `new_name` (i.e. mistakenly messing up the order of arguments), lead to a hard to understand `Method `if_then_else` of type Column could not be found.`, instead now it would file with type error 'expected Text got Column`.	2023-11-08 12:59:55 +00:00
Radosław Waśko	2ce1567384	Limit `max_rows` that are downloaded in `Table.read` by default, and warn if more rows are available (#8159 ) - Sets the default limit for `Table.read` in Database to be max 1000 rows. - The limit for in-memory compatible API still defaults to `Nothing`. - Adds a warning if there are more rows than limit. - Enables a few unrelated asserts.	2023-11-06 16:41:47 +00:00
Radosław Waśko	237aae33c7	Simplify internal logic of `Table.order_by`, avoid unnecessary warning (#8221 ) - Fixes #8213	2023-11-06 11:00:01 +00:00
GregoryTravis	3c371adbef	Implement Table.format similar to Table.parse allowing to format columns in bulk (#8150 ) * doc * one test * date tests * empty and nothing * ints floats * bools * all columns * regex and index * locales * bad formats * all with one format * docs * examples, not impl db * docs, more errors * cleanup * changelog * check list * reorder * clue * review * review * review * review * review * review * specify time zone	2023-11-02 09:36:36 -04:00
Cassandra-Clark	b5d6628c57	Change filter_blank_rows when_any parameter to have a more user-friendly type (#7935 ) Added Blank_Selector constructor and applied to remove_blank_columns, select_blank_columns, filter_blank_rows for #7931 . Changed when_any to when for readability.	2023-11-01 16:51:15 +00:00
GregoryTravis	d467683ed1	Constant columns (in expressions and Column_Operations) should have clearer names (#8188 ) Previously, constant columns were given generated names with UUIDs in them, which are long and provide no information. Instead, we now use the constant value itself to form the name. Since these new generated names are less unique, we must explicitly make them unique, in cases where the caller did not explicilty set a name.	2023-11-01 14:41:03 +00:00
GregoryTravis	1480f50207	Overhaul the random number and item generation code (#8127 ) Rewrite most of Random.enso.	2023-10-31 15:25:37 +00:00
Radosław Waśko	79011bd550	Implement `Table.lookup_and_replace` in Database (#8146 ) - Closes #7981 - Adds a `RUNTIME_ERROR` operation into the DB dialect, that may be used to 'crash' a query if a condition is met - used to validate if `lookup_and_replace` invariants are still satisfied when the query is materialized. - Removes old `Table_Helpers.is_table` and `same_backend` checks, in favour of the new way of checking this that relies on `Table.from` conversions, and is much simpler to use and also more robust.	2023-10-31 15:19:55 +00:00
Pavel Marek	b084e097c9	Add special handling for Dataflow_Error passed to Runtime.assert (#8168 ) Add special handling for `Dataflow_Error` passed as action to `Runtime.assert`. It caused an infinite recursion.	2023-10-27 17:22:15 +00:00
Radosław Waśko	c1259cb4d2	Compare performance of Panic / Java Exception / Dataflow error (#8130 ) After a discussion, I was really curious that our panics are supposed to be almost free - and while trusting that statement, it was really hard to believe - so I wanted to see for myself - knowing that an experiment is the most robust source of this kind of information - testing that in practice. So I wrote a benchmark comparing various ways of reporting errors, also testing them both at 'shallow' and 'deep' stack traces (adding 200 additional frames) - to see how stack depth affects them, if at all. The panics are indeed blazing fast! Kudos to the engine team. However, it seems that our dataflow errors are relatively slow (and we tend to use them _more_ than panics and want to be using them more and more). This uncovers a possible optimization opportunity. Can we make them as fast as panics?? Analysis of the benchmark results in comment below.	2023-10-24 12:03:44 +00:00
Jaroslav Tulach	e283c78977	Properly convert defaulted arguments before on_call back (#8143 )	2023-10-24 13:39:54 +02:00
Radosław Waśko	0c278391fe	Test and improve handling of `Date_Time with_timezone=False` in Postgres (#8114 ) - Fixes #8049 - Adds tests for handling of Date_Time upload/download in Postgres. - Adds tests for edge cases of handling of Decimal and Binary types in Postgres.	2023-10-21 21:35:13 +00:00
Radosław Waśko	8172896065	Support `Previous_Value` in `fill_nothing` and `fill_missing` (#8105 ) - Adds `Previous_Value` to `fill_nothing` and `fill_empty`, as requested by #7192.	2023-10-20 13:18:53 +00:00
James Dunkerley	ad95dc9815	Fixes for Table viz (#8102 ) - Fixes issue with `to_display_text` on `Date_Time`. Now a reversible format. - Adjusted the auto width code on table so it works. - Manually sized columns should remain fixed size. - Enable Excel range style selection in table. - Removed page support as not implemented yet. # Important Notes Will need to apply to Vue based viz at some point as well.	2023-10-19 08:22:23 +00:00
GregoryTravis	7383db0e04	Restructuring XML into Table form (#8083 ) # Important Notes Adds `.to Table` support, as well as XML support for `expand_column`.	2023-10-19 07:02:48 +00:00
Radosław Waśko	28fc183f92	Review places where we can use `Column_Ref` (#8101 ) Closes #8046	2023-10-18 19:03:50 +00:00
Radosław Waśko	93a31fcc8b	Add benchmarks related to `add_row_number` performance investigation (#8091 ) - Follow-up of #8055 - Adds a benchmark comparing performance of Enso Map and Java HashMap in two scenarios - _only incremental_ updates (like `Vector.distinct`) and _replacing_ updates (like keeping a counter for each key). These benchmarks can be used as a metric for #8090	2023-10-18 17:21:59 +00:00
Radosław Waśko	e9fa12763e	Improve performance of `add_row_number` (#8076 ) Fixes #8055	2023-10-17 00:42:35 +00:00
Radosław Waśko	08b717eb54	Refactor Table problem handling to a more robust and hopefully cleaner approach (#7879 ) Closes #7514	2023-10-16 15:09:08 +00:00
GregoryTravis	f18d1323e1	Add Table.expand_to_rows to allow flattening vector and array values in table (#8042 ) # Important Notes Also includes a fix for a reallocation bug in `InferredBuilder`.	2023-10-13 20:54:06 +00:00
James Dunkerley	b7d7910a88	Use US locale for Date/Time parsing by default. (#8053 ) Fixes issue with parsing long format month names.	2023-10-13 17:47:14 +00:00
James Dunkerley	fac9e7a420	Expand capabilities of `Table.set` and better dropdown support, (#8005 ) - Adds the ability to use numbers, date/time and Boolean values as constants in `set`. - `Table.set` can take a `Column_Operation`, allowing for deriving of a new column based on other columns. - Added `Column_Ref` type to refer to a column in `filter`.	2023-10-13 16:03:28 +00:00
Jaroslav Tulach	31624b5314	Helper method to turn iterator to a Vector (#8035 )	2023-10-12 17:50:51 +02:00
James Dunkerley	0dcfc3e9bf	Minor improvements from last couple of Book Clubs (#8034 ) - Added some ALIASes. - Added `sheet` to `Excel_Workbook` to give familiar API to read sheet. - Added conversion from range to vector allowing easy use with Zip. - Add `Map.from_keys_and_values.	2023-10-12 14:29:59 +00:00
Radosław Waśko	cd84ac16ce	Restructure `Table.from_objects` to use conversions (#8020 ) Closes #7957	2023-10-11 22:25:18 +00:00
Jaroslav Tulach	7ae0971d11	Runtime checks of signatures with polyglot java classes (#8016 )	2023-10-11 15:05:36 +02:00
GregoryTravis	a2c8940218	Add tests for Table.from_objects and Table.expand_column. (#8010 )	2023-10-11 11:50:51 +00:00
somebody1234	826127d8ff	Eliminate line feeds from `XML.outer_xml` on Windows (#8013 ) - Closes #7999 # Important Notes None	2023-10-10 23:21:34 +00:00
Radosław Waśko	6f78570115	Fix a DROP table bug, add SQL debug logging (#8007 ) - Fixes a bug where creating a temporary table could accidentally issue a `DROP` statement of the table name that the user provided, risking destruction of user data. - Fortunately, the bad scenario was almost impossible, because the `DROP` statement was only issued _if_ we previously checked that the mentioned table _does not exist_ - dropping a nonexistent table does not do any harm. - It could have been dangerous in a very unlikely scenario that a table was created just between the _existence check_ and the _drop_. - After the fix the existence check and any modifications are done within a transaction to avoid interference from concurrent modifications, and the DROP is correctly applied to a temporary Enso table instead of the original one. - Replaced a temporary log with proper simple logging of SQL statements into a file, if an Environment variable is set. - Used that feature to test that no unexpected statements occur.	2023-10-10 13:16:06 +00:00
Radosław Waśko	6e0bd86753	Implement `Table.lookup_and_replace` for in-memory (#7979 ) - Closes #7749 implementing the in-memory logic. - Additional complications have surfaced regarding the Database logic, so it has been split off into a separate ticket: #7981	2023-10-10 10:42:06 +00:00
Jaroslav Tulach	a234e82ee9	Instrumenter to observe behavior of nodes with UUID (#7833 ) Exposing instrumentation capabilities to Enso. Fixes #7683.	2023-10-10 02:36:59 +00:00
GregoryTravis	9ba7be20af	Basic XML support (#7947 ) This PR includes * Reading XML from a file, stream, or string * Reading XML via Data.fetch * Accessing the root element, element children, and attributes * Accessing tag text contents * Get tags by name * Inner / Outer XML string	2023-10-06 17:52:19 +00:00
Pavel Marek	c525698e5c	Enable Enso asserts on CI (#7929 ) --------- Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>	2023-10-04 15:52:44 +02:00
Jaroslav Tulach	421e3f22a4	Avoid unnecessary conversions (#7952 ) Fixes #7854 by checking all elements of a union type for direct match and only then, if direct match isn't available, applying conversions.	2023-10-04 12:31:06 +02:00
Radosław Waśko	0cd446432f	Fix inconsistency when building a Mixed column, fixes to Union (#7919 ) - Fixes #7352 by remembering original value types in type inference mode to be able to reconstruct them for Mixed. - Added more benchmarks for comparing performance of constructing columns. - Fixes missing implementations that caused `Table.union` crashing on some type pairs. - Ensures that `Loss_Of_Integer_Precision` warning is not swallowed when numeric columns are unioned to create a `Float` column. - Adds test for all of the above cases. - Allow to output benchmark results to a CSV by setting an environment variable - useful for quickly comparing benchmarks, e.g. in Enso.	2023-10-03 20:33:34 +02:00
Radosław Waśko	08cd449a99	Fix `NumberParser` to avoid `thousandSeparator==decimalPoint` and prefer US decimal format (#7946 ) Closes #7930	2023-10-03 20:07:54 +02:00
Radosław Waśko	3222e5af62	Avoid exponential growth of column names (#7934 ) - Fixes #7933 - Avoids duplicating `]` as `]]` in generated column names - now column names grow linearly.	2023-10-02 16:05:28 +00:00
Jaroslav Tulach	6a59fa5e93	Meta.Type.find a type by FQN (#7885 )	2023-10-02 17:02:00 +02:00
James Dunkerley	fb50eb7595	Using conversions in a few places (#7859 ) - Shuffle a few `from`s into correct places: - `Day_Of_Week.from` removing `Day_Of_Week_From` module. - Adding short cut for `http` and `https` in `Data.read` so it calls onto `Data.fetch` giving a single entry point. - Moved `URI` extensions from `Standard.Base.Data` module into `Standard.Base.Network.Extensions`. - Added `post` extension for `URI`. - Added `contains_key` to `JS_Object`. - Restored `into` in `JS_Object`: - Follows old logic populating a constructor. - Will use conversion from `JS_Object` if present. - Added automatic deserialization of `Date`, `Time_Of_Day` and `Date_Time` from JSON. - Uses conversion from `JS_Object`. - Added conversion from `Text` to a `HTTP_Method` and type checking where `HTTP_Method` used in public APIs. - Added support for `Date`, `Time_Of_Day` and `Date_Time` in `Table.from_objects`. - Added `expand_column` to `Table` to expand `JS_Object` to values. - Add type checking for `Table` in `right` arguments (allowing `Column`s to be used). - Use type checking in `Table.set` to allow for conversion to a `Column`. - Remove some unused imports. - Fix for bug in S3 edge case.	2023-10-02 14:54:22 +00:00
Cassandra-Clark	d7258abbf5	Add Text.substring to allow for an easy short hand of Text.take (start.up_to end) (#7913 ) * Add Text.substring function and get_position helper function For #7876 adds a Text.substring function which supports negative indexes and returns a part of a string from 0-based index 'start' and continuing for 'length' * added substring and simplified get function For #7876 adds a Text.substring function which supports negative indexes and returns a part of a string from 0-based index 'start' and continuing for 'length'. Also simplified get function as it looped unnecessarily. * Update distribution/lib/Standard/Base/0.0.0-dev/src/Data/Text/Extensions.enso punctuation corrections Co-authored-by: GregoryTravis <greg.m.travis@gmail.com> * Update Text_Spec.enso Added test for start index larger than string * Update distribution/lib/Standard/Base/0.0.0-dev/src/Data/Text/Extensions.enso updated Arguments: section to use consistent style Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org> * Update distribution/lib/Standard/Base/0.0.0-dev/src/Data/Text/Extensions.enso updated Index_Out_Of_Bounds error to reference cached length Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org> * Removed Slice label and added changelog entry * Re-added slice tag to substring Per conversation with James, added slice back to substring * Update CHANGELOG.md add link --------- Co-authored-by: GregoryTravis <greg.m.travis@gmail.com> Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org>	2023-09-29 10:57:57 -06:00
Jaroslav Tulach	ffa036411d	Make sure Integer can be treated as Float (#7909 )	2023-09-28 11:56:54 +02:00
Radosław Waśko	8d926166ea	Follow up improvements to `Date_Time_Formatter` (#7875 ) - Closes #7872 - Also closes #7866	2023-09-28 09:38:00 +00:00
Radosław Waśko	c690559ec4	Implement `auto_value_type` operation (#7908 ) Closes #6113	2023-09-27 15:45:34 +00:00
GregoryTravis	b03712390c	Improve HTTP tests (#7847 ) * simple-httpbin encodes response using the Content-encoding header value * Return sent body verbatim	2023-09-27 14:02:32 +00:00
Hubert Plociniczak	7d80ac12bb	Transform Range.iterate test to a benchmark (#7905 ) * Transform Range.iterate test to a benchmark a) it slows down regular unit testing b) it is not a unit test c) it behaves like a benchmark So it should be a benchmark. * missing Range import	2023-09-27 12:24:45 +02:00
Radosław Waśko	12c4f2981d	More robust Date/Time format patterns parsing (#7826 ) - Closes #7461 by introducing a `Date_Time_Formatter` type and making parsing date time formats more robust and safer. - The default ('simple') set of patterns is slightly simplified and made case insensitive (except for `M/m` and `H/h`) to avoid the `YYYY` vs `yyyy` issues and make it less error prone. - The `YYYY` now has the same meaning as `yyyy` in simple mode. The old meaning (week-based year) is moved to a _separate mode_, triggered by `Date_Time_Formatter.from_iso_week_date_pattern`. - Full Java syntax, as well as custom-built Java `DateTimeFormatter` can also be used by `Date_Time_Formatter.from_java`. - Text-based constants (e.g. `ISO_ZONED_DATE_TIME`) have now become methods on `Date_Time_Formatter`, e.g. `Date_Time_Formatter.iso_zoned_date_time`).	2023-09-22 10:12:18 +00:00
Jaroslav Tulach	5150c14afd	Avoid duplicated conversion & search target type scope (#7849 )	2023-09-20 17:23:09 +02:00
James Dunkerley	74d1d0861c	S3 Read Access, Input Stream based reading (#7776 ) - Added a `FileSystemSPI` allowing protocol resolution to a target type. - Separated `Input_Stream` and `Output_Stream` from `File` to allow use in other spaces. - `File_Format` types `read_web` changed to be `read_stream` working with `InputStream`. - Added directory listing to `Auto_Detect` allowing for `Data.read` to list a folder. - Adjusted HTTP to return an `InputStream` not a `byte[]`: - `Response_Body` adjusted to wrap an `InputStream`. - Added ability to materialize to either and in-memory vector (<4KB) or a temporary file. - `Data.fetch` will materialize if not a recognized mime-type. - Added `HTTP_Error` to handle IO exceptions from the stream. - `Excel_Format` now supports mime-type and reading a stream. - `Excel_Workbook` can now get a `Excel_Section` using `read_section`. - Added S3 APIs: - `parse_uri`: splits an S3 URI into bucket and key. - `list_objects`: list the items in a S3 bucket with specified prefix. - `read_bucket`: list prefixes and keys with a delimiter in a S3 bucket with specified prefix. - `head`: either head_bucket (tests existance) or head_object API (reads object meta data). - `get_object`: gets an object from S3 returning as a `Response_Body`. - Added `S3_File` type acting like a `File`: - No support for writing in this PR. - ToDo: recursive listing, glob filtering, exists, size. - Fixed a few invalid type signature line. - Moved `create` methods for `Postgres_Connection` and `SQLite_Connection` into type instead of module. - Renamed `Column_Fetcher.Builder` to `Column_Fetcher_Builder`. - Fixed bug with `select_into` in Dry Run mode creating permanent tables. ToDo: Unit tests.	2023-09-20 15:09:11 +00:00
Jaroslav Tulach	3b790606e1	Avoid StackOverflow when comparing unknown foreign objects (#7780 ) Closes #7677 by eliminating the _stackoverflow execption_. In general it seems _too adventurous_ to walk members of random foreign objects. There can be anything including cycles. Rather than trying to be too smart in these cases, let's just rely on `InteropLibrary.isIdentical` message. # Important Notes Calling `sort` on the `numpy` array no longer yields an error, but the array isn't sorted - that needs a fix on the Python side: https://github.com/oracle/graalpython/issues/354 - once it is in, the elements will be treated as numbers and the sorting happens automatically (without any changes in Enso code).	2023-09-19 15:39:04 +00:00
Jaroslav Tulach	6cbd111bad	Optional Espresso support with ENSO_JAVA=espresso env variable	2023-09-19 15:10:12 +02:00
GregoryTravis	b0c1f3b00e	New Data.post for sending a payload to a Web API (#7700 )	2023-09-19 11:26:29 +00:00
Jaroslav Tulach	af8a2c39d8	Throw UnsupportedMessageException when loosing BigInteger precision (#7817 )	2023-09-15 17:00:09 +02:00
Dmitry Bushev	bbf96f0d16	expressionUpdates do not contain Method Pointer of operators (#7659 ) close #7520 Changelog: - update: SectionsToBinOp compiler pass produces function application for left sections - refactor: simplify the registration of builtin methods	2023-09-15 08:01:21 +00:00
Hubert Plociniczak	1ee3d8f4f0	Rename Decimal to Float (#7807 ) Implements #6889.	2023-09-14 15:01:30 +00:00
Jaroslav Tulach	30a62b97bb	Runtime checking of ascribed expression types (#7796 )	2023-09-14 14:09:41 +02:00
Jaroslav Tulach	a5094225cf	Moving numpy check into test/Examples_Tests (#7808 )	2023-09-14 13:13:40 +02:00
Radosław Waśko	8b6e70b155	Support for BigInteger values in Table (#7715 ) - Fixes #7354 - And also closes #7712 - Refactors how we handle numeric ops - ensuring that the 'kernels' are placed all in one place and selected based on storage types.	2023-09-12 13:18:04 +00:00
Jaroslav Tulach	50124704c8	Support runtime checks of intersection types (#7769 )	2023-09-11 14:05:20 +02:00
Pavel Marek	e0ee8fdda7	Merge Small_Integer and Big_Integer types (#7636 ) Merge Small_Integer and Big_Integer types into a single Integer type	2023-09-08 16:04:39 +02:00
James Dunkerley	f0ae9bf9c5	Fixes issue writing to a dry run Excel File (#7763 ) - Adds `size` to `File`. - If file is empty, then create a new Excel file. - Fixes dry run Excel write issue.	2023-09-08 08:52:00 +00:00
Pavel Marek	687686773e	Fix `on` argument for Vector.sort (#7751 ) Fix treatment of the `on` argument of `Vector.sort`.	2023-09-07 17:32:08 +00:00
Jaroslav Tulach	4213bf9983	Display overriden non-default values (#7762 )	2023-09-07 13:31:51 +02:00
Radosław Waśko	7d424bf8a2	Implement `Table.delete_rows`. (#7709 ) - Closes #7238 - Aligns `update_database_table` to a more consistent and clearer API - `update_rows`. - Adds a `truncate_table` helper function, to pair up with `drop_table`. Both are `PRIVATE` for now. - Adds tests for NULLs in keys in `update_rows` and `delete_rows`. - The behaviour is sometimes unexpected, so instead these fail with `Null_Values_In_Key_Columns`. - Adds a workaround for https://github.com/oracle/graal/issues/7359 - Adds a workaround for a related bug where a stack frame has no name (its `rootNode.getName() == null`). - I could not track down this bug to provide a neat repro.	2023-09-07 11:07:53 +00:00
Radosław Waśko	87ce78615a	Change layout of local library search path in order to be able to move `Round_Spec.enso` back to `Tests` (#7634 ) - Closes #7633 - Moves `Round_Spec.enso` from published `Standard.Test` into our `test/Tests` project; the `Table_Tests` that depend on it, simply `import enso_dev.Tests`. - Changes the layout of the local libraries directory: - It used to be `root/<namespace>/<name>`. - Now it is `root/<dir>` - the namespace and name are now read from `package.yaml` instead. - Adds the parent directory of the current project to the default `ENSO_LIBRARY_PATH`. - It is treated as a secondary path, so the default `ENSO_HOME/lib` still takes precedence. - This allows projects to reference and load 'sibling' projects easily - the only requirement is for the project to enable `prefer-local-libraries: true` or add the other local project to its edition. The edition resolution logic is not changed.	2023-09-01 20:20:04 +00:00
Jaroslav Tulach	1437a671e1	Introducing generic Any.to type conversion method (#7704 )	2023-09-01 08:05:48 +02:00
GregoryTravis	54689510a4	fix dst test failure (#7649 ) Fixes a test failure that depends on the date on which Daylight Saving Time changes.	2023-08-31 20:05:29 +00:00
GregoryTravis	061876e640	Add simple parts of Table.take and Table.drop functions to Database table (#7615 ) Implements database Table and Column take/drop, except While and Sample. Additional features and optimizations are in https://github.com/enso-org/enso/issues/7614.	2023-08-31 18:52:02 +00:00
Radosław Waśko	255b424b72	Add `value_type` to `Column.from_vector` and `expected_value_type` to `Column.map` and `Column.zip` (#7637 ) - Closes #6111 - Aligns semantics of handling Mixed columns. - Now, if an operation like `iif` or `fill_nothing` is given a `Mixed` column, the result will also be `Mixed` regardless of the `inferred_precise_value_type`. - Enables a few old tests that were pending but could be enabled since the types work is advanced enough.	2023-08-31 13:20:49 +00:00
Jaroslav Tulach	198ab7f487	Support for Python libraries like numpy (#7678 )	2023-08-30 06:10:18 +02:00
Jaroslav Tulach	6461e20870	Special support for Python Date/Time/Zone interop (#7617 )	2023-08-25 10:27:16 +02:00
GregoryTravis	ddf18f212b	Handle writing to a relative file (#7638 ) Fixes bug in writing to a non-absolute file (with backup).	2023-08-24 21:01:37 +00:00
Jaroslav Tulach	20e18d22df	More descriptive function information (#7629 ) Fixes #7359 by printing more information about the function including partially applied arguments and over-saturated arguments.	2023-08-24 18:04:08 +00:00
James Dunkerley	7d83b3d7b4	Add GROUP to functions (#7622 ) - Update list of groups to agreed list. - Lower case `ALIAS` names to be consistent with function names. - Add `GROUP` to methods. - All constructors and functions have doc comments. - Correct a few typos (e.g. `PRVIATE`). - Mark some more things as `PRIVATE`. - Use `ToDo:` and `Note:` consistently. - Order tags in doc comment. # Important Notes We don't have all the doc comments on types and will want to add them in future,	2023-08-23 13:20:38 +00:00
Pavel Marek	c32bfad2b1	Add std-libs benchmarks GH workflow (#7597 ) Adds "Benchmark Standard Libraries" workflow that can be run locally with `sbt std-benchmarks/bench`.	2023-08-23 12:18:36 +02:00
Radosław Waśko	2385f5b357	Add size-limited strings and varying bit-width integer Value_Types to in-memory backend and check for ArithmeticOverflow in LongStorage (#7557 ) - Closes #5159 - Now data downloaded from the database can keep the type much closer to the original type (like string length limits or smaller integer types). - Cast also exposes these types. - The integers are still all stored as 64-bit Java `long`s, we just check their bounds. Changing underlying storage for memory efficiency may come in the future: #6109 - Fixes #7565 - Fixes #7529 by checking for arithmetic overflow in in-memory integer arithmetic operations that could overflow. Adds a documentation note saying that the behaviour for Database backends is unspecified and depends on particular database.	2023-08-22 18:10:46 +00:00
Jaroslav Tulach	a2946d2d27	Allow Java Enums in "case of" branches (#7607 ) Fixes #7551 by looking up static symbols of a `JavaClass` polyglot object.	2023-08-21 15:12:57 +00:00
Pavel Marek	a0086bb112	Ability to invoke all std benchmarks via jmh (#7519 ) All the Enso benchmarks in `test/Benchmarks` can be invoked via JMH	2023-08-17 14:48:43 +02:00
GregoryTravis	c9d7c5cb2b	Convert in-memory Column.round to Java (#7521 )	2023-08-16 14:45:23 +00:00
Jaroslav Tulach	aa0413e5a2	Use only Type instances as keys for State (#7585 )	2023-08-16 15:54:17 +02:00
James Dunkerley	296c95d414	Fix for empty column on replace and out of memory catching for join and tab (#7593 ) - Added a Panic.catch to catch heap memory error in joins and cross_tab. - Adjusted column replace so type is correct.	2023-08-15 17:06:51 +00:00
Jaroslav Tulach	7a272ec152	Encapsulating array-like data and operations into a single package (#7544 )	2023-08-15 13:00:47 +02:00
Radosław Waśko	8541a9e1ac	Improve generation of long operation in presence of column name length limit (#7556 ) I planned to do this as part of #7428, but I forgot. Making up for that now.	2023-08-14 16:58:36 +00:00
GregoryTravis	d3436fae70	Implement Number.round as a builtin (#7460 )	2023-08-14 15:43:39 +00:00
Jaroslav Tulach	585ede7741	Making CI green: Addressing 'UTC' did not equal 'Etc/UTC' failure on Mac (#7526 )	2023-08-10 10:38:05 +02:00
Radosław Waśko	b656b336c7	Report `Loss_Of_Integer_Precision` when an integer is not exactly representable as a float during conversion (#7509 ) Closes #7353 I introduce a new type `WithAggregatedProblems`, because `WithProblems` was too simple - it only allowed to hold a `List<Problem>` but `AggregatedProblems` is more than that. Ideally we shouldn't multiply entities like this too much. We should probably unify all to use `WithAggregatedProblems` - but after starting this, I realised it will likely just take too much effort to do for this little PR. So instead, I created a follow-up task for this: #7514	2023-08-08 12:30:44 +00:00
Pavel Marek	8e49255d92	Invoke all Enso benchmarks via JMH (#7101 ) # Important Notes #### The Plot - there used to be two kinds of benchmarks: in Java and in Enso - those in Java got quite a good treatment - there even are results updated daily: https://enso-org.github.io/engine-benchmark-results/ - the benchmarks written in Enso used to be 2nd class citizen #### The Revelation This PR has the potential to fix it all! - It designs new [Bench API](`88fd6fb988`) ready for non-batch execution - It allows for _single benchmark in a dedicated JVM_ execution - It provides a simple way to wrap such an Enso benchmark as a Java benchmark - thus the results of Enso and Java benchmarks are [now unified](https://github.com/enso-org/enso/pull/7101#discussion_r1257504440) Long live _single benchmarking infrastructure for Java and Enso_!	2023-08-07 12:39:01 +00:00
GregoryTravis	758b3b31b9	Avoid indexing the table twice for Cross Tab (#7417 ) Rewrites MultiValueIndex.makeCrossTabTable to build only a single index.	2023-08-04 21:14:18 +00:00
Radosław Waśko	bc9cde6543	Fix column naming edge cases - invalid and duplicated columns, case-insensitive name aliasing for case-insensitive backends (#7495 ) - Fixes #7412 - Also adds tests and fixes some more edge cases: - Ensures correct handling of existing Database tables whose column names may be invalid from Enso perspective, or clashing from Enso perspective (e.g. for most DBs `ś` and `s\u0301` are different names, but for Enso they are basically the same so this would cause issues - thus Enso now renames such columns when accessed (still using the correct column reference in the generated SQL under the hood).	2023-08-04 09:04:38 +00:00
GregoryTravis	037a687401	Expose Unicode normalization methods on Texts (#7425 ) Exposes Text_Utils.normalize().	2023-08-03 18:07:00 +00:00
Jaroslav Tulach	9e9cf0b2ed	Adopting GraalVM's support for BigInteger (#7420 ) Fixes #7213 by reacting to new `isBigInteger` and `asBigInteger` messages. Adjusts tests to require `BigInteger` when appropriate. `EnsoBigInteger` no longer extends `Number`. Enso tests to come.	2023-08-03 10:49:16 +00:00
Radosław Waśko	c61c741476	Respect database backend naming limitations when generating table/column names and validate user-provided names to avoid silent name clashes; process JDBC warnings reported from backends (#7428 ) - Closes #5951 - Ensures any SQL warnings reported by the database through the JDBC driver are processed and forwarded to the user. - These warnings show issues like the implicit name truncation that this PR is also solving. It's good to make sure they are visible as they can help avoid and understand unexpected problems. They should not show up in most standard workflows. - Adds simple history to our REPL.	2023-08-03 09:44:27 +00:00
Jaroslav Tulach	c4d90ae6ff	Verify Python date can be used where Enso date (#7396 ) Verifies #7387 by running existing suites with various Python objects.	2023-07-28 14:17:42 +00:00
GregoryTravis	628a51d8e2	Convert Number.round to Java (#7360 )	2023-07-26 12:03:09 +00:00
James Dunkerley	7345f0fd9a	Speed up statistics (#7390 ) - Allow `parse_to_columns` to take a `Regex` object. - Add `pattern` to the `Regex` object. - Add `column_names` to the `Row` object. - Improve statistics performance. - Add benchmarks for stats. \| Benchmark \| Reference \| New \| Improvement \| \| --- \| --- \| --- \| --- \| \| Max (by reduce) \| 16.4ms \| 16.3ms \| - \| \| Max (stats) \| 703ms \| 224ms \| 68% \| \| Sum (by reduce) \| 38ms \| 38ms \| - \| \| Sum (stats) \| 753ms \| 420ms \| 44% \| \| Variance (stats) \| 745ms \| 553s \| 26% \| Also tried using a Ref approach for stats but as slower (`7e13c45224`).	2023-07-26 10:01:18 +00:00
Radosław Waśko	4b5a2e2176	Fixing operations on Mixed types (#7368 ) - Fixes #7231 - Cleans up vectorized operations to distinguish unary and binary operations. - Introduces MixedStorage which may pretend to be a more specialized storage on demand. - Ensures that operations request a more specialized storage on right-hand side to ensure compatibility with reported inferred storage type. - Ensures that a dataflow error returned by an Enso callback in Java is propagated as a polyglot exception and can be caught back in Enso - Tests for comparison of Mixed storages with each other and other types - Started using `Set` for `Filter_Condition.Is_In` for better performance. - ~~Migrated `Column.map` and `Column.zip` to use the Java-to-Enso callbacks.~~ - This does not forward warnings. IMO we should not be losing them. We can switch and add a ticket to fix the warnings, but that would be a regression (current implementation handles them correctly). Instead, we should first gain some ability to work with warnings in polyglot. I created a ticket to get this figured out #7371 - ~~Trying to avoid conversions when calling Enso functions from Java.~~ - Needs extra care as dataflow errors may not be handled right then. So only works for simple functions that should not error. - Not sure how much it really helps. [Benchmarks](https://github.com/enso-org/enso/pull/7270#issuecomment-1635618393) suggested it could improve the performance quite significantly, but the practical solution is not exactly the same as the one measured, so we may have to measure and tune it to get the best results. - Created #7378 to track this.	2023-07-25 23:25:17 +00:00
GregoryTravis	1f6fcf189b	Implement replace on the Database Column (#7275 ) Implements `replace` for database text columns, for text, regex, and column patterns.	2023-07-25 18:09:50 +00:00
Jaroslav Tulach	34c63e3b11	Ascribed types & atom_with_hole (#7348 )	2023-07-24 07:02:58 +02:00
James Dunkerley	88f32d9b2a	Various small tickets... (#7367 ) - Added `Text.length` into Text class so CB lists the built in. - Added `File.starts_with` and tests for the built in method. - Add `to_js_object` and `to_display_text` to `Regex`. ![image](https://github.com/enso-org/enso/assets/4699705/3b197c94-9c49-4bc5-a2cc-ce53b917942e) - Add `to_js_object` and `to_display_text` to `Match`. ![image](https://github.com/enso-org/enso/assets/4699705/962ec4f2-324d-4f10-8ec0-932b093c6729) - Remove the `bit_shift_l` alias from the built-ins. - Add test and Enso wrapper for `Text.is_normalized`.	2023-07-23 09:04:11 +00:00

... 2 3 4 5 6 ...

886 Commits