enso-org/enso - enso - gitea: Gitea Service

mirror of https://github.com/enso-org/enso.git synced 2024-12-02 11:34:35 +03:00

Author	SHA1	Message	Date
Radosław Waśko	bc9cde6543	Fix column naming edge cases - invalid and duplicated columns, case-insensitive name aliasing for case-insensitive backends (#7495 ) - Fixes #7412 - Also adds tests and fixes some more edge cases: - Ensures correct handling of existing Database tables whose column names may be invalid from Enso perspective, or clashing from Enso perspective (e.g. for most DBs `ś` and `s\u0301` are different names, but for Enso they are basically the same so this would cause issues - thus Enso now renames such columns when accessed (still using the correct column reference in the generated SQL under the hood).	2023-08-04 09:04:38 +00:00
Radosław Waśko	c61c741476	Respect database backend naming limitations when generating table/column names and validate user-provided names to avoid silent name clashes; process JDBC warnings reported from backends (#7428 ) - Closes #5951 - Ensures any SQL warnings reported by the database through the JDBC driver are processed and forwarded to the user. - These warnings show issues like the implicit name truncation that this PR is also solving. It's good to make sure they are visible as they can help avoid and understand unexpected problems. They should not show up in most standard workflows. - Adds simple history to our REPL.	2023-08-03 09:44:27 +00:00
Radosław Waśko	4b5a2e2176	Fixing operations on Mixed types (#7368 ) - Fixes #7231 - Cleans up vectorized operations to distinguish unary and binary operations. - Introduces MixedStorage which may pretend to be a more specialized storage on demand. - Ensures that operations request a more specialized storage on right-hand side to ensure compatibility with reported inferred storage type. - Ensures that a dataflow error returned by an Enso callback in Java is propagated as a polyglot exception and can be caught back in Enso - Tests for comparison of Mixed storages with each other and other types - Started using `Set` for `Filter_Condition.Is_In` for better performance. - ~~Migrated `Column.map` and `Column.zip` to use the Java-to-Enso callbacks.~~ - This does not forward warnings. IMO we should not be losing them. We can switch and add a ticket to fix the warnings, but that would be a regression (current implementation handles them correctly). Instead, we should first gain some ability to work with warnings in polyglot. I created a ticket to get this figured out #7371 - ~~Trying to avoid conversions when calling Enso functions from Java.~~ - Needs extra care as dataflow errors may not be handled right then. So only works for simple functions that should not error. - Not sure how much it really helps. [Benchmarks](https://github.com/enso-org/enso/pull/7270#issuecomment-1635618393) suggested it could improve the performance quite significantly, but the practical solution is not exactly the same as the one measured, so we may have to measure and tune it to get the best results. - Created #7378 to track this.	2023-07-25 23:25:17 +00:00
GregoryTravis	1f6fcf189b	Implement replace on the Database Column (#7275 ) Implements `replace` for database text columns, for text, regex, and column patterns.	2023-07-25 18:09:50 +00:00
GregoryTravis	8a829384ed	Add zone to Date_Times (#7325 )	2023-07-19 11:04:50 +00:00
GregoryTravis	2fb5c3710b	Add Fallback to Prim_Text_Helper.compile_regex; accept Regex in Text.parse_to_table (#7297 ) This PR does three related things: - Fails more gracefully when a non-string is passed to compile_regex - Don't pass a non-string to compile_regex - Allow a Regex param to parse_to_table	2023-07-18 19:55:56 +00:00
James Dunkerley	fd0bdc86dd	Fix issue with rename_columns and revert order of parameter change on select_columns. (#7321 ) The Regex change introduced some issues. Added a test for missed case in `rename_columns` where using vector of pairs. Reverted parameter order change for `select_columns`.	2023-07-18 13:30:23 +00:00
James Dunkerley	aaa235fbad	Add drop down for replace, remove Column_Selector (#7295 ) - Add dropdowns for `replace` functions. - Retire `Column_Selector` type. - Add `select_blank_columns` and `remove_blank_columns` functions to table types. - Allow Regex to be used to pick columns.	2023-07-14 17:30:52 +00:00
Radosław Waśko	866283c0a8	Improve error message on `Filter_Condition` missing arguments in `Table.filter` (#7290 ) In #7148 I improved the error message when a `Filter_Condition` constructor without arguments is provided to `Vector.filter` and its friends. This PR applies the same check to the `Table.filter`. This is useful, because when we select a Filter_Condition from a widget, initially it does not have all its arguments applied. This used to lead to confusing errors being reported to the user, now, a much clearer error is shown: ![image](https://github.com/enso-org/enso/assets/1436948/19140a7b-d6fc-4292-81d3-dc6d61135cb9)	2023-07-14 08:00:13 +00:00
Radosław Waśko	620cc361ce	Add `date_diff`, `date_add` and `date_part` to scalar Enso date-time values. (#7273 ) Followup of #7221, adding `date_diff`, `date_add` and `date_part` to scalar Enso date-time values.	2023-07-13 15:17:21 +00:00
Radosław Waśko	ca68dd94da	Adding new Date/Time operations (`-`, `date_add`, `date_diff`, `date_part`) (#7221 ) - Adds `Column.date_diff` for computing date/time difference as integer multiply of some unit. - Adds `Column.date_add` for shifting date/time by a unit. - Adds `Column.date_part` for extracting various parts of the date/time value as integer. - Adds widgets for the 3 methods above whose content depends on the column value type. - Adds shorthands: `Column.hour`, `Column.minute` and `Column.second` to extract these date parts. - Extends `Time_Period` with support for milli-, micro- and nano- seconds; and adapts functions taking `Time_Period` to support these wherever possible.	2023-07-13 12:56:54 +00:00
James Dunkerley	0adab6c68c	Round on a column was always adding a warning (#7246 ) - Only warn if outside allowed range. - Added `is_infinite` to In-Memory column. - Allow integer value type for `is_nan` and `is_infinite`.	2023-07-10 17:35:23 +00:00
GregoryTravis	345d6b9cb1	Add cross_join support to Database Table (#7234 )	2023-07-10 16:29:37 +00:00
James Dunkerley	1fb60df61b	Fixes from the live demo. (#7243 ) - Removed defaults from `cross_tab`. It caused an out-of-heap space error when it attempted to build a 205k x 205k table. Now has a hard limit of 10,000 columns - we can increase this once we have more concrete test data. ![image](https://github.com/enso-org/enso/assets/4699705/bc38d41c-56dc-41bd-8a7c-fa89ecfa7f79) - Adjusted the dropdowns on `Aggregate_Column` for `columns` and `order_by` to be dropdowns as nested Vector editors are not supported. ![image](https://github.com/enso-org/enso/assets/4699705/f4a7c7cc-6a21-462c-a39e-65fbab82c367) - Altered `Aggregate_Column` so `new_name` now `new_name:Text=""` and not taking `Nothing` anymore. Makes it appear correctly in IDE. ![image](https://github.com/enso-org/enso/assets/4699705/196a49ba-4274-44bb-b876-0372c8f62746) - Added dropdowns for `fill_empty`, `fill_nothing` and `replace` on `Table`. ![image](https://github.com/enso-org/enso/assets/4699705/9ee5cec2-82d5-4452-b650-67015ac9fee5) - Added `replace` to Database table throwing `Unsupport_Database_Operation`.	2023-07-09 18:03:05 +00:00
GregoryTravis	bd26e95fd6	Add Table.replace; Change Text.replace to take a Text\|Pattern, and remove the use_regex param. (#7223 )	2023-07-06 16:13:11 +00:00
James Dunkerley	7749286c69	Tidy up the imports using script (#7220 ) Ordering the imports to test a script.	2023-07-06 14:22:50 +00:00
GregoryTravis	6eb46afb40	Do not rename column on fill_nothing and add version to the Table allowing filling multiple (include fill_empty as well). (#7166 ) Updated Column.fill_nothing and .fill_empty, and added the same to Table. (Both in-memory and db.)	2023-07-05 17:20:23 +00:00
Radosław Waśko	78545b4402	Add safepoints to standard libraries Java polyglot helpers (#7183 ) Closes #7129	2023-07-05 14:12:13 +00:00
Radosław Waśko	2d73277238	Fix a bug that somehow went under CI (#7204 )	2023-07-05 08:54:27 +00:00
James Dunkerley	4fbe7e3830	Remove `Array.new` and `Array.copy` and move Vector functions to builtins. (#7147 ) - Removed Array methods: `new`, `copy` and `new_[1234]`. - New builtins for `Vector.insert`, `Vector.remove` and `Vector.flatten`. - Replaced `Vector_Builder` use of `Array.copy` to a `Vector.Builder` approach.	2023-07-03 12:41:41 +00:00
Radosław Waśko	4ccf3566ce	Implement `add_row_number` for Database backends, fix primary key inference for SQLite (#7174 ) Closes #6921 and also closes #7037	2023-07-03 11:51:42 +00:00
GregoryTravis	c866aa7fb5	parse_to_columns should generate at least one row for a non-match (#7171 )	2023-06-30 18:10:33 +00:00
GregoryTravis	550d146493	Add round, ceil, floor, truncate to the In-Database Column type (#6988 )	2023-06-30 16:47:40 +00:00
James Dunkerley	56688ec1e7	Minor fixes. (#7122 ) Mostly stuff to tidy up the static methods in the CB. - Remove default pattern from `parse_to_table` (caused IDE to freeze). - Rename any `_` arguments to what they are. - Merge `Date.now` into `Date.today` - Merge the Interval constructors into a single constructor. - Hide various methods.	2023-06-27 18:18:15 +00:00
Radosław Waśko	2bac9cc844	Execution Context integration for Database write operations (#7072 ) Closes #6887	2023-06-27 15:51:21 +00:00
James Dunkerley	937651f696	Code Clean Up, Fix Weird Namespace, S3 List Objects and Read Object (#7114 ) Mostly a tidy up as part of looking over the function catalogue for groups. Sorted some whitespaces issues.	2023-06-24 23:18:58 +00:00
James Dunkerley	760fb71798	First part of AWS S3 API, various small fixes. (#6973 ) - Add type detection for `Mixed` columns when calling column functions. - Excel uses column name for missing headers. - Add aliases for parse functions on text. - Adjust `Date`, `Time_Of_Day` and `Date_Time` parse functions to not take `Nothing` anymore and provide dropdowns. - Removed built-in parses. - All support Locale. - Add support for missing day or year for parsing a Date. - All will trim values automatically. - Added ability to list AWS profiles. - Added ability to list S3 buckets. - Workaround for Table.aggregate so default item added works.	2023-06-15 16:20:13 +00:00
Radosław Waśko	dad57e6c7d	Implement remaining `Update_Action`s for `update_database_table`. (#7035 ) Closes #6498	2023-06-15 08:48:22 +00:00
Pavel Marek	67821bf8df	Add compiler pass that discovers ambiguous imports (#6868 ) Add a new compiler pass that analyses duplicated and ambiguous symbols from imports	2023-06-14 12:18:57 +02:00
Radosław Waśko	d9ed63fb89	Implement `Insert` update action for `update_database_table`. (#6990 ) This adds the spec for all update actions, but implements the common input validation framework and `Insert`. Tests for remaining actions are marked as pending - these will be implemented in a subsequent PR.	2023-06-14 00:14:32 +00:00
James Dunkerley	578ba59f1d	Use US Locale for Date and Time parsing and formatting (#6967 ) Sorts out parsing and printing long form names of months and weekdays.	2023-06-06 21:44:25 +00:00
GregoryTravis	912fbce97b	Reimplement Column.truncate, .ceil, and .floor as vectorized Java ops (#6941 ) Reimplement these in Java. Benchmarks: Before: Column.truncate floats average: 124.4ms Column.ceil floats average: 121.47ms Column.floor floats average: 120.18ms Column.truncate ints average: 124.78ms Column.ceil ints average: 120.41ms Column.floor ints average: 102.35ms After (boxed): Column.truncate floats average: 3.75ms Column.ceil floats average: 2.25ms Column.floor floats average: 1.89ms Column.truncate ints average: 2ms Column.ceil ints average: 1.77ms Column.floor ints average: 1.74ms After (unboxed): Column.truncate floats average: 3.32ms Column.ceil floats average: 2.15ms Column.floor floats average: 1.69ms Column.truncate ints average: 1.74ms Column.ceil ints average: 1.61ms Column.floor ints average: 1.99ms	2023-06-06 18:07:12 +00:00
Radosław Waśko	b513839418	Refactor `create_database_table` into `Connection.create_table` and `select_into_database_table`, implement `Set`. (#6925 ) First part for #6498 - refactoring of the upload infrastructure, in preparation for `update_database_table`. Implemented a `Set` data structure which was long needed. The APIs are added and an initial implementation is created, but it is not complete - but it has grown significantly already so the remaining implementation will be done as a separate PR. Adds some basic ability for a function to ensure that it is only executed from within a transaction.	2023-06-06 10:36:05 +00:00
James Dunkerley	db96bd2e2c	Small fixes from book club. (#6933 ) - Add the missing dropdowns for `Locale` and `Encoding`. - Correct a few mismatched type signatures. - Adjust `order_by` calls with a single `Sort_Column` to call in a Vector. - Adjust parameter names for `transpose`. - Fix for the table viz: escape HTML and `suppressFieldDotNotation`. - Use `Filter_Condition.Equal True` for the default filter. - Adjust `Data.fetch` to return the response on success when parse fails. Rename `parse` to `try_auto_parse`. - Add various aliases for methods. - Add tests for `Table.set` when using a `Vector`, `Range` or `Date_Range`. - Add check for mismatched length on `Table.set`. ![image](https://github.com/enso-org/enso/assets/4699705/23ea0ba3-2b05-4af8-afd9-f35b55446c24) ![image](https://github.com/enso-org/enso/assets/4699705/8b0253e6-e9e8-490a-9607-0da51ab5a215)	2023-06-05 13:57:30 +00:00
Radosław Waśko	cfb2f2916e	Merge `Column_Indexes_Out_Of_Range` into `Missing_Input_Columns`. (#6901 ) Implements #6869	2023-06-02 12:09:20 +00:00
Radosław Waśko	d44b1250b7	Implement `Table.add_row_number` (#6890 ) Closes #5227 # Important Notes - This lays first steps towards #6292 - we get pure Enso variants of MultiValueKey. - Another part refactors `LongStorage` into `AbstractLongStorage` allowing it to provide alternative implementations of the underlying storage, in our case `LongRangeStorage` generating the values ad-hoc and `LongConstantStorage` - currently unused but in the future it can be adapted to support constant columns (once we implement similar facilities for other types).	2023-06-02 10:13:13 +00:00
James Dunkerley	343b5fb085	Execution control for Table.write and various widget tweaks... (#6835 ) - Adds execution control to `Table.write`. - Refactored the `Text.write` to make part reusable. - Tidied up some legacy mess in tests. - Add easier flow to go from `Text` to an `URI` to fetching data. - Add decode functions to `Response` and `Response_Body`. - Fix issue with 0 length regex matches (using same as Python and .Net approach). - Add various ALIAS entries to make function discovery easier. - Sort a lot of drop down and vector editors out (including switch to fully qualified names).	2023-06-01 22:10:03 +00:00
GregoryTravis	0337180384	Add rounding functions to the Column type (#6817 )	2023-06-01 20:06:23 +00:00
Radosław Waśko	c3e771c75c	Allow casting a Mixed column into a concrete type (#6777 ) Follow-up of #6711 Closes #6838	2023-05-26 13:25:53 +00:00
Pavel Marek	bd70ed6830	Update GraalVM to 22.3.1 JDK17 (#6750 ) Upgrade GraalVM to 22.3.2 based on JDK17. --------- Co-authored-by: somebody1234 <ehern.lee@gmail.com>	2023-05-24 10:39:08 +02:00
Pavel Marek	2b1e5cd017	Add a compiler pass to analyze non-existing imported symbols (#6726 ) Add diagnosis for unresolved symbols in `from ... import sym1, sym2, ...` statements. - Adds a new compiler pass, `ImportSymbolAnalysis`, that checks these statements and iterates through the symbols and checks if all the symbols can be resolved. - Works with `BindingsMap` metadata. - Add `ImportExportTest` that creates various modules with various imports/exports and checks their generated `BindingMap`. --------- Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com> Co-authored-by: Jaroslav Tulach <jaroslav.tulach@enso.org>	2023-05-22 10:41:15 +02:00
James Dunkerley	62fecfa474	Widgets, Vector as Column, Viz Fixes and Rename Columns (#6768 ) - Fix couple of bugs in Table viz: rounding of bottom div, missing character, not including row count as an option. - Add better JSON format for `Row`, add support for visualization in the Table viz both for `Vector Row` or `Row`. - Fix some type signature errors. - Move `Column_Format` to `Standard.Table.Internal`. - Move `format_widget` to `File_Format.default_widget` and sort the signature of `Widget` methods. - Added utility to make `Single_Choice` widgets. - Added dropdown for delimiter on split methods. - Removed `default_widget` from `Problem_Behavior` and `Filter_Condition`. - Altered signature and widgets for table functions. - Added `to_column` extension to allow easy conversion of Range and Vector to Column. - Added `compute`, `compute_bulk`, `running` to Column to allow statistic computation. - Added drop down for `Table.write` format parameter. - Added drop down for `Table.rename_columns`. - Added support for Vector of pairs for renaming columns. - Added check when making a map from Vector if not 2 items. ![image](https://github.com/enso-org/enso/assets/4699705/beed257c-efe3-44a3-9e3a-041354701735)	2023-05-19 23:24:47 +00:00
Radosław Waśko	a9a464af37	Implement simple variants of `parse` for the Database backend (#6731 ) Implements the simplest `parse` scenarios for the Database backend. Before #6711 these could have been done by `cast`, but in #6711 the APIs were unified to only allow casting to the same set of types in both in-memory and Database. Converting Text to other types is supposed to be done by `parse` and not `cast`, so the ability to use `cast` for rudimentary parsing is removed in the Database backend to make it consistent with in-memory. But now it is lacking any, even simplest, Text->Int/Text->Date support. To alleviate that, the simple scenarios for `parse` are implemented (no support for format customization yet, will boil down to a cast under the hood).	2023-05-19 22:11:23 +00:00
Radosław Waśko	447786a304	Implement `cast` for Table and Column (#6711 ) Closes #6112	2023-05-19 10:00:20 +00:00
Radosław Waśko	f5071a17fd	Improving widgets for take/drop (#6641 ) Related to #6410 # Important Notes - Updated some `Meta` methods (needed for error handling): - `Meta.Type` now has `name` and `qualified_name`. - `Meta.Constructor` has `declaring_type` allowing to get the type that this constructor is associated with.	2023-05-12 15:33:15 +00:00
Radosław Waśko	d8b926922a	Improve `Non_Unique_Primary_Key` error, split file format detection into read/write, improve SQLite format detection (#6604 ) Closes #6437 Related to #6410 - Add example duplicate row to `Non_Unique_Primary_Key`. - Ensure `File.read` fails if the file does not exist, always. - Ensure SQLite fails if file is empty or nonexistent or malformed. - Split file format detection into read and write modes, so that the read mode can depend on actual file _contents_.	2023-05-09 17:15:44 +00:00
GregoryTravis	d6f7cea923	tokenize_to_columns or parse_to_columns results in a single column we shouldn't add the 1 (#6607 ) When generating new column names in tokenize/split_to_columns, if there's only one new column, use the deleted input column name and don't add disambiguating integers after it.	2023-05-09 15:56:33 +00:00
GregoryTravis	4ba8409def	Add format to the in-memory Column (#6538 ) Add format to the in-memory Column # Important Notes Also updates .format in date types. Some rearrangement of date formatting builtins / Java libraries.	2023-05-09 08:47:40 +00:00
James Dunkerley	bc0db18a6e	Small changes from Book Club issues (#6533 ) - Add dropdown to tokenize and split `column`. - Remove the custom `Join_Kind` dropdown. - Adjust split and tokenize names to start numbering from 1, not 0. - Add JS_Object serialization for Period. - Add `days_until` and `until` to `Date`. - Add `Date_Period.Day` and create `next` and `previous` on `Date`. - Use simple names with `File_Format` dropdown. - Avoid using `Main.enso` based imports in `Standard.Base.Data.Map` and `Standard.Base.Data.Text.Helpers`. - Remove an incorrect import from `Standard.Database.Data.Table`. From #6587: A few small changes, lots of lines because this affected lots of tests: - `Table.join` now defaults to `Join_Kind.Left_Outer`, to avoid losing rows in the left table unexpectedly. If the user really wants to have an Inner join, they can switch to it. - `Table.join` now defaults to joining columns by name not by index - it looks in the right table for a column with the same name as the first column in left table. - Missing Input Column errors now specify which table they refer to in the join. - The unique name suffix in column renaming / default column names when loading from file is now a space instead of underscore.	2023-05-06 10:10:24 +00:00
GregoryTravis	f7282b7cff	"at_least_one" flag for tokenize_to_rows (#6539 ) Add "at_least_one" flag for tokenize_to_rows.	2023-05-06 06:22:09 +00:00
Radosław Waśko	0be572af6a	Only `Join_Kind.Inner` removes the common-named columns (#6564 ) As requested by @jdunkerley	2023-05-05 08:12:07 +00:00
Radosław Waśko	41a8257e8d	Separating Redshift connector from `Database` library into a new `AWS` library (#6550 ) Related to #5777	2023-05-04 17:36:51 +00:00
James Dunkerley	bb8f910be9	Continued Execution Context work and some little fixes (#6506 ) - Add `with_disabled` shortcut for `Context`. - Protect `Image.write` behind `Context.Output`. - Correct text on the `Forbidden_Operation` error message. - Remove context overrides from tests. - Add `File` operations tests with `Context.Output` disabled. - Add tests for `Text.write` operations with `Context.Output` disabled. - Use a better method to make `File_Format` dropdown widget. - Fix bug in `Invalid_Format.to_display_text`.	2023-05-03 08:01:16 +00:00
Radosław Waśko	5eb9c3a843	Implement `create_database_table` for Database Table (#6467 ) Closes #6327	2023-05-02 14:01:44 +00:00
James Dunkerley	6b0c682b08	Add Execution Context control to Text.write (#6459 ) - Adjusted `Context.is_enabled` to support default argument (moved built in so can have defaults). - Made `environment` case-insensitive. - Bug fix for play button. - Short hand to execute within an enabled context. - Forbid file writing if the Output context is disabled with a `Forbidden_Operation` error. - Add temporary file support via `File.create_temporary_file` which is deleted on exit of JVM. - Execution Context first pass in `Text.write`. - Added dry run warning. - Writes to a temporary file if disabled. - Created a `DryRunFileManager` which will create and manage the temporary files. - Added `format` dropdown to `File.read` and `Data.read`. - Renamed `JSON_File` to `JSON_Format` to be consistent. (still to unit test).	2023-04-29 08:39:18 +00:00
Radosław Waśko	cdd0065800	Change defaults for `Connection.tables` and ensure that `Connection.query` recognizes all available tables (#6443 ) Closes #6398	2023-04-29 00:13:29 +00:00
Hubert Plociniczak	0dc9f3742c	Unbreak CI after outdated merge (#6466 )	2023-04-28 14:35:11 +02:00
Hubert Plociniczak	c6790f1e9c	Report only unique warnings (#6372 ) This change makes sure that reported warnings are unique, based on the value of internal clock tick and ignoring differences in reassignments. Before: ![Screenshot from 2023-04-20 15-42-55](https://user-images.githubusercontent.com/292128/233415710-925c1045-37c7-49f5-9bc3-bfbfd30270a3.png) After: ![Screenshot from 2023-04-20 15-27-27](https://user-images.githubusercontent.com/292128/233415807-8cb67bc2-ac37-4db7-924e-ae7619074b5b.png) On the positive side, no further changes, like in LS, have to be done. Closes #6257.	2023-04-28 07:16:00 +00:00
Radosław Waśko	462016a428	Create database table from memory (#6429 )	2023-04-27 20:06:17 +00:00
James Dunkerley	0c7c3bdeaf	Fix for the massive number of warnings when renaming with invalid names. (#6450 ) * Rename makeUnique overloads to avoid issue when Nothing is passed. Suspend warnings when building the output table to avoid mass warning duplication. * Add test for mixed invalid names. Adjust so a single warning attached. * PR comments.	2023-04-27 14:51:59 +01:00
James Dunkerley	0e51131809	Table Visualization and display text changes. (#6382 ) - Missing tests from number parsing. - Fix type signature on some warning methods. - Fix warnings on `Standard.Database.Data.Table.parse_values`. - Added test for `Nothing` and empty string on `use_first_row_as_names`. - New API for `Number.format` taking a simple format string and `Locale`. - Add ellipsis to truncated `Text.to_display_text`. - Adjusted built-in `to_display_text` for numbers to not include type (but also to display BigInteger as value). - Remove `Noise.Generator` interface type. - Json: Added `to_display_text` to `JS_Object`. - Time: Added `to_display_text` for `Date`, `Time_Of_Day`, `Date_Time`, `Duration` and `Period`. - Text: Added `to_display_text` to `Locale`, `Case_Sensitivity`, `Encoding`, `Text_Sub_Range`, `Span`, `Utf_16_Span`. - System: Added `to_display_text` to `File`, `File_Permissions`, `Process_Result` and `Exit_Code`. - Network: Added `to_display_text` to `URI`, `HTTP_Status_Code` and `Header`. - Added `to_display_text` to `Maybe`, `Regression`, `Pair`, `Range`, `Filter_Condition`. - Added support for `to_js_object` and `to_display_text` to `Random_Number_Generator`. - Verified all error types have `to_display_text`. - Removed `BigInt`, `Date`, `Date_Time` and `Time_Of_Day` JS based rendering as using `to_display_text` now. - Added support for rendering nested structures in the table viz.	2023-04-26 18:15:48 +00:00
GregoryTravis	dcfbf841b3	Replace Table should_equal with should_equal_verbose (#6405 ) Table.should_equal_verbose shows the contents of the tables on failure; let's make this the standard comparison.	2023-04-26 14:01:49 +00:00
GregoryTravis	afd804d529	5127 Add Table.parse_to_columns to parse a single column to a set of columns. (#6383 ) Implement Table.parse_to_columns	2023-04-24 15:21:38 +00:00
Radosław Waśko	f3873f9768	Infer SQLite types locally (#6381 ) Closes #6208	2023-04-24 10:55:12 +00:00
Radosław Waśko	a43d524336	Add typechecks to Aggregate and Cross Tab (#6380 ) Follow up of #6298 as it grew too much. Adds the needed typechecks to aggregate operations. Ensures that the DB operations report `Floating_Point_Equality` warning consistently with in-memory.	2023-04-24 08:55:54 +00:00
GregoryTravis	22f820feb7	Add Table.parse_text_to_table to convert Text to a Table. (#6294 )	2023-04-21 17:43:19 +00:00
Radosław Waśko	8db2ad51a1	Adding typechecks to Column Operations (#6298 ) Closes #6106	2023-04-21 12:20:12 +00:00
James Dunkerley	0350762386	Add `replace`, `trim` to Column. Better number parsing. (#6253 ) - Add `replace` with same syntax as on `Text` to an in-memory `Column`. - Add `trim` with same syntax as on `Text` to an in-memory `Column`. - Add `trim` to in-database `Column`. - Added `is_supported` to dialects and exposed the dialect consistently on the `Connection`. - Add `write_table` support to `JSON_File` allowing `Table.write` to write JSON. - Updated the parsing for integers and decimals: - Support for currency symbols. - Support for brackets for negative numbers. - Automatic detection of decimal points and thousand separators. - Tighter rules for scientific and thousand separated numbers. - Remove `replace_text` from `Table`. - Remove `write_json` from `Table`.	2023-04-20 16:04:59 +00:00
Pavel Marek	b42e910280	sort handles incomparable values (#5998 ) * Update type ascriptions in some operators in Any * Add @GenerateUncached to AnyToTextNode. Will be used in another node with @GenerateUncached. * Add tests for "sort handles incomparable types" * Vector.sort handles incomparable types * Implement sort handling for different comparators * Comparison operators in Any do not throw Type_Error * Fix some issues in Ordering_Spec * Remove the remaining comparison operator overrides for numbers. * Consolidate all sorting functionality into a single builtin node. * Fix warnings attachment in sort * PrimitiveValuesComparator handles other types than primitives * Fix byFunc calling * on function can be called from the builtin * Fix build of native image * Update changelog * Add VectorSortTest * Builtin method should not throw DataflowError. If yes, the message is discarded (a bug?) * TypeOfNode may not return only Type * UnresolvedSymbol is not supported as `on` argument to Vector.sort_builtin * Fix docs * Fix bigint spec in LessThanNode * Small fixes * Small fixes * Nothings and Nans are sorted at the end of default comparator group. But not at the whole end of the resulting vector. * Fix checking of `by` parameter - now accepts functions with default arguments. * Fix changelog formatting * Fix imports in DebuggingEnsoTest * Remove Array.sort_builtin * Add comparison operators to micro-distribution * Remove Array.sort_builtin * Replace Incomparable_Values by Type_Error in some tests * Add on_incomparable argument to Vector.sort_builtin * Fix after merge - Array.sort delegates to Vector.sort * Add more tests for problem_behavior on Vector.sort * SortVectorNode throws only Incomparable_Values. * Delete Collections helper class * Add test for expected failure for custom incomparable values * Cosmetics. * Fix test expecting different comparators warning * isNothing is checked via interop * Remove TruffleLogger from SortVectorNode * Small review refactorings * Revert "Remove the remaining comparison operator overrides for numbers." This reverts commit `0df66b1080`. * Improve bench_download.py tool's `--compare` functionality. - Output table is sorted by benchmark labels. - Do not fail when there are different benchmark labels in both runs. * Wrap potential interop values with `HostValueToEnsoNode` * Use alter function in Vector_Spec * Update docs * Invalid comparison throws Incomparable_Values rather than Type_Error * Number comparison builtin methods return Nothing in case of incomparables	2023-04-16 16:40:12 +02:00
GregoryTravis	4dcf5faddd	Add split and tokenize to the Table. (#6233 ) Implement split and tokenize for tables.	2023-04-14 16:03:02 +00:00
Radosław Waśko	0f4f8a0542	Full-joins in SQLite (#6215 ) Closes #5254 In #6189 the SQLite version was bumped to a newer release which has builtin support for Full and Right joins, so no workaround is no longer needed.	2023-04-06 16:49:14 +00:00
Radosław Waśko	f5db35af07	Adjust `{Table\|Column}.parse` to use `Value_Type` (#6213 ) Closes #5660	2023-04-06 10:58:55 +00:00
Radosław Waśko	83b10a2088	Implement `Table.union` for Database backend (#6204 ) Closes #5235	2023-04-06 08:40:34 +00:00
James Dunkerley	f26bcf6ab6	Small issues from working with Ned (#6160 ) - `Process.run` now returns a `Process_Result` allowing the easy capture of stdout and stderr. - Joining a column with a column name does not warn if adding just the prefix. - Stop the table viz from changing case and adding spaces to the headers.	2023-04-03 13:01:42 +00:00
Radosław Waśko	6ddcb553e5	Date/time support for Postgres. Year/month/day operations on Columns. (#6153 ) Closes #6115	2023-03-31 18:37:04 +00:00
Radosław Waśko	6f86115498	Proper implementation of Value Types in Table (#6073 ) This is the first part of the #5158 umbrella task. It closes #5158, follow-up tasks are listed as a comment in the issue. - Updates all prototype methods dealing with `Value_Type` with a proper implementation. - Adds a more precise mapping from in-memory storage to `Value_Type`. - Adds a dialect-dependent mapping between `SQL_Type` and `Value_Type`. - Removes obsolete methods and constants on `SQL_Type` that were not portable. - Ensures that in the Database backend, operation results are computed based on what the Database is meaning to return (by asking the Database about expected types of each operation). - But also ensures that the result types are sane. - While SQLite does not officially support a BOOLEAN affinity, we add a set of type overrides to our operations to ensure that Boolean operations will return Boolean values and will not be changed to integers as SQLite would suggest. - Some methods in SQLite fallback to a NUMERIC affinity unnecessarily, so stuff like `max(text, text)` will keep the `text` type instead of falling back to numeric as SQLite would suggest. - Adds ability to use custom fetch / builder logic for various types, so that we can support vendor specific types (for example, Postgres dates). # Important Notes - There are some TODOs left in the code. I'm still aligning follow-up tasks - once done I will try to add references to relevant tasks in them.	2023-03-31 16:16:18 +00:00
GregoryTravis	6b9cbeacb2	Implement Regular Expression replace and update `Text.replace` to the new API (#5959 ) Re-implement replace on top of Truffle regex.	2023-03-28 06:13:12 +00:00
James Dunkerley	58f2c7643f	Use new Enso Hash Codes and Comparable (#6060 ) Enables `distinct`, `aggregate` and `cross_tab` to use the Enso hashing and equality operations. Also, I rewired the way the ObjectComparators are obtained in polyglot code to be more consistent. Add Comparator for `Day_Of_Week`, `Header`, `SQL_Type`, `Image` and `Matrix`. Also, removed the custom `==` from these types as needed. (Closes #5626)	2023-03-24 15:02:25 +00:00
Hubert Plociniczak	8c6fd60aaf	Detect conflicts between exported types and FQNs (#5986 ) Exporting types named the same as the module where they are defined in `Main` modules of library components may lead to accidental name conflicts. This became apparent when trying to access `Problem_Behavior` module via a fully qualified name and the compiler rejected it. This is due to the fact that `Main` module exported `Error` type defined in `Standard.Base.Error` module, thus making it impossible to access any other submodules of `Standard.Base.Error` via a fully qualified name. This change adds a warning to FullyQualifiedNames pass that detects any such future problems. While only `Error` module was affected, it was widely used in the stdlib, hence the number of changes. Closes #5902. # Important Notes I left out the potential conflict in micro-distribution, thus ensuring we actually detect and report the warning.	2023-03-21 21:09:41 +00:00
Radosław Waśko	952beba8d1	Fix `cross_tab` column naming edge cases, add `fill_empty` (#5863 ) Closes #5151 and adds some additional tests for `cross_tab` that verify duplicated and invalid names. I decided that for empty or `Nothing` names, instead of replacing them with `Column` and implicitly losing connection with the value that was in the column, we should just error on such values. To make handling of these easier, `fill_empty` was added allowing to easily replace the empty values with something else. Also, `{is,fill}_missing` was renamed to `{is,fill}_nothing` to align with `Filter_Condition.Is_Nothing`.	2023-03-11 11:58:54 +00:00
Pavel Marek	5f7a4a5a39	Merge ordered and unordered comparators (#5845 ) Merge _ordered_ and _unordered_ comparators into a single one. # Important Notes Comparator is now required to have only `compare` method: ``` type Comparator comapre : T -> T -> (Ordering\|Nothing) hash : T -> Integer ```	2023-03-11 05:43:22 +00:00
Radosław Waśko	91ef8acf35	Review generated Column names (#5850 ) Closes #5583 and closes #5157	2023-03-10 19:07:58 +00:00
Radosław Waśko	62e57f5557	Test some Mismatched Quote edge cases in Delimited reader (#5810 ) Follow-up to #5113 - I add some more edge case tests as we discussed with @jdunkerley When debugging some quoting issues, I also realised the current `Mismatched_Quote` error provided not enough information. So I amended it to at least include some context indicating which was the 'offending' cell.	2023-03-10 15:47:57 +00:00
Jaroslav Tulach	8bbdd1af5b	Meta.is_a consistent with case-type-of check (#5853 ) Removing special handling of `AtomConstructor` in `Meta.is_a` check. # Important Notes A lot of tests are about to fail. Many of them indirectly call `Meta.is_a` with a constructor rather than type.	2023-03-10 07:41:04 +00:00
James Dunkerley	299bfd6b7d	Fixes from the Demo on 2nd March (#5823 ) - Fix issue with Geo Map viz. - Handle invalid format strings better in `Data_Formatter`. - New constants for the ISO format strings (and a special ENSO_ZONED_DATE_TIME) - Consistent Date Time format for parsing in all places. - Avoid throwing exception in datetime parsing. - Support for milliseconds (well nanoseconds) in Date_Time and Time_Of_Day. - `Column.map` stays within Enso. - Allow `Aggregate_Column.Group_By` in `cross_tab` group_by parameter.	2023-03-07 20:58:00 +00:00
Pavel Marek	b6e2319fcc	Comparators support partial ordering (#5778 )	2023-03-07 04:16:38 +00:00
Radosław Waśko	da760aa27d	Review Text/Table.write problem behavior (#5816 ) Closes #5114 Added tests for various problems scenarios when writing files. And ensured that those tests are passing by fixing a few edge cases.	2023-03-07 02:25:13 +00:00
Radosław Waśko	2d29456ed1	Review File/Data read and read_text warnings (#5799 ) Closes #5113 Fixes a bug where read-only files would be overwritten if File.write was used in backup mode, and added tests to avoid such regression. To implement it, introduced a `is_writable` property on `File`.	2023-03-06 03:43:38 +00:00
James Dunkerley	01fc34c18a	Improving Expression Support for In Database (#5790 ) - Adjust Excel Workbook write behaviour. - Support Nothing / Null constants. - Deduce the type of arithmetic operations and `iif`. - Allow Date_Time constants, treating as local timezone. - Removed the `to_column_name` and `ensure_sane_name` code.	2023-03-03 12:03:05 +00:00
Radosław Waśko	b764b0b7b7	Improve error handling of Connection.query (#5693 ) Closes #5252	2023-02-24 17:15:10 +00:00
Radosław Waśko	793eafc866	Improve Table.parse_values API (#5692 ) Closes #5111	2023-02-24 13:35:01 +00:00
James Dunkerley	652b8d5db3	Update `rename_columns` to new API design, add `first_row`, `second_row` and `last_row` functions to the table. (#5719 ) - Updates the `rename_columns` API. - Add `first_row`, `second_row` and `last_row` to the Table types. - New option for reading only last row of ResultSet.	2023-02-23 19:42:45 +00:00
Radosław Waśko	3027c6f3a2	Ensure entries containing newlines are quoted when writing Delimited Files (#5652 ) Fixes #5638	2023-02-17 00:57:48 +00:00
James Dunkerley	1bc27501e6	Remove `Column` type from Aggregate_Column, simplify Column_Selector, some new `File_Format`s (#5646 ) - Updated `Widget.Vector_Editor` ready for use by IDE team. - Added `get` to `Row` to make API more aligned. - Added `first_column`, `second_column` and `last_column` to `Table` APIs. - Adjusted `Column_Selector` and associated methods to have simpler API. - Removed `Column` from `Aggregate_Column` constructors. - Added new `Excel_Workbook` type and added to `Excel_Section`. - Added new `SQLiteFormatSPI` and `SQLite_Format`. - Added new `IamgeFormatSPI` and `Image_Format`.	2023-02-16 15:15:49 +00:00
Radosław Waśko	a02eab451e	Implement basic warnings for column arithmetic, review warnings on expressions and `filter` (#5605 ) Closes #5109 # Important Notes - Currently the tests pass for the in-memory parts of Common_Table_Operations, but still some stuff not working on DB backends - in progress.	2023-02-14 09:33:04 +00:00
Pavel Marek	1f8511dab2	Add Comparator conversion for all types (#4067 ) Add `Comparator` type class emulation for all types. Migrate all the types in stdlib to this new `Comparator` API. The main documentation is in `Ordering.enso`. Fixes these pivotals: - https://www.pivotaltracker.com/story/show/183945328 - https://www.pivotaltracker.com/story/show/183958734 - https://www.pivotaltracker.com/story/show/184380208 # Important Notes - The new Comparator API forces users to specify both `equals` and `hash` methods on their custom comparators. - All the `compare_to` overrides were replaced by definition of a custom _ordered_ comparator. - All the call sites of `x.compare_to y` method were replaced with `Ordering.compare x y`. - `Ordering.compare` is essentially a shortcut for `Comparable.from x . compare x y`. - The default comparator for `Any` is `Default_Unordered_Comparator`, which just forwards to the builtin `EqualsNode` and `HashCodeNode` nodes. - For `x`, one can get its hash with `Comparable.from x . hash x`. - This makes `hash` as _hidden_ as possible. There are no other public methods to get a hash code of an object. - Comparing `x` and `y` can be done either by `Ordering.compare x y` or `Comparable.from x . compare x y` instead of `x.compare_to y`.	2023-02-10 09:22:11 +00:00
James Dunkerley	1c821e22cf	Some fixed form the Anagrams experiment. (#5592 ) - Fixes the display of Date, Time_Of_Day and Date_Time so doesn't wrap. - Adjust serialization of large integer values for JS and display within table. - Workaround for issue with using `.lines` in the Table (new bug filed). - Disabled warning on no specified `separator` on `Concatenate`. Does not include fix for aggregation on integer values outside of `long` range.	2023-02-08 22:17:00 +00:00
Radosław Waśko	4f90946d1e	Rework Invalid Aggregations (#5579 ) Closes #5108	2023-02-08 18:39:09 +00:00
Radosław Waśko	3c72ab08c4	Review `Missing_Input_Column` and `Column_Index_Out_Of_Range` warnings (#4118 ) Implements https://www.pivotaltracker.com/story/show/184226383	2023-02-06 19:52:25 +00:00
James Dunkerley	0790ce494f	New set function, parse a column (#4097 ) - New `set` function design - takes a `Column` and works with that more easily and supports control of `Set_Mode`. - New simple `parse` API on `Column`. - Separated expression support for `filter` to new `filter_by_expression` on `Table`. - New `compute` function allowing creation of a column from an expression. - Added case sensitivity argument to `Column` based on `starts_with`, `ends_with` and `contains`. - Added case sensitivity argument to `Filter_Condition` for `Starts_With`, `Ends_With`, `Contains` and `Not_Contains`. - Fixed the issue in JS Table visualisation where JavaScript date was incorrectly set. - Some dynamic dropdown expressions - experimenting with ways to use them. - Fixed issue with `.pretty` that wasn't escaping `\`. - Changed default Postgres DB to `postgres`. - Fixed SQLite support for starts_with, ends_with and contains to be consistent (using GLOB not LIKE).	2023-01-31 20:48:16 +00:00
Radosław Waśko	c965ad3455	Review `Table.order_by` (#4104 )	2023-01-31 18:29:02 +00:00
Radosław Waśko	b9dbfd036f	First steps of the Problem Handling refactor to the new design (#4086 ) Implements: - https://www.pivotaltracker.com/story/show/184226137 - https://www.pivotaltracker.com/story/show/184226434 - https://www.pivotaltracker.com/story/show/184226462	2023-01-30 16:48:06 +00:00
Radosław Waśko	778d28fba3	Table with no columns is not valid, No_Output_Columns is always an error (#4073 ) Implements https://www.pivotaltracker.com/story/show/184226020	2023-01-25 02:40:23 +00:00
Radosław Waśko	d2e57edc8b	Add Table.cross_join and Table.zip to In-Memory Table (#4063 ) Implements https://www.pivotaltracker.com/story/show/184239059	2023-01-23 13:19:52 +00:00
Pavel Marek	fcc2163ae3	All Enso objects are hasheable (#3878 ) * Hash codes prototype * Remove Any.hash_code * Improve caching of hashcode in atoms * [WIP] Add Hash_Map type * Implement Any.hash_code builtin for primitives and vectors * Add some values to ValuesGenerator * Fix example docs on Time_Zone.new * [WIP] QuickFix for HashCodeTest before PR #3956 is merged * Fix hash code contract in HashCodeTest * Add times and dates values to HashCodeTest * Fix docs * Remove hashCodeForMetaInterop specialization * Introduce snapshoting of HashMapBuilder * Add unit tests for EnsoHashMap * Remove duplicate test in Map_Spec.enso * Hash_Map.to_vector caches result * Hash_Map_Spec is a copy of Map_Spec * Implement some methods in Hash_Map * Add equalsHashMaps specialization to EqualsAnyNode * get and insert operations are able to work with polyglot values * Implement rest of Hash_Map API * Add test that inserts elements with keys with same hash code * EnsoHashMap.toDisplayString use builder storage directly * Add separate specialization for host objects in EqualsAnyNode * Fix specialization for host objects in EqualsAnyNode * Add polyglot hash map tests * EconomicMap keeps reference to EqualsNode and HashCodeNode. Rather than passing these nodes to `get` and `insert` methods. * HashMapTest run in polyglot context * Fix containsKey index handling in snapshots * Remove snapshots field from EnsoHashMapBuilder * Prepare polyglot hash map handling. - Hash_Map builtin methods are separate nodes * Some bug fixes * Remove ForeignMapWrapper. We would have to wrap foreign maps in assignments for this to be efficient. * Improve performance of Hash_Map.get_builtin Also, if_nothing parameter is suspended * Remove to_flat_vector. Interop API requires nested vector (our previous to_vector implementation). Seems that I have misunderstood the docs the first time I read it. - to_vector does not sort the vector by keys by default * Fix polyglot hash maps method dispatch * Add tests that effectively test hash code implementation. Via hash map that behaves like a hash set. * Remove Hashcode_Spec * Add some polyglot tests * Add Text.== tests for NFD normalization * Fix NFD normalization bug in Text.java * Improve performance of EqualsAnyNode.equalsTexts specialization * Properly compute hash code for Atom and cache it * Fix Text specialization in HashCodeAnyNode * Add Hash_Map_Spec as part of all tests * Remove HashMapTest.java Providing all the infrastructure for all the needed Truffle nodes is no longer manageable. * Remove rest of identityHashCode message implementations * Replace old Map with Hash_Map * Add some docs * Add TruffleBoundaries * Formatting * Fix some tests to accept unsorted vector from Map.to_vector * Delete Map.first and Map.last methods * Add specialization for big integer hash * Introduce proper HashCodeTest and EqualsTest. - Use jUnit theories. - Call nodes directly * Fix some specializations for primitives in HashCodeAnyNode * Fix host object specialization * Remove Any.hash_code * Fix import in Map.enso * Update changelog * Reformat * Add truffle boundary to BigInteger.hashCode * Fix performance of HashCodeTest - initialize DataPoints just once * Fix MetaIsATest * Fix ValuesGenerator.textual - Java's char is not Text * Fix indent in Map_Spec.enso * Add maps to datapoints in HashCodeTest * Add specialization for maps in HashCodeAnyNode * Add multiLevelAtoms to ValuesGenerator * Provide a workaround for non-linear key inserts * Fix specializations for double and BigInteger * Cosmetics * Add truffle boundaries * Add allowInlining=true to some truffle boundaries. Increases performance a lot. * Increase the size of vectors, and warmup time for Vector.Distinct benchmark * Various small performance fixes. * Fix Geo_Spec tests to accept unsorted Map.to_vector * Implement Map.remove * FIx Visualization tests to accept unsorted Map.to_vector * Treat java.util.Properties as Map * Add truffle boundaries * Invoke polyglot methods on java.util.Properties * Ignore python tests if python lang is missing	2023-01-19 10:33:25 +01:00
James Dunkerley	48e5ed9eea	Some little bits from Book Club week 1 (#4058 ) - Add `get` to Table. - Correct `Count Nothing` examples. - Add `join` to File. - Add `File_Format.all` listing all installed formats. - Add some more ALIAS entries.	2023-01-18 11:46:13 +00:00
Radosław Waśko	8853053020	Division in Columns within InDB is integer based if both columns are integers (#4057 ) Fixes https://www.pivotaltracker.com/story/show/184073099 # Important Notes - Since now the only operator on columns for division, `/`, returns floats, it may be worth creating an additional `div` operator exposing integer division. But that will be done as a separate task aligning column operator APIs.	2023-01-17 20:29:25 +00:00
Radosław Waśko	082e0bfd0d	Add `Table.union` to the In-Memory Table. (#4052 ) Implements https://www.pivotaltracker.com/story/show/183854144	2023-01-17 00:34:57 +00:00
James Dunkerley	c4c35c92b7	Align Vector API with design, add some extra functions from AoC (#4026 ) Vector - Adjusted `Vector.sort` to be `Vector.sort order on by`. - Adjusted other sort to use `order` for direction argument. - Added `insert`, `remove`, `index_of` and `last_index_of` to `Vector`. - Added `start` and `if_missing` arguments to `find` on `Vector`, and adjusted default is `Not_Found` error. - Added type checking to `+` on `Vector`. - Altered `first`, `second` and `last` to error with `Index_Out_Of_Bounds` on `Vector`. - Removed `sum`, `exists`, `head`, `init`, `tail`, `rest`, `append`, `prepend` from `Vector`. Pair - Added `last`, `any`, `all`, `contains`, `find`, `index_of`, `last_index_of`, `reverse`, `each`, `fold` and `reduce` to `Pair`. - Added `get` to `Pair`. Range - Added `first`, `second`, `index_of`, `last_index_of`, `reverse` and `reduce` to `Range`. - Added `at` and `get` to `Range`. - Added `start` and `if_missing` arguments to `find` on `Range`. - Simplified `last` and `length` of `Range`. - Removed `exists` from `Range`. List - Added `second`, `find`, `index_of`, `last_index_of`, `reverse` and `reduce` to `Range`. - Added `at` and `get` to `List`. - Removed `exists` from `List`. - Made `all` short-circuit if any fail on `List`. - Altered `is_empty` to not compute the length of `List`. - Altered `first`, `tail`, `head`, `init` and `last` to error with `Index_Out_Of_Bounds` on `List`. Others - Added `first`, `second`, `last`, `get` to `Text`. - Added wrapper methods to the Random_Number_Generator so you can get random values more easily. - Adjusted `Aggregate_Column` to operate on the first column by default. - Added `contains_key` to `Map`. - Added ALIAS to `row_count` and `order_by`.	2023-01-12 13:32:24 +00:00
Radosław Waśko	0088096a58	Implement Distinct for the Database backends (#4027 ) Implements https://www.pivotaltracker.com/story/show/182307281	2023-01-11 22:46:54 +00:00
Radosław Waśko	8c661fdb74	Database Joins (#4007 ) Implements https://www.pivotaltracker.com/story/show/184032869 # Important Notes - Currently we get failures in Full joins on Postgres which show a more serious problem - amending equality to ensure that `[NULL = NULL] == True` breaks hash/merge based indexing - so such joins will be extremely inefficient. All our joins currently rely on this notion of equality which will mean all of our DB joins will be extremely inefficient. - We need to find a solution that will support nulls and still work OK with indices (but after exploring a few approaches: `COALESCE(a = b, a IS NULL AND b is NULL)`, `a IS NOT DISTINCT FROM b`, `(a = b) OR (a IS NULL AND b is NULL)`; all of which did not work (they all result in `ERROR: FULL JOIN is only supported with merge-joinable or hash-joinable join conditions`) I'm less certain that it is possible. Alternatively, we may need to change the NULL semantics to align it with SQL - this seems like likely the simpler solution, allowing us to generate simple, reliable SQL - the NULL=NULL solution will be cornering us into nasty workarounds very dependent on the particular backend.	2023-01-05 10:36:22 +00:00
Dmitry Bushev	74742d3267	Make To Text Conversion Identity for Text (#4009 ) First part of fixing `Text.to_text`. - add: `pretty` method for pretty printing. - update: make `Text.to_text` conversion identity for Text In the next iterations `to_text` will be gradually replaced with `to Text` conversion once the related issues with conversions are fixed.	2022-12-29 12:21:24 +00:00
James Dunkerley	579d3fc397	Adds Date, Time_Of_Day and Date_Time support to Excel IO (#3997 ) - Allow date time inputs from Excel. - Enables disabled test. - Fix for Map.==. - Allow nulls in crosstab name.	2022-12-20 16:12:00 +00:00
James Dunkerley	ace459ed53	Let JavaScript parse JSON and write JSON ... (#3987 ) Use JavaScript to parse and serialise to JSON. Parses to native Enso object. - `.to_json` now returns a `Text` of the JSON. - Json methods now `parse`, `stringify` and `from_pairs`. - New `JSON_Object` representing a JavaScript Object. - `.to_js_object` allows for types to custom serialize. Returning a `JS_Object`. - Default JSON format for Atom now has a `type` and `constructor` property (or method to call for as needed to deserialise). - Removed `.into` support for now. - Added JSON File Format and SPI to allow `Data.read` to work. - Added `Data.fetch` API for easy Web download. - Default visualization for JS Object trunctes, and made Vector default truncate children too. Fixes defect where types with no constructor crashed on `to_json` (e.g. `Matching_Mode.Last.to_json`. Adjusted default visualisation for Vector, so it doesn't serialise an array of arrays forever. Likewise, JS_Object default visualisation is truncated to a small subset. New convention: - `.get` returns `Nothing` if a key or index is not present. Takes an `other` argument allowing control of default. - `.at` error if key or index is not present. - `Nothing` gains a `get` method allowing for easy propagation.	2022-12-20 10:33:46 +00:00
Radosław Waśko	b9bf958f2c	Efficient joining for Equals and Equals_Ignore_Case using a hashmap (#3978 ) - Implemented https://www.pivotaltracker.com/story/show/183913276 - Refactored MultiValueIndex and MultiValueKeys to be more type-safe and more direct about using ordered or unordered maps. - Added performance tests ensuring we use an efficient algorithm for the joins (the tests will fail for a full O(N*M) scan). - Removed some duplicate code in the Table library. - Added optional coloring of test results in terminal to make failures easier to spot.	2022-12-14 22:56:20 +00:00
James Dunkerley	77fe69dfd9	JSON Improvements, small Table stuff, Statistic in Enso not Java and few other minor bits. (#3964 ) - Aligned `compare_to` so returns `Type_Error` if `that` is wrong type for `Text`, `Ordering` and `Duration`. - Add `empty_object`, `empty_array`. `get_or_else`, `at`, `field_names` and `length` to `Json`. - Fix `Json` serialisation of NaN and Infinity (to "null"). - Added `length`, `at` and `to_vector` to Pair (allowing it to be treated as a Vector). - Added `running_fold` to the `Vector` and `Range`. - Added `first` and `last` to the `Vector.Builder`. - Allow `order_by` to take a single `Sort_Column` or have a mix of `Text` and `Sort_Column.Name` in a `Vector`. - Allow `select_columns_helper` to take a `Text` value. Allows for a single field in group_by in cross_tab. - Added `Patch` and `Custom` to HTTP_Method. - Added running `Statistic` calculation and moved more of the logic from Java to Enso. Performance seems similar to pure Java version now.	2022-12-14 19:40:27 +00:00
Radosław Waśko	8e880e430b	Improve basic join implementation (#3958 ) Implements https://www.pivotaltracker.com/story/show/183913232 # Important Notes Added counts of succeeded/failed tests within a group and global summary, to easier see how many tests failed.	2022-12-09 00:55:07 +00:00
James Dunkerley	da0dc253cb	Fix order by Text (#3957 ) Mistake in the definition.	2022-12-07 19:16:32 +00:00
James Dunkerley	4cbd72a4eb	Some more tidying based on remaining tickets and PR comments. (#3946 ) - Moved `to_default_visualization_data` to `Standard.Visualization`. - Remove the use of `is_a` in favour of case statements. - Stop exporting Standard.Base.Error.Common. - Separate errors to own files. - Change constructors to be called `Error`. - Rename `Caught_Panic.Caught_Panic_Data` -> `Caught_Panic.Panic`. - Rename `Project_Description.Project_Description_Data` ->`Project_Description.Value` - Rename `Regex_Matcher.Regex_Matcher_Data` -> `Regex_Matcher.Value` (can't come up with anything better!). - Rename `Range.Value` -> `Range.Between`. - Rename `Interval.Value` -> `Interval.Between`. - Rename `Column.Column_Data` -> `Column.Value`. - Rename `Table.Table_Data` -> `Table.Value`. - Align all the Error types in Table. - Removed GEO Json bits from Table. - `Json.to_table` doesn't have the GEO bits anymore. - Added `Json.geo_json_to_table` to add the functions back in. # Important Notes No more exports from anywhere but Main! No more `_Data` constructors!	2022-12-06 18:35:18 +00:00
James Dunkerley	0ad70c6332	Tidy Standard.Base part 5 of n ... (hopefully the end...) (#3929 ) - Moved `Any`, `Error` and `Panic` to `Standard.Base`. - Separated `Json` and `Range` extensions into own modules. - Tidied `Case`, `Case_Sensitivity`, `Encoding`, `Matching`, `Regex_Matcher`, `Span`, `Text_Matcher`, `Text_Ordering` and `Text_Sub_Range` in `Standard.Base.Data.Text`. - Tidied `Standard.Base.Data.Text.Extensions` and stopped it re-exporting anything. - Tidied `Regex_Mode`. Renamed `Option` to `Regex_Option` and added type to export. - Tidied up `Regex` space. - Tidied up `Meta` space. - Remove `Matching` from export. - Moved `Standard.Base.Data.Boolean` to `Standard.Base.Boolean`. # Important Notes - Moved `to_json` and `to_default_visualization_data` from base types to extension methods.	2022-12-02 18:08:14 +00:00
Radosław Waśko	0e6fd34d22	Split Common_Table_Spec into smaller modules, tidy up Table tests (#3940 ) Implements https://www.pivotaltracker.com/story/show/183901722	2022-12-01 17:02:44 +00:00
Hubert Plociniczak	06bd69436b	Import modules' extension methods only with unqualified import statements (#3906 ) # Important Notes Note that one cannot ``` import Standard.Table as Table_Module ``` because of the 2-component name restriction that gets desugared to `Standard.Table.Main` and we have to write ``` import Standard.Table.Main as Table_Module ``` in a few places. Once we move `Json.to_table` extension this can be improved.	2022-12-01 10:13:34 +00:00
James Dunkerley	4518f8303d	Implementing transpose and cross_tab for the InMemory table. (#3919 ) - Adds transpose and cross_tab to the In-Memory table. - Cross Tab is built on top of aggregate and hence allows for expressions and has same error trapping as in aggregate. # Important Notes Only basic tests have been implemented. Error and warning tests will be added as a follow up task.	2022-11-30 01:19:25 +00:00
Radosław Waśko	85cbf7d9f9	Initial (naive) implementation for in memory join (#3918 ) Implements https://www.pivotaltracker.com/story/show/183854123 It features a naive full scan join and only allows equality conditions. More advanced conditions and better optimized algorithms will be implemented in a subsequent PR.	2022-11-29 19:37:31 +00:00
James Dunkerley	4e30b3036d	Tidy Standard.Base part 4 of n ... (#3898 ) - Export all for `Problem_Behavior` (allowing for Report_Warning, Report_Error and Ignore to be trivially used). - Renamed `Range.Range_Data` to `Range.Value` moved to using `up_to` wherever possible. - Reviewed `Function`, `IO`, `Polyglot`, `Random`, `Runtime`, `System`. - `File` now published as type. Some static methods moved to `Data` others into type. Removed `read_bytes` static. - New `Data` module for reading input data in one place (e.g. `Data.read_file`) will add `Data.connect` later. - Added `Random` module to the exports. - Move static methods into `Warning` type and exporting the type not the module. # Important Notes - Sorted a few imports into order (ordering by direct import in project, then by from import in project then polyglot and finally self imports).	2022-11-25 02:00:16 +00:00
James Dunkerley	93fee3a51f	Tidy Standard.Base part 3 of n ... (#3893 ) Here we go again... - Tidied up `Pair` and stopped exporting `Pair_Data`. Adjusted so type exported. - Tidy imports for `Json`, `Json.Internal`, `Locale`. - Tidy imports Ordering.*. Export `Sort_Direction` and `Case_Sensitivity` as types. - Move methods of `Statistics` into `Statistic`. Publishing the types not the module. - Added a `compute` to a `Rank_Method`. - Tidied the `Regression` module. - Move methods of `Date`, `Date_Time`, `Duration`, `Time_Of_Day` and `Time_Zone` into type. Publishing types not modules. - Added exporting `Period`, `Date_Period` and `Time_Period` as types. Static methods moved into types. # Important Notes - Move `compare_to_ignore_case`, `equals_ignore_case` and `to_case_insensitive_key` from Extensions into `Text`. - Hiding polyglot java imports from export all in `Main.enso`.	2022-11-21 15:30:18 +00:00
James Dunkerley	14dbe7287b	Tidy Standard.Base Part 1 of n... (#3884 ) * Tidy Bound and Interval. * Fix Interval tests. * Fix Interval tests. * Restructure Index_Sub_Range to new Type/Statics. * Adjust for Vector exported as a type and static methods on it. * Tidy Maybe. * Fix issue with Line_Ending_Style. * Revert Filter_Condition change. Fix benchmark test issue. Tidy imports on Index_Sub_Range. * Revert Filter_Condition change. Fix benchmark test issue. Tidy imports on Index_Sub_Range. * Can't export constructors unless exported from type in module. * Fix failing tests.	2022-11-18 08:57:41 +00:00
James Dunkerley	c868ed5efe	Some minor fixes (#3874 ) - Allow `Map` to store a `Nothing` key (fixes `Vector.distinct` with a `Nothing`). - Add `column_names` method to `Table` as a shorthand. - Return data flow error when comparing with Nothing (not a Panic or a Polyglot exception). - Allow milli and micro second for DateTime and Time Of Day # Important Notes - Added a load of tests for the various comparison operators to Numbers_Spec.	2022-11-17 07:11:18 +00:00
Jaroslav Tulach	5ce173316b	More improvements that work with both parsers (#3868 )	2022-11-12 02:34:14 +01:00
James Dunkerley	45276b243d	Expanding Derived Columns and Expression Syntax (#3782 ) - Added expression ANTLR4 grammar and sbt based build. - Added expression support to `set` and `filter` on the Database and InMemory `Table`. - Added expression support to `aggregate` on the Database and InMemory `Table`. - Removed old aggregate functions (`sum`, `max`, `min` and `mean`) from `Column` types. - Adjusted database `Column` `+` operator to do concatenation (`\|\|`) when text types. - Added power operator `^` to both `Column` types. - Adjust `iif` to allow for columns to be passed for `when_true` and `when_false` parameters. - Added `is_present` to database `Column` type. - Added `coalesce`, `min` and `max` functions to both `Column` types performing row based operation. - Added support for `Date`, `Time_Of_Day` and `Date_Time` constants in database. - Added `read` method to InMemory `Column` returning `self` (or a slice). # Important Notes - Moved approximate type computation to `SQL_Type`. - Fixed issue in `LongNumericOp` where it was always casting to a double. - Removed `head` from InMemory Table (still has `first` method).	2022-11-08 15:57:59 +00:00
James Dunkerley	b5881efdf0	Allow integers for take and drop. (#3854 ) Allows passing an integer to take or drop as a shorthand.	2022-11-04 14:03:28 +00:00
Radosław Waśko	438a284346	Implement missing `Table.take/drop While` (#3840 ) Implements https://www.pivotaltracker.com/story/show/183677982	2022-11-01 12:11:50 +00:00
Kaz Wesley	330612119a	Parse the standard library (#3830 ) Fix bugs in `TreeToIr` (rewrite) and parser. Implement more undocumented features in parser. Emulate some old parser bugs and quirks for compatibility. Changes in libs: - Fix some bugs. - Clean up some odd syntaxes that the old parser translates idiosyncratically. - Constructors are now required to precede methods. # Important Notes Out of 221 files: - 215 match the old parser - 6 contain complex types the old parser is known not to handle correctly So, compared to the old parser, the new parser parses 103% of files correctly.	2022-10-31 16:19:12 +00:00
Radosław Waśko	f60e9e9d8e	Add a Visualization to the Table.Row type (#3837 ) Follow up to https://www.pivotaltracker.com/story/show/182307026 When working in the IDE I noticed that the default vis is bad, so this should make it better.	2022-10-31 14:20:13 +00:00
Radosław Waśko	bb29833da5	Create a Table `Row` Type and expose as a Vector on In-Memory Table with `.rows` property (#3827 ) Implements https://www.pivotaltracker.com/story/show/182307026	2022-10-26 11:21:33 +00:00
Radosław Waśko	2bc0611869	Add support for using Columns within `Is_In` (#3822 ) Implements https://www.pivotaltracker.com/story/show/183560222	2022-10-24 12:51:15 +00:00
James Dunkerley	f0f6deef2a	Load the File_Format types via a ServiceLoader (#3813 ) Moves the File.read method into the `File` type. Uses the ServiceLoader to find all types for the File_Format.	2022-10-24 09:55:18 +00:00
Radosław Waśko	cc76e7d36a	Add support for `Blank_Columns` to Table and Database (#3812 ) Implements https://www.pivotaltracker.com/story/show/183390281 and https://www.pivotaltracker.com/story/show/183390394	2022-10-20 09:11:08 +00:00
Radosław Waśko	17f73988e8	Update `drop_missing_rows` to `filter_blank_rows` API. (#3805 ) Implements https://www.pivotaltracker.com/story/show/183390042 and https://www.pivotaltracker.com/story/show/183390370	2022-10-18 15:58:50 +00:00
James Dunkerley	701c644d0e	Tidy up the remaining ones except Base... (#3797 ) - Removed `Dubious constructor export` from Examples, Geo, Google_Api, Image and Test. - Updated Google_Api project to meet newer code standards. - Restructured `Standard.Test`: - `Main.enso` now exports `Bench`, `Faker`, `Problems`, `Test`, `Test_Suite` - `Test.Suite` methods moved into a `Test_Suite` type. - Moved `Bench.measure` into `Bench` type. - Separated the reporting to a `Test_Reporter` module. - Moved `Faker` methods into `Faker` type. - Removed `Verbs` and `.should` method. - Added `should_start_with` and `should_contain` extensions to `Any`. - Restructured `Standard.Image`: - Merged Codecs methods into `Image`. - Export `Image`, `Read_Flag`, `Write_Flag` and `Matrix` as types from `Main.enso`. - Merged the internal methods into `Matrix` and `Image`. - Fixed `Day_Of_Week` to be exported as a type and sort the `from` method.	2022-10-17 11:27:27 +00:00
Radosław Waśko	82de8f88bd	Add support for `Is_In` and `Not_In` to `Filter_Condition` (#3790 ) Implements https://www.pivotaltracker.com/story/show/183389945	2022-10-15 11:29:59 +00:00
Paweł Grabarz	ce6267f098	Add `replace_text` method to In-Memory `Table` (#3793 ) Implements https://www.pivotaltracker.com/n/projects/2539304/stories/183415329	2022-10-14 17:42:29 +02:00
Radosław Waśko	592a8516a8	Add `Is_Empty`, `Not_Empty`, `Like` and `Not_Like` to `Filter_Condition` (#3775 ) Implements https://www.pivotaltracker.com/story/show/183389890	2022-10-10 23:11:04 +00:00
James Dunkerley	9301f2dcc5	Sort out statics in Database. (#3774 ) - Moved `Standard.Database.connect` into `Standard.Database.Database.connect`, so can now just `from Standard.Database import ...`. - Removed all `Dubious constructor export`s. - Switched to using `project` for internal imports. - Moved to using `Value` for private constructors and not re-exporting. - Export types not modules from `Standard.Database`. - Broke up `IR` into separate files (Context, Expression, From_Spec, Internal_Column, Join_Kind, Query). - No longer use `IR.` instead via specific types. - Broke up `SQL` into separate files (SQL_Type and SQL_Statement). Additionally; - Standard.Table: Moved `storage_types` into `Storage`. - Standard.Table: Switched to using `project` for internal imports. - Standard.Table.Excel: Renamed modules `Range` to `Excel_Range` and `Section` to `Excel_Section`. - `Standard.Visualisation`: Switched to using `project` for internal imports. - `Standard.Visualisation`: Moved to using `Value` for private constructors and not re-exporting. # Important Notes - Have not cleared up the `Errors` yet. - Have not switched to type pattern matching.	2022-10-07 11:32:00 +00:00
Radosław Waśko	7afaf8c6cc	Add filtering by Filter_Condition to `Vector`, `Range` and `List` (#3770 ) Implements https://www.pivotaltracker.com/story/show/183389901	2022-10-07 04:02:54 +00:00
James Dunkerley	185378f07c	Moving library statics to type for Table. (#3760 ) - Generally export types not modules from the `Standard.Table` import. - Moved `new`, `from_rows` the `Standard.Table` library into the `Table` type. - Renames `Standard.Table.Data.Storage.Type` to `Standard.Table.Data.Storage.Storage` - Removed the internal `from_columns` method. - Removed `join` and `concat` and merged into instance methods. - Removed `Table` and `Column` from the `Standard.Database` exports. - Removed `Standard.Table.Data.Column.Aggregate_Column` as not used any more.	2022-10-06 17:01:18 +00:00
Radosław Waśko	503d3eb5a3	Update Table API with new `filter` design (#3750 ) Implements https://www.pivotaltracker.com/story/show/183389855 # Important Notes Implements basic filter operations both In-Memory and for the Database backend, ensuring that existing tests can be adapted and keep working. Not all `Filter_Condition`s are implemented yet. Also implements significant part of https://www.pivotaltracker.com/story/show/183390314	2022-10-05 11:40:10 +00:00
James Dunkerley	eeec3bdf94	Adding a shorthand for the Table API until widgets available (#3753 ) Allows using `Vector ColumnName` for the various table functions as short hand. - `select_columns`, `remove_columns`,`reorder_columns`, `distinct` all map to an exact By_Name match. - `rename_columns` does a positional rename on the Vector passed. - `order_by` sorts ascending on each column passed in order. # Important Notes This may be reversed once widgets are available and working but this makes the APIs much more usable in current UI.	2022-10-04 11:37:14 +00:00
Hubert Plociniczak	ae66087bb9	By-type pattern matching (#3742 ) This change brings by-type pattern matching to Enso. One can pattern match on Enso types as well as on polyglot types. For example, ``` case x of _ : Integer -> ... _ : Text -> ... _ -> ... ``` as well as Java's types ``` case y of _ : ArrayList -> ... _ : List -> ... _ : AbstractList -> ... _ -> .. ``` It is no longer possible to match a value with a corresponding type constructor. For example ``` case Date.now of Date -> ... ``` will no longer match and one should match on the type (`_ : Date`) instead. ``` case Date of Date -> ... ``` is fine though, as requested in the ticket. The change required further changes to `type_of` logic which wasn't dealing well with polyglot values. Implements https://www.pivotaltracker.com/story/show/183188846 # Important Notes ~I discovered late in the game that nested patterns involving type patterns, such as `Const (f : Foo) tail -> ...` are not possible due to the old parser logic. I would prefer to add it in a separate PR because this one is already getting quite large.~ This is now supported!	2022-10-04 09:23:58 +00:00
Radosław Waśko	61a4120cfb	Fix date comparisons and test sorting of tables and vectors with dates (#3745 ) Implements https://www.pivotaltracker.com/story/show/183402892 # Important Notes - Fixes inconsistent `compare_to` vs `==` behaviour in date/time types and adds test for that. - Adds test for `Table.order_by` on dates and custom types. - Fixes an issue with `Table.order_by` for custom types. - Unifies how incomparable objects are reported by `Table.order_by` and `Vector.sort`. - Adds benchmarks comparing `Table.order_by` and `Vector.sort` performance.	2022-09-29 08:48:00 +00:00
Radosław Waśko	7da4d61484	Update Case Sensitivity arguments (#3741 ) Implements https://www.pivotaltracker.com/story/show/183314956	2022-09-27 20:15:52 +00:00
Marcin Kostrzewa	726bfeb883	Wip/mk/statics (#3740 ) Makes statics static. A type and its instances have different methods defined on them, as it should be. Constructors are now scoped in types, and can be imported/exported. # Important Notes The method of fixing stdlib chosen here is to just not. All the conses are exported to make all old code work. All such instances are marked with `TODO Dubious constructor export` so that it can be found and fixed.	2022-09-27 14:23:06 +00:00
Hubert Plociniczak	7a6ee0c200	Implement `type_of` (#3722 ) This change implements a simple `type_of` method that returns a type of a given value, including for polyglot objects. The change also allows for pattern matching on various time-related instances. It is a nice-to-have on its own, but it was primarily needed here to write some tests. For equality checks on types we currently can't use `==` due to a known _feature_ which essentially does wrong dispatching. This will be improved in the upcoming statics PR so we agreed that there is no point in duplicating that work and we can replace it later. Also, note that this PR changes `Meta.is_same_object`. Comparing types revealed that it was wrong when comparing polyglot wrappers over the same value.	2022-09-26 16:01:39 +00:00
James Dunkerley	6f54e80970	Adjust Database connection to use query/read to access data. (#3727 ) Adjust Database connection API to align with new [design](https://github.com/enso-org/design/blob/wip/jd/database-read/epics/basic-libraries/database-read/design.md#querying-tables). - `query` replaces the old `access_table` and is expanded to support raw SQL queries. - `read` replaces `execute_query` and matches the API of `query`. - `to_dataframe` is renamed to `read`. # Important Notes Added support for `++` to concatenate a Text without wrapping in a `SQL.Code`.	2022-09-23 07:35:08 +00:00
James Dunkerley	d6346e9d66	Renaming various constructors and moving types around for Database. (#3715 ) Repairing the constructor name following the types work. Some general tiding up as well. - Remove `Standard.Database.Data.Column.Aggregate_Column_Builder`. - Remove `Standard.Database.Data.Dialect.Dialect.Dialect_Data`. - Remove unused imports and update some type definitions. - Rename `Postgres.Postgres_Data` => `Postgres_Options.Postgres`. - Rename `Redshift.Redshift_Data` => `Redshift_Options.Redshift`. - Rename `SQLite.SQLite_Data` => `SQLite_Options.SQLite`. - Rename `Credentials.Credentials_Data` => `Credentials.Username_And_Password`. - Rename `Sql` to `SQL` across the board. - Merge `Standard.Database.Data.Internal` into `Standard.Database.Internal`. - Move dialects into `Internal` and merge the function in `Helpers` into `Base_Generator`.	2022-09-19 12:39:40 +00:00
Radosław Waśko	8fa8d12cc3	String functionality in std-table should use std-base (#3717 ) Implements https://www.pivotaltracker.com/story/show/181754646	2022-09-17 14:38:02 +00:00
Hubert Plociniczak	0e5df935d3	Don't rename imported Main module that only imports names (#3710 ) Turns that if you import a two-part import we had special code that would a) add Main submodule b) add an explicit rename. b) is problematic because sometimes we only want to import specific names. E.g., ``` from Bar.Foo import Bar, Baz ``` would be translated to ``` from Bar.Foo.Main as Foo import Bar, Baz ``` and it should only be translated to ``` from Bar.Foo.Main import Bar, Baz ``` This change detects this scenario and does not add renames in that case. Fixes [183276486](https://www.pivotaltracker.com/story/show/183276486).	2022-09-16 13:01:06 +00:00
Radosław Waśko	5ed388930e	Additional tests for handling Dates in Table (#3707 ) Resolves https://www.pivotaltracker.com/story/show/183285801 @JaroslavTulach suggested the current implementation may not handle these correctly, which suggests that the logic is not completely trivial - so I added a test to ensure that it works as we'd expect. Fortunately, it did work - but it's good to keep the tests to avoid regressions.	2022-09-15 23:18:19 +00:00
James Dunkerley	0126f02e7b	Restructure File.read into the new design (#3701 ) Changes following Marcin's work. Should be back to very similar public API as before. - Add an "interface" type: `Standard.Base.System.File_Format.File_Format`. - All `File_Format` types now have a `can_read` method to decide if they can read a file. - Move `Standard.Table.IO.File_Format.Text.Text_Data` to `Standard.Base.System.File_Format.Plain_Text_Format.Plain_Text`. - Move `Standard.Table.IO.File_Format.Bytes` to `Standard.Base.System.File_Format.Bytes`. - Move `Standard.Table.IO.File_Format.Infer` to `Standard.Base.System.File_Format.Infer`. (doesn't belong here...) - Move `Standard.Table.IO.File_Format.Unsupported_File_Type` to `Standard.Base.Error.Common.Unsupported_File_Type`. - Add `Infer`, `File_Format`, `Bytes`, `Plain_Text`, `Plain_Text_Format` to `Standard.Base` exports. - Fold extension methods of `Standard.Base.Meta.Unresolved_Symbol` into type. - Move `Standard.Table.IO.File_Format.Auto` to `Standard.Table.IO.Auto_Detect.Auto_Detect`. - Added a `types` Vector of all the built in formats. - `Auto_Detect` asks each type if they `can_read` a file. - Broke up and moved `Standard.Table.IO.Excel` into `Standard.Table.Excel`: - Moved `Standard.Table.IO.File_Format.Excel.Excel_Data` to `Standard.Table.Excel.Excel_Format.Excel_Format.Excel`. - Renamed `Sheet` to `Worksheet`. - Internal types `Reader` and `Writer` providing the actual read and write methods. - Created `Standard.Table.Delimited` with similar structure to `Standard.Table.Excel`: - Moved `Standard.Table.IO.File_Format.Delimited.Delimited_Data` to `Standard.Table.Delimited.Delimited_Format.Delimited_Format.Delimited`. - Moved `Standard.Table.IO.Quote_Style` to `Standard.Table.Delimited.Quote_Style`. - Moved the `Reader` and `Writer` internal types into here. Renamed methods to have unique names. - Add `Aggregate_Column`, `Auto_Detect`, `Delimited`, `Delimited_Format`, `Excel`, `Excel_Format`, `Sheet_Names`, `Range_Names`, `Worksheet` and `Cell_Range` to `Standard.Table` exports.	2022-09-15 14:48:46 +00:00
James Dunkerley	2b425f8e08	Restructuring `Database.Connection` to allow for database specific types. (#3632 ) - Added `databases`, `database`, `set_database`. - Added `schemas`, `schema`, `set_schema`. - Added `table_types`, - Added `tables`. - Moved the vast majority of the connection work into a lower level `JDBC_Connection` object. - `Connection` represents the standard API for database connections and provides a base JDBC implementation. - `SQLite_Connection` has the `Connection` API but with custom `databases` and `schemas` methods for SQLite. - `Postgres_Connection` has the `Connection` API but with custom `set_database`, `databases`, `set_schema` and `schemas` methods for Postgres. - Updated `Redshift` - no public API change.	2022-09-07 17:32:28 +00:00
Radosław Waśko	551100af3b	Add `Table.distinct` function to In-Memory table (#3684 ) Implements https://www.pivotaltracker.com/story/show/182307143 # Important Notes - Modified standard library Java helpers dependencies so that `std-table` module depends on `std-base`, as a provided dependency. This is allowed, because `std-table` is used by the `Standard.Table` Enso module which depends on `Standard.Base` which ensures that the `std-base` is loaded onto the classpath, thus whenever `std-table` is loaded by `Standard.Table`, so is `std-base`. Thus we can rely on classes from `std-base` and its dependencies being _provided_ on the classpath. Thanks to that we can use utilities like `Text_Utils` also in `std-table`, avoiding code duplication. Additional advantage of that is that we don't need to specify ICU4J as a separate dependency for `std-table`, since it is 'taken' from `std-base` already - so we avoid including it in our build packages twice.	2022-09-07 12:28:41 +00:00
Radosław Waśko	eafba079d9	Make In Memory Table Aggregator types more specific where possible (#3679 ) Many aggregation types fell back to the general `Any` type where they could have used the type of input column - for example `First` of a column of integers is guaranteed to fit the `Integer` storage type, so it doesn't have to fall back to `Any`. This PR fixes that and adds a test that checks this.	2022-09-05 09:17:41 +00:00
Radosław Waśko	65140f48ca	Add storage support for Date, Time and DateTime to InMemory table (#3673 ) Implements https://www.pivotaltracker.com/story/show/183080911	2022-08-31 22:06:29 +00:00
Marcin Kostrzewa	4fc6dcced0	Get rid of free-floating atoms. Everything has a type now! (#3671 ) This is a step towards the new language spec. The `type` keyword now means something. So we now have ``` type Maybe a Some (from_some : a) None ``` as a thing one may write. Also `Some` and `None` are not standalone types now – only `Maybe` is. This halfway to static methods – we still allow for things like `Number + Number` for backwards compatibility. It will disappear in the next PR. The concept of a type is now used for method dispatch – with great impact on interpreter code density. Some APIs in the STDLIB may require re-thinking. I take this is going to be up to the libraries team – some choices are not as good with a semantically different language. I've strived to update stdlib with minimal changes – to make sure it still works as it did. It is worth mentioning the conflicting constructor name convention I've used: if `Foo` only has one constructor, previously named `Foo`, we now have: ``` type Foo Foo_Data f1 f2 f3 ``` This is now necessary, because we still don't have proper statics. When they arrive, this can be changed (quite easily, with SED) to use them, and figure out the actual convention then. I have also reworked large parts of the builtins system, because it did not work at all with the new concepts. It also exposes the type variants in SuggestionBuilder, that was the original tiny PR this was based on. PS I'm so sorry for the size of this. No idea how this could have been smaller. It's a breaking language change after all.	2022-08-30 22:54:53 +00:00
Radosław Waśko	e6e4692692	DataFormatter should infer datetime from values without seconds (#3668 ) Fixes https://www.pivotaltracker.com/story/show/183033133	2022-08-26 21:10:52 +00:00
Radosław Waśko	d7ebc4a338	Add `Table.take` and `Table.drop` functions to In-Memory table (#3647 ) Implements https://www.pivotaltracker.com/story/show/182307347	2022-08-26 19:41:36 +00:00
James Dunkerley	a20d43390e	Adding DateTime part functions (#3669 ) - Added `Zone`, `Date_Time` and `Time_Of_Day` to `Standard.Base`. - Renamed `Zone` to `Time_Zone`. - Added `century`. - Added `is_leap_year`. - Added `length_of_year`. - Added `length_of_month`. - Added `quarter`. - Added `day_of_year`. - Added `Day_Of_Week` type and `day_of_week` function. - Updated `week_of_year` to support ISO. # Important Notes - Had to pass locale to formatter for date/time tests to work on my PC. - Changed default of `week_of_year` to use ISO.	2022-08-26 15:47:58 +00:00
Hubert Plociniczak	d87a32d019	Builtin Date_Time, Time_Of_Day, Zone (#3658 ) * Builtin Date_Time, Time_Of_Day, Zone Improved polyglot support for Date_Time (formerly Time), Time_Of_Day and Zone. This follows the pattern introduced for Enso Date. Minor caveat - in tests for Date, had to bend a lot for JS Date to pass. This is because JS Date is not really only a Date, but also a Time and Timezone, previously we just didn't consider the latter. Also, JS Date does not deal well with setting timezones so the trick I used is to first call foreign function returning a polyglot JS Date, which is converted to ZonedDateTime and only then set the correct timezone. That way none of the existing tests had to be changes or special cased. Additionally, JS deals with milliseconds rather than nanoseconds so there is loss in precision, as noted in Time_Spec. * Add tests for Java's LocalTime * changelog * Make date formatters in table happy * PR review, add more tests for zone * More tests and fixed a bug in column reader Column reader didn't take into account timezone but that was a mistake since then it wouldn't map to Enso's Date_Time. Added tests that check it now. * remove redundant conversion * Update distribution/lib/Standard/Base/0.0.0-dev/src/Data/Time.enso Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org> * First round of addressing PR review * don't leak java exceptions in Zone * Move Date_Time to top-level module * PR review Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org> Co-authored-by: Jaroslav Tulach <jaroslav.tulach@enso.org>	2022-08-24 12:31:29 +02:00
James Dunkerley	684adcb7fb	Tidy up the default imports for Standard.Table (#3660 ) - Removed various unnecessary `Standard.Base` imports still left behind. - Added `Regex` to default `Standard.Base`. - Removed aliasing from the examples as no longer needed (case coercion no long occurs). - Remove `import Standard.Table` from within the Table library (directly importing types). - Reviewed what was in `Standard.Database` - a few tweaks and removals. - Removed various un-needed aliasing following Hubert's import work.	2022-08-22 19:21:54 +00:00
Radosław Waśko	bcca7f10d9	Add key functions to Table to make it act as [Column] (#3644 ) Implements https://www.pivotaltracker.com/story/show/181370836	2022-08-18 12:33:02 +00:00
Radosław Waśko	fbf6c800f1	Short hand version for `order_by` (#3643 ) Implements https://www.pivotaltracker.com/story/show/182868310	2022-08-16 15:41:37 +00:00
Radosław Waśko	3dca738cf7	Add `Vector.take` and `Vector.drop` functions (#3629 ) Implements https://www.pivotaltracker.com/story/show/182307048	2022-08-10 16:02:02 +00:00
Radosław Waśko	0a2fea925c	Create `Index_Sub_Range` type and update `Text.take` and `Text.drop` (#3617 )	2022-08-03 11:41:34 +00:00
Hubert Plociniczak	d59714a29d	Support module imports using a qualified name (#3608 ) This change allows for importing modules using a qualified name and deals with any conflicts on the way. Given a module C defined at `A/B/C.enso` with ``` type C type C a ``` it is now possible to import it as ``` import project.A ... val x = A.B.C 10 ``` Given a module located at `A/B/C/D.enso`, we will generate intermediate, synthetic, modules that only import and export the successor module along the path. For example, the contents of a synthetic module B will look like ``` import <namespace>.<pkg-name>.A.B.C export <namespace>.<pkg-name>.A.B.C ``` If module B is defined already by the developer, the compiler will _inject_ the above statements to the IR. Also removed the last elements of some lowercase name resolution that managed to survive recent changes (`Meta.Enso_Project` would now be ambiguous with `enso_project` method). Finally, added a pass that detects shadowing of the synthetic module by the type defined along the path. We print a warning in such a situation. Related to https://www.pivotaltracker.com/n/projects/2539304 # Important Notes There was an additional request to fix the annoying problem with `from` imports that would always bring the module into the scope. The changes in stdlib demonstrate how it is now possible to avoid the workaround of ``` from X.Y.Z as Z_Module import A, B ``` (i.e. `as Z_Module` part is almost always unnecessary).	2022-07-29 14:19:07 +00:00
Hubert Plociniczak	f63e40df1b	Explicit self (#3569 ) This change modifies the current language by requiring explicit `self` parameter declaration for methods. Methods without `self` parameter in the first position should be treated as statics although that is not yet part of this PR. We add an implicit self to all methods This obviously required updating the whole stdlib and its components, tests etc but the change is pretty straightforward in the diff. Notice that this change does not change method dispatch, which was removed in the last changes. This was done on purpose to simplify the implementation for now. We will likely still remove all those implicit selfs to bring true statics. Minor caveat - since `main` doesn't actually need self, already removed that which simplified a lot of code.	2022-07-27 17:45:36 +00:00
James Dunkerley	a54a7d5553	Tidying up what is in Standard.Base (#3603 ) - Added various of the types from the new APIs to the Standard.Base export. - Removed Syntax_Error types for Regex and Uri and used the common one.	2022-07-27 13:28:00 +00:00
Radosław Waśko	ee91656f30	Remove duplicate `Line_Ending_Style` and update defaults (#3597 ) Implements https://www.pivotaltracker.com/story/show/182749831	2022-07-27 09:43:51 +00:00
James Dunkerley	7090e1fb91	Docker file for testing Postgres SSL and updated Postgres Spec (#3607 ) Adds a Dockerfile and `CreatePostgresSSL.sh` script, which makes an Alpine based Postgres server with a self signed certificate. The script will drop the generated `rootCA.crt` into the `data/transient` folder. This can then be included in the test by setting the environment variable `ENSO_DATABASE_TEST_CA_CERT_FILE`. Test has been updated to check the various SSL connection modes.	2022-07-26 13:28:43 +00:00
James Dunkerley	be311457bd	Add Linear Regression support for Vectors. (#3601 ) Adds least squares regression APIs. Covers the basic 4 trend line types from Excel (doesn't cover Polynomial or Moving Average). Removes the old `Model` from the `Standard.Table`.	2022-07-22 08:41:17 +00:00
Radosław Waśko	16fd038c1a	Add support for `.pgpass` to PostgreSQL (#3593 ) Implements https://www.pivotaltracker.com/story/show/182582924	2022-07-21 13:32:37 +00:00
Jaroslav Tulach	4465d63dd8	Improved polyglot Date support (#3559 ) Significantly improves the polyglot Date support (as introduced by #3374). It enhances the `Date_Spec` to run it in four flavors: - with Enso Date (as of now) - with JavaScript Date - with JavaScript Date wrapped in (JavaScript) array - with Java LocalDate allocated directly The code is then improved by necessary modifications to make the `Date_Spec` pass. # Important Notes James has requested in [#181755990](https://www.pivotaltracker.com/n/projects/2539304/stories/181755990) - e.g. _Review and improve InMemory Table support for Dates, Times, DateTimes, BigIntegers_ the following program to work: ``` foreign js dateArr = """ return [1, new Date(), 7] main = IO.println <\| (dateArr.at 1).week_of_year ``` the program works with here in provided changes and prints `27` as of today. @jdunkerley has provided tests for proper behavior of date in `Table` and `Column`. Those tests are working as of [`f16d07e`](`f16d07e640`). One just needs to accept `List<Value>` and then query `Value` for `isDate()` when needed. Last round of changes is related to exception handling. `8b686b12bd` makes sure `makePolyglotError` accepts only polyglot values. Then it wraps plain Java exceptions into `WrapPlainException` with `has_type` method - `60da5e70ed` - the remaining changes in the PR are only trying to get all tests working in the new setup. The support for `Time` isn't part of this PR yet.	2022-07-21 06:32:40 +00:00
James Dunkerley	5e4083978f	Type name case fixes: (#3590 ) - MacOS => Mac_OS - PostgreSQL => Postgres - SQLite => SQLite (align a few) - InMemory => In_Memory - PointData => Point_Data - Io_Error => IO_Error - Standard.Table.Io => Standard.Table.IO In Tests: - MyError => My_Error - NotFoo => Not_Foo	2022-07-19 14:09:09 +00:00
Radosław Waśko	fc110659db	Implement `should_succeed` (#3586 ) Implements https://www.pivotaltracker.com/story/show/182709976	2022-07-14 19:58:44 +00:00
Radosław Waśko	35ddd2a89e	Add new options to the Delimited format (#3581 ) Implements https://www.pivotaltracker.com/story/show/182662195 and https://www.pivotaltracker.com/story/show/182651884	2022-07-14 15:01:26 +00:00
James Dunkerley	9578dc1e43	Move `write_bytes` to be part of `Vector`. (#3583 ) Updates `write_bytes` API to be part of `Vector` and to conform to `write` APIs. # Important Notes Ensures doesn't touch the file if an invalid byte array.	2022-07-14 11:30:40 +00:00
James Dunkerley	e41936f436	Additional tests for Excel Append (#3580 ) Add some additional scenarios to Excel append tests: - Non-A1 start - Name duplication - Hitting another range # Important Notes Also fixed a warning in the Image library.	2022-07-13 13:02:39 +00:00
James Dunkerley	2527a7bdb2	Update SQLite, PostgreSQL and Redshift drivers (#3571 ) Updated the SQLite, PostgreSQL and Redshift drivers. # Important Notes Updated the API for Redshift and proved able to connect without the ini file workaround.	2022-07-11 18:39:16 +00:00
Radosław Waśko	df10e4ba7c	Add appending support for Delimited files (#3573 ) Implements https://www.pivotaltracker.com/story/show/182309839	2022-07-11 12:36:01 +00:00
Hubert Plociniczak	96e50648dd	Remove 'here' and make method name resolution case-sensitive (#3538 ) Modified UppercaseNames to now resolve methods without an explicit `here` to point to the current module. `here` was also often used instead of `self` which was allowed by the compiler. Therefore UppercaseNames pass is now GlobalNames and does some extra work - it translated method calls without an explicit target into proper applications. # Important Notes There was a long-standing bug in scopes usage when compiling standalone expressions. This resulted in AliasAnalysis generating incorrect graphs and manifested itself only in unit tests and when running `eval`, thus being a bit hard to locate. See `runExpression` for details. Additionally, method name resolution is now case-sensitive. Obsolete passes like UndefinedVariables and ModuleThisToHere were removed. All tests have been adapted.	2022-07-07 10:31:06 +00:00
James Dunkerley	16e6f2fa08	Adding Append support to Excel.Write (#3558 ) Adds support for appending to an existing Excel table. # Important Notes - Renamed `Column_Mapping` to `Column_Name_Mapping` - Changed new type name to `Map_Column` - Added last modified time and creation time to `File`.	2022-07-07 06:41:33 +00:00
Radosław Waśko	7c94fa6a77	Custom Encoding support when writing Delimited files (#3564 ) Implements https://www.pivotaltracker.com/story/show/182545847	2022-07-07 00:20:00 +00:00
James Dunkerley	5174cc6ece	Update `Database.connect` to match new API (#3542 ) Initial work restructuring the `Database.connect` API - New SQLite API with support for InMemory. - Updated PostgreSQL API with SSL and Client Certificate Support. - Updated Redshift API. # Important Notes Follow up tasks: - PostgreSQL SSL additional testing. - Driver version updating. - `.pgpass` support.	2022-07-04 20:26:44 +00:00
James Dunkerley	4ca2097488	Adding write support to `File_Format.Excel` (#3551 ) Support for writing tables to Excel. # Important Notes Has custom support for Error mode as will allow appending a new table in this mode to the file.	2022-07-04 18:32:16 +00:00
Radosław Waśko	972b34d1a9	Implement value formatting and writing new files in Delimited format. (#3528 ) Implements https://www.pivotaltracker.com/story/show/182309429 and https://www.pivotaltracker.com/story/show/182309573	2022-06-23 16:51:52 +00:00
James Dunkerley	7a2d304fa0	Update Excel reading API (#3523 ) - Remove `from_xls` and `from_xlsx`. - Add `headers` support to `File_Format.Excel`. - Altered default read for Excel to be the first sheet. - Altered behavior so that single cells grow down and right when reading sheet. - Altered `Excel_Range` so knows if single cell or 1x1 range address. # Important Notes - Renamed `Range` to `Cell_Range` to avoid name clash.	2022-06-21 13:39:32 +00:00
Hubert Plociniczak	22a371a9c6	Substitute this with self (#3524 ) A semi-manual s/this/self appied to the whole standard library. Related to https://www.pivotaltracker.com/story/show/182328601 In the compiler promoted to use constants instead of hardcoded `this`/`self` whenever possible. # Important Notes The PR does not require explicit `self` parameter declaration for methods as this part of the design is still under consideration.	2022-06-21 10:53:52 +00:00
James Dunkerley	a0c6fa9c96	Removing old functions and tidy up of Table types (#3519 ) - Removed `select` method. - Removed `group` method. - Removed `Aggregate_Table` type. - Removed `Order_Rule` type. - Removed `sort` method from Table. - Expanded comments on `order_by`. - Update comment on `aggregate` on Database. - Update Visualisation to use new APIs. - Updated Data Science examples to use new APIs. - Moved Examples test out of Tests to own test. # Important Notes Need to get Examples_Tests added to CI.	2022-06-14 13:37:20 +00:00
Radosław Waśko	e83c36d9d6	Add scaffolding for `Table.write` function (#3521 ) Implements https://www.pivotaltracker.com/story/show/182309559 This task implements common scaffolding for the `Table.write`, so that the particular implementations for Delimited and Excel file formats can be done in parallel.	2022-06-14 11:29:03 +00:00
Radosław Waśko	a04825a5ce	Add `Text.write` Function (#3518 ) Implements https://www.pivotaltracker.com/story/show/182309026	2022-06-13 09:11:46 +00:00
James Dunkerley	e97d27e1e0	Adjusting First and Last order_by to use Sort_Column_Selector (#3517 )	2022-06-10 09:59:03 +00:00

... 2 3 4 5 6 ...

423 Commits