enso-org/enso - enso - gitea: Gitea Service

mirror of https://github.com/enso-org/enso.git synced 2024-12-01 13:43:18 +03:00

Author	SHA1	Message	Date
Radosław Waśko	a02eab451e	Implement basic warnings for column arithmetic, review warnings on expressions and `filter` (#5605 ) Closes #5109 # Important Notes - Currently the tests pass for the in-memory parts of Common_Table_Operations, but still some stuff not working on DB backends - in progress.	2023-02-14 09:33:04 +00:00
Pavel Marek	1f8511dab2	Add Comparator conversion for all types (#4067 ) Add `Comparator` type class emulation for all types. Migrate all the types in stdlib to this new `Comparator` API. The main documentation is in `Ordering.enso`. Fixes these pivotals: - https://www.pivotaltracker.com/story/show/183945328 - https://www.pivotaltracker.com/story/show/183958734 - https://www.pivotaltracker.com/story/show/184380208 # Important Notes - The new Comparator API forces users to specify both `equals` and `hash` methods on their custom comparators. - All the `compare_to` overrides were replaced by definition of a custom _ordered_ comparator. - All the call sites of `x.compare_to y` method were replaced with `Ordering.compare x y`. - `Ordering.compare` is essentially a shortcut for `Comparable.from x . compare x y`. - The default comparator for `Any` is `Default_Unordered_Comparator`, which just forwards to the builtin `EqualsNode` and `HashCodeNode` nodes. - For `x`, one can get its hash with `Comparable.from x . hash x`. - This makes `hash` as _hidden_ as possible. There are no other public methods to get a hash code of an object. - Comparing `x` and `y` can be done either by `Ordering.compare x y` or `Comparable.from x . compare x y` instead of `x.compare_to y`.	2023-02-10 09:22:11 +00:00
James Dunkerley	1c821e22cf	Some fixed form the Anagrams experiment. (#5592 ) - Fixes the display of Date, Time_Of_Day and Date_Time so doesn't wrap. - Adjust serialization of large integer values for JS and display within table. - Workaround for issue with using `.lines` in the Table (new bug filed). - Disabled warning on no specified `separator` on `Concatenate`. Does not include fix for aggregation on integer values outside of `long` range.	2023-02-08 22:17:00 +00:00
Radosław Waśko	4f90946d1e	Rework Invalid Aggregations (#5579 ) Closes #5108	2023-02-08 18:39:09 +00:00
James Dunkerley	c39410b4d1	Adjusting annotations to use Widget types properly. (#5564 ) Closes #5038 - Use the proper widget structure. - Provide new method `get_widget_json` with whole structure, but keep `get_full_annotations_json` in old form. - Start to get to some reusable functions. - Added widget to JS_Object field selections.	2023-02-07 07:33:51 +00:00
James Dunkerley	dcd6d740bb	Fix bug with Delimited_Format web reader. (#4112 ) - Mistake using `if_nothing` not `is_nothing`. - Cache the File_Format types to save calling over and over.	2023-02-02 07:11:49 +00:00
James Dunkerley	0790ce494f	New set function, parse a column (#4097 ) - New `set` function design - takes a `Column` and works with that more easily and supports control of `Set_Mode`. - New simple `parse` API on `Column`. - Separated expression support for `filter` to new `filter_by_expression` on `Table`. - New `compute` function allowing creation of a column from an expression. - Added case sensitivity argument to `Column` based on `starts_with`, `ends_with` and `contains`. - Added case sensitivity argument to `Filter_Condition` for `Starts_With`, `Ends_With`, `Contains` and `Not_Contains`. - Fixed the issue in JS Table visualisation where JavaScript date was incorrectly set. - Some dynamic dropdown expressions - experimenting with ways to use them. - Fixed issue with `.pretty` that wasn't escaping `\`. - Changed default Postgres DB to `postgres`. - Fixed SQLite support for starts_with, ends_with and contains to be consistent (using GLOB not LIKE).	2023-01-31 20:48:16 +00:00
Radosław Waśko	b9dbfd036f	First steps of the Problem Handling refactor to the new design (#4086 ) Implements: - https://www.pivotaltracker.com/story/show/184226137 - https://www.pivotaltracker.com/story/show/184226434 - https://www.pivotaltracker.com/story/show/184226462	2023-01-30 16:48:06 +00:00
James Dunkerley	359d4f56df	Initial Widget Dynamic Metadata (#4082 ) Adjusts the old mock to use new attributes. Expanded examples for dynamic widgets.	2023-01-25 14:16:10 +00:00
James Dunkerley	60f0e96047	Update starts_with, ends_with and contains to new API design (#4078 ) - Updated `Text.starts_with`, `Text.ends_with` and `Text.contains` to new simpler API. - Added a `Case_Sensitivity.Default` and adjusted `Table.distinct` to use it by default. - Fixed a bug with `Data.fetch` on an HTTP error. - Improved SQLite Case Sensitivity control in distinct to use collations.	2023-01-25 11:47:27 +00:00
Radosław Waśko	778d28fba3	Table with no columns is not valid, No_Output_Columns is always an error (#4073 ) Implements https://www.pivotaltracker.com/story/show/184226020	2023-01-25 02:40:23 +00:00
Dmitry Bushev	bf9508603f	Add Meta.get_annotation (#4049 ) - add: `GeneralAnnotation` IR node for `@name expression` annotations - update: compilation pipeline to process the annotation expressions - update: rewrite `OverloadsResolution` compiler pass so that it keeps the order of module definitions - add: `Meta.get_annotation` builtin function that returns the result of annotation expression - misc: improvements (private methods, lazy arguments, build.sbt cleanup)	2023-01-24 21:28:33 +03:00
Pavel Marek	fcc2163ae3	All Enso objects are hasheable (#3878 ) * Hash codes prototype * Remove Any.hash_code * Improve caching of hashcode in atoms * [WIP] Add Hash_Map type * Implement Any.hash_code builtin for primitives and vectors * Add some values to ValuesGenerator * Fix example docs on Time_Zone.new * [WIP] QuickFix for HashCodeTest before PR #3956 is merged * Fix hash code contract in HashCodeTest * Add times and dates values to HashCodeTest * Fix docs * Remove hashCodeForMetaInterop specialization * Introduce snapshoting of HashMapBuilder * Add unit tests for EnsoHashMap * Remove duplicate test in Map_Spec.enso * Hash_Map.to_vector caches result * Hash_Map_Spec is a copy of Map_Spec * Implement some methods in Hash_Map * Add equalsHashMaps specialization to EqualsAnyNode * get and insert operations are able to work with polyglot values * Implement rest of Hash_Map API * Add test that inserts elements with keys with same hash code * EnsoHashMap.toDisplayString use builder storage directly * Add separate specialization for host objects in EqualsAnyNode * Fix specialization for host objects in EqualsAnyNode * Add polyglot hash map tests * EconomicMap keeps reference to EqualsNode and HashCodeNode. Rather than passing these nodes to `get` and `insert` methods. * HashMapTest run in polyglot context * Fix containsKey index handling in snapshots * Remove snapshots field from EnsoHashMapBuilder * Prepare polyglot hash map handling. - Hash_Map builtin methods are separate nodes * Some bug fixes * Remove ForeignMapWrapper. We would have to wrap foreign maps in assignments for this to be efficient. * Improve performance of Hash_Map.get_builtin Also, if_nothing parameter is suspended * Remove to_flat_vector. Interop API requires nested vector (our previous to_vector implementation). Seems that I have misunderstood the docs the first time I read it. - to_vector does not sort the vector by keys by default * Fix polyglot hash maps method dispatch * Add tests that effectively test hash code implementation. Via hash map that behaves like a hash set. * Remove Hashcode_Spec * Add some polyglot tests * Add Text.== tests for NFD normalization * Fix NFD normalization bug in Text.java * Improve performance of EqualsAnyNode.equalsTexts specialization * Properly compute hash code for Atom and cache it * Fix Text specialization in HashCodeAnyNode * Add Hash_Map_Spec as part of all tests * Remove HashMapTest.java Providing all the infrastructure for all the needed Truffle nodes is no longer manageable. * Remove rest of identityHashCode message implementations * Replace old Map with Hash_Map * Add some docs * Add TruffleBoundaries * Formatting * Fix some tests to accept unsorted vector from Map.to_vector * Delete Map.first and Map.last methods * Add specialization for big integer hash * Introduce proper HashCodeTest and EqualsTest. - Use jUnit theories. - Call nodes directly * Fix some specializations for primitives in HashCodeAnyNode * Fix host object specialization * Remove Any.hash_code * Fix import in Map.enso * Update changelog * Reformat * Add truffle boundary to BigInteger.hashCode * Fix performance of HashCodeTest - initialize DataPoints just once * Fix MetaIsATest * Fix ValuesGenerator.textual - Java's char is not Text * Fix indent in Map_Spec.enso * Add maps to datapoints in HashCodeTest * Add specialization for maps in HashCodeAnyNode * Add multiLevelAtoms to ValuesGenerator * Provide a workaround for non-linear key inserts * Fix specializations for double and BigInteger * Cosmetics * Add truffle boundaries * Add allowInlining=true to some truffle boundaries. Increases performance a lot. * Increase the size of vectors, and warmup time for Vector.Distinct benchmark * Various small performance fixes. * Fix Geo_Spec tests to accept unsorted Map.to_vector * Implement Map.remove * FIx Visualization tests to accept unsorted Map.to_vector * Treat java.util.Properties as Map * Add truffle boundaries * Invoke polyglot methods on java.util.Properties * Ignore python tests if python lang is missing	2023-01-19 10:33:25 +01:00
James Dunkerley	48e5ed9eea	Some little bits from Book Club week 1 (#4058 ) - Add `get` to Table. - Correct `Count Nothing` examples. - Add `join` to File. - Add `File_Format.all` listing all installed formats. - Add some more ALIAS entries.	2023-01-18 11:46:13 +00:00
Radosław Waśko	082e0bfd0d	Add `Table.union` to the In-Memory Table. (#4052 ) Implements https://www.pivotaltracker.com/story/show/183854144	2023-01-17 00:34:57 +00:00
James Dunkerley	c4c35c92b7	Align Vector API with design, add some extra functions from AoC (#4026 ) Vector - Adjusted `Vector.sort` to be `Vector.sort order on by`. - Adjusted other sort to use `order` for direction argument. - Added `insert`, `remove`, `index_of` and `last_index_of` to `Vector`. - Added `start` and `if_missing` arguments to `find` on `Vector`, and adjusted default is `Not_Found` error. - Added type checking to `+` on `Vector`. - Altered `first`, `second` and `last` to error with `Index_Out_Of_Bounds` on `Vector`. - Removed `sum`, `exists`, `head`, `init`, `tail`, `rest`, `append`, `prepend` from `Vector`. Pair - Added `last`, `any`, `all`, `contains`, `find`, `index_of`, `last_index_of`, `reverse`, `each`, `fold` and `reduce` to `Pair`. - Added `get` to `Pair`. Range - Added `first`, `second`, `index_of`, `last_index_of`, `reverse` and `reduce` to `Range`. - Added `at` and `get` to `Range`. - Added `start` and `if_missing` arguments to `find` on `Range`. - Simplified `last` and `length` of `Range`. - Removed `exists` from `Range`. List - Added `second`, `find`, `index_of`, `last_index_of`, `reverse` and `reduce` to `Range`. - Added `at` and `get` to `List`. - Removed `exists` from `List`. - Made `all` short-circuit if any fail on `List`. - Altered `is_empty` to not compute the length of `List`. - Altered `first`, `tail`, `head`, `init` and `last` to error with `Index_Out_Of_Bounds` on `List`. Others - Added `first`, `second`, `last`, `get` to `Text`. - Added wrapper methods to the Random_Number_Generator so you can get random values more easily. - Adjusted `Aggregate_Column` to operate on the first column by default. - Added `contains_key` to `Map`. - Added ALIAS to `row_count` and `order_by`.	2023-01-12 13:32:24 +00:00
Jaroslav Tulach	fa22e467bf	Catch Panic and Error from Atom.to_text and yield detailed error message (#4034 ) Don't propagate errors from `toDisplayString` - construct an error message with `Atom.toString`. # Important Notes > currently a failure in to_text is swallowed by `toString` and we cannot detect that something went wrong during the serialization Not sure how satisfying the solution is, but the error swallowing happens in Truffle and there is little to do with it. We can just catch the error ourselves and produce some meaningful string.	2023-01-12 08:07:26 +00:00
Hubert Plociniczak	ae0889e843	Make ArrayOverBuffer behave like an Array/Array.sort no longer mutates the Array (#4022 ) Most of the problems with accessing `ArrayOverBuffer` have been resolved by using `CoerceArrayNode` (https://github.com/enso-org/enso/pull/3817). In `Array.sort` we still however specialized on Array which wasn't compatible with `ArrayOverBuffer`. Similarly sorting JS or Python arrays wouldn't work. Added a specialization to `Array.sort` to deal with that case. A generic specialization (with `hasArrayElements`) not only handles `ArrayOverBuffer` but also polyglot arrays coming from JS or Python. We could have an additional specialization for `ArrayOverBuffer` only (removed in the last commit) that returns `ArrayOverBuffer` rather than `Array` although that adds additional complexity which so far is unnecessary. Also fixed an example in `Array.enso` by providing a default argument.	2023-01-09 17:49:49 +00:00
Jaroslav Tulach	41b2aac39f	Removing Unsafe.set_atom_field (#4023 ) Introducing `Meta.atom_with_hole` to create an `Atom` _with a hole_ that is then _safely_ filled in later.	2023-01-09 13:39:14 +00:00
Hubert Plociniczak	3379ce51f2	Report failed name resolutions in type signatures (#4030 ) Compiler performed name resolution of literals in type signatures but would silently fail to report any problems. This meant that wrong names or forgotten imports would sneak in to stdlib. This change introduces 2 main changes: 1) failed name resolutions are appended in `TypeNames` pass 2) `GatherDiagnostics` pass also collects and reports failures from type signatures IR Updated stdlib so that it passes given the correct gatekeepers in place.	2023-01-09 10:35:36 +00:00
Radosław Waśko	8c661fdb74	Database Joins (#4007 ) Implements https://www.pivotaltracker.com/story/show/184032869 # Important Notes - Currently we get failures in Full joins on Postgres which show a more serious problem - amending equality to ensure that `[NULL = NULL] == True` breaks hash/merge based indexing - so such joins will be extremely inefficient. All our joins currently rely on this notion of equality which will mean all of our DB joins will be extremely inefficient. - We need to find a solution that will support nulls and still work OK with indices (but after exploring a few approaches: `COALESCE(a = b, a IS NULL AND b is NULL)`, `a IS NOT DISTINCT FROM b`, `(a = b) OR (a IS NULL AND b is NULL)`; all of which did not work (they all result in `ERROR: FULL JOIN is only supported with merge-joinable or hash-joinable join conditions`) I'm less certain that it is possible. Alternatively, we may need to change the NULL semantics to align it with SQL - this seems like likely the simpler solution, allowing us to generate simple, reliable SQL - the NULL=NULL solution will be cornering us into nasty workarounds very dependent on the particular backend.	2023-01-05 10:36:22 +00:00
Dmitry Bushev	1e5e2327ab	Improve performance of Text.compare_to (#4012 ) PR adds a flag to `Text` implementation tracking whether it is in a FCD normal form. Then this information can be used in the `Normalizer.compare` method. \| Benchmark name \| Old (ms) \| With flag (ms) \| --- \| --- \| --- \| Unicode very short \| 40.29 \| 40.04 \| Unicode medium \| 9.07 \| 1.99 \| Unicode big - random \| 115.39 \| 0.35 \| Unicode big - early difference \| 107.02 \| 0.54 \| Unicode big - late difference \| 749.81 \| 94.73 \| ASCII very short \| 28.13 \| 31.13 \| ASCII medium \| 4.58 \| 2.26 \| ASCII big - random \| 42.68 \| 0.26 \| ASCII big - early difference \| 30.91 \| 0.32 \| ASCII big - late difference \| 66.29 \| 42.72 Full benchmark output. [bench_old.txt](https://github.com/enso-org/enso/files/10325202/bench_old.txt) [bench_new.txt](https://github.com/enso-org/enso/files/10325201/bench_new.txt)	2023-01-02 17:09:03 +00:00
Pavel Marek	e6838bc90d	Convert Any.== to a builtin (#3956 ) `Any.==` is a builtin method. The semantics is the same as it used to be, except that we no longer assume `x == y` iff `Meta.is_same_object x y`, which used to be the case and caused failures in table tests. # Important Notes Measurements from `EqualsBenchmarks` shows that the performance of `Any.==` for recursive atoms increased by roughly 20%, and the performance for primitive types stays roughly the same.	2022-12-29 21:20:00 +00:00
Dmitry Bushev	74742d3267	Make To Text Conversion Identity for Text (#4009 ) First part of fixing `Text.to_text`. - add: `pretty` method for pretty printing. - update: make `Text.to_text` conversion identity for Text In the next iterations `to_text` will be gradually replaced with `to Text` conversion once the related issues with conversions are fixed.	2022-12-29 12:21:24 +00:00
Jaroslav Tulach	7252af6d62	Enso.getMetaObject, Type.isMetaInstance and Meta.is_a consolidation (#3949 ) Implements `getMetaObject` and related messages from Truffle interop for Enso values and types. Turns `Meta.is_a` into builtin and re-uses the same functionality. # Important Notes Adds `ValueGenerator` testing infrastructure to provide unified access to special Enso values and builtin types that can be reused by other tests, not just `MetaIsATest` and `MetaObjectTest`.	2022-12-22 08:00:06 +00:00
Radosław Waśko	c0c0abe4fe	Add benchmarks comparing ArrayProxy with elements generated ad-hoc with a regular Vector (#3831 )	2022-12-21 20:15:39 +00:00
James Dunkerley	579d3fc397	Adds Date, Time_Of_Day and Date_Time support to Excel IO (#3997 ) - Allow date time inputs from Excel. - Enables disabled test. - Fix for Map.==. - Allow nulls in crosstab name.	2022-12-20 16:12:00 +00:00
James Dunkerley	ace459ed53	Let JavaScript parse JSON and write JSON ... (#3987 ) Use JavaScript to parse and serialise to JSON. Parses to native Enso object. - `.to_json` now returns a `Text` of the JSON. - Json methods now `parse`, `stringify` and `from_pairs`. - New `JSON_Object` representing a JavaScript Object. - `.to_js_object` allows for types to custom serialize. Returning a `JS_Object`. - Default JSON format for Atom now has a `type` and `constructor` property (or method to call for as needed to deserialise). - Removed `.into` support for now. - Added JSON File Format and SPI to allow `Data.read` to work. - Added `Data.fetch` API for easy Web download. - Default visualization for JS Object trunctes, and made Vector default truncate children too. Fixes defect where types with no constructor crashed on `to_json` (e.g. `Matching_Mode.Last.to_json`. Adjusted default visualisation for Vector, so it doesn't serialise an array of arrays forever. Likewise, JS_Object default visualisation is truncated to a small subset. New convention: - `.get` returns `Nothing` if a key or index is not present. Takes an `other` argument allowing control of default. - `.at` error if key or index is not present. - `Nothing` gains a `get` method allowing for easy propagation.	2022-12-20 10:33:46 +00:00
Hubert Plociniczak	49204e92cf	Simplify exception handling for polyglot exceptions (#3981 ) This removes the special handling of polyglot exceptions and allows matching on Java exceptions in the same way as for any other types. `Polyglot_Error`, `Panic.catch_java` and `Panic.catch_primitive` are gone The change mostly deals with the backslash of removing `Polyglot_Error` and two `Panic` methods. `Panic.catch` was implemented as a builtin instead of delegating to `Panic.catch_primitive` builtin that is now gone. This fixes https://www.pivotaltracker.com/story/show/182844611	2022-12-19 19:16:43 +00:00
Jaroslav Tulach	4b4167fc06	Curried and lambda function invocation should be both fast (#3979 ) Benchmark to compare _curried and lambda_ based function invocations and a fix to make _curried_ invocation (at least) as fast as the _lambda_ one. Allows us to use _curried invocations_ in standard library again without loosing any speed. # Important Notes Execute as: ``` sbt:runtime> benchOnly CurriedFunctionBenchmarks ``` Prior to subsequent bugfixes in this PR the benchmark results were: - `averageCurried` runs in 0.290 ms - `averageLambda` runs in 0.122 ms e.g. _curried invocations_ is more than twice slow. That confirms our findings from the `Array_Proxy` vector benchmarks. The problem is that _function object is not compilation final_. After fixing it we have following results: - `averageCurried` runs in 0.102 ms - `averageLambda` runs in 0.111 ms e.g. both operations are of similar complexity.	2022-12-16 07:12:24 +00:00
James Dunkerley	77fe69dfd9	JSON Improvements, small Table stuff, Statistic in Enso not Java and few other minor bits. (#3964 ) - Aligned `compare_to` so returns `Type_Error` if `that` is wrong type for `Text`, `Ordering` and `Duration`. - Add `empty_object`, `empty_array`. `get_or_else`, `at`, `field_names` and `length` to `Json`. - Fix `Json` serialisation of NaN and Infinity (to "null"). - Added `length`, `at` and `to_vector` to Pair (allowing it to be treated as a Vector). - Added `running_fold` to the `Vector` and `Range`. - Added `first` and `last` to the `Vector.Builder`. - Allow `order_by` to take a single `Sort_Column` or have a mix of `Text` and `Sort_Column.Name` in a `Vector`. - Allow `select_columns_helper` to take a `Text` value. Allows for a single field in group_by in cross_tab. - Added `Patch` and `Custom` to HTTP_Method. - Added running `Statistic` calculation and moved more of the logic from Java to Enso. Performance seems similar to pure Java version now.	2022-12-14 19:40:27 +00:00
Jaroslav Tulach	ec047d6ac1	Speeding up Array_Proxy.from_proxy_object twice (#3969 ) Using lambda instead of higher order function. # Important Notes Running: ``` sbt:runtime> benchOnly VectorBenchmarks.averageOverArrayProxy ``` speeds up from `0.038 ms/op` to `0.016 ms/op` on my computer. Which seems good enough.	2022-12-10 10:35:14 +00:00
James Dunkerley	8f30fcf376	Fix issue where constructor was becoming a value. Fixes to_json. (#3955 ) If a constructor is fully specified the Meta constructor was becoming the atom in a few places. This broke to_json and hence viz.	2022-12-07 17:20:56 +00:00
Hubert Plociniczak	0855b74875	Vector should preserve warnings (#3938 ) * Sequence literal (Vector) should preserve warnings When Vector was created via a sequence literal, we simply dropped any associated any warnings associated with it. This change propagates Warnings during the creation of the Vector. Ideally, it would be sufficient to propagate warnings from the individual elements to the underlying storage but doesn't go well with `Vector.fromArray`. * update changelog * Array-like structures preserver warnings Added a WarningsLibrary that exposes `hasWarnings` and `getWarnings` messages. That way we can have a single storage that defines how to extract warnings from an Array and the others just delegate to it. This simplifies logic added to sequence literals to handle warnings. * Ensure polyglot method calls are warning-free Since warnings are no longer automatically extracted from Array-like structures, we delay the operation until an actual polyglot method call is performed. Discovered a bug in `Warning.detach_selected_warnings` which was missing any usage or tests. * nits * Support multi-dimensional Vectors with warnings * Propagate warnings from case branches * nit * Propagate all vector warnings when reading element Previously, accessing an element of an Array-like structure would only return warnings of that element or of the structure itself. Now, accessing an element also returns warnings from all its elements as well.	2022-12-07 11:10:11 +01:00
James Dunkerley	4cbd72a4eb	Some more tidying based on remaining tickets and PR comments. (#3946 ) - Moved `to_default_visualization_data` to `Standard.Visualization`. - Remove the use of `is_a` in favour of case statements. - Stop exporting Standard.Base.Error.Common. - Separate errors to own files. - Change constructors to be called `Error`. - Rename `Caught_Panic.Caught_Panic_Data` -> `Caught_Panic.Panic`. - Rename `Project_Description.Project_Description_Data` ->`Project_Description.Value` - Rename `Regex_Matcher.Regex_Matcher_Data` -> `Regex_Matcher.Value` (can't come up with anything better!). - Rename `Range.Value` -> `Range.Between`. - Rename `Interval.Value` -> `Interval.Between`. - Rename `Column.Column_Data` -> `Column.Value`. - Rename `Table.Table_Data` -> `Table.Value`. - Align all the Error types in Table. - Removed GEO Json bits from Table. - `Json.to_table` doesn't have the GEO bits anymore. - Added `Json.geo_json_to_table` to add the functions back in. # Important Notes No more exports from anywhere but Main! No more `_Data` constructors!	2022-12-06 18:35:18 +00:00
James Dunkerley	0ad70c6332	Tidy Standard.Base part 5 of n ... (hopefully the end...) (#3929 ) - Moved `Any`, `Error` and `Panic` to `Standard.Base`. - Separated `Json` and `Range` extensions into own modules. - Tidied `Case`, `Case_Sensitivity`, `Encoding`, `Matching`, `Regex_Matcher`, `Span`, `Text_Matcher`, `Text_Ordering` and `Text_Sub_Range` in `Standard.Base.Data.Text`. - Tidied `Standard.Base.Data.Text.Extensions` and stopped it re-exporting anything. - Tidied `Regex_Mode`. Renamed `Option` to `Regex_Option` and added type to export. - Tidied up `Regex` space. - Tidied up `Meta` space. - Remove `Matching` from export. - Moved `Standard.Base.Data.Boolean` to `Standard.Base.Boolean`. # Important Notes - Moved `to_json` and `to_default_visualization_data` from base types to extension methods.	2022-12-02 18:08:14 +00:00
Edward Kmett	d9cdb32121	implement compare_to for Ordering (#3936 ) Ordering should itself be ordered. # Important Notes Implement the obvious Ord instance for Ordering and supply a test.	2022-12-01 18:42:46 +00:00
Jaroslav Tulach	099f045178	Integer.parse and Decimal.parse improvements (#3934 ) Converting `Integer.parse` into a builtin and making sure it can parse big values like `100!`. Adding `locale` parameter to `Locale.parse` and making sure it parses `32,5` as `32.5` double in Czech locale.	2022-12-01 11:25:28 +00:00
Hubert Plociniczak	06bd69436b	Import modules' extension methods only with unqualified import statements (#3906 ) # Important Notes Note that one cannot ``` import Standard.Table as Table_Module ``` because of the 2-component name restriction that gets desugared to `Standard.Table.Main` and we have to write ``` import Standard.Table.Main as Table_Module ``` in a few places. Once we move `Json.to_table` extension this can be improved.	2022-12-01 10:13:34 +00:00
Hubert Plociniczak	20c22f2422	from/all import must not include module in name resolution (#3931 ) It appears that when were doing `from XYZ import all` the module `XYZ` was also being taken into account during name resolution. This was unfortunate and became problematic when one had a type with the same name defined in it. During pattern matching one could not simply do ``` from XYZ import all ... case ... of _ : XYZ -> ... ``` since the compiler would complain that we try to pattern match on a type but give it a module. The module is now excluded from the name resolution, when importing everything from the module. It appears that this "feature" was used in a number of our tests, so they had to be adapted. This fixes task 4 in https://www.pivotaltracker.com/story/show/183833055	2022-11-30 16:28:57 +00:00
James Dunkerley	c37eade954	Use `Vector.new` for `fill` and tuning... (#3744 ) - Make `Vector.fill` use the `Vector.new` method. - Tuning of some Range methods to try and get better performance. \| Test \| Old \| Current \| Change \| \| --- \| --- \| --- \| --- \| \| New Vector \| 77.5 \| 72.5 \| 94% \| \| Fill Constant \| 71.8 \| 42.1 \| 59% \| \| Fill Random \| 156.5 \| 124.2 \| 79% \| \| Append Single \| 13.3 \| 3.9 \| 29% \| \| Append Large \| 13.0 \| 4.9 \| 38% \| \| Sum \| 146.4 \| 122.3 \| 84% \| \| Drop First 20 and Sum \| 148.0 \| 132.7 \| 90% \| \| Drop Last 20 and Sum \| 145.3 \| 138.0 \| 95% \| \| Filter \| 79.4 \| 68.5 \| 86% \| \| Filter With Index \| 152.9 \| 158.5 \| 104% \| \| Map & Filter \| 438.0 \| 440.7 \| 101% \| \| Partition \| 256.4 \| 296.7 \| 116% \| \| Partition With Index \| 410.0 \| 392.0 \| 96% \| \| Each \| 117.4 \| 103.8 \| 88% \|	2022-11-30 09:20:07 +00:00
James Dunkerley	4e30b3036d	Tidy Standard.Base part 4 of n ... (#3898 ) - Export all for `Problem_Behavior` (allowing for Report_Warning, Report_Error and Ignore to be trivially used). - Renamed `Range.Range_Data` to `Range.Value` moved to using `up_to` wherever possible. - Reviewed `Function`, `IO`, `Polyglot`, `Random`, `Runtime`, `System`. - `File` now published as type. Some static methods moved to `Data` others into type. Removed `read_bytes` static. - New `Data` module for reading input data in one place (e.g. `Data.read_file`) will add `Data.connect` later. - Added `Random` module to the exports. - Move static methods into `Warning` type and exporting the type not the module. # Important Notes - Sorted a few imports into order (ordering by direct import in project, then by from import in project then polyglot and finally self imports).	2022-11-25 02:00:16 +00:00
Hubert Plociniczak	4c5868cf0e	Fix imports for Index_Sub_Range's constructors (#3902 ) The change that is now allowed due to https://github.com/enso-org/enso/pull/3897 This fixes the first problem mentioned in https://www.pivotaltracker.com/n/projects/2539304/stories/183833055	2022-11-24 22:36:58 +00:00
Dmitry Bushev	5a0aad16eb	Check that type names are resolved (#3895 ) RuntimeStdlibTest now checks that names in type signatures are qualified (aka poor man's typechecker) Will be merged after the stdlib tidying by James is complete.	2022-11-21 22:33:36 +00:00
James Dunkerley	93fee3a51f	Tidy Standard.Base part 3 of n ... (#3893 ) Here we go again... - Tidied up `Pair` and stopped exporting `Pair_Data`. Adjusted so type exported. - Tidy imports for `Json`, `Json.Internal`, `Locale`. - Tidy imports Ordering.*. Export `Sort_Direction` and `Case_Sensitivity` as types. - Move methods of `Statistics` into `Statistic`. Publishing the types not the module. - Added a `compute` to a `Rank_Method`. - Tidied the `Regression` module. - Move methods of `Date`, `Date_Time`, `Duration`, `Time_Of_Day` and `Time_Zone` into type. Publishing types not modules. - Added exporting `Period`, `Date_Period` and `Time_Period` as types. Static methods moved into types. # Important Notes - Move `compare_to_ignore_case`, `equals_ignore_case` and `to_case_insensitive_key` from Extensions into `Text`. - Hiding polyglot java imports from export all in `Main.enso`.	2022-11-21 15:30:18 +00:00
James Dunkerley	99bacc5c06	Tidy Standard.Base Part 2 of n... (#3889 ) - Moved static methods into `Locale` type. Publishing type not module. - Stop publishing `Nil` and `Cons` from `List`. - Tidied up `Json` and merged static in to type. Sorted out various type signatures which used a `Constructor`. Now exporting type and extensions. - Tidied up `Noise` and merge `Generator` into file. Export type not module. - Moved static method of `Map` into type. Publishing type not module. # Important Notes - Move `Text.compare_to` into `Text`. - Move `Text.to_json` into `Json`.	2022-11-19 08:01:45 +00:00
Radosław Waśko	5b6fd74929	Data analysts should be able to `Text.match`, `Text.match_all`, `Text.is_match` to find or check matches (#3841 ) Implements https://www.pivotaltracker.com/story/show/181266092 # Important Notes Also renaming `Text.location_of` and `Text.location_of_all` to `Text.locate` and `Text.locate_all`.	2022-11-18 22:17:42 +00:00
James Dunkerley	14dbe7287b	Tidy Standard.Base Part 1 of n... (#3884 ) * Tidy Bound and Interval. * Fix Interval tests. * Fix Interval tests. * Restructure Index_Sub_Range to new Type/Statics. * Adjust for Vector exported as a type and static methods on it. * Tidy Maybe. * Fix issue with Line_Ending_Style. * Revert Filter_Condition change. Fix benchmark test issue. Tidy imports on Index_Sub_Range. * Revert Filter_Condition change. Fix benchmark test issue. Tidy imports on Index_Sub_Range. * Can't export constructors unless exported from type in module. * Fix failing tests.	2022-11-18 08:57:41 +00:00
James Dunkerley	c868ed5efe	Some minor fixes (#3874 ) - Allow `Map` to store a `Nothing` key (fixes `Vector.distinct` with a `Nothing`). - Add `column_names` method to `Table` as a shorthand. - Return data flow error when comparing with Nothing (not a Panic or a Polyglot exception). - Allow milli and micro second for DateTime and Time Of Day # Important Notes - Added a load of tests for the various comparison operators to Numbers_Spec.	2022-11-17 07:11:18 +00:00
Hubert Plociniczak	7b0759f8b3	Don't add module's builtins to the scope of a builtin type (#3791 ) It appears that we were always adding builtin methods to the scope of the module and the builtin type that shared the same name. This resulted in some methods being accidentally available even though they shouldn't. This change treats differently builtins of types and modules and introduces auto-registration feature for builtins. By default all builtin methods are registered with a type, unless explicitly defined in the annotation property. Builtin methods that are auto-registered do not have to be explicitly defined and are registered with the underlying type. Registration correctly infers the right type, depending whether we deal with static or instance methods. Builtin methods that are not auto-registered have to be explicitly defined always. Modules' builtin methods are the prime example. # Important Notes Builtins now carry information whether they are static or not (inferred from the lack of `self` parameter). They also carry a `autoRegister` property to determine if a builtin method should be automatically registered with the type.	2022-11-16 10:23:52 +00:00
Kaz Wesley	a1db36b57c	Support mixed constructors/bindings in types (#3870 ) Libraries: Revert changes that were necessitated by a new rule we have decided not to introduce. Parser: - Support mixed constructors/bindings in types. - Disallow zero-length hex sequences in character escapes: `\x`, `\u`, `\u{}`, `\U`, `\U{}` are no longer legal synonyms for `\0` (matches old parser behavior).	2022-11-14 20:24:07 +00:00
Jaroslav Tulach	ecd1fdc3f8	Caching the grapheme_length of a Text (#3864 ) Computing length of a text takes time. Let's cache it after first computation. # Important Notes Wrote `StringBenchmarks` that sums lengths of (the same) `Text` present many time in a `Vector`. Initially it took `383.673 ms` per operation. Then it took `0.031 ms/op`. Looks like the `length` calls are returning instantly as they get cached.	2022-11-14 15:53:10 +00:00
James Dunkerley	b5881efdf0	Allow integers for take and drop. (#3854 ) Allows passing an integer to take or drop as a shorthand.	2022-11-04 14:03:28 +00:00
Kaz Wesley	330612119a	Parse the standard library (#3830 ) Fix bugs in `TreeToIr` (rewrite) and parser. Implement more undocumented features in parser. Emulate some old parser bugs and quirks for compatibility. Changes in libs: - Fix some bugs. - Clean up some odd syntaxes that the old parser translates idiosyncratically. - Constructors are now required to precede methods. # Important Notes Out of 221 files: - 215 match the old parser - 6 contain complex types the old parser is known not to handle correctly So, compared to the old parser, the new parser parses 103% of files correctly.	2022-10-31 16:19:12 +00:00
Pavel Marek	f8a4e2a9d2	Add `Period` type (#3818 ) This PR adds `Period` type, which is a date-only complement to `Duration` builtin type. # Important Notes - `Period` replaces `Date_Period`, and `Time_Period`. - Added shorthand constructors for `Duration` and `Period`. For example: `Period.days 10` instead of `Period.new days=10`. - `Period` can be compared to other `Period` in some cases, other cases throw an error.	2022-10-28 17:27:20 +00:00
Pavel Marek	28243a0fd1	Define Enso epoch start as 15th October 1582. (#3804 ) Define start of Enso epoch as 15th of October 1582 - start of the Gregorian calendar. # Important Notes - Some (Gregorian) calendar related functionalities within `Date` and `Date_Time` now produces a warning if the receiving Date/Date_Time is before the epoch start, e.g., `week_of_year`, `is_leap_year`, etc.	2022-10-27 10:16:43 +00:00
Marcin Kostrzewa	901760816c	State rework & IO Contexts (#3828 ) 1. Changes how we do monadic state – rather than a haskelly solution, we now have an implicit env with mutable data inside. It's better for the JVM. It also opens the possibility to have state ratained on exceptions (previously not possible) – both can now be implemented. 2. Introduces permission check system for IO actions.	2022-10-26 16:22:08 +00:00
Radosław Waśko	bb29833da5	Create a Table `Row` Type and expose as a Vector on In-Memory Table with `.rows` property (#3827 ) Implements https://www.pivotaltracker.com/story/show/182307026	2022-10-26 11:21:33 +00:00
Radosław Waśko	2bc0611869	Add support for using Columns within `Is_In` (#3822 ) Implements https://www.pivotaltracker.com/story/show/183560222	2022-10-24 12:51:15 +00:00
James Dunkerley	f0f6deef2a	Load the File_Format types via a ServiceLoader (#3813 ) Moves the File.read method into the `File` type. Uses the ServiceLoader to find all types for the File_Format.	2022-10-24 09:55:18 +00:00
Jaroslav Tulach	d8882f606d	Few more properly parsed files (#3826 ) Another part of #3611 with few more `TreeToIr` improvements. # Important Notes Unofficial `LoadParser.sh` check from #3611 of all library files now reports just 54 failures out of 222 files - e.g. 75% success rate.	2022-10-24 08:53:37 +00:00
Hubert Plociniczak	6c440beecc	Move logic calculating the index in Vector.at to a builtin method to make the performance of Vector to be on par with Array (#3811 ) The main culprit of a Vector slowdown (when compared to Array) was the normalization of the index when accessing the elements. Turns out that the Graal was very persistent on not inlining that particular fragment and that was degrading the results in benchmarks. Being unable to force it to do it (looks like a combination of thunk execution and another layer of indirection) we resorted to just moving the normalization to the builtin method. That makes Array and Vector perform roughly the same. Moved all handling of invalid index into the builtin as well, simplifying the Enso implementation. This also meant that `Vector.unsafe_at` is now obsolete. Additionally, added support for negative indices in Array, to behave in the same way as for Vector. # Important Notes Note that this workaround only addresses this particular perf issue. I'm pretty sure we will have more of such scenarios. Before the change `averageOverVector` benchmark averaged around `0.033 ms/op` now it does consistently `0.016 ms/op`, similarly to `averageOverArray`.	2022-10-20 12:50:44 +00:00
Radosław Waśko	cc76e7d36a	Add support for `Blank_Columns` to Table and Database (#3812 ) Implements https://www.pivotaltracker.com/story/show/183390281 and https://www.pivotaltracker.com/story/show/183390394	2022-10-20 09:11:08 +00:00
Pavel Marek	a53fbc79be	Improve Unsupported_Argument_Types message. (#3803 ) Improve `Unsupported_Argument_Types` error so that it includes the message from the original exception. `arguments` field is retained, but not included in `to_display_text` method.	2022-10-18 12:03:25 +00:00
James Dunkerley	701c644d0e	Tidy up the remaining ones except Base... (#3797 ) - Removed `Dubious constructor export` from Examples, Geo, Google_Api, Image and Test. - Updated Google_Api project to meet newer code standards. - Restructured `Standard.Test`: - `Main.enso` now exports `Bench`, `Faker`, `Problems`, `Test`, `Test_Suite` - `Test.Suite` methods moved into a `Test_Suite` type. - Moved `Bench.measure` into `Bench` type. - Separated the reporting to a `Test_Reporter` module. - Moved `Faker` methods into `Faker` type. - Removed `Verbs` and `.should` method. - Added `should_start_with` and `should_contain` extensions to `Any`. - Restructured `Standard.Image`: - Merged Codecs methods into `Image`. - Export `Image`, `Read_Flag`, `Write_Flag` and `Matrix` as types from `Main.enso`. - Merged the internal methods into `Matrix` and `Image`. - Fixed `Day_Of_Week` to be exported as a type and sort the `from` method.	2022-10-17 11:27:27 +00:00
Radosław Waśko	82de8f88bd	Add support for `Is_In` and `Not_In` to `Filter_Condition` (#3790 ) Implements https://www.pivotaltracker.com/story/show/183389945	2022-10-15 11:29:59 +00:00
Pavel Marek	e9260227c4	Duration type is a builtin type (#3759 ) - Reimplement the `Duration` type to a built-in type. - `Duration` is an interop type. - Allow Enso method dispatch on `Duration` interop coming from different languages. # Important Notes - The older `Duration` type should now be split into new `Duration` builtin type and a `Period` type. - This PR does not implement `Period` type, so all the `Period`-related functionality is currently not working, e.g., `Date - Period`. - This PR removes `Integer.milliseconds`, `Integer.seconds`, ..., `Integer.years` extension methods.	2022-10-14 18:08:08 +00:00
Paweł Grabarz	ce6267f098	Add `replace_text` method to In-Memory `Table` (#3793 ) Implements https://www.pivotaltracker.com/n/projects/2539304/stories/183415329	2022-10-14 17:42:29 +02:00
Radosław Waśko	592a8516a8	Add `Is_Empty`, `Not_Empty`, `Like` and `Not_Like` to `Filter_Condition` (#3775 ) Implements https://www.pivotaltracker.com/story/show/183389890	2022-10-10 23:11:04 +00:00
Radosław Waśko	7afaf8c6cc	Add filtering by Filter_Condition to `Vector`, `Range` and `List` (#3770 ) Implements https://www.pivotaltracker.com/story/show/183389901	2022-10-07 04:02:54 +00:00
James Dunkerley	185378f07c	Moving library statics to type for Table. (#3760 ) - Generally export types not modules from the `Standard.Table` import. - Moved `new`, `from_rows` the `Standard.Table` library into the `Table` type. - Renames `Standard.Table.Data.Storage.Type` to `Standard.Table.Data.Storage.Storage` - Removed the internal `from_columns` method. - Removed `join` and `concat` and merged into instance methods. - Removed `Table` and `Column` from the `Standard.Database` exports. - Removed `Standard.Table.Data.Column.Aggregate_Column` as not used any more.	2022-10-06 17:01:18 +00:00
Radosław Waśko	d201959a36	Try fixing JUnit reports for CI (#3761 ) - Adds a `details` field to `Failure` for additional contextual information. - Stacktraces are moved from main message (which should generally be short and fit in one line) to the `details`. - Ensuring that the attribute does not contain multiple lines fixes the CI viewer which seems to have been breaking on multiline attributes. - Additionally, test execution time is now measured and printed in the CLI as well as included in the JUnit report for the CI - we can use this to catch tests running unexpectedly slowly.	2022-10-05 21:37:26 +00:00
Radosław Waśko	503d3eb5a3	Update Table API with new `filter` design (#3750 ) Implements https://www.pivotaltracker.com/story/show/183389855 # Important Notes Implements basic filter operations both In-Memory and for the Database backend, ensuring that existing tests can be adapted and keep working. Not all `Filter_Condition`s are implemented yet. Also implements significant part of https://www.pivotaltracker.com/story/show/183390314	2022-10-05 11:40:10 +00:00
Dmitry Bushev	f6b5438e9e	Fix Warnings visualization (#3754 ) Changelog - fix reporting of runtime type for values annotated with warning - fix visualizations of values annotated with warnings - fix `Runtime.get_stack_trace` failure in interactive mode	2022-10-04 17:27:13 +00:00
Hubert Plociniczak	ae66087bb9	By-type pattern matching (#3742 ) This change brings by-type pattern matching to Enso. One can pattern match on Enso types as well as on polyglot types. For example, ``` case x of _ : Integer -> ... _ : Text -> ... _ -> ... ``` as well as Java's types ``` case y of _ : ArrayList -> ... _ : List -> ... _ : AbstractList -> ... _ -> .. ``` It is no longer possible to match a value with a corresponding type constructor. For example ``` case Date.now of Date -> ... ``` will no longer match and one should match on the type (`_ : Date`) instead. ``` case Date of Date -> ... ``` is fine though, as requested in the ticket. The change required further changes to `type_of` logic which wasn't dealing well with polyglot values. Implements https://www.pivotaltracker.com/story/show/183188846 # Important Notes ~I discovered late in the game that nested patterns involving type patterns, such as `Const (f : Foo) tail -> ...` are not possible due to the old parser logic. I would prefer to add it in a separate PR because this one is already getting quite large.~ This is now supported!	2022-10-04 09:23:58 +00:00
Radosław Waśko	61a4120cfb	Fix date comparisons and test sorting of tables and vectors with dates (#3745 ) Implements https://www.pivotaltracker.com/story/show/183402892 # Important Notes - Fixes inconsistent `compare_to` vs `==` behaviour in date/time types and adds test for that. - Adds test for `Table.order_by` on dates and custom types. - Fixes an issue with `Table.order_by` for custom types. - Unifies how incomparable objects are reported by `Table.order_by` and `Vector.sort`. - Adds benchmarks comparing `Table.order_by` and `Vector.sort` performance.	2022-09-29 08:48:00 +00:00
Radosław Waśko	7da4d61484	Update Case Sensitivity arguments (#3741 ) Implements https://www.pivotaltracker.com/story/show/183314956	2022-09-27 20:15:52 +00:00
Marcin Kostrzewa	726bfeb883	Wip/mk/statics (#3740 ) Makes statics static. A type and its instances have different methods defined on them, as it should be. Constructors are now scoped in types, and can be imported/exported. # Important Notes The method of fixing stdlib chosen here is to just not. All the conses are exported to make all old code work. All such instances are marked with `TODO Dubious constructor export` so that it can be found and fixed.	2022-09-27 14:23:06 +00:00
Hubert Plociniczak	7a6ee0c200	Implement `type_of` (#3722 ) This change implements a simple `type_of` method that returns a type of a given value, including for polyglot objects. The change also allows for pattern matching on various time-related instances. It is a nice-to-have on its own, but it was primarily needed here to write some tests. For equality checks on types we currently can't use `==` due to a known _feature_ which essentially does wrong dispatching. This will be improved in the upcoming statics PR so we agreed that there is no point in duplicating that work and we can replace it later. Also, note that this PR changes `Meta.is_same_object`. Comparing types revealed that it was wrong when comparing polyglot wrappers over the same value.	2022-09-26 16:01:39 +00:00
Radosław Waśko	cd10b5d34d	Add `Date_Period.Week` to `start_of` and `end_of` methods (#3733 ) Implements https://www.pivotaltracker.com/story/show/183349732	2022-09-23 22:14:35 +00:00
James Dunkerley	a3de3c6128	Use ArraySlice to slice a Vector (#3724 ) Use an `ArraySlice` to slice `Vector`. Avoids memory copying for the slice function. # Important Notes \| Test \| Ref \| New \| \| --- \| --- \| --- \| \| New Vector \| 71.9 \| 71.0 \| \| Append Single \| 26.0 \| 27.7 \| \| Append Large \| 15.1 \| 14.9 \| \| Sum \| 156.4 \| 165.8 \| \| Drop First 20 and Sum \| 171.2 \| 165.3 \| \| Drop Last 20 and Sum \| 170.7 \| 163.0 \| \| Filter \| 76.9 \| 76.9 \| \| Filter With Index \| 166.3 \| 168.3 \| \| Partition \| 278.5 \| 273.8 \| \| Partition With Index \| 392.0 \| 393.7 \| \| Each \| 101.9 \| 102.7 \| - Note: the performance of New and Append has got slower from previous tests.	2022-09-23 15:13:16 +00:00
Radosław Waśko	e9ebc663c1	Add business days functions to Date and Date_Time (#3726 ) Implements https://www.pivotaltracker.com/story/show/183082087 # Important Notes - Removed unnecessary invocations of `Error.throw` improving performance of `Vector.distinct`. The time of the `add_work_days and work_days_until should be consistent with each other` test suite came down from 15s to 3s after the changes.	2022-09-22 08:31:15 +00:00
James Dunkerley	d6346e9d66	Renaming various constructors and moving types around for Database. (#3715 ) Repairing the constructor name following the types work. Some general tiding up as well. - Remove `Standard.Database.Data.Column.Aggregate_Column_Builder`. - Remove `Standard.Database.Data.Dialect.Dialect.Dialect_Data`. - Remove unused imports and update some type definitions. - Rename `Postgres.Postgres_Data` => `Postgres_Options.Postgres`. - Rename `Redshift.Redshift_Data` => `Redshift_Options.Redshift`. - Rename `SQLite.SQLite_Data` => `SQLite_Options.SQLite`. - Rename `Credentials.Credentials_Data` => `Credentials.Username_And_Password`. - Rename `Sql` to `SQL` across the board. - Merge `Standard.Database.Data.Internal` into `Standard.Database.Internal`. - Move dialects into `Internal` and merge the function in `Helpers` into `Base_Generator`.	2022-09-19 12:39:40 +00:00
Radosław Waśko	8fa8d12cc3	String functionality in std-table should use std-base (#3717 ) Implements https://www.pivotaltracker.com/story/show/181754646	2022-09-17 14:38:02 +00:00
Hubert Plociniczak	0e5df935d3	Don't rename imported Main module that only imports names (#3710 ) Turns that if you import a two-part import we had special code that would a) add Main submodule b) add an explicit rename. b) is problematic because sometimes we only want to import specific names. E.g., ``` from Bar.Foo import Bar, Baz ``` would be translated to ``` from Bar.Foo.Main as Foo import Bar, Baz ``` and it should only be translated to ``` from Bar.Foo.Main import Bar, Baz ``` This change detects this scenario and does not add renames in that case. Fixes [183276486](https://www.pivotaltracker.com/story/show/183276486).	2022-09-16 13:01:06 +00:00
Radosław Waśko	ca950fbf78	Rename `Standard.Base.Data.Text.Regex.Mode` to `Regex_Mode` (#3713 ) Implements https://www.pivotaltracker.com/story/show/183032392	2022-09-16 00:40:40 +00:00
James Dunkerley	0126f02e7b	Restructure File.read into the new design (#3701 ) Changes following Marcin's work. Should be back to very similar public API as before. - Add an "interface" type: `Standard.Base.System.File_Format.File_Format`. - All `File_Format` types now have a `can_read` method to decide if they can read a file. - Move `Standard.Table.IO.File_Format.Text.Text_Data` to `Standard.Base.System.File_Format.Plain_Text_Format.Plain_Text`. - Move `Standard.Table.IO.File_Format.Bytes` to `Standard.Base.System.File_Format.Bytes`. - Move `Standard.Table.IO.File_Format.Infer` to `Standard.Base.System.File_Format.Infer`. (doesn't belong here...) - Move `Standard.Table.IO.File_Format.Unsupported_File_Type` to `Standard.Base.Error.Common.Unsupported_File_Type`. - Add `Infer`, `File_Format`, `Bytes`, `Plain_Text`, `Plain_Text_Format` to `Standard.Base` exports. - Fold extension methods of `Standard.Base.Meta.Unresolved_Symbol` into type. - Move `Standard.Table.IO.File_Format.Auto` to `Standard.Table.IO.Auto_Detect.Auto_Detect`. - Added a `types` Vector of all the built in formats. - `Auto_Detect` asks each type if they `can_read` a file. - Broke up and moved `Standard.Table.IO.Excel` into `Standard.Table.Excel`: - Moved `Standard.Table.IO.File_Format.Excel.Excel_Data` to `Standard.Table.Excel.Excel_Format.Excel_Format.Excel`. - Renamed `Sheet` to `Worksheet`. - Internal types `Reader` and `Writer` providing the actual read and write methods. - Created `Standard.Table.Delimited` with similar structure to `Standard.Table.Excel`: - Moved `Standard.Table.IO.File_Format.Delimited.Delimited_Data` to `Standard.Table.Delimited.Delimited_Format.Delimited_Format.Delimited`. - Moved `Standard.Table.IO.Quote_Style` to `Standard.Table.Delimited.Quote_Style`. - Moved the `Reader` and `Writer` internal types into here. Renamed methods to have unique names. - Add `Aggregate_Column`, `Auto_Detect`, `Delimited`, `Delimited_Format`, `Excel`, `Excel_Format`, `Sheet_Names`, `Range_Names`, `Worksheet` and `Cell_Range` to `Standard.Table` exports.	2022-09-15 14:48:46 +00:00
Radosław Waśko	b304402d8e	Add Period Start and End functions to Date and DateTime (#3695 ) Implements https://www.pivotaltracker.com/story/show/183081152	2022-09-13 09:51:08 +00:00
Hubert Plociniczak	fba5047acc	Improved Vector/Array interop (#3667 ) `Vector` type is now a builtin type. This requires a bunch of additional builtin methods for its creation: - Use `Vector.from_array` to convert any array-like structure into a `Vector` [by copy](`f628b28f5f`) - Use (already existing) `Vector.from_polyglot_array` to convert any array-like structure into a `Vector` without copying - Use (already existing) `Vector.fill 1 item` to create a singleton `Vector` Additional, for pattern matching purposes, we had to implement a `VectorBranchNode`. Use following to match on `x` being an instance of `Vector` type: ``` import Standard.Base.Data.Vector size = case x of Vector.Vector -> x.length _ -> 0 ``` Finally, `VectorLiterals` pass that transforms `[1,2,3]` to (roughly) ``` a1 = 1 a2 = 2 a3 = 3 Vector (Array (a1,a2, a3)) ``` had to be modified to generate ``` a1 = 1 a2 = 2 a3 = 3 Vector.from_array (Array (a1, a2, a3)) ``` instead to accomodate to the API changes. As of `025acaa676` all the known CI checks passes. Let's start the review. # Important Notes Matching in `case` statement is currently done via `Vector_Data`. Use: ``` case x of Vector.Vector_Data -> True ``` until a better alternative is found.	2022-09-13 03:07:17 +00:00
James Dunkerley	4c82b657de	Tidy up type signatures and error types (#3693 ) Small clean up PR. - Aligns a few type signatures with their functions. - Some formatting fixes. - Remove a few unused types. - Make error extension functions be standard methods.	2022-09-09 11:11:46 +00:00
Radosław Waśko	551100af3b	Add `Table.distinct` function to In-Memory table (#3684 ) Implements https://www.pivotaltracker.com/story/show/182307143 # Important Notes - Modified standard library Java helpers dependencies so that `std-table` module depends on `std-base`, as a provided dependency. This is allowed, because `std-table` is used by the `Standard.Table` Enso module which depends on `Standard.Base` which ensures that the `std-base` is loaded onto the classpath, thus whenever `std-table` is loaded by `Standard.Table`, so is `std-base`. Thus we can rely on classes from `std-base` and its dependencies being _provided_ on the classpath. Thanks to that we can use utilities like `Text_Utils` also in `std-table`, avoiding code duplication. Additional advantage of that is that we don't need to specify ICU4J as a separate dependency for `std-table`, since it is 'taken' from `std-base` already - so we avoid including it in our build packages twice.	2022-09-07 12:28:41 +00:00
Marcin Kostrzewa	4fc6dcced0	Get rid of free-floating atoms. Everything has a type now! (#3671 ) This is a step towards the new language spec. The `type` keyword now means something. So we now have ``` type Maybe a Some (from_some : a) None ``` as a thing one may write. Also `Some` and `None` are not standalone types now – only `Maybe` is. This halfway to static methods – we still allow for things like `Number + Number` for backwards compatibility. It will disappear in the next PR. The concept of a type is now used for method dispatch – with great impact on interpreter code density. Some APIs in the STDLIB may require re-thinking. I take this is going to be up to the libraries team – some choices are not as good with a semantically different language. I've strived to update stdlib with minimal changes – to make sure it still works as it did. It is worth mentioning the conflicting constructor name convention I've used: if `Foo` only has one constructor, previously named `Foo`, we now have: ``` type Foo Foo_Data f1 f2 f3 ``` This is now necessary, because we still don't have proper statics. When they arrive, this can be changed (quite easily, with SED) to use them, and figure out the actual convention then. I have also reworked large parts of the builtins system, because it did not work at all with the new concepts. It also exposes the type variants in SuggestionBuilder, that was the original tiny PR this was based on. PS I'm so sorry for the size of this. No idea how this could have been smaller. It's a breaking language change after all.	2022-08-30 22:54:53 +00:00
Radosław Waśko	d7ebc4a338	Add `Table.take` and `Table.drop` functions to In-Memory table (#3647 ) Implements https://www.pivotaltracker.com/story/show/182307347	2022-08-26 19:41:36 +00:00
James Dunkerley	a20d43390e	Adding DateTime part functions (#3669 ) - Added `Zone`, `Date_Time` and `Time_Of_Day` to `Standard.Base`. - Renamed `Zone` to `Time_Zone`. - Added `century`. - Added `is_leap_year`. - Added `length_of_year`. - Added `length_of_month`. - Added `quarter`. - Added `day_of_year`. - Added `Day_Of_Week` type and `day_of_week` function. - Updated `week_of_year` to support ISO. # Important Notes - Had to pass locale to formatter for date/time tests to work on my PC. - Changed default of `week_of_year` to use ISO.	2022-08-26 15:47:58 +00:00
Radosław Waśko	fd318cfa96	Remove `Array.set_at` (#3634 ) Implements https://www.pivotaltracker.com/story/show/182879865 # Important Notes Note that removing `set_at` still does not make our arrays fully immutable - `Array.copy` can still be used to mutate them.	2022-08-26 09:34:33 +00:00
Hubert Plociniczak	d87a32d019	Builtin Date_Time, Time_Of_Day, Zone (#3658 ) * Builtin Date_Time, Time_Of_Day, Zone Improved polyglot support for Date_Time (formerly Time), Time_Of_Day and Zone. This follows the pattern introduced for Enso Date. Minor caveat - in tests for Date, had to bend a lot for JS Date to pass. This is because JS Date is not really only a Date, but also a Time and Timezone, previously we just didn't consider the latter. Also, JS Date does not deal well with setting timezones so the trick I used is to first call foreign function returning a polyglot JS Date, which is converted to ZonedDateTime and only then set the correct timezone. That way none of the existing tests had to be changes or special cased. Additionally, JS deals with milliseconds rather than nanoseconds so there is loss in precision, as noted in Time_Spec. * Add tests for Java's LocalTime * changelog * Make date formatters in table happy * PR review, add more tests for zone * More tests and fixed a bug in column reader Column reader didn't take into account timezone but that was a mistake since then it wouldn't map to Enso's Date_Time. Added tests that check it now. * remove redundant conversion * Update distribution/lib/Standard/Base/0.0.0-dev/src/Data/Time.enso Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org> * First round of addressing PR review * don't leak java exceptions in Zone * Move Date_Time to top-level module * PR review Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org> Co-authored-by: Jaroslav Tulach <jaroslav.tulach@enso.org>	2022-08-24 12:31:29 +02:00
Hubert Plociniczak	4b9c91626e	Use Vector.from_polyglot_array to make Vectors backed by polyglot arrays (#3628 ) Use Proxy_Polyglot_Array as a proxy for polyglot arrays, thus unifying the way the underlying array is accessed in Vector. Used the opportunity to cleanup builtin lookup, which now actually respects what is defined in the body of @Builtin_Method annotation. Also discovered that polyglot null values (in JS, Python and R) were leaking to Enso. Fixed that by doing explicit translation to `Nothing`. https://www.pivotaltracker.com/story/show/181123986	2022-08-23 21:13:16 +00:00
James Dunkerley	684adcb7fb	Tidy up the default imports for Standard.Table (#3660 ) - Removed various unnecessary `Standard.Base` imports still left behind. - Added `Regex` to default `Standard.Base`. - Removed aliasing from the examples as no longer needed (case coercion no long occurs). - Remove `import Standard.Table` from within the Table library (directly importing types). - Reviewed what was in `Standard.Database` - a few tweaks and removals. - Removed various un-needed aliasing following Hubert's import work.	2022-08-22 19:21:54 +00:00
Radosław Waśko	3dca738cf7	Add `Vector.take` and `Vector.drop` functions (#3629 ) Implements https://www.pivotaltracker.com/story/show/182307048	2022-08-10 16:02:02 +00:00
Dmitry Bushev	5e114acbb5	Update Scala to 2.13.8 (#3631 ) Update Scala compiler and libraries.	2022-08-08 19:32:55 +00:00
Radosław Waśko	0a2fea925c	Create `Index_Sub_Range` type and update `Text.take` and `Text.drop` (#3617 )	2022-08-03 11:41:34 +00:00
Hubert Plociniczak	d59714a29d	Support module imports using a qualified name (#3608 ) This change allows for importing modules using a qualified name and deals with any conflicts on the way. Given a module C defined at `A/B/C.enso` with ``` type C type C a ``` it is now possible to import it as ``` import project.A ... val x = A.B.C 10 ``` Given a module located at `A/B/C/D.enso`, we will generate intermediate, synthetic, modules that only import and export the successor module along the path. For example, the contents of a synthetic module B will look like ``` import <namespace>.<pkg-name>.A.B.C export <namespace>.<pkg-name>.A.B.C ``` If module B is defined already by the developer, the compiler will _inject_ the above statements to the IR. Also removed the last elements of some lowercase name resolution that managed to survive recent changes (`Meta.Enso_Project` would now be ambiguous with `enso_project` method). Finally, added a pass that detects shadowing of the synthetic module by the type defined along the path. We print a warning in such a situation. Related to https://www.pivotaltracker.com/n/projects/2539304 # Important Notes There was an additional request to fix the annoying problem with `from` imports that would always bring the module into the scope. The changes in stdlib demonstrate how it is now possible to avoid the workaround of ``` from X.Y.Z as Z_Module import A, B ``` (i.e. `as Z_Module` part is almost always unnecessary).	2022-07-29 14:19:07 +00:00
Hubert Plociniczak	f63e40df1b	Explicit self (#3569 ) This change modifies the current language by requiring explicit `self` parameter declaration for methods. Methods without `self` parameter in the first position should be treated as statics although that is not yet part of this PR. We add an implicit self to all methods This obviously required updating the whole stdlib and its components, tests etc but the change is pretty straightforward in the diff. Notice that this change does not change method dispatch, which was removed in the last changes. This was done on purpose to simplify the implementation for now. We will likely still remove all those implicit selfs to bring true statics. Minor caveat - since `main` doesn't actually need self, already removed that which simplified a lot of code.	2022-07-27 17:45:36 +00:00
James Dunkerley	a54a7d5553	Tidying up what is in Standard.Base (#3603 ) - Added various of the types from the new APIs to the Standard.Base export. - Removed Syntax_Error types for Regex and Uri and used the common one.	2022-07-27 13:28:00 +00:00
Radosław Waśko	ee91656f30	Remove duplicate `Line_Ending_Style` and update defaults (#3597 ) Implements https://www.pivotaltracker.com/story/show/182749831	2022-07-27 09:43:51 +00:00
James Dunkerley	be311457bd	Add Linear Regression support for Vectors. (#3601 ) Adds least squares regression APIs. Covers the basic 4 trend line types from Excel (doesn't cover Polynomial or Moving Average). Removes the old `Model` from the `Standard.Table`.	2022-07-22 08:41:17 +00:00
Radosław Waśko	16fd038c1a	Add support for `.pgpass` to PostgreSQL (#3593 ) Implements https://www.pivotaltracker.com/story/show/182582924	2022-07-21 13:32:37 +00:00
Jaroslav Tulach	4465d63dd8	Improved polyglot Date support (#3559 ) Significantly improves the polyglot Date support (as introduced by #3374). It enhances the `Date_Spec` to run it in four flavors: - with Enso Date (as of now) - with JavaScript Date - with JavaScript Date wrapped in (JavaScript) array - with Java LocalDate allocated directly The code is then improved by necessary modifications to make the `Date_Spec` pass. # Important Notes James has requested in [#181755990](https://www.pivotaltracker.com/n/projects/2539304/stories/181755990) - e.g. _Review and improve InMemory Table support for Dates, Times, DateTimes, BigIntegers_ the following program to work: ``` foreign js dateArr = """ return [1, new Date(), 7] main = IO.println <\| (dateArr.at 1).week_of_year ``` the program works with here in provided changes and prints `27` as of today. @jdunkerley has provided tests for proper behavior of date in `Table` and `Column`. Those tests are working as of [`f16d07e`](`f16d07e640`). One just needs to accept `List<Value>` and then query `Value` for `isDate()` when needed. Last round of changes is related to exception handling. `8b686b12bd` makes sure `makePolyglotError` accepts only polyglot values. Then it wraps plain Java exceptions into `WrapPlainException` with `has_type` method - `60da5e70ed` - the remaining changes in the PR are only trying to get all tests working in the new setup. The support for `Time` isn't part of this PR yet.	2022-07-21 06:32:40 +00:00
James Dunkerley	5e4083978f	Type name case fixes: (#3590 ) - MacOS => Mac_OS - PostgreSQL => Postgres - SQLite => SQLite (align a few) - InMemory => In_Memory - PointData => Point_Data - Io_Error => IO_Error - Standard.Table.Io => Standard.Table.IO In Tests: - MyError => My_Error - NotFoo => Not_Foo	2022-07-19 14:09:09 +00:00
Radosław Waśko	35ddd2a89e	Add new options to the Delimited format (#3581 ) Implements https://www.pivotaltracker.com/story/show/182662195 and https://www.pivotaltracker.com/story/show/182651884	2022-07-14 15:01:26 +00:00
James Dunkerley	9578dc1e43	Move `write_bytes` to be part of `Vector`. (#3583 ) Updates `write_bytes` API to be part of `Vector` and to conform to `write` APIs. # Important Notes Ensures doesn't touch the file if an invalid byte array.	2022-07-14 11:30:40 +00:00
Jaroslav Tulach	735053c218	Implementing basic functions (#3554 ) The language specification suggests to add [five basic functions into the standard library](https://github.com/enso-org/design/blob/wip/wd/enso-spec/epics/enso-spec-1.0/05.%20Functions.md#useful-functions-in-the-standard-library). `identity`, `flip`, `const`, `curry` & `uncurry`. # Important Notes The new functions are being added into existing `Function.enso` file. That may not be the best place, but it is not clear from the [design spec](https://github.com/enso-org/design/blob/wip/wd/enso-spec/epics/enso-spec-1.0/05.%20Functions.md#useful-functions-in-the-standard-library) how they are supposed to be imported. I can move them wherever needed. There is a documentation provided for each of the functions, but I am not sure how to verify it is correct. Do we generate the documentation for stdlib somehow?	2022-07-11 10:30:44 +00:00
Radosław Waśko	28513a3389	Allow filtering caught error type in `Error.catch` (#3574 ) More and more often I need a way to only recover a specific type of a dataflow error (in a similar manner as with panics). So the API for `Error.catch` has been amended to more closely resemble `Panic.catch`, allowing to handle only specific types of dataflow errors, passing others through unchanged. The default is `Any`, meaning all errors are caught by default, and the behaviour of `x.catch` remains unchanged.	2022-07-11 08:26:44 +00:00
Radosław Waśko	d8dddf40c6	Fix `Meta.Polyglot.get_language` (#3568 )	2022-07-07 13:29:38 +00:00
Hubert Plociniczak	96e50648dd	Remove 'here' and make method name resolution case-sensitive (#3538 ) Modified UppercaseNames to now resolve methods without an explicit `here` to point to the current module. `here` was also often used instead of `self` which was allowed by the compiler. Therefore UppercaseNames pass is now GlobalNames and does some extra work - it translated method calls without an explicit target into proper applications. # Important Notes There was a long-standing bug in scopes usage when compiling standalone expressions. This resulted in AliasAnalysis generating incorrect graphs and manifested itself only in unit tests and when running `eval`, thus being a bit hard to locate. See `runExpression` for details. Additionally, method name resolution is now case-sensitive. Obsolete passes like UndefinedVariables and ModuleThisToHere were removed. All tests have been adapted.	2022-07-07 10:31:06 +00:00
James Dunkerley	16e6f2fa08	Adding Append support to Excel.Write (#3558 ) Adds support for appending to an existing Excel table. # Important Notes - Renamed `Column_Mapping` to `Column_Name_Mapping` - Changed new type name to `Map_Column` - Added last modified time and creation time to `File`.	2022-07-07 06:41:33 +00:00
Radosław Waśko	7c94fa6a77	Custom Encoding support when writing Delimited files (#3564 ) Implements https://www.pivotaltracker.com/story/show/182545847	2022-07-07 00:20:00 +00:00
James Dunkerley	4ca2097488	Adding write support to `File_Format.Excel` (#3551 ) Support for writing tables to Excel. # Important Notes Has custom support for Error mode as will allow appending a new table in this mode to the file.	2022-07-04 18:32:16 +00:00
Radosław Waśko	972b34d1a9	Implement value formatting and writing new files in Delimited format. (#3528 ) Implements https://www.pivotaltracker.com/story/show/182309429 and https://www.pivotaltracker.com/story/show/182309573	2022-06-23 16:51:52 +00:00
Hubert Plociniczak	22a371a9c6	Substitute this with self (#3524 ) A semi-manual s/this/self appied to the whole standard library. Related to https://www.pivotaltracker.com/story/show/182328601 In the compiler promoted to use constants instead of hardcoded `this`/`self` whenever possible. # Important Notes The PR does not require explicit `self` parameter declaration for methods as this part of the design is still under consideration.	2022-06-21 10:53:52 +00:00
Hubert Plociniczak	fd46e84e8d	Towards a full-blown builtins DSL (part 3) (#3471 ) Auto-generate all builtin methods for builtin `File` type from method signatures. Similarly, for `ManagedResource` and `Warning`. Additionally, support for specializations for overloaded and non-overloaded methods is added. Coverage can be tracked by the number of hard-coded builtin classes that are now deleted. ## Important notes Notice how `type File` now lacks `prim_file` field and we were able to get rid off all of those propagating method calls without writing a single builtin node class. Similarly `ManagedResource` and `Warning` are now builtins and `Prim_Warnings` stub is now gone.	2022-06-13 11:48:34 +00:00
Radosław Waśko	a04825a5ce	Add `Text.write` Function (#3518 ) Implements https://www.pivotaltracker.com/story/show/182309026	2022-06-13 09:11:46 +00:00
James Dunkerley	e97d27e1e0	Adjusting First and Last order_by to use Sort_Column_Selector (#3517 )	2022-06-10 09:59:03 +00:00
James Dunkerley	8afba43add	Implement In-Memory Table order_by (#3515 ) Implemented the `order_by` function with support for all modes of operation. Added support for case insensitive natural order. # Important Notes - Improved MultiValueIndex/Key to not create loads of arrays. - Adjusted HashCode for MultiValueKey to have a simple algorithm. - Added Text_Utils.compare_normalized_ignoring_case to allow for case insensitive comparisons. - Fixed issues with ObjectComparator and added some unit tests for it.	2022-06-08 12:30:50 +00:00
Radosław Waśko	2af970fe52	Basic changes to File_Format (#3516 ) Implements https://www.pivotaltracker.com/story/show/182308987	2022-06-08 09:53:18 +00:00
Radosław Waśko	a382e0c15e	Improve database `Table.order_by` (#3514 ) Implements https://www.pivotaltracker.com/story/show/182195405 Adds support for the Postgres dialect and simple case insensitive collation for SQLite.	2022-06-07 12:31:55 +00:00
Radosław Waśko	7d94efa6f2	Implement `Table.order_by` for SQLite and the common scaffolding for all backends (#3502 ) Implements the common and SQLite parts of https://www.pivotaltracker.com/story/show/182195405	2022-06-06 10:56:52 +00:00
Hubert Plociniczak	e43325bfe1	Short-circuiting \|\| and && (#3492 ) Short-circuiting \|\| and && is typically taken for granted by users of other PLs. This change makes it happen for Enso. Related to https://www.pivotaltracker.com/story/show/182261401	2022-06-02 16:58:38 +00:00
James Dunkerley	1aa0bb3552	Rank Data, Correlation, Covariance, R Squared (#3484 ) - Added new `Statistic`s: Covariance, Pearson, Spearman, R Squared - Added `covariance_matrix` function - Added `pearson_correlation` function to compute correlation matrix - Added `rank_data` and Rank_Method type to create rankings of a Vector - Added `spearman_correlation` function to compute Spearman Rank correlation matrix # Important Notes - Added `Panic.throw_wrapped_if_error` and `Panic.handle_wrapped_dataflow_error` to help with errors within a loop. - Removed `Array.set_at` use from `Table.Vector_Builder`	2022-05-30 17:13:06 +00:00
Radosław Waśko	f0f3a343eb	Adjust Table.sort_columns to use Text_Ordering design (#3487 ) Implements https://www.pivotaltracker.com/story/show/182195306	2022-05-30 12:26:29 +00:00
James Dunkerley	ec30793ecb	Remove duplicate methods (#3480 ) Fix the cases of overloaded methods. - `Text.==` - `Integer.up_to` and `Integer.down_to` - `Nothing.is_nothing`	2022-05-25 14:30:51 +00:00
Hubert Plociniczak	12d6ef799f	Refactor methods of Managed_Resource (#3460 ) Promoted `with`, `take`, `finalize` to be methods of Managed_Resource rather than static methods always taking `resource`, for consistency reasons. This required function dispatch boilerplate, similarly to `Ref`. In future iterations we will address this boilerplate code. Related to https://www.pivotaltracker.com/story/show/182212217	2022-05-18 17:27:42 +00:00
Radosław Waśko	8430ce2625	Parsing values with known types (#3455 ) Implements https://www.pivotaltracker.com/story/show/181824146	2022-05-18 15:27:48 +00:00
Hubert Plociniczak	6b6b1430bc	Cleanup Ref - get/put (#3457 ) The change promotes static methods of `Ref`, `get` and `put`, to be methods of `Ref` type. The change also removes `Ref` module from the default namespace. Had to mostly c&p functional dispatch for now, in order for the methods to be found. Will auto-generate that code as part of builtins system. Related to https://www.pivotaltracker.com/story/show/182138899	2022-05-17 10:26:36 +00:00
James Dunkerley	4f3a76817c	Statistics on a Vector (#3442 ) - Implements various statistics on Vector # Important Notes Some minor codebase improvements: - Some tweaks to Any/Nothing to improve performance - Fixed bug in ObjectComparator - Added if_nothing - Removed Group_By_Key	2022-05-11 13:25:06 +00:00
Radosław Waśko	64f178f7a8	Delimited File Encoding (#3430 ) Implements https://www.pivotaltracker.com/story/show/181998375	2022-05-10 22:44:05 +00:00
Hubert Plociniczak	4bbabc00be	Move Builtin Types and Methods to stdlib (#3363 ) This PR replaces hard-coded `@Builtin_Method` and `@Builtin_Type` nodes in Builtins with an automated solution that a) collects metadata from such annotations b) generates `BuiltinTypes` c) registers builtin methods with corresponding constructors. The main differences are: 1) The owner of the builtin method does not necessarily have to be a builtin type 2) You can now mix regular methods and builtin ones in stdlib 3) No need to keep track of builtin methods and types in various places and register them by hand (a source of many typos or omissions as it found during the process of this PR) Related to #181497846 Benchmarks also execute within the margin of error. ### Important Notes The PR got a bit large over time as I was moving various builtin types and finding various corner cases. Most of the changes however are rather simple c&p from Builtins.enso to the corresponding stdlib module. Here is the list of the most crucial updates: - `engine/runtime/src/main/java/org/enso/interpreter/runtime/builtin/Builtins.java` - the core of the changes. We no longer register individual builtin constructors and their methods by hand. Instead, the information about those is read from 2 metadata files generated by annotation processors. When the builtin method is encountered in stdlib, we do not ignore the method. Instead we lookup it up in the list of registered functions (see `getBuiltinFunction` and `IrToTruffle`) - `engine/runtime/src/main/java/org/enso/interpreter/runtime/callable/atom/AtomConstructor.java` has now information whether it corresponds to the builtin type or not. - `engine/runtime/src/main/scala/org/enso/compiler/codegen/RuntimeStubsGenerator.scala` - when runtime stubs generator encounters a builtin type, based on the @Builtin_Type annotation, it looks up an existing constructor for it and registers it in the provided scope, rather than creating a new one. The scope of the constructor is also changed to the one coming from stdlib, while ensuring that synthetic methods (for fields) also get assigned correctly - `engine/runtime/src/main/scala/org/enso/compiler/codegen/IrToTruffle.scala` - when a builtin method is encountered in stdlib we don't generate a new function node for it, instead we look it up in the list of registered builtin methods. Note that Integer and Number present a bit of a challenge because they list a whole bunch of methods that don't have a corresponding method (instead delegating to small/big integer implementations). During the translation new atom constructors get initialized but we don't want to do it for builtins which have gone through the process earlier, hence the exception - `lib/scala/interpreter-dsl/src/main/java/org/enso/interpreter/dsl/MethodProcessor.java` - @Builtin_Method processor not only generates the actual code fpr nodes but also collects and writes the info about them (name, class, params) to a metadata file that is read during builtins initialization - `lib/scala/interpreter-dsl/src/main/java/org/enso/interpreter/dsl/MethodProcessor.java` - @Builtin_Method processor no longer generates only (root) nodes but also collects and writes the info about them (name, class, params) to a metadata file that is read during builtins initialization - `lib/scala/interpreter-dsl/src/main/java/org/enso/interpreter/dsl/TypeProcessor.java` - Similar to MethodProcessor but handles @Builtin_Type annotations. It doesn't, yet, generate any builtin objects. It also collects the names, as present in stdlib, if any, so that we can generate the names automatically (see generated `types/ConstantsGen.java`) - `engine/runtime/src/main/java/org/enso/interpreter/node/expression/builtin` - various classes annotated with @BuiltinType to ensure that the atom constructor is always properly registered for the builitn. Note that in order to support types fields in those, annotation takes optional `params` parameter (comma separated). - `engine/runtime/src/bench/scala/org/enso/interpreter/bench/fixtures/semantic/AtomFixtures.scala` - drop manual creation of test list which seemed to be a relict of the old design	2022-05-05 20:18:06 +02:00
Radosław Waśko	8219dca400	Improve support for reading Delimited files (#3424 ) Implements https://www.pivotaltracker.com/story/show/181823957	2022-04-29 17:12:19 +00:00
Radosław Waśko	14257d07aa	Data analysts should be able to use `Text.split`, `Text.lines` and `Text.words` to break up strings (#3415 ) Implements https://www.pivotaltracker.com/story/show/181266184 ### Important Notes Changed example image download to only proceed if the file did not exist before - thus cutting on the build time (the build used to download it _every_ time - which completely failed the build if network is down). A redownload can be forced by performing a fresh repository checkout.	2022-04-26 17:22:53 +02:00
Radosław Waśko	fecaa81551	Review Range and Interval, resolve infinite loop issue (#3408 ) Implements: https://www.pivotaltracker.com/story/show/181652841	2022-04-20 16:22:01 +00:00
James Dunkerley	5a6b6749cc	Restructuring for File.read (#3390 ) - Added Encoding type - Added `Text.bytes`, `Text.from_bytes` with Encoding support - Renamed `File.read` to `File.read_text` - Renamed `File.write` to `File.write_text` - Added Encoding support to `File.read_text` and `File.write_text` - Added warnings to invalid encodings	2022-04-19 16:50:03 +00:00
Jaroslav Tulach	ab692b3b74	Enso Date shall be converted to java.time.LocalDate when passed to Java (#3374 )	2022-04-15 06:02:05 +02:00
Dmitry Bushev	998d078b9a	Fill component groups of standard libraries (#3391 ) Changelog: - add: component groups to package descriptions - add: `executionContext/getComponentGroups` method that returns component groups of libraries that are currently loaded - doc: cleanup unimplemented undo/redo commands - refactor: internal component groups datatype	2022-04-14 13:12:18 +00:00
Radosław Waśko	0ea5dc2a6f	Data analysts should be able to use `Text.replace` to substitute parts of the text (#3393 ) Implements https://www.pivotaltracker.com/story/show/181266274	2022-04-13 19:21:47 +00:00
Radosław Waśko	891f064a6a	Extend Aggregate_Spec test suite with tests for missed edge-cases to ensure the feature is well-tested on all backends (#3383 ) Implements https://www.pivotaltracker.com/story/show/181805693 and finishes the basic set of features of the Aggregate component. Still not all aggregations are supported everywhere, because for example SQLite has quite limited support for aggregations. Currently the workaround is to bring the table into memory (if possible) and perform the computation locally. Later on, we may add more complex generator features to emulate the missing aggregations with complex sub-queries.	2022-04-12 11:02:01 +00:00
James Dunkerley	bade0c31de	First and Last ordering (#3380 ) Add the missing `order_by` support to First and Last aggregations for InMemory table.	2022-04-06 12:36:46 +00:00
Nikita Pekin	22e3941371	Data analysts should be able to reverse strings using `Text.reverse` (#3377 ) This commit implements `Text.reverse` as an extension on `Text`. `Text.reverse` reverses strings. For example: `"Hello World!".reverse` results in `"!dlroW olleH"`. Strings are reversed by their Extended Grapheme Clusters not by their characters. This has some performance implications because we need to find these grapheme cluster boundaries when iterating. To do so, `BreakIterator.getCharacterInstance` is used. Implements: https://www.pivotaltracker.com/n/projects/2539304/stories/181265419	2022-04-05 16:45:56 +00:00
Radosław Waśko	43265f10a8	Implement Error-Handling for Database aggregations, unify some error helpers across backends (#3371 )	2022-03-31 12:10:22 +00:00
Dmitry Bushev	23e5216922	Fix docparser (#3370 ) Changelog: - fix the docparser case when there is an unclosed formatted section before a list - add the test checking the doc generation on the stdlib	2022-03-31 04:44:34 +00:00
Dmitry Bushev	a34c2bcfe6	Implement Documentation Sections API (#3360 )	2022-03-25 10:49:37 +03:00
James Dunkerley	02bcfbb2a8	Refactor Aggregate Column (#3349 ) - Make it easier to understand the computations. - Fix issue with First. - Improve quote handling in Concatenate - Added validation and warnings to input	2022-03-22 18:18:46 +00:00
Radosław Waśko	cc7333812d	The library developer should be able to handle specific types of Panics while passing through others (#3344 ) Implements https://www.pivotaltracker.com/story/show/181569176 Also ensures that Dataflow Errors have proper stack traces (earlier they did not point at the right location).	2022-03-18 16:57:06 +00:00
Radosław Waśko	08183f59f2	Minor fixes for Text (#3340 ) * Avoid unnecessary copies * Add tests for conversions * Add guidelines for Text tests Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>	2022-03-15 16:11:46 +00:00
Radosław Waśko	dedd1eac96	Refactor library warnings to use the new system (#3337 ) Implements https://www.pivotaltracker.com/story/show/181536964	2022-03-15 12:52:57 +01:00
Radosław Waśko	247b284316	Data analysts should be able to use `Text.location_of` to find indexes within string using various matchers (#3324 ) Implements https://www.pivotaltracker.com/n/projects/2539304/stories/181266029	2022-03-12 19:42:00 +00:00
Marcin Kostrzewa	4653bfeeab	Decorate values with arbitrary warnings (#3248 )	2022-03-09 16:40:02 +01:00
James Dunkerley	65465fb8ef	Restructuring the Faker type and creating tests for Group_By (#3318 ) - Added Minimum, Maximum, Longest. Shortest, Mode, Percentile - Added first and last to Map - Restructured Faker type more inline with FakerJS - Created 2,500 row data set - Tests for group_by - Performance tests for group_by	2022-03-09 10:31:02 +00:00
Hubert Plociniczak	8bdca89917	New Text.insert function (#3311 ) Implements https://www.pivotaltracker.com/n/projects/2539304	2022-03-04 16:40:34 +01:00
James Dunkerley	fb68f18739	Within Vector, use Array.Copy wherever possible (#3236 ) Following the Slice and Array.Copy experiment, took just the Array.Copy parts out and built into the Vector class. This gives big performance wins in common operations: \| Test \| Ref \| New \| \| --- \| --- \| --- \| \| New Vector \| 41.5 \| 41.4 \| \| Append Single \| 26.6 \| 4.2 \| \| Append Large \| 26.6 \| 4.2 \| \| Sum \| 230.1 \| 99.1 \| \| Drop First 20 and Sum \| 343.5 \| 96.9 \| \| Drop Last 20 and Sum \| 311.7 \| 96.9 \| \| Filter \| 240.2 \| 92.5 \| \| Filter With Index \| 364.9 \| 237.2 \| \| Partition \| 772.6 \| 280.4 \| \| Partition With Index \| 912.3 \| 427.9 \| \| Each \| 110.2 \| 113.3 \| Benchmarks run on an AWS EC2 r5a.xlarge with 1,000,000 item count, 100 iteration size run 10 times. # Important Notes Have generally tried to push the `@Tail_Call` down from the Vector class and move to calling functions on the range class. - Expanded benchmarks on Vector - Added `take` method to Vector - Added `each_with_index` method to Vector - Added `filter_with_index` method to Vector	2022-03-03 15:40:48 +00:00
James Dunkerley	ad1130587d	Updating `Text.repeat` and adding `Text.` (#3310 ) Updating the `Text.repeat` function: - fix issue with negative count - add operator Add tests of the function.	2022-03-02 19:00:47 +00:00
Radosław Waśko	40c851bf8b	Text.pad and Text.trim (#3309 ) Implements https://www.pivotaltracker.com/story/show/181265516	2022-03-02 17:19:39 +00:00
Radosław Waśko	0d96f59f44	Data analysts should be able to use `Text.to_case` to change the case of Text values (#3302 ) * Move to_upper_case and to_lower_case into to_case * Add an export, not sure about it * Implement title case TODO: some more tests would be good * Add more tests * explain title case * fix todo * changelog	2022-02-28 23:20:41 +00:00
Radosław Waśko	b03416f907	Update Column_Selector and Column_Mapping to use Matcher over Matching_Strategy (#3299 ) Implements https://www.pivotaltracker.com/story/show/181339748	2022-02-25 18:39:10 +00:00
Radosław Waśko	2ae636f63c	Data analysts should be able to use `Text.starts_with` and `Text.ends_with` (#3292 ) Implements https://www.pivotaltracker.com/story/show/181265900	2022-02-23 16:48:33 +00:00
James Dunkerley	2e2c5562a8	Text.take and Text.drop (#3287 ) Implementation of the Text take and drop APIs - Added `Range.contains` function - Added `Text_Sub_Range` type - Added `Text_Utils.index_of` and `Text_Utils.last_index_of` based on ICU StringSearcher	2022-02-22 18:50:59 +00:00
Radosław Waśko	ae9d51555f	Data analysts should be able to use `Text.contains` to check for substring using various matcher techniques. (#3285 ) * Add matching mode definitions * Add stub for new method API and an initial test suite * Fix tests, implement exact matching * Implement Regex matching * changelog * Add benchmarks * Wokraround for case insensitive regex locale support * minor tweaks * Unify Case_Insensitive * Update edge cases * Fix other affected places * minor style change * Add a problematic test * Add a regex test for a similar situation * Migrate to StringSearch:wq * Add test cases for scharfes S edge case * Add problematic Regex Unicode normalization test * Document the regex accents peculiarity * Do not apply the normalization in ASCII only mode * cr	2022-02-22 15:41:56 +00:00
Radosław Waśko	14f57271a2	Ensure that `Text.compare_to` compares strings according to grapheme clusters (#3282 ) https://www.pivotaltracker.com/story/show/181175238	2022-02-17 17:09:41 +00:00
James Dunkerley	7afc8c48c5	Adding Integer.Parse (#3283 ) * Integer parse via Longs * Integer parse via Longs * Benchmark for Number Parse * CHANGELOG.md and Natural Order * Expanded test set * Number base tests * Few more negative tests	2022-02-17 15:04:00 +00:00
James Dunkerley	68b85dea82	Improvement to the Natural Order Sort (#3276 ) * Improved Natural Order Data generator for benchmarking * Missing Import Benchmark script * Update Natural_Order.enso Restore missing ToDo * Changelog * PR Comments * PR Comments * Additional comments. * Correction	2022-02-16 17:40:33 +00:00
Marcin Kostrzewa	67b4e59506	Properly expose stacktraces and related data to user code (#3271 )	2022-02-16 10:36:19 +03:00
Radosław Waśko	fbf747d6cf	Implement Vector.flatten (#3259 )	2022-02-15 16:16:08 +01:00
James Dunkerley	585afd83ce	Adding Text.at and Text.is_digit functions (#3269 ) * Add Text.at function * Add tests for Text.at * Add tests for Text.is_digit * Change log * Avoid memory allocation	2022-02-14 09:03:55 +00:00
James Dunkerley	1814d3c4f1	Data analysts should be able to transform a Table using the rename_columns functions (#3249 ) * Implement Natural_Order and sort_columns * Starting on Rename Align Column_Mapping Add By_Position Separating off the validation for By_Index so can reuse for rename By_Position implemented By_Index implemented Adjusted behaviour following discussion with Ned, so that renames dominate untouched columns. Moving to validation style checks for problems Putting accumulator back Rename work * Add Range.find * More work * Regex support Tidy of Unique Name Strategy * Fix Regex support * Warning messages Tests for Unique Naming Strategy Table rename working * Database Table rename_columns Fix for Table Must follow up on slice * Some tests * More tests * Complete test set (and associated fixes) * Functional use_first_row_as_names Tests to go... * Test for use_first_row_as_names * Change log * trailing space Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org>	2022-02-11 10:18:51 +00:00
Radosław Waśko	8b24336604	Data analysts should be able to reorder columns into name order using sort_columns functions (#3250 )	2022-02-08 17:28:46 +01:00
Michał Wawrzyniec Urbańczyk	4baad5f146	Nightly proccess preparations: Setting Enso version through the environment (#3241 ) Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org> Co-authored-by: Radosław Waśko <wasko.radek@gmail.com>	2022-02-07 15:14:32 +01:00

... 2 3 4 5 6 ...

325 Commits