enso-org/enso - enso - gitea: Gitea Service

mirror of https://github.com/enso-org/enso.git synced 2024-11-23 16:18:23 +03:00

Author	SHA1	Message	Date
Radosław Waśko	3222e5af62	Avoid exponential growth of column names (#7934 ) - Fixes #7933 - Avoids duplicating `]` as `]]` in generated column names - now column names grow linearly.	2023-10-02 16:05:28 +00:00
Jaroslav Tulach	6a59fa5e93	Meta.Type.find a type by FQN (#7885 )	2023-10-02 17:02:00 +02:00
James Dunkerley	fb50eb7595	Using conversions in a few places (#7859 ) - Shuffle a few `from`s into correct places: - `Day_Of_Week.from` removing `Day_Of_Week_From` module. - Adding short cut for `http` and `https` in `Data.read` so it calls onto `Data.fetch` giving a single entry point. - Moved `URI` extensions from `Standard.Base.Data` module into `Standard.Base.Network.Extensions`. - Added `post` extension for `URI`. - Added `contains_key` to `JS_Object`. - Restored `into` in `JS_Object`: - Follows old logic populating a constructor. - Will use conversion from `JS_Object` if present. - Added automatic deserialization of `Date`, `Time_Of_Day` and `Date_Time` from JSON. - Uses conversion from `JS_Object`. - Added conversion from `Text` to a `HTTP_Method` and type checking where `HTTP_Method` used in public APIs. - Added support for `Date`, `Time_Of_Day` and `Date_Time` in `Table.from_objects`. - Added `expand_column` to `Table` to expand `JS_Object` to values. - Add type checking for `Table` in `right` arguments (allowing `Column`s to be used). - Use type checking in `Table.set` to allow for conversion to a `Column`. - Remove some unused imports. - Fix for bug in S3 edge case.	2023-10-02 14:54:22 +00:00
Cassandra-Clark	d7258abbf5	Add Text.substring to allow for an easy short hand of Text.take (start.up_to end) (#7913 ) * Add Text.substring function and get_position helper function For #7876 adds a Text.substring function which supports negative indexes and returns a part of a string from 0-based index 'start' and continuing for 'length' * added substring and simplified get function For #7876 adds a Text.substring function which supports negative indexes and returns a part of a string from 0-based index 'start' and continuing for 'length'. Also simplified get function as it looped unnecessarily. * Update distribution/lib/Standard/Base/0.0.0-dev/src/Data/Text/Extensions.enso punctuation corrections Co-authored-by: GregoryTravis <greg.m.travis@gmail.com> * Update Text_Spec.enso Added test for start index larger than string * Update distribution/lib/Standard/Base/0.0.0-dev/src/Data/Text/Extensions.enso updated Arguments: section to use consistent style Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org> * Update distribution/lib/Standard/Base/0.0.0-dev/src/Data/Text/Extensions.enso updated Index_Out_Of_Bounds error to reference cached length Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org> * Removed Slice label and added changelog entry * Re-added slice tag to substring Per conversation with James, added slice back to substring * Update CHANGELOG.md add link --------- Co-authored-by: GregoryTravis <greg.m.travis@gmail.com> Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org>	2023-09-29 10:57:57 -06:00
Pavel Marek	9f15b90caa	Implement Enso-specific assert (#7883 ) Implement Enso-specific assert - `Runtime.assert` that works like asserts in any other runtime. # Important Notes - Enso-specific assertions are enabled when JVM assertions are enabled, or when `ENSO_ENABLE_ASSERTIONS` env var is not empty (See `72cd8361cb/engine/runtime/src/main/java/org/enso/interpreter/runtime/EnsoContext.java (L139)`)	2023-09-29 14:46:58 +00:00
Jaroslav Tulach	ffa036411d	Make sure Integer can be treated as Float (#7909 )	2023-09-28 11:56:54 +02:00
Radosław Waśko	8d926166ea	Follow up improvements to `Date_Time_Formatter` (#7875 ) - Closes #7872 - Also closes #7866	2023-09-28 09:38:00 +00:00
Radosław Waśko	c690559ec4	Implement `auto_value_type` operation (#7908 ) Closes #6113	2023-09-27 15:45:34 +00:00
GregoryTravis	b03712390c	Improve HTTP tests (#7847 ) * simple-httpbin encodes response using the Content-encoding header value * Return sent body verbatim	2023-09-27 14:02:32 +00:00
Hubert Plociniczak	18b2491a41	Always log to console and file (#7825 ) * Always log verbose to a file The change adds an option by default to always log to a file with verbose log level. The implementation is a bit tricky because in the most common use-case we have to always log in verbose mode to a socket and only later apply the desired log levels. Previously socket appender would respect the desired log level already before forwarding the log. If by default we log to a file, verbose mode is simply ignored and does not override user settings. To test run `project-manager` with `ENSO_LOGSERVER_APPENDER=console` env variable. That will output to the console with the default `INFO` level and `TRACE` log level for the file. * add docs * changelog * Address some PR requests 1. Log INFO level to CONSOLE by default 2. Change runner's default log level from ERROR to WARN Took a while to figure out why the correct log level wasn't being passed to the language server, therefore ignoring the (desired) verbose logs from the log file. * linter * 3rd party uses log4j for logging Getting rid of the warning by adding a log4j over slf4j bridge: ``` ERROR StatusLogger Log4j2 could not find a logging implementation. Please add log4j-core to the classpath. Using SimpleLogger to log to the console... ``` * legal review update * Make sure tests use test resources Having `application.conf` in `src/main/resources` and `test/resources` does not guarantee that in Tests we will pick up the latter. Instead, by default it seems to do some kind of merge of different configurations, which is far from desired. * Ensure native launcher test log to console only Logging to console and (temporary) files is problematic for Windows. The CI also revealed a problem with the native configuration because it was not possible to modify the launcher via env variables as everything was initialized during build time. * Adapt to method changes * Potentially deal with Windows failures	2023-09-26 11:32:04 +02:00
Radosław Waśko	12c4f2981d	More robust Date/Time format patterns parsing (#7826 ) - Closes #7461 by introducing a `Date_Time_Formatter` type and making parsing date time formats more robust and safer. - The default ('simple') set of patterns is slightly simplified and made case insensitive (except for `M/m` and `H/h`) to avoid the `YYYY` vs `yyyy` issues and make it less error prone. - The `YYYY` now has the same meaning as `yyyy` in simple mode. The old meaning (week-based year) is moved to a _separate mode_, triggered by `Date_Time_Formatter.from_iso_week_date_pattern`. - Full Java syntax, as well as custom-built Java `DateTimeFormatter` can also be used by `Date_Time_Formatter.from_java`. - Text-based constants (e.g. `ISO_ZONED_DATE_TIME`) have now become methods on `Date_Time_Formatter`, e.g. `Date_Time_Formatter.iso_zoned_date_time`).	2023-09-22 10:12:18 +00:00
Jaroslav Tulach	5150c14afd	Avoid duplicated conversion & search target type scope (#7849 )	2023-09-20 17:23:09 +02:00
James Dunkerley	74d1d0861c	S3 Read Access, Input Stream based reading (#7776 ) - Added a `FileSystemSPI` allowing protocol resolution to a target type. - Separated `Input_Stream` and `Output_Stream` from `File` to allow use in other spaces. - `File_Format` types `read_web` changed to be `read_stream` working with `InputStream`. - Added directory listing to `Auto_Detect` allowing for `Data.read` to list a folder. - Adjusted HTTP to return an `InputStream` not a `byte[]`: - `Response_Body` adjusted to wrap an `InputStream`. - Added ability to materialize to either and in-memory vector (<4KB) or a temporary file. - `Data.fetch` will materialize if not a recognized mime-type. - Added `HTTP_Error` to handle IO exceptions from the stream. - `Excel_Format` now supports mime-type and reading a stream. - `Excel_Workbook` can now get a `Excel_Section` using `read_section`. - Added S3 APIs: - `parse_uri`: splits an S3 URI into bucket and key. - `list_objects`: list the items in a S3 bucket with specified prefix. - `read_bucket`: list prefixes and keys with a delimiter in a S3 bucket with specified prefix. - `head`: either head_bucket (tests existance) or head_object API (reads object meta data). - `get_object`: gets an object from S3 returning as a `Response_Body`. - Added `S3_File` type acting like a `File`: - No support for writing in this PR. - ToDo: recursive listing, glob filtering, exists, size. - Fixed a few invalid type signature line. - Moved `create` methods for `Postgres_Connection` and `SQLite_Connection` into type instead of module. - Renamed `Column_Fetcher.Builder` to `Column_Fetcher_Builder`. - Fixed bug with `select_into` in Dry Run mode creating permanent tables. ToDo: Unit tests.	2023-09-20 15:09:11 +00:00
GregoryTravis	b0c1f3b00e	New Data.post for sending a payload to a Web API (#7700 )	2023-09-19 11:26:29 +00:00
Hubert Plociniczak	1ee3d8f4f0	Rename Decimal to Float (#7807 ) Implements #6889.	2023-09-14 15:01:30 +00:00
Radosław Waśko	8b6e70b155	Support for BigInteger values in Table (#7715 ) - Fixes #7354 - And also closes #7712 - Refactors how we handle numeric ops - ensuring that the 'kernels' are placed all in one place and selected based on storage types.	2023-09-12 13:18:04 +00:00
James Dunkerley	f0ae9bf9c5	Fixes issue writing to a dry run Excel File (#7763 ) - Adds `size` to `File`. - If file is empty, then create a new Excel file. - Fixes dry run Excel write issue.	2023-09-08 08:52:00 +00:00
Radosław Waśko	7d424bf8a2	Implement `Table.delete_rows`. (#7709 ) - Closes #7238 - Aligns `update_database_table` to a more consistent and clearer API - `update_rows`. - Adds a `truncate_table` helper function, to pair up with `drop_table`. Both are `PRIVATE` for now. - Adds tests for NULLs in keys in `update_rows` and `delete_rows`. - The behaviour is sometimes unexpected, so instead these fail with `Null_Values_In_Key_Columns`. - Adds a workaround for https://github.com/oracle/graal/issues/7359 - Adds a workaround for a related bug where a stack frame has no name (its `rootNode.getName() == null`). - I could not track down this bug to provide a neat repro.	2023-09-07 11:07:53 +00:00
Radosław Waśko	87ce78615a	Change layout of local library search path in order to be able to move `Round_Spec.enso` back to `Tests` (#7634 ) - Closes #7633 - Moves `Round_Spec.enso` from published `Standard.Test` into our `test/Tests` project; the `Table_Tests` that depend on it, simply `import enso_dev.Tests`. - Changes the layout of the local libraries directory: - It used to be `root/<namespace>/<name>`. - Now it is `root/<dir>` - the namespace and name are now read from `package.yaml` instead. - Adds the parent directory of the current project to the default `ENSO_LIBRARY_PATH`. - It is treated as a secondary path, so the default `ENSO_HOME/lib` still takes precedence. - This allows projects to reference and load 'sibling' projects easily - the only requirement is for the project to enable `prefer-local-libraries: true` or add the other local project to its edition. The edition resolution logic is not changed.	2023-09-01 20:20:04 +00:00
Jaroslav Tulach	1437a671e1	Introducing generic Any.to type conversion method (#7704 )	2023-09-01 08:05:48 +02:00
GregoryTravis	061876e640	Add simple parts of Table.take and Table.drop functions to Database table (#7615 ) Implements database Table and Column take/drop, except While and Sample. Additional features and optimizations are in https://github.com/enso-org/enso/issues/7614.	2023-08-31 18:52:02 +00:00
Radosław Waśko	255b424b72	Add `value_type` to `Column.from_vector` and `expected_value_type` to `Column.map` and `Column.zip` (#7637 ) - Closes #6111 - Aligns semantics of handling Mixed columns. - Now, if an operation like `iif` or `fill_nothing` is given a `Mixed` column, the result will also be `Mixed` regardless of the `inferred_precise_value_type`. - Enables a few old tests that were pending but could be enabled since the types work is advanced enough.	2023-08-31 13:20:49 +00:00
Jaroslav Tulach	6461e20870	Special support for Python Date/Time/Zone interop (#7617 )	2023-08-25 10:27:16 +02:00
GregoryTravis	ddf18f212b	Handle writing to a relative file (#7638 ) Fixes bug in writing to a non-absolute file (with backup).	2023-08-24 21:01:37 +00:00
Jaroslav Tulach	20e18d22df	More descriptive function information (#7629 ) Fixes #7359 by printing more information about the function including partially applied arguments and over-saturated arguments.	2023-08-24 18:04:08 +00:00
James Dunkerley	7d83b3d7b4	Add GROUP to functions (#7622 ) - Update list of groups to agreed list. - Lower case `ALIAS` names to be consistent with function names. - Add `GROUP` to methods. - All constructors and functions have doc comments. - Correct a few typos (e.g. `PRVIATE`). - Mark some more things as `PRIVATE`. - Use `ToDo:` and `Note:` consistently. - Order tags in doc comment. # Important Notes We don't have all the doc comments on types and will want to add them in future,	2023-08-23 13:20:38 +00:00
Radosław Waśko	2385f5b357	Add size-limited strings and varying bit-width integer Value_Types to in-memory backend and check for ArithmeticOverflow in LongStorage (#7557 ) - Closes #5159 - Now data downloaded from the database can keep the type much closer to the original type (like string length limits or smaller integer types). - Cast also exposes these types. - The integers are still all stored as 64-bit Java `long`s, we just check their bounds. Changing underlying storage for memory efficiency may come in the future: #6109 - Fixes #7565 - Fixes #7529 by checking for arithmetic overflow in in-memory integer arithmetic operations that could overflow. Adds a documentation note saying that the behaviour for Database backends is unspecified and depends on particular database.	2023-08-22 18:10:46 +00:00
Pavel Marek	a0086bb112	Ability to invoke all std benchmarks via jmh (#7519 ) All the Enso benchmarks in `test/Benchmarks` can be invoked via JMH	2023-08-17 14:48:43 +02:00
GregoryTravis	c9d7c5cb2b	Convert in-memory Column.round to Java (#7521 )	2023-08-16 14:45:23 +00:00
Jaroslav Tulach	aa0413e5a2	Use only Type instances as keys for State (#7585 )	2023-08-16 15:54:17 +02:00
James Dunkerley	296c95d414	Fix for empty column on replace and out of memory catching for join and tab (#7593 ) - Added a Panic.catch to catch heap memory error in joins and cross_tab. - Adjusted column replace so type is correct.	2023-08-15 17:06:51 +00:00
Jaroslav Tulach	7a272ec152	Encapsulating array-like data and operations into a single package (#7544 )	2023-08-15 13:00:47 +02:00
Radosław Waśko	8541a9e1ac	Improve generation of long operation in presence of column name length limit (#7556 ) I planned to do this as part of #7428, but I forgot. Making up for that now.	2023-08-14 16:58:36 +00:00
GregoryTravis	d3436fae70	Implement Number.round as a builtin (#7460 )	2023-08-14 15:43:39 +00:00
Radosław Waśko	b656b336c7	Report `Loss_Of_Integer_Precision` when an integer is not exactly representable as a float during conversion (#7509 ) Closes #7353 I introduce a new type `WithAggregatedProblems`, because `WithProblems` was too simple - it only allowed to hold a `List<Problem>` but `AggregatedProblems` is more than that. Ideally we shouldn't multiply entities like this too much. We should probably unify all to use `WithAggregatedProblems` - but after starting this, I realised it will likely just take too much effort to do for this little PR. So instead, I created a follow-up task for this: #7514	2023-08-08 12:30:44 +00:00
GregoryTravis	758b3b31b9	Avoid indexing the table twice for Cross Tab (#7417 ) Rewrites MultiValueIndex.makeCrossTabTable to build only a single index.	2023-08-04 21:14:18 +00:00
Radosław Waśko	bc9cde6543	Fix column naming edge cases - invalid and duplicated columns, case-insensitive name aliasing for case-insensitive backends (#7495 ) - Fixes #7412 - Also adds tests and fixes some more edge cases: - Ensures correct handling of existing Database tables whose column names may be invalid from Enso perspective, or clashing from Enso perspective (e.g. for most DBs `ś` and `s\u0301` are different names, but for Enso they are basically the same so this would cause issues - thus Enso now renames such columns when accessed (still using the correct column reference in the generated SQL under the hood).	2023-08-04 09:04:38 +00:00
GregoryTravis	037a687401	Expose Unicode normalization methods on Texts (#7425 ) Exposes Text_Utils.normalize().	2023-08-03 18:07:00 +00:00
Radosław Waśko	c61c741476	Respect database backend naming limitations when generating table/column names and validate user-provided names to avoid silent name clashes; process JDBC warnings reported from backends (#7428 ) - Closes #5951 - Ensures any SQL warnings reported by the database through the JDBC driver are processed and forwarded to the user. - These warnings show issues like the implicit name truncation that this PR is also solving. It's good to make sure they are visible as they can help avoid and understand unexpected problems. They should not show up in most standard workflows. - Adds simple history to our REPL.	2023-08-03 09:44:27 +00:00
GregoryTravis	628a51d8e2	Convert Number.round to Java (#7360 )	2023-07-26 12:03:09 +00:00
James Dunkerley	7345f0fd9a	Speed up statistics (#7390 ) - Allow `parse_to_columns` to take a `Regex` object. - Add `pattern` to the `Regex` object. - Add `column_names` to the `Row` object. - Improve statistics performance. - Add benchmarks for stats. \| Benchmark \| Reference \| New \| Improvement \| \| --- \| --- \| --- \| --- \| \| Max (by reduce) \| 16.4ms \| 16.3ms \| - \| \| Max (stats) \| 703ms \| 224ms \| 68% \| \| Sum (by reduce) \| 38ms \| 38ms \| - \| \| Sum (stats) \| 753ms \| 420ms \| 44% \| \| Variance (stats) \| 745ms \| 553s \| 26% \| Also tried using a Ref approach for stats but as slower (`7e13c45224`).	2023-07-26 10:01:18 +00:00
Radosław Waśko	4b5a2e2176	Fixing operations on Mixed types (#7368 ) - Fixes #7231 - Cleans up vectorized operations to distinguish unary and binary operations. - Introduces MixedStorage which may pretend to be a more specialized storage on demand. - Ensures that operations request a more specialized storage on right-hand side to ensure compatibility with reported inferred storage type. - Ensures that a dataflow error returned by an Enso callback in Java is propagated as a polyglot exception and can be caught back in Enso - Tests for comparison of Mixed storages with each other and other types - Started using `Set` for `Filter_Condition.Is_In` for better performance. - ~~Migrated `Column.map` and `Column.zip` to use the Java-to-Enso callbacks.~~ - This does not forward warnings. IMO we should not be losing them. We can switch and add a ticket to fix the warnings, but that would be a regression (current implementation handles them correctly). Instead, we should first gain some ability to work with warnings in polyglot. I created a ticket to get this figured out #7371 - ~~Trying to avoid conversions when calling Enso functions from Java.~~ - Needs extra care as dataflow errors may not be handled right then. So only works for simple functions that should not error. - Not sure how much it really helps. [Benchmarks](https://github.com/enso-org/enso/pull/7270#issuecomment-1635618393) suggested it could improve the performance quite significantly, but the practical solution is not exactly the same as the one measured, so we may have to measure and tune it to get the best results. - Created #7378 to track this.	2023-07-25 23:25:17 +00:00
GregoryTravis	1f6fcf189b	Implement replace on the Database Column (#7275 ) Implements `replace` for database text columns, for text, regex, and column patterns.	2023-07-25 18:09:50 +00:00
James Dunkerley	2dc565b366	Fix failing test (#7394 ) Fix a failing test.	2023-07-25 14:06:11 +00:00
Adam Obuchowicz	1d2371f986	Groups in DocTags (#7337 ) Fixes #7336 in a quick way. Next to the old way of defining groups, the library can just add `GROUP` tag to some entities, and it will be added to the group specified in tag's description. The group name may be qualified (with project name, like `Standard.Base.Input/Output`) or just name - in the latter case, IDE will assume a group defined in the same library as the entity. Also moved some entities from "export" list in package.yaml to GROUP tag to give an example. I didn't move all of those, as I assume the library team will reorganize those groups anyway. ### Important Notes @jdunkerley @radeusgd @GregoryTravis When you will start specifying groups in tags, remember that: * The groups still belongs to a concrete project; if some entity outside a project wants to be added to its group, the "qualified" name should be specified. See `Table.new` example in this PR. * If the group name does not reflect any group in package.yaml the tag is ignored. * A single entity may be only in a single group. If it's specified in both package.yaml and in tag, the tag takes precedence. --------- Co-authored-by: Ilya Bogdanov <fumlead@gmail.com>	2023-07-24 15:54:16 +02:00
James Dunkerley	88f32d9b2a	Various small tickets... (#7367 ) - Added `Text.length` into Text class so CB lists the built in. - Added `File.starts_with` and tests for the built in method. - Add `to_js_object` and `to_display_text` to `Regex`. ![image](https://github.com/enso-org/enso/assets/4699705/3b197c94-9c49-4bc5-a2cc-ce53b917942e) - Add `to_js_object` and `to_display_text` to `Match`. ![image](https://github.com/enso-org/enso/assets/4699705/962ec4f2-324d-4f10-8ec0-932b093c6729) - Remove the `bit_shift_l` alias from the built-ins. - Add test and Enso wrapper for `Text.is_normalized`.	2023-07-23 09:04:11 +00:00
Radosław Waśko	56635c9a88	Add benchmarks comparing performance of Table operations 'vectorized' in Java vs performed in Enso (#7270 ) The added benchmark is a basis for a performance investigation. We compare the performance of the same operation run in Java vs Enso to see what is the overhead and try to get the Enso operations closer to the pure-Java performance.	2023-07-21 17:25:02 +00:00
Pavel Marek	cab6968340	Update GraalVM to 17.0.7 (23.0.0 JDK17) (#7176 ) - Previous GraalVM update: https://github.com/enso-org/enso/pull/6750 Removed warnings: - Remove deprecated `ConditionProfile.createCountingProfile()`. - Add `@Shared` to some `@Cached` parameters (Truffle now emits warnings about potential `@Share` usage). - Specialization method names should not start with execute - Add limit attribute to some specialization methods - Add `@NeverDefault` for some cached initializer expressions - Add `@Idempotent` or `@NonIdempotent` where appropriate BigInteger and potential Node inlining are tracked in follow-up issues. # Important Notes For `SDKMan` users: ``` sdk install java 17.0.7-graalce sdk use java 17.0.7-graalce ``` For other users - download link can be found at https://github.com/graalvm/graalvm-ce-builds/releases/tag/jdk-17.0.7 Release notes: https://www.graalvm.org/release-notes/JDK_17/ R component was dropped from the release 23.0.0, only `python` is available to install via `gu install python`.	2023-07-20 15:11:30 +00:00
Jaroslav Tulach	a5ec6a9e51	Bench builder API (#7324 ) Designing new `Bench` API to _collect benchmarks_ first and only execute them then. This is a minimal change to allow implementation of #7323 - e.g. ability to invoke a _single benchmark_ via JMH harness. # Important Notes This is just the basic API skeleton. It can be enhanced, if the basic properties (allowing integration with JMH) are kept. It is not intent of this PR to make the API 100% perfect and usable. Neither it is goal of this PR to update existing benchmarks to use it (`74ac8d7` changes only one of them to demonstrate _it all works_ somehow). It is however expected that once this PR is integrated, the newly written benchmarks (like the ones from #7270) are going to use (or even enhance) the new API.	2023-07-19 09:18:28 +00:00
GregoryTravis	2fb5c3710b	Add Fallback to Prim_Text_Helper.compile_regex; accept Regex in Text.parse_to_table (#7297 ) This PR does three related things: - Fails more gracefully when a non-string is passed to compile_regex - Don't pass a non-string to compile_regex - Allow a Regex param to parse_to_table	2023-07-18 19:55:56 +00:00

1 2 3 4 5 ...

615 Commits