enso-org/enso - enso - gitea: Gitea Service

mirror of https://github.com/enso-org/enso.git synced 2024-11-25 21:25:20 +03:00

Author	SHA1	Message	Date
James Dunkerley	e6ea3b2607	Default Project, Templates and Docs (#10980 ) - Linting updates. - Add an `Examples.welcome` and adjust the start up project to use it. - Merge all of Cass's work into the source code. - Make example render in mono space font.	2024-09-05 14:36:08 +00:00
Jaroslav Tulach	339c275338	Benchmark and speed processing of polyglot java imports up (#10899 )	2024-08-29 17:20:53 +02:00
Radosław Waśko	b1958f8aa3	Adding vectorized implementations to some Column operations (#10795 ) - Part of #6256 - implements operations that could have been vectorized without changes to the overall infrastructure	2024-08-13 08:53:39 +00:00
Kaz Wesley	aafdef1aeb	Improve parser contextualization (#10734 )	2024-08-05 15:46:58 +00:00
Jaroslav Tulach	c20eab2af9	Detect compilation while benchmarking (#10574 ) Enables `engine.TruffleCompilation` in `std-benchmarks`, collects the logs and dumps compilation into to `System.err` when a benchmark is influenced by dynamic compilation.	2024-07-18 15:49:16 +00:00
Jaroslav Tulach	a33f632a73	Make `EnsoOutputStream` fast again! (#10559 ) Will fix #10550 by properly using `@TruffleBoundary` annotation	2024-07-16 17:30:56 +00:00
Jaroslav Tulach	077b86f98c	Speed up EnsoInputStream again (#10515 ) Fix #10503 by creating a benchmark and then speeding it up by making sure usage of `InteropLibrary` reminds in partially evaluated code and isn't hidden behind `@TruffleBoundary`.	2024-07-11 10:08:26 +00:00
Jaroslav Tulach	220b40a1cd	Enforce conversion method return type & introduce Comparable.new (#10468 )	2024-07-11 06:58:51 +02:00
James Dunkerley	4b3e4ae15e	Rename `Map` to `Dictionary` and `Set` to `Hashset`. (#10474 ) - Rename `Map` to `Dictionary`. - Rename `Set` to `Hashset`. - Add a deprecated place holder for the static method of `Map`.	2024-07-09 09:12:23 +00:00
James Dunkerley	4c0647ea29	Stop publishing `First`/`Last` as constructors and use auto-scoping for `take` and `drop`. (#10467 ) - Removes `First` and `Last` from the `Standard.Base` exports. - Enable auto-scoping for all `Index_Sub_Range` and `Text_Sub_Range`. - Update all use of those methods to use auto-scoping.	2024-07-08 10:26:30 +00:00
James Dunkerley	9a2aed92f1	Separating `list` from `read` function and other small tweaks. (#10434 ) - Rename `Faker.string_value` to `Faker.text_value`. - Remove `Regex.pattern_string` as duplicate of `Regex.pattern`. - Sort the Date picker. - Rename `Data.list_directory` to `Data.list`. - Remove support for reading a directory. ![image](https://github.com/enso-org/enso/assets/4699705/b42bb3c9-e63b-49f2-8cdc-4666cb6d968e) ![image](https://github.com/enso-org/enso/assets/4699705/97f49891-5ae5-4f0a-9a41-6200888fcd86)	2024-07-03 22:00:53 +00:00
James Dunkerley	d92078471b	Rename `order_by` to `sort` for `Table` and `DB_Table`. (#10372 ) - Rename `order_by` to `sort` for `Table` and `DB_Table`. - Added deprecated placeholder. - Fixed a couple of minor deprecated mistakes. ![image](https://github.com/enso-org/enso/assets/4699705/96c32fa7-33e5-400a-9d3a-ebf330886911)	2024-06-26 17:46:09 +00:00
Jaroslav Tulach	aaaebcabf8	Implement and benchmark `ArrowOperationPlus` node (#10150 ) Prototype of #10056 showing `+` operation implemented in the _Arrow language_.	2024-06-11 12:50:59 +00:00
Jaroslav Tulach	b5969cf31c	Sort rational numbers and primes first benchmarks (#10142 ) Investigating #6361 - can we speed callback from `java.util.TimSort.sort` to Enso by using `CallTarget`?	2024-06-09 12:46:41 +00:00
GregoryTravis	1c2955655c	Add `no_warning_propagation` flag to `Vector.build` and `Builder.new` (#10079 ) * no_warning_propagation flag * doc * benchmarks * change to propagate_warnings, test * fix java benchmark * add benchmark to main	2024-06-04 09:06:09 -04:00
AdRiley	06327f8fde	Add statistic product (#10122 ) Add Statistic.Product ![image](https://github.com/enso-org/enso/assets/1720119/f7fc7bb5-9efe-4dbe-9150-cd9e5101c553)	2024-05-31 09:29:52 +00:00
GregoryTravis	720d32cbe3	Convert most remaining uses of `Vector.new_builder` to `Vector.build` (#9891 )	2024-05-09 15:52:53 +00:00
James Dunkerley	c0fd6eed2d	Small tweaks from Ned, Mark, Cass and Steve's feedback. (#9859 ) - Rename `.info` to `.column_info`. - Rename `.select_by_type` to `.select_columns_by_type`. (to be merged into `select_columns` functionality) - Rename `.remove_by_type` to `.remove_columns_by_type`. (to be merged into `remove_columns` functionality) - Adding "field" `ALIAS`es. - Making `Table.Value` private (and hence hiding `java_table`). - Making `Column.Value` private (and hence hiding `java_column`, added method to `Storage` to allow tests to work). - Making `DB_Table.Value` private (and hence hiding `connection`, `context` and `internal_columns`). - Making `DB_Column.Value` private (and hence hiding `connection`, `sql_type_reference`, `expression` and `context`). - Add widgets for everywhere `Filter_Condition` used (fix missing entry for lower, can't use auto-scoping). ![image](https://github.com/enso-org/enso/assets/4699705/bad7d5c8-cd01-4384-a1c8-dbcd7a5ad92b) - Fix for widgets in `Simple_Calculation`: `date_add` and `format`. ![image](https://github.com/enso-org/enso/assets/4699705/14eeef2e-5069-4a88-8b68-eb675497addf) ![image](https://github.com/enso-org/enso/assets/4699705/75d4e097-f171-4bf6-a27e-f9d396c44a92) - Fix for nested `Row`, `Column` and `DB_Column` rendering in Table Viz. ![image](https://github.com/enso-org/enso/assets/4699705/8c50c519-7c34-4228-bab7-be938a6ce1bf) ![image](https://github.com/enso-org/enso/assets/4699705/c83ad301-f31e-451d-a8be-b022e1456b4e) # Important Notes - Can't use `.` to access a private constructor field when mapping, have to use a lambda. - We should agree convention for private backing fields which are wrapped by a public getter (went with `internal_` for this pass).	2024-05-06 13:49:12 +00:00
Pavel Marek	7850312340	Do not disable BackgroundCompilation in benchmarks (#9646 ) * Do not disable BackgroundCompilation in benchmarks * Increase warmup for Table sorting benchmarks. #9470	2024-04-08 16:49:57 +02:00
GregoryTravis	973d2c6aea	`Decimal` arithmetic benchmarks (#9582 )	2024-04-04 16:03:35 +00:00
James Dunkerley	19f15b8f97	Small fixes from building up another demo. (#9385 ) - Fix `Excel_Workbook.sheet` and add a test. - Add icon for `Table.row_count` and `DB_Table.row_count`. - Make `join_kind` widget `Display.Always`. - Add expression as an option to `Aggregate_Column`. - Add `Simple_Calculation.Copy` to create a copy. - Add defaults to `Simple_Expression` so less errory. - Set period to default to day for `date_diff` allowing use in expressions. - Add `Text_Left`, `Text_Right`, `Text_Length` and `Format` to `Simple_Expression`.	2024-03-13 18:13:33 +00:00
James Dunkerley	8c197f325b	Update the `aggregate` API to take a separate `group_by` (#9027 ) Separate out the `Group_By` from the column definition in `aggregate`. ![image](https://github.com/enso-org/enso/assets/4699705/6b4f03bc-1c4a-4582-b38a-ba528ae94167) Supports the old API with a warning attached about deprecation: ![image](https://github.com/enso-org/enso/assets/4699705/0cc42ff7-6047-41a5-bb99-c717d06d0d93) Widgets have been updated with `Group_By` removed from the dropdown.	2024-02-13 10:23:59 +00:00
James Dunkerley	0b6db5797c	Refactor OrderMask to avoid memory copying (#8863 ) Goal of this PR is to refactor the design of OrderMask and avoid copying arrays or lists wherever possible. We have removed a few legacy functions which were not being used. On a poor mans benchmark seems to be quicker (13s vs 16s) and memory usage should be lower.	2024-01-26 11:16:16 +00:00
AdRiley	1c6898b19d	Add Join Large Small benchmarks (#8845 ) Adds new benchmark joining a large table to a small table in preparation for a coming optimisation that will only index the smaller of the 2 tables in #8342	2024-01-24 19:13:09 +00:00
AdRiley	23d6fcdd9c	Make max of Random methods inclusive rather than exclusive (#8768 ) * Make max inclusive rather than exclusive * Update names to min max. Remove end_inclusive * Indicies should remain exclusive * Make the tests pass	2024-01-24 16:58:33 +00:00
GregoryTravis	f2cb1f097e	Support on_problems=Problem_Behavior.Report_Warning and Map_Error wrapping in Vector.map (#8595 ) Implements `Warnings.get_all wrap_errors=True` which wraps warnings attached to values inside vectors with `Map_Error`, which includes the position of the value within the vector. See [the documentation](https://github.com/enso-org/enso/blob/develop/docs/semantics/wrapped-errors.md) for more details. `get_all wrap_errors=True` does not change the warnings that are attached to values -- it wraps them before returning them to the caller, but does not change the original warnings attached to the values. Wrapped warnings only appear attached to the vector itself. The values inside the vector do not have their warnings wrapped. Warning propagation is not changed at all; `Warnings.get_all` (with default `wrap_errors=False`) behaves as before. `get_all wrap_errors=True` is meant to be used primarily by the IDE, although it can be used anywhere this wrapping is desired.	2024-01-16 09:36:22 +00:00
Radosław Waśko	f0c2a5fa7f	Opt-in return type checks (#8502 ) - Closes #8240	2023-12-19 15:32:30 +00:00
Pavel Marek	4b65e44ef3	EpbLanguage re-uses other TruffleContext support to run tests with assertions enabled (#7882 )	2023-12-15 13:31:32 +01:00
Hubert Plociniczak	a978d70a9e	Minor improvement to Startup.enso error (#8536 ) Unable to parse the current error message due to newlines. # Important Notes For example https://github.com/enso-org/enso/actions/runs/7174245013/job/19535275294#step:10:16133	2023-12-14 09:38:58 +00:00
GregoryTravis	1c815a3d45	Better Error Trapping in map (#8307 ) * tests * wip * wip * additional warnings * wip * wip * cleanup * nested wrapping * multiple nestings * wraps_error uses looks_for, test for should_fail_with * wip * stack trace line fix * use catch_primitive internally * fix warning mapping, dtf spec * just one wrapper checker, vector spec * missing ctor, back to non-primitive catch * back to c_p * put old map back * wip * unnest tests * Array.map on_problems * wip * Revert "wip" This reverts commit `c30d171457`. * better test names * warning logging * wip * wip * move logic into ALH * doc * constant * My_Error.Error * nested * doc * map_primtiive in warning mapper * composition * ref spec * Remove warnings prior to matching on the value If an expression has warnings and is matched we: 1) extract the warnings 2) execute the branch of a pattern that matches the value 3) attach extracted warnings to the result This caused warnings to reappear when doing the custom warnings manipulation. This is also consistent with how `CaseNode`'s `doWarning` specialization is defined. * fix 1 * do not auto unwrap in test error checkers * nested error matcher * in problems too * dtf * v * statistics * wip * Table_Spec, map_with_index_primitive * Column_Operations_Spec * disable warning wrapping and Report_Warning * unimpl test * Warnings_Spec * DCS * ACG JP * zip_primitive * join_helpers * Lookup_Helpers * Table * Data_Formatter * Value_Type_Helpers * revert check types changes * table_helpers * table tests * remove st * do not remove warnings from value * vec docs, tests for zip, mwi, flat_map * docs, fixes * remove nested_error_matcher * cleanup * benchmark * one error * alter * add bench to main * review * review * review * tail call * changelog * tail call was not a tail call * ws * bad import * Added missing import * Update distribution/lib/Standard/Base/0.0.0-dev/src/Data/Array.enso Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org> * review, ref example * lazy benchmark data * extra paren * check outside of catch * review * vector too * actually lazy * disambiguate Map_Error * finish rename * move to extensions * combine Additional_Warnings error * rename to map_no_wrap * do not catch and rethrow * review * wip * remove _primitives entirely * remove unused should_fail_with function options * remove expected_warning as function in Problems --------- Co-authored-by: Hubert Plociniczak <hubert.plociniczak@gmail.com> Co-authored-by: Radosław Waśko <radoslaw.wasko@enso.org>	2023-12-13 09:38:09 -05:00
Jaroslav Tulach	e4b2b56a40	Turning Sieve benchmarks into Enso benchmarks (#8475 )	2023-12-08 10:27:52 +01:00
Jaroslav Tulach	c6eb61a055	Filter for test/Benchmarks (#8391 ) With herein proposed change one can pass an optional filter to `enso --run test/Benchmarks` to execute only groups and specs that contain given string in its name.	2023-11-27 15:27:12 +00:00
Jaroslav Tulach	893965ed5c	3% speedup with LazyMap and MetadataStorage (#8359 )	2023-11-27 10:28:12 +01:00
Radosław Waśko	c6b6384fe6	Improve performance of anti-join (#8338 ) - Closes #8217	2023-11-24 02:44:57 +00:00
Jaroslav Tulach	4464a15035	Benchmark to measure startup time (#8378 ) While trying to speed `MetadataStorage` up - #8324 - I felt the need to have an independent (on my computer) measurement of startup time. Here is a benchmark that measures how long execution of two simple hello world programs take. # Important Notes There are two benchmarks: - `empty_startup` measures the time needed to boot without using any `Standard` library - basically _an overhead of the JVM and engine_ - `hello_world_startup` measures the use of `IO.println` - it shall take longer than `empty_startup` and show the overhead we have while processing the standard library	2023-11-23 21:31:01 +00:00
Pavel Marek	5a7ad6bfe4	Upgrade enso to GraalVM for jdk 21 (#7991 ) Upgrade to GraalVM JDK 21. ``` > java -version openjdk version "21" 2023-09-19 OpenJDK Runtime Environment GraalVM CE 21+35.1 (build 21+35-jvmci-23.1-b15) OpenJDK 64-Bit Server VM GraalVM CE 21+35.1 (build 21+35-jvmci-23.1-b15, mixed mode, sharing) ``` With SDKMan, download with `sdk install java 21-graalce`. # Important Notes - After this PR, one can theoretically run enso with any JRE with version at least 21. - Removed `sbt bootstrap` hack and all the other build time related hacks related to the handling of GraalVM distribution. - `project-manager` remains backward compatible - it can open older engines with runtimes. New engines now do no longer require a separate runtime to be downloaded. - sbt does not support compilation of `module-info.java` files in mixed projects - https://github.com/sbt/sbt/issues/3368 - Which means that we can have `module-info.java` files only for Java-only projects. - Anyway, we need just a single `module-info.class` in the resulting `runtime.jar` fat jar. - `runtime.jar` is assembled in `runtime-with-instruments` with a custom merge strategy (`sbt-assembly` plugin). Caching is disabled for custom merge strategies, which means that re-assembly of `runtime.jar` will be more frequent. - Engine distribution contains multiple JAR archives (modules) in `component` directory, along with `runner/runner.jar` that is hidden inside a nested directory. - The new entry point to the engine runner is [EngineRunnerBootLoader](https://github.com/enso-org/enso/pull/7991/files#diff-9ab172d0566c18456472aeb95c4345f47e2db3965e77e29c11694d3a9333a2aa) that contains a custom ClassLoader - to make sure that everything that does not have to be loaded from a module is loaded from `runner.jar`, which is not a module. - The new command line for launching the engine runner is in [distribution/bin/enso](https://github.com/enso-org/enso/pull/7991/files#diff-0b66983403b2c329febc7381cd23d45871d4d555ce98dd040d4d1e879c8f3725) - [Newest version of Frgaal](https://repo1.maven.org/maven2/org/frgaal/compiler/20.0.1/) (20.0.1) does not recognize `--source 21` option, only `--source 20`.	2023-11-17 18:02:36 +00:00
Radosław Waśko	1b8b30a68d	Improve performance of `Join_Condition.Between` by sorting on one dimension (#8212 ) - Closes #5303 - Refactors `JoinStrategy` allowing us to 'stack' join strategies on top of each other (to some extent) - currently a `HashJoin` can be followed by another join strategy (currently `SortJoin`) - Adds benchmarks for join - Due to limitations of the sorting approach this will still not be as fast as possible for cases where there is more than 1 `Between` condition in a single query - trying to demonstrate that in benchmarks. - We can replace sorting by d-dimensional [RangeTrees](https://en.wikipedia.org/wiki/Range_tree) to get `O((n + m) log^d n + k)` performance (where `n` and `m` are sizes of joined tables, `d` is the amount of `Between` conditions used in the query and `k` is the result set size). - Follow up ticket for consideration later: #8216 - Closes #8215 - After all, it turned out that `TreeSet` was problematic (because of not enough flexibility with duplicate key handling), so the simplest solution was to immediately implement this sub-task. - Closes #8204 - Unrelated, but I ran into this here: adds type checks to other arguments of `set`. - Before, putting in a Column as `new_name` (i.e. mistakenly messing up the order of arguments), lead to a hard to understand `Method `if_then_else` of type Column could not be found.`, instead now it would file with type error 'expected Text got Column`.	2023-11-08 12:59:55 +00:00
GregoryTravis	1480f50207	Overhaul the random number and item generation code (#8127 ) Rewrite most of Random.enso.	2023-10-31 15:25:37 +00:00
Radosław Waśko	c1259cb4d2	Compare performance of Panic / Java Exception / Dataflow error (#8130 ) After a discussion, I was really curious that our panics are supposed to be almost free - and while trusting that statement, it was really hard to believe - so I wanted to see for myself - knowing that an experiment is the most robust source of this kind of information - testing that in practice. So I wrote a benchmark comparing various ways of reporting errors, also testing them both at 'shallow' and 'deep' stack traces (adding 200 additional frames) - to see how stack depth affects them, if at all. The panics are indeed blazing fast! Kudos to the engine team. However, it seems that our dataflow errors are relatively slow (and we tend to use them _more_ than panics and want to be using them more and more). This uncovers a possible optimization opportunity. Can we make them as fast as panics?? Analysis of the benchmark results in comment below.	2023-10-24 12:03:44 +00:00
Radosław Waśko	93a31fcc8b	Add benchmarks related to `add_row_number` performance investigation (#8091 ) - Follow-up of #8055 - Adds a benchmark comparing performance of Enso Map and Java HashMap in two scenarios - _only incremental_ updates (like `Vector.distinct`) and _replacing_ updates (like keeping a counter for each key). These benchmarks can be used as a metric for #8090	2023-10-18 17:21:59 +00:00
Radosław Waśko	e9fa12763e	Improve performance of `add_row_number` (#8076 ) Fixes #8055	2023-10-17 00:42:35 +00:00
Radosław Waśko	0cd446432f	Fix inconsistency when building a Mixed column, fixes to Union (#7919 ) - Fixes #7352 by remembering original value types in type inference mode to be able to reconstruct them for Mixed. - Added more benchmarks for comparing performance of constructing columns. - Fixes missing implementations that caused `Table.union` crashing on some type pairs. - Ensures that `Loss_Of_Integer_Precision` warning is not swallowed when numeric columns are unioned to create a `Float` column. - Adds test for all of the above cases. - Allow to output benchmark results to a CSV by setting an environment variable - useful for quickly comparing benchmarks, e.g. in Enso.	2023-10-03 20:33:34 +02:00
Radosław Waśko	8d926166ea	Follow up improvements to `Date_Time_Formatter` (#7875 ) - Closes #7872 - Also closes #7866	2023-09-28 09:38:00 +00:00
Hubert Plociniczak	7d80ac12bb	Transform Range.iterate test to a benchmark (#7905 ) * Transform Range.iterate test to a benchmark a) it slows down regular unit testing b) it is not a unit test c) it behaves like a benchmark So it should be a benchmark. * missing Range import	2023-09-27 12:24:45 +02:00
Hubert Plociniczak	1ee3d8f4f0	Rename Decimal to Float (#7807 ) Implements #6889.	2023-09-14 15:01:30 +00:00
Radosław Waśko	255b424b72	Add `value_type` to `Column.from_vector` and `expected_value_type` to `Column.map` and `Column.zip` (#7637 ) - Closes #6111 - Aligns semantics of handling Mixed columns. - Now, if an operation like `iif` or `fill_nothing` is given a `Mixed` column, the result will also be `Mixed` regardless of the `inferred_precise_value_type`. - Enables a few old tests that were pending but could be enabled since the types work is advanced enough.	2023-08-31 13:20:49 +00:00
Pavel Marek	c32bfad2b1	Add std-libs benchmarks GH workflow (#7597 ) Adds "Benchmark Standard Libraries" workflow that can be run locally with `sbt std-benchmarks/bench`.	2023-08-23 12:18:36 +02:00
Radosław Waśko	2385f5b357	Add size-limited strings and varying bit-width integer Value_Types to in-memory backend and check for ArithmeticOverflow in LongStorage (#7557 ) - Closes #5159 - Now data downloaded from the database can keep the type much closer to the original type (like string length limits or smaller integer types). - Cast also exposes these types. - The integers are still all stored as 64-bit Java `long`s, we just check their bounds. Changing underlying storage for memory efficiency may come in the future: #6109 - Fixes #7565 - Fixes #7529 by checking for arithmetic overflow in in-memory integer arithmetic operations that could overflow. Adds a documentation note saying that the behaviour for Database backends is unspecified and depends on particular database.	2023-08-22 18:10:46 +00:00
Pavel Marek	a0086bb112	Ability to invoke all std benchmarks via jmh (#7519 ) All the Enso benchmarks in `test/Benchmarks` can be invoked via JMH	2023-08-17 14:48:43 +02:00
GregoryTravis	c9d7c5cb2b	Convert in-memory Column.round to Java (#7521 )	2023-08-16 14:45:23 +00:00

1 2 3

130 Commits