enso/distribution/lib/Standard
Radosław Waśko 1b8b30a68d
Improve performance of Join_Condition.Between by sorting on one dimension (#8212)
- Closes #5303
- Refactors `JoinStrategy` allowing us to 'stack' join strategies on top of each other (to some extent) - currently a `HashJoin` can be followed by another join strategy (currently `SortJoin`)
- Adds benchmarks for join
- Due to limitations of the sorting approach this will still not be as fast as possible for cases where there is more than 1 `Between` condition in a single query - trying to demonstrate that in benchmarks.
- We can replace sorting by d-dimensional [RangeTrees](https://en.wikipedia.org/wiki/Range_tree) to get `O((n + m) log^d n + k)` performance (where `n` and `m` are sizes of joined tables, `d` is the amount of `Between` conditions used in the query and `k` is the result set size).
- Follow up ticket for consideration later:
#8216
- Closes #8215
- After all, it turned out that `TreeSet` was problematic (because of not enough flexibility with duplicate key handling), so the simplest solution was to immediately implement this sub-task.
- Closes #8204
- Unrelated, but I ran into this here: adds type checks to other arguments of `set`.
- Before, putting in a Column as `new_name` (i.e. mistakenly messing up the order of arguments), lead to a hard to understand `Method `if_then_else` of type Column could not be found.`, instead now it would file with type error 'expected Text got Column`.
2023-11-08 12:59:55 +00:00
..
AWS/0.0.0-dev Implement Table.lookup_and_replace in Database (#8146) 2023-10-31 15:19:55 +00:00
Base/0.0.0-dev Limit max_rows that are downloaded in Table.read by default, and warn if more rows are available (#8159) 2023-11-06 16:41:47 +00:00
Database/0.0.0-dev Improve performance of Join_Condition.Between by sorting on one dimension (#8212) 2023-11-08 12:59:55 +00:00
Examples/0.0.0-dev S3 Read Access, Input Stream based reading (#7776) 2023-09-20 15:09:11 +00:00
Geo/0.0.0-dev Rename Decimal to Float (#7807) 2023-09-14 15:01:30 +00:00
Google_Api/0.0.0-dev S3 Read Access, Input Stream based reading (#7776) 2023-09-20 15:09:11 +00:00
Image/0.0.0-dev Add GROUP to functions (#7622) 2023-08-23 13:20:38 +00:00
Searcher/0.0.0-dev New Data.post for sending a payload to a Web API (#7700) 2023-09-19 11:26:29 +00:00
Table/0.0.0-dev Improve performance of Join_Condition.Between by sorting on one dimension (#8212) 2023-11-08 12:59:55 +00:00
Test/0.0.0-dev Overhaul the random number and item generation code (#8127) 2023-10-31 15:25:37 +00:00
Visualization/0.0.0-dev Limit max_rows that are downloaded in Table.read by default, and warn if more rows are available (#8159) 2023-11-06 16:41:47 +00:00