Release 1.3.0 (30 Jun 2021)

Key Features

AreaFeaturePR #s
Task RecoveryFixed a few important bugs and further improved the stability of this function. It now can work with spill-to-disk feature.812,813,837,
838,842,843,
847,863,868,
874,875,885,
889,891,901,
906,917,930,
932
CTE (Common Table Expressions)Additional optimization on top of 1.2.0 CTE optimization. Added cost based decision to decide whether to enable CTE or not. Added support for pushdown of dynamic filters and predicates into CTE nodes.722,811,815,
876,921,927
DM (Data Management)Further improved the performance of Data Management Operations. Exposed performance tuning parameters as:
- metastore -client-service-threads: Parallelize operations to Hive metastore by using multiple clients to send/receive requests
- metastore -write-bach-size: Reduce round trip to hive metastore by packing multiple operation objects per call
888
Star Tree Index1. Star Tree Cube now supports up to 10 billion cardinality.
2. openLooKeng CLI updates to improve cube management experience. User now has to issue a single sql statement to both create and populate data in the cube instead of multiple sql statements. The CLI changes help avoid query exceeded cluster memory limit issue.
3. Bug fixes
a. Merge continuous ranges into single range so cube can be utilized
b. Count distinct issue: Filter source data during cube insertion
834,867,890,
902,907
CBOSupport Sorted Source Aggregator

Added support for sort based aggregator in cases where input source is pre-ordered. This greatly reduces the amount of memory used for hashes and can finalize the majority of the results at the partial aggregation stage itself, thereby reducing the final aggregation load at a higher plan stage.

The openLooKeng optimizer makes choices between Sort Aggregator and Hash Aggregator based on the estimated cost of operation for the given memory.
855,905,906
Hudi ConnectorSnapshot queries for Hudi COW table is supported; snapshot queries and read optimized queries for HUDI MOR table are supported.881,900
GreenPlum ConnectorSupport read and write operations on the GreenPlum datasource. But update and delete operations are not yet supported.689
Oracle ConnectorAdd new capability to support update and delete operation within Oracle datasource.897
ClickHouse ConnectorSupport read and write operations to the ClickHouse datasource.

Also add support for SQL query pushdown, and registration & pushdown of external functions.
920
JDBC ConnectorEnhance JDBC to support the multiple splits so that it can improve the performance of high concurrency scenarios.939
Hive ConnectorUpgrade the Hive dependency from 3.0.0 to 3.1.2 and fixed the compatibility issue of timestamp caused by the upgrade.903
Memory ConnectorMemory Connector Optimizations

- HetuMetaStore integration to persist table info
- New data formation (LogicalParts) to support sorting and indexing
- Predicate pushdown
- Automatic spill-to-disk management
914
ResourceEnhanced resource group to throttle scheduling or kill query based on resource usage and user configurations.779,821,822,
836

Known Issues

CategoryDescriptionGitee issue
Task RecoveryAn error message: “Unsuccessful query retry”, is shown when CTA creates a transaction table and inserts data.I3YF45
A query can hang when there is insufficient memory for a node.I3YF4O
When an exception is thrown during stage 1, the value is doubled.I3YF4V

Obtaining the Document

For details, see https://gitee.com/openlookeng/hetu-core/tree/1.3.0/hetu-docs/en

有奖捉虫

“有虫”文档片段

0/500

存在的问题

文档存在风险与错误

● 拼写,格式,无效链接等错误;

● 技术原理、功能、规格等描述和软件不一致,存在错误;

● 原理图、架构图等存在错误;

● 版本号不匹配:文档版本或内容描述和实际软件不一致;

● 对重要数据或系统存在风险的操作,缺少安全提示;

● 排版不美观,影响阅读;

内容描述不清晰

● 描述存在歧义;

● 图形、表格、文字等晦涩难懂;

● 逻辑不清晰,该分类、分项、分步骤的没有给出;

内容获取有困难

● 很难通过搜索引擎,openLooKeng官网,相关博客找到所需内容;

示例代码有错误

● 命令、命令参数等错误;

● 命令无法执行或无法完成对应功能;

内容有缺失

● 关键步骤错误或缺失,无法指导用户完成任务,比如安装、配置、部署等;

● 逻辑不清晰,该分类、分项、分步骤的没有给出

● 图形、表格、文字等晦涩难懂

● 缺少必要的前提条件、注意事项等;

● 描述存在歧义

0/500

您对文档的总体满意度

非常不满意
非常满意

请问是什么原因让您参与到这个问题中

您的邮箱

创Issue赢奖品
根据您的反馈,会自动生成issue模板。您只需点击按钮,创建issue即可。
有奖捉虫