Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Doris Roadmap 2022 #7502

Closed
morningman opened this issue Dec 28, 2021 · 31 comments
Closed

Doris Roadmap 2022 #7502

morningman opened this issue Dec 28, 2021 · 31 comments
Labels
Discuss kind/community Issues or PRs related to Doris community

Comments

@morningman
Copy link
Contributor

morningman commented Dec 28, 2021

No description provided.

@morningman
Copy link
Contributor Author

morningman commented Dec 28, 2021

The following is the Roadmap for the Doris community in 2022.
The plan includes all aspects of code features, documentation, community building, etc. that are to be developed, have already been developed, and have been completed but require ongoing optimization.

The plan is currently under discussion, so if you have comments or suggestions on any aspect of the plan or beyond, please feel free to leave a comment or send an email to dev@doris.apache.org.

We will gradually create issues or jira for each direction of the plan to describe and track the progress in detail. Developers who wish to contribute are also welcome to create issues directly and associate with them (just leave a comment)

The directions marked (Good First Issue) in the plan are more independent modules, which are more suitable for newbie tasks or developers who are new to Doris. If you are interested in the relevant direction, please contact us at dev@doris.apache.org or under this issue, and we will provide detailed guidance, help and discussion.

The directions marked with (Q1) are the current work to be completed in the first quarter of 2022. We will update the schedule and progress of other directions gradually.

The marked (Done & Optimizing) directions are the directions that are currently completed but need continuous optimization. Such as ease of use, feature additions, and documentation additions.

We encourage developers to discuss anything in the dev mailing list, to subscribe to the mailing list please refer to How to subscribe.

Features

Performance Optimization

Stability and Observability

Testing

Functional Optimization

Deployment and Maintenance

Peripheral Ecology

Community

  • [Roadmap] Doris official website #7589 (Q1)

    Refactored Doris official website to provide best practices, community progress, blog posts, FAQ, and more.

  • Doris Documentation (Good First Issue)

    Non-code contributions are as important as code contributions, and the community is very open to developers improving and proofreading the project documentation.

  • Github Action (Good First Issue)

    Introduced more Github Actions to help improve the management of the code base. This includes but is not limited to PR autoresponders, tagging, etc. If you have a good Action to recommend, please leave a comment.

@morningman morningman pinned this issue Dec 28, 2021
@yiguolei
Copy link
Contributor

For regression test and performance test, we could follow clickhouse's test method. If it is allowed, I could do this.

@yiguolei
Copy link
Contributor

Clang compile is already on process, see #7451

@EmmyMiao87
Copy link
Contributor

Could you please open an email to discuss Roadmap 2022 of Doris ?

@EmmyMiao87 EmmyMiao87 added Discuss kind/community Issues or PRs related to Doris community labels Dec 28, 2021
@yangzhg
Copy link
Member

yangzhg commented Dec 28, 2021

支持parquet 文件存储格式也应该加进去吧

@wangshuo128
Copy link
Contributor

希望考虑跨版本升级功能。

@Henry2SS
Copy link
Contributor

What about supporting AVRO format in LOAD function?

@zbtzbtzbt
Copy link
Contributor

zbtzbtzbt commented Dec 28, 2021

Looking forward to push based pipeline engine @morningman-cmy @yiguolei

@hf200012
Copy link
Contributor

hf200012 commented Dec 28, 2021

Doris Manager:
1.Follow-up Doris Manager upgrade
2.User UI interaction improvement
3.Doris Manager supports Doris automated upgrade

@924060929
Copy link
Contributor

我们公司已经有一个回归测试框架。大体是用groovy的dsl去完成测试sql、stream load、安装tpch等功能,大概使用方式如下图。
后续可以提给社区
image

@jackwener
Copy link
Member

既然后续有这么多内容,关于社区部分建一个 RFC 目录挺有必要的,大型的 PR 的 design doc 放进去,一方面是为了社区新人的快速融入,另外也减小PR review的压力

@morningman morningman changed the title [Discussion] Doris Roadmap 2022 Doris Roadmap 2022 Jan 2, 2022
@morningman
Copy link
Contributor Author

既然后续有这么多内容,关于社区部分建一个 RFC 目录挺有必要的,大型的 PR 的 design doc 放进去,一方面是为了社区新人的快速融入,另外也减小PR review的压力

好主意,你是否有一些RFC 模板可供参考?

@jackwener
Copy link
Member

这是 cockroach 的 实践

@caiconghui
Copy link
Contributor

What about supporting AVRO format in LOAD function?

#7650

@Henry2SS
Copy link
Contributor

Henry2SS commented Jan 6, 2022

What about supporting AVRO format in LOAD function?

#7650

Thx for opening an issue.

@morningman
Copy link
Contributor Author

What about supporting AVRO format in LOAD function?

#7650

Thx for opening an issue.

Add to the roadmap

@hf200012
Copy link
Contributor

#7680 Data export function supports exporting to db, kafka, etc.

@hf200012
Copy link
Contributor

#7678 max_by, min_by aggregate function support

@huligong1234
Copy link

huligong1234 commented Jan 18, 2022

support decimal data type for create table as select statement. (detailMessage = Unsupported type 'DECIMAL(9,0)' in create table as select statement)

@morningman
Copy link
Contributor Author

morningman commented Jan 18, 2022

#7680 Data export function supports exporting to db, kafka, etc.
#7678 max_by, min_by aggregate function support

Added to the Roadmap

@morningman
Copy link
Contributor Author

support decimal data type for create table as select statement. (detailMessage = Unsupported type 'DECIMAL(9,0)' in create table as select statement)

Added to the roadmap

@yiguolei
Copy link
Contributor

Could use vectorized method to optimize load process??

@i7xh
Copy link

i7xh commented Jan 18, 2022

Why Doris need push based query execution engine?

@yiguolei
Copy link
Contributor

@i7xh
Two example:

  1. Currently doris‘s concurrency control is based on tablet, one tablet ---> exec fragment, there is only one thread to deal with the data at query engine. If use pushed engine, could expand the computing thread num at run time.
  2. In pull engine, if one fragment contains 3 or more node, like scan--> filter--> agg there is only one node is executing, but in push based engine, node executing could be async, for example scan and agg could execute at same time.

@i7xh
Copy link

i7xh commented Jan 19, 2022

Provides Schemaless semantics for fast analysis of semi-structured data

Json Parsing Optimization

There are a lot of user cases schema-less or semi-structured that lead to support json optimization especially,mainly with decouple change of the schema?

@i7xh
Copy link

i7xh commented Jan 19, 2022

企业微信截图_a18eb192-6b6a-4615-a6a5-eeae4d0430d2

look forward imperatively

@lordk911
Copy link

lordk911 commented Mar 8, 2022

want to know when will vectorized query engine could be released.

@kuncle
Copy link
Contributor

kuncle commented Aug 4, 2022

When will support data type Decimal(38,18) ?

@kpfly
Copy link

kpfly commented Aug 4, 2022

When will support data type Decimal(38,18) ?

This feature will be released as an experimental feature in version 1.2.0,which will be released at the end of this month.

@kuncle
Copy link
Contributor

kuncle commented Aug 4, 2022

When will support data type Decimal(38,18) ?

This feature will be released as an experimental feature in version 1.2.0,which will be released at the end of this month.

cool, thanks.

@mengzhisy
Copy link

请教一下“Pipeline execution engine”是否可以更方便地实现多线程执行算法,然后大幅提升多核利用率?
拿doris测试过tpc-ds,目前对多核心的利用率貌似比较低

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Discuss kind/community Issues or PRs related to Doris community
Projects
None yet
Development

No branches or pull requests