Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Subquery cache roadmap #22114

Open
1 of 11 tasks
sopel39 opened this issue May 24, 2024 · 6 comments
Open
1 of 11 tasks

Subquery cache roadmap #22114

sopel39 opened this issue May 24, 2024 · 6 comments
Labels
performance roadmap Top level issues for major efforts in the project subquery-cache Label for subquery cache relates issues

Comments

@sopel39
Copy link
Member

sopel39 commented May 24, 2024

#21888 introduces subquery cache feature to Trino engine. However, there are many follow up items to improve the performance and hit rate.

Here are the roadmap items:

Tasks

  1. subquery-cache
    sopel39
  2. subquery-cache
    gaurav8297
  3. subquery-cache
  4. subquery-cache
  5. subquery-cache
  6. subquery-cache
    osscm
  7. subquery-cache
  8. subquery-cache
  9. subquery-cache
  10. subquery-cache
  11. subquery-cache
@sopel39 sopel39 added the subquery-cache Label for subquery cache relates issues label May 24, 2024
@chenjian2664
Copy link
Contributor

@sopel39 I am interested in contributing to this topic, but I am not very familiar with the tasks mentioned above. Could you guide me on how to start and suggest which task or tasks might be suitable to start

@osscm
Copy link
Contributor

osscm commented May 28, 2024

Thanks a lot @sopel39 !

We had discussed a couple of times on the old issue about the implementation and approach.

As discussed we were also looking into the same problem, and would be more than happy to contribute.
Please share your thoughts, where do you think I can work on.

May be,

#22116
Or
#22165
Or

Whatsoever task you think.

Thanks.

@sopel39
Copy link
Member Author

sopel39 commented May 28, 2024

Hi @osscm

Take into consideration that it will probably take some time to land this PR. I'm extracting smaller PRs atm. However, I think we should also progress with the improvements.

#22116 and #22165 are both important. Especially #22116 will improve cache hit rate for string partition types, which are fairy common.

However, I would start with something simpler like #22120 to familiarize with the concepts (code can still change in the process of review)

I would start with something simpler like #22121 to familiarize with the concepts (code can still change in the process of review)

@sug-ghosh
Copy link

Hi @sopel39

I am interested to contribute to this issue. Going through the code and implementation in understanding the issue. Can you please guide me which sub-task I can take up and able to contribute.

@sopel39
Copy link
Member Author

sopel39 commented Jun 5, 2024

@sug-ghosh I think we would need to sync. Ping me on slack please

@hackeryang
Copy link
Member

Relevant PR: #21888

@hackeryang hackeryang added the roadmap Top level issues for major efforts in the project label Jul 3, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
performance roadmap Top level issues for major efforts in the project subquery-cache Label for subquery cache relates issues
Development

No branches or pull requests

5 participants