Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature request : add delta cache #769

Closed
djouallah opened this issue Aug 25, 2022 · 5 comments
Closed

Feature request : add delta cache #769

djouallah opened this issue Aug 25, 2022 · 5 comments
Labels
enhancement New feature or request help wanted Extra attention is needed

Comments

@djouallah
Copy link

I have being playing with delta table and python just for fun, but recently with the performance of duckdb, thing started to make more sense, and maybe even be useful in production environment, but I notice in order to get any good performance using cloud storage we need some form of local cache.

it would be really useful if Delta table cache is supported, that will be a killer feature.

@djouallah djouallah added the enhancement New feature or request label Aug 25, 2022
@houqp houqp added the help wanted Extra attention is needed label Aug 28, 2022
@houqp
Copy link
Member

houqp commented Aug 28, 2022

Might be better to implement the caching at the query engine layer because delta-rs doesn't read any of the data files. There is definitely value in caching table metadata in delta-rs.

@djouallah
Copy link
Author

"delta-rs doesn't read any of the data files" but it does know which file is needed by the Query ? correct ?

@wjones127
Copy link
Collaborator

I wonder if this could be implemented at the object-store level, where it would keep an LRU cache of files recently loaded. Though might not be useful if the query engine is always requesting specific ranges rather than whole files.

@djouallah
Copy link
Author

@wjones127 all I can say, anything you do will have a massive impact, Object store throughput is problematic and any cache will make the experience way better !!! and it will be a competitive advantage to delta table !!!

@wjones127 wjones127 mentioned this issue Feb 5, 2023
21 tasks
@djouallah
Copy link
Author

closing as it is really engine specific and not really delta table responsability

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

3 participants