-
Notifications
You must be signed in to change notification settings - Fork 1.8k
Open
Labels
bugSomething isn't workingSomething isn't working
Description
Describe the bug
You can trivial cause datafusion to use any amount of memory by simply running
select generate_series(9876543210);Memory management functionality, e.g. MemoryPool does't seem to have any effect
To Reproduce
Run datafusion-cli with a memory limit, then run generate_series:
datafusion-cli -m 1g -c 'select generate_series(9876543210);'Memory immediately jumps to ~20GB. (note this is not limited to datafusion-cli)
This query also hangs indefinitely, but in production we see posts being killed OOM for queries like this.
Expected behavior
generate_series should either be streamed so it uses very little memory, or should be killed/constrained by the memory pool.
Additional context
Same presumably applies to the range UDF.
cc @davidhewitt
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working