-
Notifications
You must be signed in to change notification settings - Fork 442
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[VL] Fix sort based shuffle oom in spill when compress was disabled #7553
[VL] Fix sort based shuffle oom in spill when compress was disabled #7553
Conversation
@@ -85,7 +85,7 @@ arrow::Status RssPartitionWriter::doEvict( | |||
inMemoryPayload->toBlockPayload( | |||
payloadType, payloadPool_.get(), codec_ ? codec_.get() : nullptr, std::move(compressed))); | |||
// Copy payload to arrow buffered os. | |||
ARROW_ASSIGN_OR_RAISE(auto rssBufferOs, arrow::io::BufferOutputStream::Create(options_.pushBufferMaxSize, pool_)); | |||
ARROW_ASSIGN_OR_RAISE(auto rssBufferOs, arrow::io::BufferOutputStream::Create(options_.pushBufferMaxSize)); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for working on a fix. Though this could turn the memory buffer into untracked?
cc @marin-ma
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It is, actually I'm a little confused about delegate the timing of spill trigger to spark framework, which means we can't make any other memory allocation when spilling.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe we should have a threshold to indicate whether trigger spill actively or not?
Thanks for opening a pull request! Could you open an issue for this pull request on Github Issues? https://github.com/apache/incubator-gluten/issues Then could you also rename commit message and pull request title in the following format?
See also: |
Thanks. Can we pre-allocate a buffer of size |
That's a better solution, I'll update later. btw, I have another question, is it possible that |
related |
This PR is stale because it has been open 45 days with no activity. Remove stale label or comment or this will be closed in 10 days. |
This PR was auto-closed because it has been stalled for 10 days with no activity. Please feel free to reopen if it is still valid. Thanks. |
@clay4megtr Let's merge this one first. Perhaps consider this change in the future. Thanks! |
What changes were proposed in this pull request?
Fix oom when compress was disabled in spill.