fix: oom during remediation annotations on large number of issues #5284
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
When there are a large number of SCA issues, remediation information is added in nested loops. Excessive allocation in the innermost loop was causing an OOM in situations where the product of vulnerabilities, issues, and paths were sufficiently high enough.
This modifies issue data in-place rather than creating a copy. This probably violates type constraints; casting those concerns away in the interest of memory resources.
Fixes CLI-261, CLI-248.
Notes
If I run this with
--json-file-output
AND comment out theJSON.stringify
injsonStringifyLargeObject
I can get a large container to write JSON output.npm run dev -- container test --platform=linux/amd64 --json-file-output=acryldata.json acryldata/datahub-ingestion:v0.11.0.4
produces a lot of text output, and a rather large JSON output:Writing the JSON output took about 10m longer to complete on a mac m1, when compared to running without JSON output.
So I think this needs more work... replacing that JSON.stringify with streaming output should address the bug completely.
The
JSON.stringify
, even in a try block, still crashests-node
on my machine.Pull Request Submission
Please check the boxes once done.
The pull request must:
feat:
orfix:
, others might be used in rare occasions as well, if there is no need to document the changes in the release notes. The changes or fixes should be described in detail in the commit message for the changelog & release notes.Pull Request Review
All pull requests must undergo a thorough review process before being merged.
The review process of the code PR should include code review, testing, and any necessary feedback or revisions.
Pull request reviews of functionality developed in other teams only review the given documentation and test reports.
Manual testing will not be performed by the reviewing team, and is the responsibility of the author of the PR.
For Node projects: It’s important to make sure changes in
package.json
are also affectingpackage-lock.json
correctly.If a dependency is not necessary, don’t add it.
When adding a new package as a dependency, make sure that the change is absolutely necessary. We would like to refrain from adding new dependencies when possible.
Documentation PRs in gitbook are reviewed by Snyk's content team. They will also advise on the best phrasing and structuring if needed.
Pull Request Approval
Once a pull request has been reviewed and all necessary revisions have been made, it is approved for merging into
the main codebase. The merging of the code PR is performed by the code owners, the merging of the documentation PR
by our content writers.
What does this PR do?
Where should the reviewer start?
How should this be manually tested?
Any background context you want to provide?
What are the relevant tickets?
Screenshots
Additional questions