-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add functionality to parse variants from VCF files #6
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
* Initial commit * Update DESCRIPTION * bootstrap tests * Test CI * Prototype function to load gene expression data to SE * Stub functions * Add progress bar * Add pre-commit config * Add personalis functions to package and run roxygen * Add relevant documentation on personalis outputs * Update exploratory notebook * Fix pre-commit prettier * Format files according to pre-commit checks * Draft function to read small variant data * configure lintr * Implement basic functionality to read variant and GEX data into MAE * Implement basic error handling for missing samples * Generalize warning for missing samplese * Read CNV data * Fix CNV IO function in case of empty tables * Add function to read personalis HLA data * Add function to read in TCR data * Scrape TCR summary statistics from HTML * Implement function to read somatic variant statistics * Read summary stats for somatic variants * Implement bumpy_matrix_to_df * Read CNV summary statistics * Read MSI info * refactor * Workaround for samples with no col in bumpy matrix * Apply the fix also to small variant data * Use "Genomic Variant" instead of pos as unique variant identifier * Fix issue with reading non-somatic variants * Handle case when there are no samples for a modality * Fix duplicated mutation ids * Fix column name incompatibility in newer HTML report versions * stub vignette * Add vignette * Update vignette * Ensure bumpy matrix, row and coldata have consistent order * Fix alternative gex filename and CNV import * Support alternative TCR path * Fix column conversion in CNV reader * Fix paths * add function for parsing VCF files * add functionality for reading and storing VCF data * add/change comments * add option to read small variant reports of type all * Angewendeter Vorschlag * Angewendeter Vorschlag * add sample type check * Angewendeter Vorschlag * Angewendeter Vorschlag * add report_type parameter * Update README * Fix reading CNV report * Roxygenize * Fix parse copy number report --------- Co-authored-by: Christopher Mohr <[email protected]> Co-authored-by: grst <[email protected]>
christopher-mohr
approved these changes
Mar 15, 2024
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM 👍
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Porting changes originally made by @christopher-mohr.