-
Notifications
You must be signed in to change notification settings - Fork 3
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Incorporate uniprot id into gene info (#160)
* Added uniprotkb_accession column to gene_info dataset * Updated gx validation for gene_info * Added the uniprot mapping file to the gene_info provenance * Updating gx validation * Updated gx validation to include uniprotkb_accessions * Finally got the gx validation to use the plural uniprotkb_accessions * Pytests passing * Removed rename_unknown_column() and related code * gene_info test_add_uniprot_id_to_gene_info_should_pass passing * uniprot fail test is failing * Removing testing yaml file that we do not need * Removing unnecessary changes to utils.py * Updated version number of syn54113663 to syn54113663.3 in config and test_config * Made the ensembl to uniprot test mapping file smaller * Improved uniprot code integration with expected flow of existing codebase * Removed unused regex for uniprot gx validation * Adding duplicate ensembl ID to uniprot testing file. Removing unused testing files. * Updating testing_gene_info docstring * Removed unused testing file * Removed unused variables in gene_info testing script --------- Co-authored-by: Beatriz Saldana <[email protected]>
- Loading branch information
1 parent
6f41530
commit ae4cf82
Showing
9 changed files
with
198 additions
and
49 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
20 changes: 20 additions & 0 deletions
20
tests/test_assets/gene_info/input/ensg_to_uniprot_mapping_good.tsv
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,20 @@ | ||
uniprotkb_accessions ensembl_gene_id | ||
P08603 ENSG00000000971 | ||
Q9H2S6 ENSG00000000005 | ||
O00522 ENSG00000001631 | ||
Q9P2G1 ENSG00000001629 | ||
Q9BTY2 ENSG00000001036 | ||
Q5TH74 ENSG00000001460 | ||
P23511 ENSG00000001167 | ||
Q6P499 ENSG00000001461 | ||
Q9Y6X5 ENSG00000001561 | ||
P48506 ENSG00000001084 | ||
Q8IZE3 ENSG00000000457 | ||
Q9NSG2 ENSG00000000460 | ||
P09769 ENSG00000000938 | ||
O60762 ENSG00000000419 | ||
Q16850 ENSG00000001630 | ||
Q9Y4W2 ENSG00000001497 | ||
P13569 ENSG00000001626 | ||
Q13275 ENSG00000001617 | ||
Q1TEST ENSG00000001617 |
Oops, something went wrong.