Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixup some inserts data to make compatible with what is in production. #1918

Open
wants to merge 11 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from 4 commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 8 additions & 0 deletions CHANGELOG.rst
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,14 @@ Change Log
----------


8.4.5
=====
* 2024-11-18/dmichaels
* Fixed up some inserts data to match 4dn/data/staging.
* Updated dcicutils to latest version (8.16.4).
* Updated dcicsnovault to latest version (11.23.0).


8.4.4
=====

Expand Down
16 changes: 8 additions & 8 deletions poetry.lock

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

6 changes: 3 additions & 3 deletions pyproject.toml
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
[tool.poetry]
# Note: Various modules refer to this system as "encoded", not "fourfront".
name = "encoded"
version = "8.4.4"
version = "8.4.5"
description = "4DN-DCIC Fourfront"
authors = ["4DN-DCIC Team <[email protected]>"]
license = "MIT"
Expand Down Expand Up @@ -49,8 +49,8 @@ colorama = "0.3.3"
# we get odd 'pyo3_runtime.PanicException: Python API call failed' error on import
# of cryptography.hazmat.bindings._rust in cryptography package. 2023-04-21.
# cryptography = "39.0.2"
dcicsnovault = "^11.22.0"
dcicutils = "^8.16.1"
dcicsnovault = "^11.23.0"
dcicutils = "^8.16.4"
elasticsearch = "7.13.4"
elasticsearch-dsl = "^7.0.0" # TODO: port code from cgap-portal to get rid of uses
execnet = "1.4.1"
Expand Down
2 changes: 1 addition & 1 deletion src/encoded/tests/data/inserts/award.json
Original file line number Diff line number Diff line change
Expand Up @@ -40,7 +40,7 @@
"status": "current",
"title": "EXPLORING HOW THE GENOME FOLDS THROUGH PROXIMITY LIGATION AND SEQUENCING",
"url": "https://projectreporter.nih.gov/project_info_details.cfm?aid=8146738&icde=30734626",
"uuid": "b0b9c607-f8b4-4f02-93f4-9895b461334c",
"uuid": "36a06537-7831-494d-b10d-3e9fea931021",
"project": "External"
},
{
Expand Down
4 changes: 2 additions & 2 deletions src/encoded/tests/data/inserts/bio_feature.json
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@
"feature_type": "0231da78-1adc-4b8a-89b9-4f91908412a3",
"uuid": "a8ab8bca-4840-41b7-88d7-04cc19a54657",
"genome_location": ["d1115d5e-40aa-43bc-b81c-32c70c9afb01"],
"relevant_genes": ["3f3496f7-31bf-429f-8da7-7f1fdf840dcc"],
"relevant_genes": ["594e3125-a9cb-4ffa-bc2b-17870c1690f0"],
"award": "1U01CA200059-01",
"lab": "4dn-dcic-lab",
"submitted_by": "[email protected]",
Expand All @@ -17,7 +17,7 @@
"description": "A protein feature",
"feature_type": "91f427e6-5246-4992-8123-b4f8fa9eef01",
"uuid": "5a5b6f55-0b54-441c-86bf-292d41e443a0",
"relevant_genes": ["d5ee3bf3-63b0-4032-b133-314173b3cc4d"],
"relevant_genes": ["a093769c-d596-4a26-8916-06ae491575ba"],
"award": "1U01CA200059-01",
"lab": "4dn-dcic-lab",
"submitted_by": "[email protected]",
Expand Down
20 changes: 10 additions & 10 deletions src/encoded/tests/data/inserts/biosource.json
Original file line number Diff line number Diff line change
Expand Up @@ -4,24 +4,24 @@
"description": "GM12878 test cells 1",
"biosource_type": "immortalized cell line",
"individual":"4DNINOOOAAQ1",
"cell_line": "530036bc-8535-4448-903e-854af460b24c",
"cell_line": "b9668b9a-be39-47de-8eab-5bf1b0854417",
"award": "1U01CA200059-01",
"lab": "4dn-dcic-lab",
"submitted_by": "[email protected]",
"status": "in review by lab",
"biosource_vendor": "b31106bc-8535-4448-903e-854af460b21e"
"biosource_vendor": "11f94a17-51ed-4a0f-93b1-1cac2fd2844f"
},
{
"uuid": "331111bc-8535-4448-903e-854af460a254",
"description": "GM12878 test cells 2",
"biosource_type": "immortalized cell line",
"individual":"4DNINOOOAAQ1",
"cell_line": "530036bc-8535-4448-903e-854af460b24c",
"cell_line": "b9668b9a-be39-47de-8eab-5bf1b0854417",
"award": "1U01CA200059-01",
"lab": "4dn-dcic-lab",
"submitted_by": "[email protected]",
"status": "in review by lab",
"biosource_vendor": "b31106bc-8535-4448-903e-854af460b21e"
"biosource_vendor": "11f94a17-51ed-4a0f-93b1-1cac2fd2844f"
},
{
"uuid": "331111bc-8535-4448-903e-854ab460b254",
Expand All @@ -32,18 +32,18 @@
"lab": "4dn-dcic-lab",
"submitted_by": "[email protected]",
"status": "in review by lab",
"biosource_vendor": "b31106bc-8535-4448-903e-854af460b21e"
"biosource_vendor": "11f94a17-51ed-4a0f-93b1-1cac2fd2844f"
},
{
"uuid": "331111bc-8535-2248-903e-854af460b254",
"description": "test modified cells",
"biosource_type": "immortalized cell line",
"individual":"4DNINOOOAAQ1",
"cell_line": "530036bc-8535-4448-903e-854af460b24c",
"cell_line": "b9668b9a-be39-47de-8eab-5bf1b0854417",
"award": "1U01CA200059-01",
"lab": "4dn-dcic-lab",
"submitted_by": "[email protected]",
"biosource_vendor": "b31106bc-8535-4448-903e-854af460b21e",
"biosource_vendor": "11f94a17-51ed-4a0f-93b1-1cac2fd2844f",
"status": "in review by lab",
"modifications": ["431106bc-8535-4448-903e-854af460b265"]
},
Expand All @@ -56,7 +56,7 @@
"lab": "4dn-dcic-lab",
"submitted_by": "[email protected]",
"status": "in review by lab",
"biosource_vendor": "b31106bc-8535-4448-903e-854af460b21e"
"biosource_vendor": "11f94a17-51ed-4a0f-93b1-1cac2fd2844f"
},
{
"uuid": "111116bc-8535-4448-903e-854af460b254",
Expand All @@ -76,7 +76,7 @@
"award": "1U01CA200059-01",
"lab": "4dn-dcic-lab",
"submitted_by": "[email protected]",
"biosource_vendor": "b31106bc-8535-4448-903e-854af460b21e"
"biosource_vendor": "11f94a17-51ed-4a0f-93b1-1cac2fd2844f"
},
{
"uuid": "0f011b1e-b772-4f2a-8c24-cc55de28a994",
Expand All @@ -86,7 +86,7 @@
"award": "1U01CA200059-01",
"lab": "dcic-testing-lab",
"submitted_by": "[email protected]",
"biosource_vendor": "b31106bc-8535-4448-903e-854af460b21e"
"biosource_vendor": "11f94a17-51ed-4a0f-93b1-1cac2fd2844f"
},
{
"uuid": "c9165aa4-2ab5-428d-b5a7-db86dbb2d815",
Expand Down
4 changes: 2 additions & 2 deletions src/encoded/tests/data/inserts/enzyme.json
Original file line number Diff line number Diff line change
Expand Up @@ -53,7 +53,7 @@
},
{
"name": "DpnII",
"aliases":["dcic:dpnII_neb"],
"aliases":["4dn-dcic-lab:dpnII_neb", "dcic:dpnII_neb"],
"enzyme_source": "new-england-biolabs",
"catalog_number":"R0543",
"recognition_sequence": "GATC",
Expand All @@ -63,7 +63,7 @@
"lab": "4dn-dcic-lab",
"submitted_by": "[email protected]",
"url": "https://www.neb.com/products/r0543-dpnii",
"uuid": "557d7469-d1f6-4200-8ed4-c40374383dd3"
"uuid": "356a57a1-1f1d-463d-a972-27742f79a6a5"
},
{
"name": "NcoI_MspI_BspHI",
Expand Down
4 changes: 2 additions & 2 deletions src/encoded/tests/data/inserts/experiment_seq.json
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
{
"accession":"4DNEXO6777A1",
"biosample":"234411bc-8535-4448-903e-854af460b255",
"experiment_type": "e27b17cd-544a-3a3b-abc3-17a544544b3c",
"experiment_type": "47a593da-c458-422a-a974-82b3302e89cb",
"files": ["4DNFIN232JB1", "4DNFIN232JB2"],
"description": "Biorep 1 Techrep 1 ChIP-seq on GM12878 batch 1",
"award": "1U01CA200059-01",
Expand All @@ -23,7 +23,7 @@
{
"accession":"4DNEXO6777B1",
"biosample":"234511bc-8535-4448-903e-854af460b255",
"experiment_type": "e27b17cd-544a-3a3b-abc3-17a544544b3c",
"experiment_type": "47a593da-c458-422a-a974-82b3302e89cb",
"files": ["4DNFIN232JB3"],
"description": "Biorep 1 Techrep 2 ChIP-seq on GM12878 batch 1",
"award": "1U01CA200059-01",
Expand Down
6 changes: 3 additions & 3 deletions src/encoded/tests/data/inserts/gene.json
Original file line number Diff line number Diff line change
Expand Up @@ -3,13 +3,13 @@
"geneid": "10664",
"award": "1U01CA200059-01",
"lab": "4dn-dcic-lab",
"uuid": "3f3496f7-31bf-429f-8da7-7f1fdf840dcc"
"uuid": "594e3125-a9cb-4ffa-bc2b-17870c1690f0"
},
{
"geneid": "5885",
"award": "1U01CA200059-01",
"lab": "4dn-dcic-lab",
"uuid": "d5ee3bf3-63b0-4032-b133-314173b3cc4d",
"preferred_symbol": "YFG"
"uuid": "a093769c-d596-4a26-8916-06ae491575ba",
"preferred_symbol": "RAD21"
}
]
4 changes: 2 additions & 2 deletions src/encoded/tests/data/inserts/lab.json
Original file line number Diff line number Diff line change
Expand Up @@ -42,8 +42,8 @@
"phone1": "713-798-4897",
"postal_code": "77030",
"state": "TX",
"title": "Erez Lieberman Aiden Lab, BCM",
"uuid": "828cd4fe-ebb0-4b36-a94a-d2e3a36cc98a"
"title": "Erez Lieberman Aiden, BCM",
"uuid": "5771d772-1d10-43ea-bec1-0ea8c5a58503"
},
{
"awards": ["Test-4DN", "1U01CA200059-01", "Test-NOT-4DN"],
Expand Down
2 changes: 1 addition & 1 deletion src/encoded/tests/data/inserts/ontology_term.json
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
[
{
"uuid": "530036bc-8535-4448-903e-854af460b24c",
"uuid": "b9668b9a-be39-47de-8eab-5bf1b0854417",
"preferred_name": "GM12878",
"term_name": "GM12878",
"term_id": "EFO:0002784",
Expand Down
4 changes: 2 additions & 2 deletions src/encoded/tests/data/inserts/vendor.json
Original file line number Diff line number Diff line change
Expand Up @@ -13,13 +13,13 @@
{
"title": "New England Biolabs",
"name": "new-england-biolabs",
"aliases":["dcic:neb"],
"aliases":["dcic:neb", "4dn-dcic-lab:neb"],
"description": "",
"url": "https://www.neb.com",
"award": "1U01CA200059-01",
"lab": "4dn-dcic-lab",
"submitted_by": "[email protected]",
"uuid": "b31106bc-8535-4448-903e-854af460b21e"
"uuid": "11f94a17-51ed-4a0f-93b1-1cac2fd2844f"
},
{
"title": "ThermoFisher Scientific",
Expand Down
2 changes: 1 addition & 1 deletion src/encoded/tests/data/master-inserts/award.json
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
{
"start_date": "2015-09-07",
"schema_version": "1",
"title": "4D NUCLEOME NETWORK DATA COORDINATION AND INTEGRATION CENTER",
"title": "4D NUCLEOME NETWORK DATA COORDINATION AND INTEGRATION CENTER - PHASE I",
"uuid": "b0b9c607-f8b4-4f02-93f4-9895b461334b",
"description": "DCIC: The goals of the 4D Nucleome (4DN) Data Coordination and Integration Center (DCIC) are to collect, store, curate, display, and analyze data generated in the 4DN Network. We have assembled a team of investigators with a strong track record in analysis of chromatin interaction data, image processing and three-dimensional data visualization, integrative analysis of genomic and epigenomic data, data portal development, large-scale computing, and development of secure and flexible cloud technologies. In Aim 1, we will develop efficient submission pipelines for data and metadata from 4DN data production groups. We will define data/metadata requirements and quality metrics in conjunction with the production groups and ensure that high-quality, well- annotated data become available to the wider scientific community in a timely manner. In Aim 2, we will develop a user-friendly data portal for the broad scientific community. This portal will provide an easy-to-navigate interface for accessing raw and intermediate data files, allow for programmatic access via APIs, and will incorporate novel analysis and visualization tools developed by DCIC as well as other Network members. For computing and storage scalability and cost-effectiveness, significant efforts will be devoted to development and deployment of cloud-based technology. We will conduct tutorials and workshops to facilitate the use of 4DN data and tools by external investigators. In Aim 3, we will coordinate and assist in conducting integrative analysis of the multiple data types. These efforts will examine key questions in higher-order chromatin organization using both sequence and image data, and the tools and algorithms developed here will be incorporated into the data portal for use by other investigators. These three aims will ensure that the data generated in 4DN will have maximal impact for the scientific community.",
"url": "https://projectreporter.nih.gov/project_info_description.cfm?aid=8987140&icde=30570219",
Expand Down
2 changes: 1 addition & 1 deletion src/encoded/tests/data/master-inserts/experiment_type.json
Original file line number Diff line number Diff line change
Expand Up @@ -185,7 +185,7 @@
"lab": "4dn-dcic-lab",
"award": "1U01CA200059-01",
"status": "released",
"uuid": "e27b17cd-544a-3a3b-abc3-17a544544b3c",
"uuid": "47a593da-c458-422a-a974-82b3302e89cb",
"valid_item_types": ["ExperimentSeq"]
},
{
Expand Down
4 changes: 2 additions & 2 deletions src/encoded/tests/data/master-inserts/ontology.json
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@
{
"uuid": "530016bc-8535-4448-903e-854af460b254",
"ontology_name": "Uberon",
"ontology_url": "http://uberon.github.io/",
"ontology_url": "http://obophenotype.github.io/uberon/",
"download_url": "http://purl.obolibrary.org/obo/uberon/composite-metazoan.owl",
"namespace_url": "http://purl.obolibrary.org/obo/",
"ontology_prefix": "UBERON",
Expand All @@ -25,7 +25,7 @@
{
"uuid": "530026bc-8535-4448-903e-854af460b254",
"ontology_name": "Ontology for Biomedical Investigations",
"ontology_url": "https://obi-ontology.org/",
"ontology_url": "http://obi-ontology.org",
"download_url": "http://purl.obolibrary.org/obo/obi.owl",
"namespace_url": "http://purl.obolibrary.org/obo/",
"ontology_prefix": "OBI",
Expand Down
5 changes: 3 additions & 2 deletions src/encoded/tests/data/perf-testing/vendor.json
Original file line number Diff line number Diff line change
Expand Up @@ -44,8 +44,9 @@
"public_release": "2017-04-10",
"date_created": "2017-04-09T17:24:14.968417+00:00",
"aliases": [
"dcic:neb"
"dcic:neb",
"4dn-dcic-lab:neb"
],
"schema_version": "1"
}
]
]
Loading