Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add I1 and Other as valid values for Read Indicator #298

Merged
merged 4 commits into from
Sep 28, 2023
Merged
Show file tree
Hide file tree
Changes from 3 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
50 changes: 25 additions & 25 deletions HTAN.model.csv
Original file line number Diff line number Diff line change
Expand Up @@ -77,7 +77,7 @@ Imaging Level 3 Image,Quality controlled imaging data,,"Component, Filename, Fil
10x Visium Spatial Transcriptomics - RNA-seq Level 2,Alignment workflows downstream of Spatial Transcriptomics RNA-seq Level 1.,,"Component, Filename, File Format, Checksum, HTAN Data File ID, HTAN Parent Data File ID, UMI Tag, Whitelist Spatial Barcode File Link, Spatial Barcode Tag, Applied Hard Trimming, Workflow Version, Workflow Link, Genomic Reference, Genomic Reference URL, Genome Annotation URL, HTAN Parent Biospecimen ID, Run ID, Capture Area",,FALSE,Spatial Transcriptomics,10x Visium Spatial Transcriptomics - RNA-seq Level 1,,
10x Visium Spatial Transcriptomics - Auxiliary Files,"Auxiliary data associated with spot/slide analysis (aligned Images, quality control files, etc) from Spatial Transcriptomics.",,"Component, Filename, File Format, HTAN Data File ID, HTAN Parent Biospecimen ID, HTAN Parent Data File ID, Run ID, Visium File Type, Slide ID, Capture Area, Workflow Version, Workflow Link",,FALSE,Spatial Transcriptomics,"10x Visium Spatial Transcriptomics - RNA-seq Level 1, 10x Visium Spatial Transcriptomics - RNA-seq Level 2",,
10x Visium Spatial Transcriptomics - RNA-seq Level 3,Processed data files based on Spatial Transcriptomics RNA-seq Level 2 and Spatial Transcriptomics Auxiliary files.,,"Component, Filename, File Format, HTAN Data File ID, HTAN Parent Biospecimen ID, HTAN Parent Data File ID, Run ID, Visium File Type, Workflow Version, Workflow Link, Capture Area, Spots under tissue, Mean Reads per Spatial Spot, Median Number Genes per Spatial Spot, Sequencing Saturation, Proportion Reads Mapped, Proportion Reads Mapped to Transcriptome, Median UMI Counts per Spot",,FALSE,Spatial Transcriptomics,"10x Visium Spatial Transcriptomics - RNA-seq Level 2, 10x Visium Spatial Transcriptomics - Auxiliary Files",,
10x Visium Spatial Transcriptomics - RNA-seq Level 4,Processed data files based on Spatial Transcriptomics RNA-seq Level 3.,,"Component, Filename, File Format, HTAN Data File ID, HTAN Parent Biospecimen ID, HTAN Parent Data File ID, Run ID, Workflow Version, Workflow Link, Visium Workflow Type, Visium Workflow Parameters Description",,FALSE,Spatial Transcriptomics,"10x Visium Spatial Transcriptomics - RNA-seq Level 3",,
10x Visium Spatial Transcriptomics - RNA-seq Level 4,Processed data files based on Spatial Transcriptomics RNA-seq Level 3.,,"Component, Filename, File Format, HTAN Data File ID, HTAN Parent Biospecimen ID, HTAN Parent Data File ID, Run ID, Workflow Version, Workflow Link, Visium Workflow Type, Visium Workflow Parameters Description",,FALSE,Spatial Transcriptomics,10x Visium Spatial Transcriptomics - RNA-seq Level 3,,
Visium File Type,The file type generated for the visium experiment.,"reference png, reference jpg, json scale factors, probe dataset csv, qc result html, filtered mex, unfiltered mex, tissue_positions, barcodes, features, fiducial image png, fiducial image jpg, detected image png, detected jpg, high res image, low res image, json scale factors, probe dataset csv",,,TRUE,Spatial Transcriptomics,,,
Run ID,A unique identifier for this individual run (typically associated with a single slide) of the spatial transcriptomic processing workflow.,,,,TRUE,Spatial Transcriptomics,,,
Capture Area,"Area (or Capture Area) - One of the either four or two active regions where tissue can be placed on a Visium slide. Each area is intended to contain only one tissue sample. Slide areas are named consecutively from top to bottom: A1, B1, C1, D1 for Visium slides with 6.5 mm Capture Area and A, B for CytAssist slides with 11 mm Capture Area. Both CytAssist slides with 6.5 mm Capture Area and Gateway Slides contain only two slide areas, A1 and D1.","A, B, C, D, A1, B1, C1, D1",,,FALSE,Spatial Transcriptomics,,,
Expand Down Expand Up @@ -139,8 +139,8 @@ NanoString GeoMx DSP ROI DCC Segment Annotation Metadata,GeoMx ROI and Segment M
Slide name,"Similar to a Run ID, the slide name indicates the slide a given ROI is linked to (as reported in Segment Summary).",,,,FALSE,NanoString GeoMx DSP ROI DCC Segment Annotation Metadata,,,
Raw reads,Reads not yet analyzed in any way to be used for data analysis. The number of reads that pass filter from the flow cell represented in the FASTQ file. ,,,,FALSE,NanoString GeoMx DSP ROI DCC Segment Annotation Metadata,,,
Stitched reads,"Represents consensus from the overlapping sequence of read 1 and 2. This is a % of the aligned reads that were overlapped and consensus confirmed, usually upward of 80% but less in terms of number of reads than aligned reads",,,,FALSE,NanoString GeoMx DSP ROI DCC Segment Annotation Metadata,,,
Aligned reads,"Is a sequence that has been aligned to a gene/probe. Typically these reads can number from the hundreds of thousands to tens of millions. In GeoMx alignment is via mapping the RTS ID to a white list of sequences that represent targets.",,,,FALSE,NanoString GeoMx DSP ROI DCC Segment Annotation Metadata,,,
Deduplicated reads,"Is the replacement of blocks of duplicate data with a Virtual Index Pointer linking the new sub-block to the existing block of data in a duplicate repository. This is used to reduce the amount of space need to store the data.",,,,FALSE,NanoString GeoMx DSP ROI DCC Segment Annotation Metadata,,,
Aligned reads,Is a sequence that has been aligned to a gene/probe. Typically these reads can number from the hundreds of thousands to tens of millions. In GeoMx alignment is via mapping the RTS ID to a white list of sequences that represent targets.,,,,FALSE,NanoString GeoMx DSP ROI DCC Segment Annotation Metadata,,,
Deduplicated reads,Is the replacement of blocks of duplicate data with a Virtual Index Pointer linking the new sub-block to the existing block of data in a duplicate repository. This is used to reduce the amount of space need to store the data.,,,,FALSE,NanoString GeoMx DSP ROI DCC Segment Annotation Metadata,,,
In Situ Negative median,Is the median of all negative control probes for a given segment. A measure of signal to background for each segment.,,,,FALSE,NanoString GeoMx DSP ROI DCC Segment Annotation Metadata,,,
Biological probe median,Is the median count from all probes except the negative control probes. A measure of signal to background for each segment,,,,FALSE,NanoString GeoMx DSP ROI DCC Segment Annotation Metadata,,,
HI-C-seq Level 1,Unaligned sequence data,,"Component, HTAN Parent Biospecimen ID, HTAN Data File ID, Filename, File Format, Genomic Reference, Sequencing Platform, Nucleic Acid Source, Technical Replicate Group, Transposition Reaction, Crosslinking Condtion, DNA Digestion Condition, Nuclei Permeabilization Method, Ligation Condition, Biotin Enrichment, DNA Input Amount, Total Reads, Protocol Link",,FALSE,Sequencing,Biospecimen,,
Expand Down Expand Up @@ -269,7 +269,7 @@ Median Fraction of Reads in Annotated cis DNA Elements,Median fraction of reads
Median Passing Read Percentage,Non-PCR duplicate nuclear genomic sequence reads not aligning to unanchored contigs out of total reads assigned to the nucleus barcode,,,,TRUE,scATAC-seq Level 1,,,
Median Percentage of Mitochondrial Reads per Nucleus,Contamination from mitochondrial sequences,,,,TRUE,scATAC-seq Level 1,,,
Peaks Calling Software,Generic name of peaks calling tool,,,,TRUE,scATAC-seq Level 1,,,
Read Indicator,Indicate if this is Read 1 (R1) or Read 2 (R2),"R1, R2, R1&R2",,,TRUE,Sequencing,,,
Read Indicator,"Indicate if this is Read 1 (R1), Read 2 (R2), Index Reads (I1), or Other","R1, R2, R1&R2, I1, Other",,,TRUE,Sequencing,,,
Read1,Read 1 content description,"Cell Barcode and UMI, cDNA",,,TRUE,scRNA-seq Level 1,,,
Read2,Read 2 content description,"Cell Barcode and UMI, cDNA",,,TRUE,scRNA-seq Level 1,,,
cDNA,"Complementary DNA. A DNA copy of an mRNA or complex sample of mRNAs, made using reverse transcriptase",,"cDNA Offset, cDNA Length",,FALSE,Sequencing,,,
Expand Down Expand Up @@ -878,7 +878,7 @@ RPPA Validation Status,Valid = RPPA and WB correlation > 0.7; Use with Caution =
Antibody Notes,Notes on antibodies replacements and antibody recognition observations.,,,,FALSE,HTAN RPPA Antibody Table,,,
Pre-processing Completed,Pre-processing steps completed to convert level 1 raw data to a single level 2 image,"Illumination correction, Tile Stitching, Channel/Cycle Registration, TMA de-arraying, None, Other",,,TRUE,Imaging,,https://www.miti-consortium.org/,
Pre-processing Required,Pre-processing steps required to convert level 1 raw data to a single level 2 image,"Illumination correction, Tile Stitching, Channel/Cycle Registration, TMA de-arraying, None, Other",,,TRUE,Imaging,,https://www.miti-consortium.org/,
Publication,"An empty parent attribute for publications",,,,FALSE,,,,
Publication,An empty parent attribute for publications,,,,FALSE,,,,
Publication Manifest,Publication specific attributes.,,"Publication-associated HTAN Parent Data File ID, HTAN Grant ID, HTAN Center ID, Publication Content Type, DOI, Title, Authors, Corresponding Author, Corresponding Author ORCID, Year of Publication, Location of Publication, Publication Abstract, License, PMID, Publication contains HTAN ID, Data Type, Tool, Supporting Link, Supporting Link Description",,FALSE,Publication,Publication,,
Publication-associated HTAN Parent Data File ID,HTAN Data File Identifier(s) of the files associated with the content presented/published. Should be comma-separated lists.,,,,TRUE,Publication,,,list like
HTAN Grant ID,HTAN grant number(s) (i.e. CA------ format) associated with the content presented/published.,,,,TRUE,Publication,,,list like
Expand Down Expand Up @@ -984,23 +984,23 @@ DMC data file format,Format of the data files,"BED, bedGraph",,,TRUE,Bulk Methyl
DMR data file Format,Format of the data files.,"BED, bedGraph",,,TRUE,Bulk Methylation-seq Level 3,,,
MS Assay Category,Type of Mass Spectrometry performed.,,,,TRUE,Mass Spectrometry Level 2,,,
Publication contains HTAN ID,HTAN IDs are used in the publication.,"Yes, No",,,TRUE,Publication,,,
Electron Microscopy Level 1,"Raw electron microscopy data as one TIFF file per plane for a 3D image stack or per tile for a 2D large area montage",,"Component, Filename, File Format, HTAN Parent Biospecimen ID, HTAN Data File ID, EM method, EM signal or contrast mech, EM instrument, Protocol Link, Software and Version, SizeX, SizeY, SizeC, SizeZ, PhysicalSizeX, PhysicalSizeY, PhysicalSizeZ, EM dwell or exposure time,EM voltage, EM beam current, EM spot size, EM stage tilt, EM signal processing, EM contrast type",,FALSE,Assay,Biospecimen,,
Electron Microscopy Level 2,"Processed electron microscopy data as one OME-TIFF image per plane or montage",,"Component, Filename, File Format, HTAN Parent Data File ID, HTAN Data File ID,Tile overlap X, Tile overlap Y,EM contrast type",,FALSE,Assay,Electron Microscopy Level 1,,
Electron Microscopy Level 3,"Segmented electron microscopy data as .am or .tiff formats",,"Component, Filename, File Format, HTAN Parent Data File ID, HTAN Data File ID, Imaging Object Class",,FALSE,Assay,Electron Microscopy Level 2,,
Electron Microscopy Level 4,"Movies or other derived files from electron microscopy data",,"Component, Filename, File Format, HTAN Parent Data File ID, HTAN Data File ID, Comment",,FALSE,Assay,Electron Microscopy Level 3,,
EM instrument,"Make and model of the EM instrument used",,,,TRUE,Electron Microscopy Level 1,,,str
EM method,"Electron microscopy method used","SEM, FIB-SEM, SBF-SEM, AT, TEM, ET, cryo ET, ssTEM",,,TRUE,Electron Microscopy Level 1,,,
EM signal or contrast mech,"How the electron microscopy signal is generated from the sample","BSE, SE, phase, bright-field, dark-field",,,TRUE,Electron Microscopy Level 1,,,
EM dwell or exposure time,"Duration in microseconds (µs) of electron beam data collection per pixel or frame",,,,FALSE,Electron Microscopy Level 1,,,num
EM voltage,"Accelerating voltage in kiloelectronvolts (keV)",,,,FALSE,Electron Microscopy Level 1,,,num
EM beam current,"Beam current in nanoamps (nA)",,,,FALSE,Electron Microscopy Level 1,,,num
EM spot size,"Beam spot size in micrometers (µm)",,,,FALSE,Electron Microscopy Level 1,,,num
EM stage tilt,"Physical stage tilt in degrees with respect to the electron beam",,,,FALSE,Electron Microscopy Level 1,,,num
EM signal processing,"SNR improvement strategies used","None, line integration, frame integration, frame averaging",,,FALSE,Electron Microscopy Level 1,,,
EM contrast type,"Does the image use standard SEM contrast or TEM contrast","SEM,TEM",,,FALSE,Electron Microscopy Level 1,,,
Tile overlap X,"Percentage of image overlap to allow tile stitching in x direction",,,,TRUE,Electron Microscopy Level 1,,,num
Tile overlap Y,"Percentage of image overlap to allow tile stitching in x direction",,,,TRUE,Electron Microscopy Level 1,,,num
Barretts Esophagus Goblet Cells Present,"Presence or absennce of Barretts esophagus goblet cells.","Yes, No",,,FALSE,Follow Up,,,
Pancreatitis Onset Year,"Date of onset of pancreatitis.",,,,FALSE,Follow Up,,,num
HTAN Parent Channel Metadata ID,"HTAN ID for a level 3 channels table.",,,,TRUE, Imaging Level 4,,,
Single Nucleus Capture, "Nuclei isolation method.","Plates, 10x, droplet",,,FALSE,scmC-seq Level 1,,,
Electron Microscopy Level 1,Raw electron microscopy data as one TIFF file per plane for a 3D image stack or per tile for a 2D large area montage,,"Component, Filename, File Format, HTAN Parent Biospecimen ID, HTAN Data File ID, EM method, EM signal or contrast mech, EM instrument, Protocol Link, Software and Version, SizeX, SizeY, SizeC, SizeZ, PhysicalSizeX, PhysicalSizeY, PhysicalSizeZ, EM dwell or exposure time,EM voltage, EM beam current, EM spot size, EM stage tilt, EM signal processing, EM contrast type",,FALSE,Assay,Biospecimen,,
Electron Microscopy Level 2,Processed electron microscopy data as one OME-TIFF image per plane or montage,,"Component, Filename, File Format, HTAN Parent Data File ID, HTAN Data File ID,Tile overlap X, Tile overlap Y,EM contrast type",,FALSE,Assay,Electron Microscopy Level 1,,
Electron Microscopy Level 3,Segmented electron microscopy data as .am or .tiff formats,,"Component, Filename, File Format, HTAN Parent Data File ID, HTAN Data File ID, Imaging Object Class",,FALSE,Assay,Electron Microscopy Level 2,,
Electron Microscopy Level 4,Movies or other derived files from electron microscopy data,,"Component, Filename, File Format, HTAN Parent Data File ID, HTAN Data File ID, Comment",,FALSE,Assay,Electron Microscopy Level 3,,
EM instrument,Make and model of the EM instrument used,,,,TRUE,Electron Microscopy Level 1,,,str
EM method,Electron microscopy method used,"SEM, FIB-SEM, SBF-SEM, AT, TEM, ET, cryo ET, ssTEM",,,TRUE,Electron Microscopy Level 1,,,
EM signal or contrast mech,How the electron microscopy signal is generated from the sample,"BSE, SE, phase, bright-field, dark-field",,,TRUE,Electron Microscopy Level 1,,,
EM dwell or exposure time,Duration in microseconds (µs) of electron beam data collection per pixel or frame,,,,FALSE,Electron Microscopy Level 1,,,num
EM voltage,Accelerating voltage in kiloelectronvolts (keV),,,,FALSE,Electron Microscopy Level 1,,,num
EM beam current,Beam current in nanoamps (nA),,,,FALSE,Electron Microscopy Level 1,,,num
EM spot size,Beam spot size in micrometers (µm),,,,FALSE,Electron Microscopy Level 1,,,num
EM stage tilt,Physical stage tilt in degrees with respect to the electron beam,,,,FALSE,Electron Microscopy Level 1,,,num
EM signal processing,SNR improvement strategies used,"None, line integration, frame integration, frame averaging",,,FALSE,Electron Microscopy Level 1,,,
EM contrast type,Does the image use standard SEM contrast or TEM contrast,"SEM,TEM",,,FALSE,Electron Microscopy Level 1,,,
Tile overlap X,Percentage of image overlap to allow tile stitching in x direction,,,,TRUE,Electron Microscopy Level 1,,,num
Tile overlap Y,Percentage of image overlap to allow tile stitching in x direction,,,,TRUE,Electron Microscopy Level 1,,,num
Barretts Esophagus Goblet Cells Present,Presence or absennce of Barretts esophagus goblet cells.,"Yes, No",,,FALSE,Follow Up,,,
Pancreatitis Onset Year,Date of onset of pancreatitis.,,,,FALSE,Follow Up,,,num
HTAN Parent Channel Metadata ID,HTAN ID for a level 3 channels table.,,,,TRUE, Imaging Level 4,,,
Single Nucleus Capture," ""Nuclei isolation method.""","Plates, 10x, droplet",,,FALSE,scmC-seq Level 1,,,
25 changes: 24 additions & 1 deletion HTAN.model.jsonld
Original file line number Diff line number Diff line change
Expand Up @@ -13686,7 +13686,7 @@
{
"@id": "bts:ReadIndicator",
"@type": "rdfs:Class",
"rdfs:comment": "Indicate if this is Read 1 (R1) or Read 2 (R2)",
"rdfs:comment": "Indicate if this is Read 1 (R1), Read 2 (R2), Index Reads (I1), or Other",
"rdfs:label": "ReadIndicator",
"rdfs:subClassOf": [
{
Expand All @@ -13705,6 +13705,12 @@
},
{
"@id": "bts:R1&R2"
},
{
"@id": "bts:I1"
},
{
"@id": "bts:Other"
}
],
"sms:displayName": "Read Indicator",
Expand Down Expand Up @@ -56245,6 +56251,23 @@
"sms:required": "sms:false",
"sms:validationRules": []
},
{
"@id": "bts:I1",
"@type": "rdfs:Class",
"rdfs:comment": "TBD",
"rdfs:label": "I1",
"rdfs:subClassOf": [
{
"@id": "bts:ReadIndicator"
}
],
"schema:isPartOf": {
"@id": "http://schema.biothings.io"
},
"sms:displayName": "I1",
"sms:required": "sms:false",
"sms:validationRules": []
},
{
"@id": "bts:Sci-ATAC-seq",
"@type": "rdfs:Class",
Expand Down