-
Notifications
You must be signed in to change notification settings - Fork 5
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #45 from puja-trivedi/create_docs_20241003
bkbit documentation
- Loading branch information
Showing
6 changed files
with
293 additions
and
23 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,77 @@ | ||
.. _genome_annotation: | ||
|
||
Annotated Genome Data | ||
---------------------- | ||
|
||
Overview | ||
......... | ||
|
||
Generate JSON-LD files for annotated genes from a given GFF3 file. Currently GFF3 files from ENSEMBL and NCBI are supported. | ||
|
||
Each JSON-LD file will contain: | ||
|
||
- GeneAnnotation objects | ||
- 1 GenomeAnnotation object | ||
- 1 GenomeAssembly object | ||
- 1 OrganismTaxon object | ||
- 1 Checksum object | ||
|
||
Command Line | ||
............. | ||
|
||
``bkbit gff2jsonld`` | ||
,,,,,,,,,,,,,,,,,,,,, | ||
|
||
.. code-block:: bash | ||
$ bkbit gff2jsonld [OPTIONS] GFF3_URL | ||
Options | ||
,,,,,,,, | ||
|
||
``-a, --assembly_accession`` | ||
ID assigned to the genomic assembly used in the GFF3 file. | ||
**Note: Must be provided when using ENSEMBL GFF3 files** | ||
|
||
``-s, --assembly_strain`` | ||
Specific strain of the organism associated with the GFF3 file. | ||
|
||
``-l, --log_level`` | ||
Logging level. | ||
|
||
Default: | ||
WARNING | ||
Options: | ||
DEBUG | INFO | WARNING | ERROR | CRITICIAL | ||
|
||
``-f, --log_to_file`` | ||
Log to a file instead of the console. | ||
|
||
Default: | ||
FALSE | ||
|
||
Arguments | ||
,,,,,,,,,,, | ||
|
||
``GFF3_URL`` | ||
URL to the GFF3 file. | ||
|
||
Examples | ||
......... | ||
|
||
Example 1: NCBI GFF3 file | ||
,,,,,,,,,,,,,,,,,,,,,,,,,, | ||
|
||
.. code-block:: bash | ||
# Run gff2jsonld command | ||
$ bkbit gff2jsonld 'https://ftp.ncbi.nlm.nih.gov/genomes/all/annotation_releases/9823/106/GCF_000003025.6_Sscrofa11.1/GCF_000003025.6_Sscrofa11.1_genomic.gff.gz' > output.jsonld | ||
Example 2: ENSEMBL GFF3 file | ||
,,,,,,,,,,,,,,,,,,,,,,,,,,,,, | ||
|
||
.. code-block:: bash | ||
# Run gff2jsonld command | ||
$ bkbit gff2jsonld -a 'GCF_003339765.1' 'https://ftp.ensembl.org/pub/release-104/gff3/macaca_mulatta/Macaca_mulatta.Mmul_10.104.gff3.gz' > output.jsonld |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,68 @@ | ||
.. _specimen_file_manifest: | ||
|
||
Specimen File Manifest | ||
---------------------- | ||
|
||
Overview | ||
......... | ||
|
||
Generates a JSON-LD file containing specimen file data using the BICAN Library Generation Schema. | ||
|
||
Command Line | ||
............. | ||
|
||
``bkbit filemanifest2jsonld`` | ||
,,,,,,,,,,,,,,,,,,,,,,,,,,,,, | ||
|
||
.. code-block:: bash | ||
$ bkbit filemanifest2jsonld [OPTIONS] FILE_MANIFEST_CSV | ||
**Options** | ||
|
||
``--list_library_aliquots`` | ||
A boolean flag that, when provided, generates a list of unique library aliquots contained in the given file manifest and saves output in file called 'file_manifest_library_aliquots.txt'. | ||
If this flag is not set (DEFAULT), then only the JSON-LD output will be generated. | ||
|
||
**Arguments** | ||
|
||
``FILE_MANIFEST_CSV`` | ||
Required argument. | ||
FILE_MANIFEST_CSV can be optained from Brain Knowledge Platform and **must** contains the following columns: | ||
|
||
- Project ID | ||
- Specimen ID | ||
- File Name | ||
- Checksum | ||
- File Type | ||
- Archive | ||
- Archive URI | ||
|
||
Examples | ||
......... | ||
|
||
Example 1: Only generate JSON-LD output | ||
,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,, | ||
|
||
.. code-block:: bash | ||
# Run filemanifest2jsonld command | ||
$ bkbit filemanifest2jsonld file_manifest.csv > output.jsonld | ||
Example 2: Generate JSON-LD output and list of library aliquots | ||
,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,, | ||
|
||
.. code-block:: bash | ||
# Run filemanifest2jsonld command | ||
$ bkbit filemanifest2jsonld --list_library_aliquots file_manifest.csv > output.jsonld | ||
# Generated output files | ||
$ ls . | ||
output.jsonld | ||
file_manifest_library_aliquots.txt | ||
# Contents of file_manifest_library_aliquots.txt | ||
$ cat file_manifest_library_aliquots.txt | ||
LP-123 | ||
LP-345 |
Oops, something went wrong.