-
Notifications
You must be signed in to change notification settings - Fork 61
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge remote-tracking branch 'origin/staging'
- Loading branch information
Showing
1,880 changed files
with
279,844 additions
and
68,830 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,20 @@ | ||
.. image:: images/pVACbind_logo_trans-bg_sm_v4b.png | ||
:align: right | ||
:alt: pVACbind logo | ||
|
||
.. _pvacbind: | ||
|
||
pVACbind | ||
==================================== | ||
|
||
This component of the pVACtools is used to predict neoantigens for the peptides in a FASTA file. | ||
|
||
.. toctree:: | ||
:glob: | ||
|
||
pvacbind/prerequisites | ||
pvacbind/getting_started | ||
pvacbind/run | ||
pvacbind/output_files | ||
pvacbind/filter_commands | ||
pvacbind/additional_commands |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,25 @@ | ||
.. image:: ../images/pVACbind_logo_trans-bg_sm_v4b.png | ||
:align: right | ||
:alt: pVACbind logo | ||
|
||
Additional Commands | ||
=================== | ||
|
||
To make using pVACbind easier, several convenience methods are included in the package. | ||
|
||
.. _pvacbind_example_data: | ||
|
||
Download Example Data | ||
--------------------- | ||
|
||
.. program-output:: pvacbind download_example_data -h | ||
|
||
List Valid Alleles | ||
------------------ | ||
|
||
.. program-output:: pvacbind valid_alleles -h | ||
|
||
List Allele-Specific Cutoffs | ||
---------------------------- | ||
|
||
.. program-output:: pvacbind allele_specific_cutoffs -h |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,50 @@ | ||
.. image:: ../images/pVACbind_logo_trans-bg_sm_v4b.png | ||
:align: right | ||
:alt: pVACbind logo | ||
|
||
Filtering Commands | ||
============================= | ||
|
||
pVACbind currently offers two filters: a binding filter and a top score filter. | ||
|
||
These filters are always run automatically as part | ||
of the pVACbind pipeline using default cutoffs. | ||
|
||
All filters can also be run manually on the filtered.tsv file to narrow the results down further, | ||
or they can be run on the all_epitopes.tsv file to apply different filtering thresholds. | ||
|
||
The binding filter is used to remove neoantigen candidates that do not meet desired peptide:MHC binding criteria. | ||
The top score filter is used to select the most promising peptide candidate for each variant. | ||
Multiple candidate peptides from a single somatic variant can be caused by multiple peptide lengths, registers, HLA alleles, | ||
and transcript annotations. | ||
|
||
Further details on each of these filters is provided below. | ||
|
||
Binding Filter | ||
-------------- | ||
|
||
.. program-output:: pvacbind binding_filter -h | ||
|
||
The binding filter removes variants that don't pass the chosen binding threshold. | ||
The user can chose whether to apply this filter to the ``lowest`` or the ``median`` binding | ||
affinity score by setting the ``--top-score-metric`` flag. The ``lowest`` binding | ||
affinity score is recorded in the ``Best MT Score`` column and represents the lowest | ||
ic50 score of all prediction algorithms that were picked during the previous pVACseq run. | ||
The ``median`` binding affinity score is recorded in the ``Median MT Score`` column and | ||
corresponds to the median ic50 score of all prediction algorithms used to create the report. | ||
Be default, the binding filter runs on the ``median`` binding affinity. | ||
|
||
By default, entries with ``NA`` values will be included in the output. This | ||
behavior can be turned off by using the ``--exclude-NAs`` flag. | ||
|
||
Top Score Filter | ||
---------------- | ||
|
||
.. program-output:: pvacbind top_score_filter -h | ||
|
||
This filter picks the top epitope for a variant. By default the | ||
``--top-score-metric`` option is set to ``median`` which will apply this | ||
filter to the ``Median MT Score`` column and pick the epitope with the lowest | ||
median mutant ic50 score for each variant. If the ``--top-score-metric`` | ||
option is set to ``lowest``, the ``Best MT Score`` column is instead used to | ||
make this determination. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,23 @@ | ||
.. image:: ../images/pVACbind_logo_trans-bg_sm_v4b.png | ||
:align: right | ||
:alt: pVACbind logo | ||
|
||
Getting Started | ||
--------------- | ||
|
||
pVACbind provides a set of example data to show the expected format of input and output files. | ||
You can download the data set by running the ``pvacbind download_example_data`` :ref:`command <pvacbind_example_data>`. | ||
|
||
The example data output can be reproduced by running the following command: | ||
|
||
.. code-block:: none | ||
pvacbind run \ | ||
<example_data_dir>/input.fasta \ | ||
Test \ | ||
HLA-A*02:01,HLA-B*35:01,DRB1*11:01 \ | ||
MHCflurry MHCnuggetsI MHCnuggetsII NNalign NetMHC PickPocket SMM SMMPMBEC SMMalign \ | ||
<output_dir> \ | ||
-e 8,9,10 | ||
A detailed description of all command options can be found on the :ref:`Usage <pvacbind_run>` page. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,89 @@ | ||
.. image:: ../images/pVACbind_logo_trans-bg_sm_v4b.png | ||
:align: right | ||
:alt: pVACbind logo | ||
|
||
Output Files | ||
============ | ||
|
||
The pVACbind pipeline will write its results in separate folders depending on | ||
which prediction algorithms were chosen: | ||
|
||
- ``MHC_Class_I``: for MHC class I prediction algorithms | ||
- ``MHC_Class_II``: for MHC class II prediction algorithms | ||
- ``combined``: If both MHC class I and MHC class II prediction algorithms were run, this folder combines the neoeptiope predictions from both | ||
|
||
Each folder will contain the same list of output files (listed in the order | ||
created): | ||
|
||
.. list-table:: | ||
:header-rows: 1 | ||
|
||
* - File Name | ||
- Description | ||
* - ``<sample_name>.tsv`` | ||
- An intermediate file with variant information parsed from the input files. | ||
* - ``<sample_name>.tsv_<chunks>`` (multiple) | ||
- The above file but split into smaller chunks for easier processing with IEDB. | ||
* - ``<sample_name>.all_epitopes.tsv`` | ||
- A list of all predicted epitopes and their binding affinity scores, with | ||
additional variant information from the ``<sample_name>.tsv``. | ||
* - ``<sample_name>.filtered.tsv`` | ||
- The above file after applying all filters, with cleavage site and stability | ||
predictions added. | ||
|
||
all_epitopes.tsv and filtered.tsv Report Columns | ||
------------------------------------------------ | ||
|
||
.. list-table:: | ||
:header-rows: 1 | ||
|
||
* - Column Name | ||
- Description | ||
* - ``Mutation`` | ||
- The FASTA ID of the peptide sequence the epitope belongs to | ||
* - ``HLA Allele`` | ||
- The HLA allele for this prediction | ||
* - ``Sub-peptide Position`` | ||
- The one-based position of the epitope in the protein sequence used to make the prediction | ||
* - ``Epitope Seq`` | ||
- The epitope sequence | ||
* - ``Median Score`` | ||
- Median ic50 binding affinity of the epitope of all prediction algorithms used | ||
* - ``Best Score`` | ||
- Lowest ic50 binding affinity of all prediction algorithms used | ||
* - ``Best Score Method`` | ||
- Prediction algorithm with the lowest ic50 binding affinity for this epitope | ||
* - ``Individual Prediction Algorithm Scores`` (multiple) | ||
- ic50 scores for the ``Epitope Seq`` for the individual prediction algorithms used | ||
* - ``cterm_7mer_gravy_score`` | ||
- Mean hydropathy of last 7 residues on the C-terminus of the peptide | ||
* - ``max_7mer_gravy_score`` | ||
- Max GRAVY score of any kmer in the amino acid sequence. Used to determine if there are any extremely | ||
hydrophobic regions within a longer amino acid sequence. | ||
* - ``difficult_n_terminal_residue`` (T/F) | ||
- Is N-terminal amino acid a Glutamine, Glutamic acid, or Cysteine? | ||
* - ``c_terminal_cysteine`` (T/F) | ||
- Is the C-terminal amino acid a Cysteine? | ||
* - ``c_terminal_proline`` (T/F) | ||
- Is the C-terminal amino acid a Proline? | ||
* - ``cysteine_count`` | ||
- Number of Cysteines in the amino acid sequence. Problematic because they can form disulfide bonds across | ||
distant parts of the peptide | ||
* - ``n_terminal_asparagine`` (T/F) | ||
- Is the N-terminal amino acid a Asparagine? | ||
* - ``asparagine_proline_bond_count`` | ||
- Number of Asparagine-Proline bonds. Problematic because they can spontaneously cleave the peptide | ||
* - ``Best Cleavage Position`` (optional) | ||
- Position of the highest predicted cleavage score | ||
* - ``Best Cleavage Score`` (optional) | ||
- Highest predicted cleavage score | ||
* - ``Cleavage Sites`` (optional) | ||
- List of all cleavage positions and their cleavage score | ||
* - ``Predicted Stability`` (optional) | ||
- Stability of the pMHC-I complex | ||
* - ``Half Life`` (optional) | ||
- Half-life of the pMHC-I complex | ||
* - ``Stability Rank`` (optional) | ||
- The % rank stability of the pMHC-I complex | ||
* - ``NetMHCstab allele`` (optional) | ||
- Nearest neighbor to the ``HLA Allele``. Used for NetMHCstab prediction |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,8 @@ | ||
.. image:: ../images/pVACbind_logo_trans-bg_sm_v4b.png | ||
:align: right | ||
:alt: pVACbind logo | ||
|
||
Prerequisites | ||
============= | ||
|
||
The input to pVACbind is a FASTA file of peptide sequences. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
.. image:: ../images/pVACbind_logo_trans-bg_sm_v4b.png | ||
:align: right | ||
:alt: pVACbind logo | ||
|
||
.. _pvacbind_run: | ||
|
||
Usage | ||
==================================== | ||
|
||
.. warning:: | ||
Using a local IEDB installation is strongly recommended for larger datasets | ||
or when the making predictions for many alleles, epitope lengths, or | ||
prediction algorithms. More information on how to install IEDB locally can | ||
be found on the :ref:`Installation <iedb_install>` page. | ||
|
||
.. program-output:: pvacbind run -h |
Oops, something went wrong.