...
Currently, the ptm-predictor
API provides predictions for Asn deamidation, Asp isomerization, N-linked glycosylation, and Met photooxidation. :
Asparagine Deamidation
Aspartic Acid Isomerization
Methionine Oxidation
Hyper-reactive Cysteines
N-Linked Glycosylation
Lysine Glycation
Pyroglutamylation
N-Term Cyclization
C-Term Lysine Processing
For more sophisticated N-linked glycosylation prediction, use our Glycosylation Prediction API
While the reported findings are trained and informed by experimental data and observations found in literature, recommendations provided should supplement rather than replace domain expertise and insights of the user.
Info |
---|
|
Table of Contents | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|
|
...
Flag known PTM motifs for a given sequence
Code Block |
---|
cyrus engine runsubmit ptm-prediction --fasta-file input.fasta |
Predict propensities for Asn deamidation and Met oxidation PTMs for multiple fasta files
Code Block |
---|
cyrus engine submit ptm-prediction --fasta-file *.fasta |
Predict PTMs for a given structure and return results with residue number offset by 12
Code Block |
---|
cyrus engine runsubmit ptm-prediction --pdb-file input.pdb --offset 12 |
Predict propensities for Asn deamidation and Met oxidation and include raw prediction data in outputPTMS for multiple structures
Code Block |
---|
cyrus engine runsubmit ptm-prediction --pdb-file inputinput1.pdb --raw |
...
input2.pdb |
Inputs
Submitting a ptm-prediction
job requires either FASTA or PDB inputs (not both). Results for FASTA inputs will not include oxidation predictions since this requires structural features for prediction. Results for PDB inputs will include both sequence and structure based predictions.
Sequence PTM Prediction
--fasta-file
(str)FASTA file(s) containing sequence(s) of interest
One or more FASTA files may be submitted
*.fasta
orinput1.fasta input2.fasta ...
When multiple FASTA files are provided, they are combined into one FASTA file for submission; therefore it is important to properly label input sequences using the FASTA header lines.
Structural PTM Prediction
--pdb-file
(str)Input PDB file(s) for predictions
One or more PDB files may be submitted.
Ideally the PDB is cleaned using the Clean PDB API and doesn’t not have regions of missing density as this would impact the quality of results
Options
For Structural PTM prediction only--offset
(int32)int64)
During feature extraction, the API automatically considers sequences and structures to start amino acid residue numbering at 1. Results are reported with this numbering by default. Sometimes it is desirable to renumber residue numbers to a specific numbering scheme.
Providing an integer N for this option will renumber residue numbers in output reports by N
Adjusts output residue numbering to original numbering scheme by offset provided
The API will automatically convert a given input PDB to sequential numbering (first residue starts at 1) internally to extract necessary features for predictions.
--raw
(boolean)For Structural PTM prediction only
Output raw prediction data used for both deamidation and oxidation as CSVs
Outputs
Sequence PTM Prediction
...
Outputs
All output files (listed below) will be returned as a tarball output.tgz
.
ptm_report.tsvcsv
TSV CSV file containing the following columns:
structure_ptm_report.csv
CSV file containing residue numbers and corresponding predicted PTM
Residue numbers will be offset by value provided if
--offset
is used
structure_ptm.pml
PyMOL script generated to highlight liable residues predicted to have PTMs
See Notes for more details and how to use this script to visualize your results
deamidation_report.csv
(if--raw
)CSV file containing the raw structural features used to predict for Asn deamidation for all Asn residues in the provided structure
Includes both positive and negative predictions
oxidation_report.csv
(if--raw
)CSV file containing the raw structural features used to predict for Met oxidation for all Met residues in the provided structure
- Includes both positive and negative predictions
seqID
ID
- header of sequence in input FASTAsequence
- input sequencedeamidation_score
- score of deamidation motif counts weighted by known motif hierarchydeamidation_count
- number of Asn deamidation motifs found in sequencedeamidation_hits
- list of detected deamidation motifs and sequence positiondeamidation_prone_hits
- highly deamidation prone motifs if detected and sequence position (See Notes for more information)isomerization_score
- score of isomerization motif counts weighted by known motif hierarchyisomerization_count
- number of Asp isomerization motifs found in sequenceisomerization_hits
- list of detected isomerization motifs and sequence positionglycosylation_count
- number of N-linked glycosylation motifs found in sequenceglycosylation_hits
- list of detected glycosylation motifs and sequence position
Structural PTM Prediction
sequence or structure ID
ptm
- PTM being reportedN
- number of residues or motifs predicted from sequence/structurehits
- List of residue or motif numbers for predicted PTMFor Asn deamidation and Asp isomerization, hits report the specific motif given experimental data attributing the N+1 residue to risk of PTM liability. For example,
NG_67
indicates that the asparagine at position 67 is at risk of deamidation, and the N+1 residue at that position is a glycine. (See Notes for more details)
ptm_report_<ID>.md
Markdown file containing a formalized report on the predicted PTMs for each sequence/structure (identified by ID).
Markdown files can be visualized in most IDEs or converted to a preferred text format by the user.
The report contains a summary of the predicted hits, and more detailed descriptions for each PTM including background on how it may manifest, potential mitigation strategies, and details on how it was predicted.
<ID>_ptm_report.pml
- only for PDB inputsWhen input structures are provided for
ptm-prediction
, predicted labile residues will be mapped onto the structures using an automatically generated PyMOL script.The script will color code residues as sticks and create individual scenes for each PTM predicted.
See Notes for more details
*pdb
- only for PDB inputsFor convenience, input PDBs will be included in the output data packet so that the generated PyMOL script will work directly in the output directory without concern for local paths to input PDB files.
Notes
Visualizing Liable Residues in PyMOL
...
The PTM predictor API, when running the structural PTM predictions, will automatically generate a
.pml
script to visualize predicted liable residues on the input structure in PyMOL.To visualize the predicted PTM residues, run the following command:
Code Block |
---|
pymol structure<ID>_ptm_report.pml |
...
Residues will be color coded and shown as sticks in separate scenes as follows, for example:
Deamidation: color = cyan (
util.cbac
), selection/scene =deamidation_pred
Oxidation:color = magenta (
util.cbam
), selection/scene =oxidation_pred
Upon starting the session, the current view of the structure would be of all predictions (scene =
all_predictions
)Clicking on a specific PTM scene would change views to only show residues of that specific PTM
See References for a tutorial on running PyMOL
...
Detection of Asp isomerization by mass spectrometry is challenging due to the same molecular mass of IsoAsp compared to Asp resulting in limited available experimental data. Therefore, the API for isomerization prediction is currently limited to sequence based flagging.