Find top 5 The Tolerance Identification API finds the top N closest (by blosum62Blosum62) 9mers N-mers in the human genome against your a given protein of sequence of interest.
https://github.com/CyrusBiotechnology/tolerance-identification

Running tolerance-identification

cyrus engine submit tolerance-identification NLYIQWLKDGGPSSGRPPPS --top-n 10

Inputs:

...

top-n: Collect top-N matches (defaults to 20)

...

Table of Contents

Quickstart

Get the top 10 closest 9-mers:

Code Block
cyrus engine submit tolerance-identification NLYIQWLKDGGPSSGRPPPS --top-n 10

Get the top 5 closet 9 and 15-mers:

Code Block
cyrus engine submit tolerance-identification NLYIQWLKDGGPSSGRPPPS --top-n 5 --nmer-sizes 9,15

--top-n (int)
- Collect the top N matches
- default = 20
--nmer-sizes
- Nmer size(s) to run this on (Comma separated string ex: 9,10,11,12)
- default = 9

Running this protocol takes between 4 and 5 GB of memory per CPU