ProteinMPNN

The ProteinMPNN API provides an interface to the ProteinMPNN protein design tool. This tool takes as input a PDB file and rapidly generates new sequences predicted to fold to the backbone of the input PDB.

Quickstart

Predict a single new sequence for an input PDB

cyrus engine submit protein-mpnn 1ubq.pdb

Predict 1000 new sequences for an input PDB

cyrus engine submit protein-mpnn 1ubq.pdb --n-mpnn-designs 1000

Predict 1000 new sequences for an input PDB using an elevated temperature (default temperature is 0.1

cyrus engine submit protein-mpnn 1ubq.pdb --n-mpnn-designs 1000 --sampling-temperature 0.2

Inputs

  • --pdb-file

    • The path to a PDB file containing the protein backbone you want to design sequences for

  • --n-mpnn-designs

    • The number of sequences to design, default is 1

  • --sampling-temperature

    • The sampling temperature of the model, default is 0.1, higher values will result in more variation of output sequenes

  • --gpu-type

    • The GPU to run the model on, default is t4. Set this to a100 if you are generating a very large number of sequences

Outputs

  • designed_sequences.fasta

    • A FASTA file containing all designed sequences. The first record in the file is the native sequence of the protein in the PDB file. The headers of the FASTA file contain score and sequence recovery values for each designed sequence