MutPred-PPI

Inputs | Variant format | Structure files | Output

Required inputs

Protein complex structure — a mmCIF (.cif / .mmcif) or PDB (.pdb) file for the protein complex of interest.
Protein A ID — identifier for the protein in chain A. Can be a UniProt accession, gene name, or any alphanumeric label.
Protein B ID — identifier for the protein in chain B.
Variants — a tab-separated list of missense variants.

Chain assignment: Protein A must correspond to chain A in your structure file, and Protein B to chain B. If they are reversed, swap your Protein A and Protein B labels accordingly.

Variant format

Three tab-separated columns per line, no header row:

MUTATED_PROTEIN_ID	VARIANT	PARTNER_ID

Column 1: ID of the protein carrying the mutation (must be Protein A or B).
Column 2: Missense variant in single-letter notation — [WT residue][1-based position][MT residue]. Example: V123A (Val→Ala at position 123). Only the standard 20 amino acids are supported.
Column 3: ID of the interaction partner (the other protein).

Position numbering: Positions are 1-based indices counting sequentially through the residues present in the structure file (first residue in the structure = 1). For AlphaFold3 outputs these match canonical sequence positions. For PDB structures with missing or non-sequentially numbered residues, the position in the variant notation should reflect the order of residues in the uploaded file, which may differ from the PDB residue numbers or UniProt canonical positions. After uploading a structure, the page shows the number of residues in each chain to help you verify the numbering.

Example

O00548	A653T	P46531
O00548	R661S	P46531
O00548	N34I	P46531
P46531	W89R	O00548
P46531	E100K	O00548

The first three rows specify mutations on Protein A (O00548). The last two rows specify mutations on Protein B (P46531) — note that the mutated protein is listed first.

Obtaining a structure file

AlphaFold3 Server — alphafoldserver.com generates predicted complex structures in mmCIF format.
AlphaFold3 outputs are subject to Google DeepMind’s Output Terms of Use (non-commercial use only).
Protein Data Bank — experimental structures from rcsb.org in mmCIF or PDB format.
Other prediction tools — AlphaFold-Multimer, RoseTTAFold, or any tool producing mmCIF or PDB output.

The structure must contain chains labeled A and B.

Output

A tab-separated file with one row per variant:

complex_id	variant	score
O00548_P46531	A653T	0.0852
O00548_P46531	R661S	0.0680
O00548_P46531	N34I	0.3541

complex_id: underscore-joined protein IDs (mutated protein first)
variant: variant notation as provided
score: predicted probability of interaction disruption (0–1). Scores ≥ 0.5 indicate predicted disruption.