Predicting the Tolerated Sequences for Proteins and Protein Interfaces Using RosettaBackrub Flexible Backbone Design
2011

Predicting Tolerated Sequences for Proteins Using Rosetta

Sample size: 171 publication Evidence: moderate

Author Information

Author(s): Colin A. Smith, Tanja Kortemme

Primary Institution: University of California San Francisco

Hypothesis

Can we predict the set of sequences that are tolerated by a protein or protein interface while maintaining a desired function?

Conclusion

The study provides a method for estimating tolerated sequences using flexible backbone protein design, which can be applied to various protein engineering tasks.

Supporting Evidence

  • The method was tested on three datasets with a significant number of tolerated sequences determined experimentally.
  • Predictions showed that 57% of frequently observed amino acids were found in the top five predicted amino acids.
  • The protocol provides a consistent set of parameters tested across several systems.

Takeaway

This study helps scientists understand which changes to a protein's sequence will still allow it to work properly, making it easier to design new proteins.

Methodology

The method involves using Monte Carlo simulations and genetic algorithms to predict low-energy sequences for proteins based on their backbone conformations.

Potential Biases

There is a risk of bias towards the native sequence in predictions.

Limitations

The method may be limited by the number of sequences that can be produced and analyzed, and it may not capture all potential sequence variations.

Digital Object Identifier (DOI)

10.1371/journal.pone.0020451

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication