Predicting Tolerated Sequences for Proteins Using Rosetta
Author Information
Author(s): Colin A. Smith, Tanja Kortemme
Primary Institution: University of California San Francisco
Hypothesis
Can we predict the set of sequences that are tolerated by a protein or protein interface while maintaining a desired function?
Conclusion
The study provides a method for estimating tolerated sequences using flexible backbone protein design, which can be applied to various protein engineering tasks.
Supporting Evidence
- The method was tested on three datasets with a significant number of tolerated sequences determined experimentally.
- Predictions showed that 57% of frequently observed amino acids were found in the top five predicted amino acids.
- The protocol provides a consistent set of parameters tested across several systems.
Takeaway
This study helps scientists understand which changes to a protein's sequence will still allow it to work properly, making it easier to design new proteins.
Methodology
The method involves using Monte Carlo simulations and genetic algorithms to predict low-energy sequences for proteins based on their backbone conformations.
Potential Biases
There is a risk of bias towards the native sequence in predictions.
Limitations
The method may be limited by the number of sequences that can be produced and analyzed, and it may not capture all potential sequence variations.
Digital Object Identifier (DOI)
Want to read the original?
Access the complete publication on the publisher's website