A method to improve protein subcellular localization prediction by integrating various biological data sources

2009

Improving Protein Localization Prediction

Sample size: 3552 publication Evidence: moderate

Author Information

Author(s): Tung Thai Quang, Lee Doheon

Primary Institution: Department of Bio & Brain Engineering, KAIST, Daejeon City, Republic of Korea

Can integrating various biological data sources improve the prediction of protein subcellular localization?

The proposed method can enhance prediction performance by incorporating neighborhood information from functional gene networks.

This study found a better way to guess where proteins are located in cells by looking at similar proteins nearby.

The study used a fuzzy k-NN classification method combined with neighborhood information from a probabilistic gene network.

The prediction may be biased towards major locations due to the imbalanced distribution of proteins.

The method may still struggle with imbalanced datasets and proteins without GO annotations.

The dataset consisted of yeast proteins with various subcellular localizations.

Access the complete publication on the publisher's website