ConiferEST: an integrated bioinformatics system for data reprocessing and mining of conifer expressed sequence tags (ESTs)
2007

ConiferEST: A Bioinformatics System for Conifer EST Data

Sample size: 172229 publication Evidence: moderate

Author Information

Author(s): Liang Chun, Wang Gang, Liu Lin, Ji Guoli, Fang Lin, Liu Yuansheng, Carter Kikia, Webb Jason S, Dean Jeffrey FD

Primary Institution: Miami University

Hypothesis

ConiferEST aims to improve data quality control and validation of conifer expressed sequence tags (ESTs) through an integrated bioinformatics system.

Conclusion

ConiferEST provides biologists with powerful tools for data verification, visualization of abnormalities, and exploration of large EST datasets.

Supporting Evidence

  • ConiferEST houses 172,229 loblolly pine EST sequence reads.
  • Only 30.03% of designated 3' ESTs had an authenticated 5' terminus.
  • Fewer than 5.34% of designated 5' ESTs had a verified 5' terminus.
  • ConiferEST integrates data from multiple public resources for enhanced analysis.
  • WebTraceMiner was used to reprocess raw DNA traces for better data quality.

Takeaway

ConiferEST helps scientists understand and analyze conifer DNA sequences better by fixing errors and showing important details about the data.

Methodology

The system reprocesses raw DNA traces using WebTraceMiner to identify and verify sequence features.

Potential Biases

There may be biases in the data due to the inherent deficiencies of EST sequences and the methods used for their construction.

Limitations

The study may not account for all potential errors in EST sequences due to the complexity of cDNA library construction.

Digital Object Identifier (DOI)

10.1186/1471-2164-8-134

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication