Parallel Method for Transcriptome Assembly

Sample size: 925000000 publication Evidence: high

Author Information

Author(s): Jackson Benjamin G, Schnable Patrick S, Aluru Srinivas

Primary Institution: Iowa State University

Can a parallel method effectively assemble transcriptomes from large short sequence data sets?

The method successfully assembled 925 million sequences from 40 billion nucleotides in a few minutes using a 1024 processor Blue Gene/L.

The method can scale to large problem sizes.
It processes high coverage data sets quickly.
Validation was performed by aligning assembled contigs back to the reference genome.

This study shows a new way to quickly piece together DNA sequences from lots of tiny bits, helping scientists understand genes better.

The method constructs a distributed bidirected graph to capture overlap information and uses parallel computing to manage complexity.

The method may require integration of clone pairs for better genome assembly.

Access the complete publication on the publisher's website