Comparative analysis of long DNA sequences by per element information content using different contexts
2007

Analyzing Long DNA Sequences Using Information Content

publication Evidence: moderate

Author Information

Author(s): Trevor I Dix, David R Powell, Lloyd Allison, Julie Bernal, Samira Jaeger, Linda Stern

Primary Institution: Monash University

Hypothesis

Can a compression model reveal significant features in long DNA sequences?

Conclusion

The methodology finds significant features in DNA sequences that biologists can confirm.

Supporting Evidence

  • The methodology allows for fast exploration of long DNA sequences.
  • It highlights features of DNA sequences that are significant and confirmable by biologists.
  • The tool developed can analyze sequences in different contexts to reveal new information.

Takeaway

This study shows how to look at DNA sequences in a new way to find important patterns, like finding similarities between different parts of DNA.

Methodology

The study uses a Bayesian compression model to analyze DNA sequences and identify features based on their information content.

Digital Object Identifier (DOI)

10.1186/1471-2105-8-S2-S10

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication