Title: | Efficient data structures in DNA sequence alignment |
Speaker: |
Jay Zhang UBC |
Abstract |
The invention of Next Generation Sequencing (NGS) machines has made
sequencing DNA cheaper and faster. Consequently, scientists are now
able to sequence whole genomes at a fraction of the effort. However,
NGS machines tend to produce large amounts of short reads--typically
less than a hundred bases--compared to traditional Sanger sequencing
methods, which produce small amounts of long reads. The short read
lengths, coupled with the massive amounts of data, make efficient
alignment and subsequent analysis difficult for traditional sequence
aligners and new techniques must be developed to compensate.
|