Technical Reports

The ICICS/CS Reading Room

UBC CS TR-2006-06 Summary

Finding local RNA motifs using covariance models, April 03, 2006 Sohrab P. Shah and Anne E. Condon, 29 pages

We present DISCO, an algorithm to detect conserved motifs in sets of unaligned RNA sequences. Our algorithm uses covariance models (CM) to represent motifs. We introduce a novel approach to initialise a CM using pairwise and multiple sequence alignment. The CM is then iteratively refined. We tested our algorithm on 26 data sets derived from Rfam seed alignments of microRNA (miRNA) precursors and conserved elements in the untranslated regions of mRNAs (UTR elements). Our algorithm outperformed RNAProfile and FOLDALIGN in measures of sensitivity and positive predictive value, although the running time of RNAProfile was considerably faster. The accuracy of our algorithm was unaffected by properties of the input data and performed consistently under different settings of key parameters. The running time of DISCO is $O(N^2L^2W2 + NL3)$ where $W$ is the approximate width of the motif, $L$ is the length of the longest sequence in the input data, and $N$ is the number of sequences. Supplemental material is available at:\~{}sshah/disco.

If you have any questions or comments regarding this page please send mail to