Title: Improved RNA secondary structure prediction by maximizing expected pair accuracy
Speaker: Monir Hajiaghayi
Department of Computer Science, University of British Columbia
Abstract

Free energy minimization has been the most popular method for RNA secondary structure prediction for decades. It is based on a set of empirical free energy change parameters derived from experiments using a nearest-neighbor model. In this talk, we will introduce a program, MaxExpect obtained by Lu et al., that predicts RNA secondary structure by maximizing the expected base-pair accuracy. This approach was first pioneered in the program CONTRAfold, using pair probabilities predicted with a statistical learning method. Here, a partition function calculation that utilizes the free energy change nearest-neighbor parameters is used to predict base-pair probabilities as well as probabilities of nucleotides being single-stranded. MaxExpect predicts both the optimal structure (having highest expected pair accuracy) and suboptimal structures to serve as alternative hypotheses for the structure. Tested on a large database of different types of RNA, the maximum expected accuracy structures are, on average, of higher accuracy than minimum free energy structures. Using MaxExpect, the average Positive predictive value (PPV) of optimal structure is improved from 66% to 68% at the same sensitivity level (73%) compared with free energy minimization.