Title: | MapReduce Algorithms for Summarizing Evolutionary Trees on Multi-Core Platforms |
Speaker: |
Tiffani Williams Department of Computer Science, Texas A&M University |
Abstract |
Phylogenetics is concerned with inferring the genealogical relationships between a group of organisms (or taxa) and this relationship is usually expressed as an evolutionary tree. However, obtaining such trees is very difficult (most approaches use NP-hard optimization criteria). As a result, most phylogenetic analyses rely on heuristics to obtain accurate (best-scoring) trees. It is not uncommon for heuristics to return thousands of best-scoring trees. Hence, fast post-processing techniques are needed in order to summarize effectively the relationships depicted among the evolutionary trees. In this talk, I will present new post-processing algorithms for phylogenetics based on MapReduce, a parallel framework popularized by Google to design parallel applications for large-scale data applications on large computing clusters. Finally, I will discuss additional applications of our post-processing algorithms to facilitate the reconstruction of accurate evolutionary trees. |