Jakub Kovac. On the Complexity of Rearrangement Problems under the Breakpoint Distance. Technical Report arXiv:1112.2172, arXiv.org, 2011.
Download preprint: not available
Download from publisher: http://arxiv.org/abs/1112.2172
Related web page: not available
Bibliography entry: BibTeX
Abstract:
Tannier et al. introduced a generalization of breakpoint distance for multichromosomal genomes. They showed that the median problem under the breakpoint distance is solvable in polynomial time in the multichromosomal circular and mixed models. This is intriguing, since in all other rearrangement models (DCJ, reversal, unichromosomal or multilinear breakpoint models), the problem is NP-hard. The complexity of the small or even the large phylogeny problem under the breakpoint distance remained an open problem. We improve the algorithm for the median problem and show that it is equivalent to the problem of finding maximum cardinality non-bipartite matching (under linear reduction). On the other hand, we prove that the more general small phylogeny problem is NP-hard. Surprisingly, we show that it is already NP-hard (or even APX-hard) for 4 species (a quartet phylogeny). In other words, while finding an ancestor for 3 species is easy, already finding two ancestors for 4 species is hard. We also show that, in the unichromosomal and the multilinear breakpoint model, the halving problem is NP-hard, thus refuting the conjecture of Tannier et al. Interestingly, this is the first problem which is harder in the breakpoint model than in the DCJ or reversal models.