PaperSwipe

Tropical Principal Component Analysis and its Application to Phylogenetics

Published 8 years agoVersion 2arXiv:1710.02682

Authors

Ruriko Yoshida, Leon Zhang, Xu Zhang

Categories

math.COq-bio.PE

Abstract

Principal component analysis is a widely-used method for the dimensionality reduction of a given data set in a high-dimensional Euclidean space. Here we define and analyze two analogues of principal component analysis in the setting of tropical geometry. In one approach, we study the Stiefel tropical linear space of fixed dimension closest to the data points in the tropical projective torus; in the other approach, we consider the tropical polytope with a fixed number of vertices closest to the data points. We then give approximative algorithms for both approaches and apply them to phylogenetics, testing the methods on simulated phylogenetic data and on an empirical dataset of Apicomplexa genomes.

Tropical Principal Component Analysis and its Application to Phylogenetics

8 years ago
v2
3 authors

Categories

math.COq-bio.PE

Abstract

Principal component analysis is a widely-used method for the dimensionality reduction of a given data set in a high-dimensional Euclidean space. Here we define and analyze two analogues of principal component analysis in the setting of tropical geometry. In one approach, we study the Stiefel tropical linear space of fixed dimension closest to the data points in the tropical projective torus; in the other approach, we consider the tropical polytope with a fixed number of vertices closest to the data points. We then give approximative algorithms for both approaches and apply them to phylogenetics, testing the methods on simulated phylogenetic data and on an empirical dataset of Apicomplexa genomes.

Authors

Ruriko Yoshida, Leon Zhang, Xu Zhang

arXiv ID: 1710.02682
Published Oct 7, 2017

Click to preview the PDF directly in your browser