Simplicial principal component analysis for density functions in Bayes spaces

Probability density functions are frequently used to characterize the distributional properties of large-scale database systems. As functional compositions, densities primarily carry relative information. As such, standard methods of functional data analysis (FDA) are not appropriate for their stati...

Full description

Saved in:
Bibliographic Details
Published inComputational statistics & data analysis Vol. 94; pp. 330 - 350
Main Authors Hron, K., Menafoglio, A., Templ, M., Hrůzová, K., Filzmoser, P.
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.02.2016
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Probability density functions are frequently used to characterize the distributional properties of large-scale database systems. As functional compositions, densities primarily carry relative information. As such, standard methods of functional data analysis (FDA) are not appropriate for their statistical processing. The specific features of density functions are accounted for in Bayes spaces, which result from the generalization to the infinite dimensional setting of the Aitchison geometry for compositional data. The aim is to build up a concise methodology for functional principal component analysis of densities. A simplicial functional principal component analysis (SFPCA) is proposed, based on the geometry of the Bayes space B2 of functional compositions. SFPCA is performed by exploiting the centred log-ratio transform, an isometric isomorphism between B2 and L2 which enables one to resort to standard FDA tools. The advantages of the proposed approach with respect to existing techniques are demonstrated using simulated data and a real-world example of population pyramids in Upper Austria.
ISSN:0167-9473
1872-7352
DOI:10.1016/j.csda.2015.07.007