Springer Series in the Data Sciences
Series Editors
David Banks
Duke University, Durham, NC, USA
Jianqing Fan
Department of Financial Engineering, Princeton University, Princeton, NJ, USA
Michael Jordan
University of California, Berkeley, CA, USA
Ravi Kannan
Microsoft Research Labs, Bangalore, India
Yurii Nesterov
CORE, Universite Catholique de Louvain, Louvain-la-Neuve, Belgium
Christopher R
Department of Computer Science, Stanford University, Stanford, USA
Ryan J. Tibshirani
Department of Statistics, Carnegie Melon University, Pittsburgh, PA, USA
Larry Wasserman
Department of Statistics, Carnegie Mellon University, Pittsburgh, PA, USA
Springer Series in the Data Sciences focuses primarily on monographs and graduate level textbooks. The target audience includes students and researchers working in and across the fields of mathematics, theoretical computer science, and statistics. Data Analysis and Interpretation is a broad field encompassing some of the fastest-growing subjects in interdisciplinary statistics, mathematics and computer science. It encompasses a process of inspecting, cleaning, transforming, and modeling data with the goal of discovering useful information, suggesting conclusions, and supporting decision making. Data analysis has multiple facets and approaches, including diverse techniques under a variety of names, in different business, science, and social science domains. Springer Series in the Data Sciences addresses the needs of a broad spectrum of scientists and students who are utilizing quantitative methods in their daily research. The series is broad but structured, including topics within all core areas of the data sciences. The breadth of the series reflects the variation of scholarly projects currently underway in the field of machine learning.
More information about this series at http://www.springer.com/series/13852
Nickolay Trendafilov and Michele Gallo
Multivariate Data Analysis on Matrix Manifolds
(with Manopt)
1st ed. 2021
Logo of the publisher
Nickolay Trendafilov
School of Mathematics and Statistics, Open University, Milton Keynes, Buckinghamshire, UK
Michele Gallo
Department of Human and Social Sciences, University of Naples LOrientale, Naples, Italy
ISSN 2365-5674 e-ISSN 2365-5682
Springer Series in the Data Sciences
ISBN 978-3-030-76973-4 e-ISBN 978-3-030-76974-1
https://doi.org/10.1007/978-3-030-76974-1
Mathematics Subject Classication (2010): 58A05 58C05 62-07 62H12 62H25 62H30 62H99 65C60 65Fxx 65K99 90C26 90C51
Springer Nature Switzerland AG 2021
This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed.
The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use.
The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors give a warranty, expressed or implied, with respect to the material contained herein or for any errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This Springer imprint is published by the registered company Springer Nature Switzerland AG
The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland
To the memory of my mother, Zdravka, and my father, Trendafil
and to my family
wife, Irina, son, Iassen, and grandchildren, Veronica and Christian
Preface
We want to start with few remarks predating considerably the emerging of the idea for writing this book and our collaboration in general. They are related to the first authors own experience which made him explore matrix manifolds in data analysis problems.
NTT was in an early stage of his research career, after his Ph.D. was completed on a completely different topic. He was assigned to study factor analysis (FA) and do some programming for a particular software product. While working on FA, NTT realized that the most interesting part for him is the FA interpretation and the so-called rotation methods (see Section 4.5). NTT recognized that the main goal of FA is to produce simple for interpretation results, which is achieved by orthogonal rotation of some initial, usually difficult to interpret, FA solution (known as factor loadings matrix). However, how we can define what is simple for interpretation results? The problem is really tough, especially if you try to capture its meaning in a single mathematical expression/formula. Because of that, a huge number of different formulas were proposed each of them claiming to approximate in some sense the idea for simple for interpretation results. In FA, these formulas are called simple structure criteria. They are supposed to measure/quantify the simplicity of a certain FA solution. Let be such a criterion. Then, if is simpler than , and if is even simpler than , then we should have , or depending on the sign of