- Nov 21 2017 stat.ME arXiv:1711.06746v1Principal manifolds are used to represent high-dimensional data in a low-dimensional space. They are high-dimensional generalizations of principal curves and surfaces. The existing methods for fitting principal manifolds have several shortcomings: model bias, heavy computational burden, sensitivity to outliers, and difficulty of use in applications. We propose a novel method for modeling principal manifolds that addresses these limitations. It is based on minimization of penalized mean squared error functionals, providing a nonlinear summary of the data points in Euclidean spaces. We introduce the framework in the context of principal manifolds of middles and develop an estimate by proposing a high-dimensional mixture density estimation procedure. The Sobolev embedding theorem guarantees the regularity of the derived manifolds and analytical expressions of the embedding maps are obtained. The algorithm is computationally efficient and robust to outliers. We used simulation studies to illustrate the comparative performance of the proposed method in low-dimensions and found that it performs better than competitors. In addition, we analyze computed tomography images of lung cancer tumors focusing on two important clinical questions - estimation of the tumor surface and identification of tumor interior classifier. We used the obtained analytic expressions of embedding maps to construct a tumor interior classifier.
- Nov 21 2017 stat.ME arXiv:1711.07357v1Assuming stationarity is unrealistic in many time series applications. A more realistic alternative is to allow for piecewise stationarity, where the model is allowed to change at given time points. We propose a three-stage procedure for consistent estimation of both structural change points and parameters of high-dimensional piecewise vector autoregressive (VAR) models. In the first step, we reformulate the change point detection problem as a high-dimensional variable selection one, and propose a penalized least square estimator using a total variation penalty. We show that the proposed penalized estimation method over-estimates the number of change points. We then propose a backward selection criterion in conjunction with a penalized least square estimator to tackle this issue. In the last step of our procedure, we estimate the VAR parameters in each of the segments. We prove that the proposed procedure consistently detects the number of change points and their locations. We also show that the procedure consistently estimates the VAR parameters. The performance of the method is illustrated through several simulation scenarios and real data examples.
- Nov 21 2017 stat.ME arXiv:1711.07137v1
- Nov 21 2017 stat.ME arXiv:1711.06912v1