This article is concerned with the fitting of multinomial regression models using the so-called "Poisson Trick". The work is motivated by Chen & Kuo (2001) and Malchow-Møller & Svarer (2003) which have been criticized for being computationally inefficient and sometimes producing nonsense results. We first discuss the case of independent data and offer a parsimonious fitting strategy when all covariates are categorical. We then propose a new approach for modelling correlated responses based on an extension of the Gamma-Poisson model, where the likelihood can be expressed in closed-form. The parameters are estimated via an Expectation/Conditional Maximization (ECM) algorithm, which can be implemented using functions for fitting generalized linear models readily available in standard statistical software packages. Compared to existing methods, our approach avoids the need to approximate the intractable integrals and thus the inference is exact with respect to the approximating Gamma-Poisson model. The proposed method is illustrated via a reanalysis of the yogurt data discussed by Chen & Kuo (2001).
Repeated measures analyses require proper choice of the correlation model to ensure accurate inference and optimal efficiency. The linear exponent autoregressive (LEAR) correlation model provides a flexible two-parameter correlation structure that accommodates a variety of data types in which the correlation within-sampling unit decreases exponentially in time or space. The LEAR model subsumes three classic temporal correlation structures, namely compound symmetry, continuous-time AR(1), and MA(1), while maintaining parsimony and providing appealing statistical and computational properties. It also supplies a plausible correlation structure for power analyses across many experimental designs. However, no commonly used statistical packages provide a straightforward way to implement the model, limiting its use to those with the appropriate programming skills. Here we present a reparameterization of the LEAR model that allows easily implementing it in standard software for the special case of data with equally spaced temporal or spatial intervals.
Identifying undocumented or potential future interactions among species is a challenge facing modern ecologists. Our aim is to guide the sampling of host-parasite networks by identifying the most likely undocumented interactions. Recent link prediction methods rely on trait data, however these data are limited to only a fraction of species found in large interaction databases. On the other hand, evolutionary relationships among species, encoded as phylogenetic trees, can act as proxies for underlying traits and historical patterns of parasite sharing among hosts. We show that using a network-based conditional model, phylogenetic information provides significant predictive power in a recently published global database of host-parasite interactions. Drawing from evolutionary biology, we find that applying alternative evolutionary models to the phylogeny greatly improves it. To further improve on the phylogeny-only model, we use a hierarchical Bayesian latent score framework for bipartite graphs that accounts for the number of interactions per species, as well as the host dependence informed by phylogeny. Combining the two information sources yields significant improvement in predictive accuracy over each of the submodels alone. As many interaction networks are constructed from presence-only data, we extend the model by integrating a correction mechanism for missing interactions, which proves valuable in reducing uncertainty in unobserved interactions.
We combine Bayesian prediction and weighted inference as a unified approach to survey inference. The general principles of Bayesian analysis imply that models for survey outcomes should be conditional on all variables that affect the probability of inclusion. We incorporate the weighting variables under the framework of multilevel regression and poststratification, as a byproduct generating model-based weights after smoothing. We investigate deep interactions and introduce structured prior distributions for smoothing and stability of estimates. The computation is done via Stan and implemented in the open source R package "rstanarm" ready for public use. Simulation studies illustrate that model-based prediction and weighting inference outperform classical weighting. We apply the proposal to the New York Longitudinal Study of Wellbeing. The new approach generates robust weights and increases efficiency for finite population inference, especially for subsets of the population.
Jul 27 2017 stat.AP
Baseline correction plays an important role in past and current methodological debates in ERP research (e.g. the Tanner v. Maess debate in Journal of Neuroscience Methods), serving as a potential alternative to strong highpass filtering. However, the very assumptions that underlie traditional baseline also undermine it, making it statistically unnecessary and even undesirable and reducing signal-to-noise ratio. Including the baseline interval as a predictor in a GLM-based statistical approach allows the data to determine how much baseline correction is needed, including both full traditional and no baseline correction as subcases, while reducing the amount of variance in the residual error term and thus potentially increasing statistical power.