We consider the utilization of a computational model to guide the optimal acquisition of experimental data to inform the stochastic description of model input parameters. Our formulation is based on the recently developed consistent Bayesian approach for solving stochastic inverse problems which seeks a posterior probability density that is consistent with the model and the data in the sense that the push-forward of the posterior (through the computational model) matches the observed density on the observations almost everywhere. Given a set a potential observations, our optimal experimental design (OED) seeks the observation, or set of observations, that maximizes the expected information gain from the prior probability density on the model parameters. We discuss the characterization of the space of observed densities and a computationally efficient approach for rescaling observed densities to satisfy the fundamental assumptions of the consistent Bayesian approach. Numerical results are presented to compare our approach with existing OED methodologies using the classical/statistical Bayesian approach and to demonstrate our OED on a set of representative PDE-based models.
Since the 2005 American Statistical Association's (ASA) endorsement of the Guidelines for Assessment and Instruction in Statistics Education (GAISE) College Report, changes in the statistics field and statistics education have had a major impact on the teaching and learning of statistics. We now live in a world where "Statistics - the science of learning from data - is the fastest-growing science, technology, engineering, and math (STEM) undergraduate degree in the United States," according to the ASA, and where many jobs demand an understanding of how to explore and make sense of data. In light of these new reports and other changes and demands on the discipline, a group of volunteers revised the 2005 GAISE College Report. The updated report was endorsed by the Board of Directors of the American Statistical Association in July 2016. To help shed additional light on the revision process and subsequent changes in the report, we review the report and share insights into the committee's thoughts and assumptions.
Discrete statistical models supported on labelled event trees can be specified using so-called interpolating polynomials which are generalizations of generating functions. These admit a nested representation. A new algorithm exploits the primary decomposition of monomial ideals associated with an interpolating polynomial to quickly compute all nested representations of that polynomial. It hereby determines an important subclass of all trees representing the same statistical model. To illustrate this method we analyze the full polynomial equivalence class of a staged tree representing the best fitting model inferred from a real-world dataset.