- arXiv.org
- Data Analysis, Statistics and Probability
- Popular Physics
- Space Physics
- Optics
- Atomic and Molecular Clusters
- Biological Physics
- Fluid Dynamics
- Physics and Society
- History and Philosophy of Physics
- General Physics
- Geophysics
- Plasma Physics
- Medical Physics
- Atomic Physics
- Classical Physics
- Computational Physics
- Atmospheric and Oceanic Physics
- Instrumentation and Detectors
- Accelerator Physics
- Chemical Physics
- Physics Education

- Information Theory
- Number Theory
- Statistics Theory
- Analysis of PDEs
- History and Overview
- Algebraic Geometry
- Complex Variables
- Probability
- Mathematical Physics
- Representation Theory
- Combinatorics
- Group Theory
- Operator Algebras
- Symplectic Geometry
- Metric Geometry
- Numerical Analysis
- Dynamical Systems
- Optimization and Control
- Geometric Topology
- General Topology
- Quantum Algebra
- Differential Geometry
- General Mathematics
- Logic
- Commutative Algebra
- Algebraic Topology
- Category Theory
- Classical Analysis and ODEs
- Rings and Algebras
- Spectral Theory
- Functional Analysis
- K-Theory and Homology

- Multiagent Systems
- Formal Languages and Automata Theory
- Computational Complexity
- Information Retrieval
- General Literature
- Information Theory
- Symbolic Computation
- Emerging Technologies
- Databases
- Operating Systems
- Computer Vision and Pattern Recognition
- Learning
- Numerical Analysis
- Programming Languages
- Sound
- Other Computer Science
- Software Engineering
- Social and Information Networks
- Neural and Evolutionary Computing
- Cryptography and Security
- Discrete Mathematics
- Distributed, Parallel, and Cluster Computing
- Hardware Architecture
- Artificial Intelligence
- Systems and Control
- Computer Science and Game Theory
- Computational Engineering, Finance, and Science
- Mathematical Software
- Digital Libraries
- Human-Computer Interaction
- Performance
- Data Structures and Algorithms
- Graphics
- Networking and Internet Architecture
- Logic in Computer Science
- Computation and Language
- Computational Geometry
- Multimedia
- Robotics
- Computers and Society

- May 29 2017 stat.AP arXiv:1705.09575v1The FIFA ranking of national (male) soccer teams suffers from various drawbacks that cause heated debates, all-the-more as this ranking governs the seating in the international tournaments and the associated qualification rounds. In the present paper, we propose five alternative models based on strength parameters that are determined by maximum likelihood estimation. We conduct a comparison of these five statistical models, to which we add the ELO model used for the women's FIFA ranking, on grounds of their predictive performance. Indeed, contrary to regular league rankings, the FIFA ranking is designed to measure the current strength of teams, hence should allow accurate predictions of future matches. Our comparison is based on fifteen English Premier League seasons. The best performing model is then used to build the alternative FIFA ranking. We furthermore show its predictive power by analyzing the 2016 European Championship (EURO2016) and comparing our predictions to the bookmaker ratings, usually considered as golden standard.
- May 29 2017 stat.AP arXiv:1705.09563v1Background-Prognostic predictive models are used in the delivery of primary care to estimate a patients risk of future disease development. Electronic medical record, EMR, data can be used for the construction of these models. Objectives- To provide a framework for those seeking to develop prognostic predictive models using EMR data, and to illustrate these steps using osteoarthritis risk estimation as an example. FRAMR-EMR-The FRAmework for Modelling Risk from EMR data, FRAMR-EMR, was created, which outlines step-by-step guidance for the construction of a prognostic predictive model using EMR data. Throughout these steps, several potential pitfalls specific to using EMR data for predictive purposes are described and methods for addressing them are suggested. Case Study-We used the DELPHI, DELiver Primary Healthcare Information, database to develop our prognostic predictive model for estimation of osteoarthritis risk. We constructed a retrospective cohort of 28447 eligible primary care patients. Patients were included if they had an encounter with their primary care practitioner between 1 January 2008 and 31 December 2009. Patients were excluded if they had a diagnosis of osteoarthritis prior to baseline. Construction of a prognostic predictive model following FRAMR-EMR yielded a predictive model capable of estimating 5-year risk of osteoarthritis diagnosis. Logistic regression was used to predict osteoarthritis based on age, sex, BMI, previous leg injury, and osteoporosis. Internal validation of the models performance demonstrated good discrimination and moderate calibration. Conclusions-This study provides guidance to those interested in developing prognostic predictive models based on EMR data. The production of high quality prognostic predictive models allows for practitioner communication of accurately estimated risks of developing future disease among primary care patients.
- May 29 2017 stat.AP physics.soc-ph arXiv:1705.09393v1To assess the presence of gerrymandering, one can consider the shapes of districts or the distribution of votes. The "efficiency gap," which does the latter, plays a central role in a 2016 federal court case on the constitutionality of Wisconsin's state legislative district plan. Unfortunately, however, the efficiency gap reduces to proportional representation, an expectation that is not a constitutional right. We present a new measure of partisan asymmetry that does not rely on the shapes of districts, is simple to compute, is provably related to the "packing and cracking" integral to gerrymandering, and that avoids the constitutionality issue presented by the efficiency gap. In addition, we introduce a generalization of the efficiency gap that also avoids the equivalency to proportional representation. We apply the first function to US congressional and state legislative plans from recent decades to identify candidate gerrymanders.

P-values: misunderstood and misused

Noon van der Silk Jan 27 2016 03:39 UTCChris Granade Sep 22 2015 19:15 UTC

Thank you for the kind comments, I'm glad that our paper, source code, and tutorial are useful!

Travis Scholten Sep 21 2015 17:05 UTC

...(continued)This was a really well-written paper! Am very glad to see this kind of work being done.

In addition, the openness about source code is refreshing. By explicitly relating the work to [QInfer](https://github.com/csferrie/python-qinfer), this paper makes it more easy to check the authors' work. Furthe

Chris Granade Sep 15 2015 02:40 UTC

As a quick addendum, please note that the [supplementary video](https://www.youtube.com/watch?v=22ejRV0Kx2g) for this work is available [on YouTube](https://www.youtube.com/watch?v=22ejRV0Kx2g). Thank you!