We study the problem of designing models for machine learning tasks defined on \emphsets. In contrast to traditional approach of operating on fixed dimensional vectors, we consider objective functions defined on sets that are invariant to permutations. Such problems are widespread, ranging from estimation of population statistics \citepoczos13aistats, to anomaly detection in piezometer data of embankment dams \citeJung15Exploration, to cosmology \citeNtampaka16Dynamical,Ravanbakhsh16ICML1. Our main theorem characterizes the permutation invariant functions and provides a family of functions to which any permutation invariant objective function must belong. This family of functions has a special structure which enables us to design a deep network architecture that can operate on sets and which can be deployed on a variety of scenarios including both unsupervised and supervised learning tasks. We also derive the necessary and sufficient conditions for permutation equivariance in deep models. We demonstrate the applicability of our method on population statistic estimation, point cloud classification, set expansion, and outlier detection.
Submitted 10 Mar 2017 to Learning
Published 20 Mar 2017
Updated 14 Apr 2018
Author comments: NIPS 2017http://arxiv.org/abs/1703.06114http://arxiv.org/pdf/1703.06114.pdf