- arXiv.org
- Biological Physics
- History and Philosophy of Physics
- Applied Physics
- General Physics
- Fluid Dynamics
- Optics
- Physics and Society
- Data Analysis, Statistics and Probability
- Popular Physics
- Plasma Physics
- Atomic and Molecular Clusters
- Atomic Physics
- Space Physics
- Medical Physics
- Physics Education
- Computational Physics
- Chemical Physics
- Classical Physics
- Atmospheric and Oceanic Physics
- Instrumentation and Detectors
- Accelerator Physics
- Geophysics

- Information Theory
- Analysis of PDEs
- Statistics Theory
- Number Theory
- History and Overview
- General Mathematics
- Mathematical Physics
- Probability
- Operator Algebras
- Representation Theory
- Complex Variables
- Combinatorics
- Group Theory
- Algebraic Geometry
- Symplectic Geometry
- Numerical Analysis
- Dynamical Systems
- Logic
- General Topology
- Optimization and Control
- Geometric Topology
- Differential Geometry
- Quantum Algebra
- Metric Geometry
- Classical Analysis and ODEs
- Algebraic Topology
- Spectral Theory
- Functional Analysis
- Category Theory
- Rings and Algebras
- Commutative Algebra
- K-Theory and Homology

- General Literature
- Information Theory
- Neural and Evolutionary Computing
- Hardware Architecture
- Mathematical Software
- Symbolic Computation
- Learning
- Computer Vision and Pattern Recognition
- Sound
- Operating Systems
- Information Retrieval
- Programming Languages
- Software Engineering
- Databases
- Multiagent Systems
- Formal Languages and Automata Theory
- Social and Information Networks
- Cryptography and Security
- Systems and Control
- Human-Computer Interaction
- Other Computer Science
- Artificial Intelligence
- Computational Complexity
- Distributed, Parallel, and Cluster Computing
- Discrete Mathematics
- Numerical Analysis
- Computer Science and Game Theory
- Emerging Technologies
- Robotics
- Computation and Language
- Computational Engineering, Finance, and Science
- Computers and Society
- Networking and Internet Architecture
- Data Structures and Algorithms
- Logic in Computer Science
- Computational Geometry
- Multimedia
- Digital Libraries
- Performance
- Graphics

- The cornerstone underpinning deep learning is the guarantee that gradient descent on an objective converges to local minima. Unfortunately, this guarantee fails in settings, such as generative adversarial nets, where there are multiple interacting losses. The behavior of gradient-based methods in games is not well understood -- and is becoming increasingly important as adversarial and multi-objective architectures proliferate. In this paper, we develop new techniques to understand and control the dynamics in general games. The key result is to decompose the second-order dynamics into two components. The first is related to potential games, which reduce to gradient descent on an implicit function; the second relates to Hamiltonian games, a new class of games that obey a conservation law, akin to conservation laws in classical mechanical systems. The decomposition motivates Symplectic Gradient Adjustment (SGA), a new algorithm for finding stable fixed points in general games. Basic experiments show SGA is competitive with recently proposed algorithms for finding local Nash equilibria in GANs -- whilst at the same time being applicable to -- and having guarantees in -- much more general games.
- Existing multi-agent reinforcement learning methods are limited typically to a small number of agents. When the agent number increases largely, the learning becomes intractable due to the curse of the dimensionality and the exponential growth of user interactions. In this paper, we present Mean Field Reinforcement Learning where the interactions within the population of agents are approximated by those between a single agent and the average effect from the overall population or neighboring agents; the interplay between the two entities is mutually reinforced: the learning of the individual agent's optimal policy depends on the dynamics of the population, while the dynamics of the population change according to the collective patterns of the individual policies. We develop practical mean field Q-learning and mean field Actor-Critic algorithms and analyze the convergence of the solution. Experiments on resource allocation, Ising model estimation, and battle game tasks verify the learning effectiveness of our mean field approaches in handling many-agent interactions in population.

A Rational Agent Controlling an Autonomous Vehicle: Implementation and Fo...

Martin Henessey Oct 03 2017 01:48 UTC- Supported by Silverpond.