Integer factorization has been one of the cornerstone applications of the field of quantum computing since the discovery of an efficient algorithm for factoring by Peter Shor. Unfortunately, factoring via Shor's algorithm is well beyond the capabilities of today's noisy intermediate-scale quantum (NISQ) devices. In this work, we revisit the problem of factoring, developing an alternative to Shor's algorithm, which employs established techniques to map the factoring problem to the ground state of an Ising Hamiltonian. The proposed variational quantum factoring (VQF) algorithm starts by simplifying equations over Boolean variables in a preprocessing step to reduce the number of qubits needed for the Hamiltonian. Then, it seeks an approximate ground state of the resulting Ising Hamiltonian by training variational circuits using the quantum approximate optimization algorithm (QAOA). We benchmark the VQF algorithm on various instances of factoring and present numerical results on its performance.
Gerrymandering is a long-standing issue within the U.S. political system, and it has received scrutiny recently by the U.S. Supreme Court. In this note, we prove that deciding whether there exists a fair redistricting among legal maps is NP-hard. To make this precise, we use simplified notions of "legal" and "fair" that account for desirable traits such as geographic compactness of districts and sufficient representation of voters. The proof of our result is inspired by the work of Mahanjan, Minbhorkar and Varadarajan that proves that planar k-means is NP-hard.
Taira Giordani, Emanuele Polino, Sabrina Emiliani, Alessia Suprano, Luca Innocenti, Helena Majury, Lorenzo Marrucci, Mauro Paternostro, Alessandro Ferraro, Nicolò Spagnolo, Fabio Sciarrino The capability to generate and manipulate quantum states in high-dimensional Hilbert spaces is a crucial step for the development of quantum technologies, from quantum communication to quantum computation. One-dimensional quantum walk dynamics represents a valid tool in the task of engineering arbitrary quantum states. Here we affirm such potential in a linear-optics platform that realizes discrete-time quantum walks in the orbital angular momentum degree of freedom of photons. Different classes of relevant qudit states in a six-dimensional space are prepared and measured, confirming the feasibility of the protocol. Our results represent a further investigation of quantum walk dynamics in photonics platforms, paving the way for the use of such a quantum state-engineering toolbox for a large range of applications.
The conditional Entropy Power Inequality is a fundamental inequality in information theory, stating that the conditional entropy of the sum of two conditionally independent vector-valued random variables each with an assigned conditional entropy is minimum when the random variables are Gaussian. We prove the conditional Entropy Power Inequality in the scenario where the conditioning system is quantum. The proof is based on the heat semigroup and on a generalization of the Stam inequality in the presence of quantum conditioning. The Entropy Power Inequality with quantum conditioning will be a key tool of quantum information, with applications in distributed source coding protocols with the assistance of quantum entanglement.
We study the joint distribution of the set of all marginals of a random Wishart matrix acting on a tensor product Hilbert space. We compute the limiting free mixed cumulants of the marginals, and we show that in the balanced asymptotical regime, the marginals are asymptotically free. We connect the matrix integrals relevant to the study of operators on tensor product spaces with the corresponding classes of combinatorial maps, for which we develop the combinatorial machinery necessary for the asymptotic study. Finally, we present some applications to the theory of random quantum states in quantum information theory.
I compare the role of the information in the classical and quantum dynamics by examining the relation between information flows in measurements and the ability of observers to reverse evolutions. I show that in the Newtonian dynamics reversibility is unaffected by the observer's retention of the information about the measurement outcome. By contrast -- even though quantum dynamics is unitary, hence, reversible -- reversing quantum evolution that led to a measurement becomes in principle impossible for an observer who keeps the record of its outcome. Thus, quantum irreversibility can result from the information gain rather than just its loss -- rather than just an increase of the (von Neumann) entropy. Recording of the outcome of the measurement resets, in effect, initial conditions within the observer's (branch of) the Universe. Nevertheless, I also show that observer's friend -- an agent who knows what measurement was successfully carried out and can confirm that the observer knows the outcome but resists his curiosity and does not find out the result -- can, in principle, undo the measurement. This relativity of quantum reversibility sheds new light on the origin of the arrow of time and elucidates the role of information in classical and quantum physics. Quantum discord appears as a natural measure of the extent to which dissemination of information about the outcome affects the ability to reverse the measurement.
Aug 28 2018
cs.CL arXiv:1808.08946v1
Recently, non-recurrent architectures (convolutional, self-attentional) have outperformed RNNs in neural machine translation. CNNs and self-attentional networks can connect distant words via shorter network paths than RNNs, and it has been speculated that this improves their ability to model long-range dependencies. However, this theoretical argument has not been tested empirically, nor have alternative explanations for their strong performance been explored in-depth. We hypothesize that the strong performance of CNNs and self-attentional networks could also be due to their ability to extract semantic features from the source text, and we evaluate RNNs, CNNs and self-attention networks on two tasks: subject-verb agreement (where capturing long-range dependencies is required) and word sense disambiguation (where semantic feature extraction is required). Our experimental results show that: 1) self-attentional networks and CNNs do not outperform RNNs in modeling subject-verb agreement over long distances; 2) self-attentional networks perform distinctly better than RNNs and CNNs on word sense disambiguation.
I demonstrate the potential of Reinforcement Learning (RL) to prepare quantum states of strongly periodically-driven non-linear single-particle models. The ability of Q-Learning to control systems far away from equilibrium is exhibited by steering the quantum Kapitza oscillator to the Floquet-engineered stable inverted position in the presence of a strong periodic drive within several shaking cycles. The study reveals the potential of the intra-period (micromotion) dynamics, often neglected in Floquet engineering, to take advantage over pure stroboscopic control at moderate drive frequencies. Without any knowledge about the underlying physical system, the algorithm is capable of learning solely from tried protocols and directly from simulated noisy quantum measurement data, and is stable to noise in the initial state, and sources of random failure events in the control sequence. Model-free RL can provide novel insights into automating experimental setups for out-of-equilibrium systems undergoing complex dynamics, with potential applications in quantum information, quantum optics, ultracold atoms, trapped ions, and condensed matter.
Aug 28 2018
cs.LO arXiv:1808.08759v1
Dependency quantified Boolean formulas (DQBF) is a logic admitting existential quantification over Boolean functions, which allows us to elegantly state synthesis problems in planning and verification. In this paper, we lift the clausal abstraction algorithm for quantified Boolean formulas (QBF) to DQBF. Clausal abstraction for QBF is an abstraction refinement algorithm that operates on a sequence of abstractions that represent the different quantifiers. For DQBF we need to generalize this principle to partial orders of abstractions. The two challenges to overcome are: (1) Clauses may contain literals with incomparable dependencies, which we address by the recently proposed proof rule called fork extension, and (2) existential variables may have spurious dependencies, which we prevent by tracking (partial) Skolem functions during the execution. Our prototype implementation shows improved performance compared to previous algorithms.
We prove that a quantum spin chain with half-odd-integral spin cannot have a unique ground state with a gap, provided that the interaction is short ranged, translation invariant, and possesses time-reversal symmetry or ${\mathbb Z}_2 \times {\mathbb Z}_2$ symmetry (i.e., the symmetry with respect to the $\pi$ rotations of spins about the three orthogonal axes). The proof is based on the deep analogy between the matrix product state formulation and the representation of the Cuntz algebra in the von Neumann algebra $\pi({\mathcal A}_{R})''$ constructed from the ground state restricted to the right half-infinite chain.
In this article we want to demonstrate the effectiveness of the new D-Wave quantum annealer, D-Wave 2000Q, in dealing with real world problems. In particular, it is shown how the quantum annealing process is able to find global optima even in the case of problems that do not directly involve binary variables. The problem addressed in this work is the following: taking a matrix V, find two matrices W and H such that the norm between V and the matrix product WH is as small as possible. The work is inspired by O'Malley's article [1], where the author proposed an algorithm to solve a problem very similar to ours, where however the matrix H was formed by only binary variables. In our case neither of the two matrices W or H is a binary matrix. In particular, the factorization foresees that the matrix W is composed of real numbers between 0 and 1 and that the sum of its rows is equal to 1. The QUBO problem associated with this type of factorization generates a potential composed of many local minima. We show that simple forward-annealing techniques are not sufficient to solve the problem. The new D-Wave 2000Q has introduced new solution refinement techniques, including reverse-annealing. Reverse-annealing allows to explore the configuration space starting from a point chosen by the user, for example a local minimum obtained with a precedent forward-annealing. In this article we propose an algorithm based on the reverse annealing technique (that we called adaptive reverse annealing) able to reach global minimum even in the case of QUBO problems where the classic forward annealing, or uncontrolled reverse annealing, can not reach satisfactory solutions.
We extend random matrix theory to consider randomly interacting spin systems with spatial locality. We develop several methods by which arbitrary correlators may be systematically evaluated in a limit where the local Hilbert space dimension $N$ is large. First, the correlators are given by sums over 'stacked' planar diagrams which are completely determined by the spectra of the individual interactions and a dependency graph encoding the locality in the system. We then introduce 'heap freeness' as a generalization of free independence, leading to a second practical method to evaluate the correlators. Finally, we generalize the cumulant expansion to a sum over 'dependency partitions', providing the third and most succinct of our methods. Our results provide tools to study dynamics and correlations within extended quantum many-body systems which conserve energy. We further apply the formalism to show that quantum satisfiability at large-$N$ is determined by the evaluation of the independence polynomial on a wide class of graphs.
Ci-Yu Wang, Jun Gao, Zhi-Qiang Jiao, Lu-Feng Qiao, Ruo-Jing Ren, Zhen Feng, Yuan Chen, Zeng-Quan Yan, Yao Wang, Hao Tang, Xian-Min Jin Quantum key distribution (QKD), harnessing quantum physics and optoelectronics, may promise unconditionally secure information exchange in theory. Recently, theoretical and experimental advances in measurement-device-independent (MDI-) QKD have successfully closed the physical backdoor in detection terminals. However, the issues of scalability, stability, cost and loss prevent QKD systems from widespread application in practice. Here, we propose and experimentally demonstrate a solution to build a star-topology quantum access network with an integrated server. By using femtosecond laser direct writing technique, we construct integrated circuits for all the elements of Bell state analyzer together and are able to integrate several such analyzers on single photonic chip. The measured high-visibility Bell state analysis suggests integrated server a promising platform for the practical application of MDI-QKD network.
We propose a new type of topological states of matter exhibiting topologically nontrivial edge states (ESs) within gapless bulk states (GBSs) protected by both spin rotational and reflection symmetries. A model presenting such states is simply comprised of a one-dimensional reflection symmetric superlattice in the presence of spin-orbit coupling containing odd number of sublattices per unit cell. We show that the system has a rich phase diagram including a topological metal (TM) phase where nontrivial ESs coexist with nontrivial GBSs at Fermi level. Topologically distinct phases can be reached through subband gap closing-reopening transition depending on the relative strength of inter and intra unit cell spin-orbit couplings. Moreover, topological class of the system is AI with an integer topological invariant called $\mathbb{Z}$ index. The stability of TM states is also analyzed against Zeeman magnetic fields and on-site potentials resulting in that the spin rotational symmetry around the lattice direction is a key requirement for the appearance of such states. Also, possible experimental realizations are discussed.
Aug 28 2018
cs.AI arXiv:1808.08497v1
Artificial intelligence (AI) is the core technology of technological revolution and industrial transformation. As one of the new intelligent needs in the AI 2.0 era, financial intelligence has elicited much attention from the academia and industry. In our current dynamic capital market, financial intelligence demonstrates a fast and accurate machine learning capability to handle complex data and has gradually acquired the potential to become a "financial brain". In this work, we survey existing studies on financial intelligence. First, we describe the concept of financial intelligence and elaborate on its position in the financial technology field. Second, we introduce the development of financial intelligence and review state-of-the-art techniques in wealth management, risk management, financial security, financial consulting, and blockchain. Finally, we propose a research framework called FinBrain and summarize four open issues, namely, explainable financial agents and causality, perception and prediction under uncertainty, risk-sensitive and robust decision making, and multi-agent game and mechanism design. We believe that these research directions can lay the foundation for the development of AI 2.0 in the finance field.
Quantum computers are different from binary digital electronic computers based on transistors. Common digital computing encodes the data into binary digits (bits), each of which is always in one of two definite states (0 or 1), quantum computation uses quantum bits (qubits). A circuit-based qubit quantum computer exists and is available for experiments via cloud, the IBM quantum experience project. We implemented a Quantum Tabu Search in order to obtain a quantum combinatorial optimisation, suggesting that an entanglement-metaheuristic can display optimal solutions and accelerate the optimisation process by using entangled states. We show by building optimal coupling maps that the distribution of our results gave similar shape as shown previous results in an existing teleport circuit. Our research aims to find which graph of coupling better matches a quantum circuit.
For the complex quadratic family $q_c:z\mapsto z^2+c$, it is known that every point in the Julia set $J(q_c)$ moves holomorphically on $c$ except at the boundary points of the Mandelbrot set. In this note, we present short proofs of the following derivative estimates of the motions near the boundary points $1/4$ and $-2$: for each $z = z(c)$ in the Julia set, the derivative $dz(c)/dc$ is uniformly $O(1/\sqrt{1/4-c})$ when real $c\nearrow 1/4$; and is uniformly $O(1/\sqrt{-2-c})$ when real $c\nearrow -2$. These estimates of the derivative imply Hausdorff convergence of the Julia set $J(q_c)$ when $c$ approaches these boundary points. In particular, the Hausdorff distance between $J(q_c)$ with $0\le c<1/4$ and $J(q_{1/4})$ is exactly $\sqrt{1/4-c}$.
Aug 28 2018
cs.CL arXiv:1808.08949v1
Contextual word representations derived from pre-trained bidirectional language models (biLMs) have recently been shown to provide significant improvements to the state of the art for a wide range of NLP tasks. However, many questions remain as to how and why these models are so effective. In this paper, we present a detailed empirical study of how the choice of neural architecture (e.g. LSTM, CNN, or self attention) influences both end task accuracy and qualitative properties of the representations that are learned. We show there is a tradeoff between speed and accuracy, but all architectures learn high quality contextual representations that outperform word embeddings for four challenging NLP tasks. Additionally, all architectures learn representations that vary with network depth, from exclusively morphological based at the word embedding layer through local syntax based in the lower contextual layers to longer range semantics such coreference at the upper layers. Together, these results suggest that unsupervised biLMs, independent of architecture, are learning much more about the structure of language than previously appreciated.
Many applications require an understanding of an image that goes beyond the simple detection and classification of its objects. In particular, a great deal of semantic information is carried in the relationships between objects. We have previously shown that the combination of a visual model and a statistical semantic prior model can improve on the task of mapping images to their associated scene description. In this paper, we review the model and compare it to a novel conditional multi-way model for visual relationship detection, which does not include an explicitly trained visual prior model. We also discuss potential relationships between the proposed methods and memory models of the human brain.
Aug 28 2018
cs.RO arXiv:1808.08939v1
Environmental monitoring of marine environments presents several challenges: the harshness of the environment, the often remote location, and most importantly, the vast area it covers. Manual operations are time consuming, often dangerous, and labor intensive. Operations from oceanographic vessels are costly and limited to open seas and generally deeper bodies of water. In addition, with lake, river, and ocean shoreline being a finite resource, waterfront property presents an ever increasing valued commodity, requiring exploration and continued monitoring of remote waterways. In order to efficiently explore and monitor currently known marine environments as well as reach and explore remote areas of interest, we present a design of an autonomous surface vehicle (ASV) with the power to cover large areas, the payload capacity to carry sufficient power and sensor equipment, and enough fuel to remain on task for extended periods. An analysis of the design and a discussion on lessons learned during deployments is presented in this paper.
Aug 28 2018
cs.CL arXiv:1808.08933v1
Multilingual Word Embeddings (MWEs) represent words from multiple languages in a single distributional vector space. Unsupervised MWE (UMWE) methods acquire multilingual embeddings without cross-lingual supervision, which is a significant advantage over traditional supervised approaches and opens many new possibilities for low-resource languages. Prior art for learning UMWEs, however, merely relies on a number of independently trained Unsupervised Bilingual Word Embeddings (UBWEs) to obtain multilingual embeddings. These methods fail to leverage the interdependencies that exist among many languages. To address this shortcoming, we propose a fully unsupervised framework for learning MWEs that directly exploits the relations between all language pairs. Our model substantially outperforms previous approaches in the experiments on multilingual word translation and cross-lingual word similarity. In addition, our model even beats supervised approaches trained with cross-lingual resources.
Aug 28 2018
cs.CL arXiv:1808.08932v1
In this paper we present the RuSentRel corpus including analytical texts in the sphere of international relations. For each document we annotated sentiments from the author to mentioned named entities, and sentiments of relations between mentioned entities. In the current experiments, we considered the problem of extracting sentiment relations between entities for the whole documents as a three-class machine learning task. We experimented with conventional machine-learning methods (Naive Bayes, SVM, Random Forest).
Dilated convolutions, also known as atrous convolutions, have been widely explored in deep convolutional neural networks (DCNNs) for various tasks like semantic image segmentation, object detection, audio generation, video modeling, and machine translation. However, dilated convolutions suffer from the gridding artifacts, which hampers the performance of DCNNs with dilated convolutions. In this work, we propose two simple yet effective degridding methods by studying a decomposition of dilated convolutions. Unlike existing models, which explore solutions by focusing on a block of cascaded dilated convolutional layers, our methods address the gridding artifacts by smoothing the dilated convolution itself. By analyzing them in both the original operation and the decomposition views, we further point out that the two degridding approaches are intrinsically related and define separable and shared (SS) operations, which generalize the proposed methods. We evaluate our methods thoroughly on two datasets and visualize the smoothing effect through effective receptive field analysis. Experimental results show that our methods yield significant and consistent improvements on the performance of DCNNs with dilated convolutions, while adding negligible amounts of extra training parameters.
This paper introduces a convolutional recurrent network with attention for speech command recognition. Attention models are powerful tools to improve performance on natural language, image captioning and speech tasks. The proposed model establishes a new state-of-the-art accuracy of 94.1% on Google Speech Commands dataset V1 and 94.5% on V2 (for the 20-commands recognition task), while still keeping a small footprint of only 202K trainable parameters. Results are compared with previous convolutional implementations on 5 different tasks (20 commands recognition (V1 and V2), 12 commands recognition (V1), 35 word recognition (V1) and left-right (V1)). We show detailed performance results and demonstrate that the proposed attention mechanism not only improves performance but also allows inspecting what regions of the audio were taken into consideration by the network when outputting a given category.
Aug 28 2018
cs.DS arXiv:1808.08925v1
Motivated by hybrid graph representations, we introduce and study the following beyond-planarity problem, which we call $h$-Clique2Path Planarity: Given a graph $G$, whose vertices are partitioned into subsets of size at most $h$, each inducing a clique, remove edges from each clique so that the subgraph induced by each subset is a path, in such a way that the resulting subgraph of $G$ is planar. We study this problem when $G$ is a simple topological graph, and establish its complexity in relation to $k$-planarity. We prove that $h$-Clique2Path Planarity is NP-complete even when $h=4$ and $G$ is a simple $3$-plane graph, while it can be solved in linear time, for any $h$, when $G$ is $1$-plane.
We propose a realistic scheme to construct topological insulators with nonvanishing Chern numbers using spin-1/2 particles carrying out a discrete-time quantum walk in a two-dimensional lattice. By Floquet engineering the quantum-walk protocol, an Aharonov--Bohm geometric phase is imprinted onto closed-loop paths in the lattice, thus realizing an abelian gauge field --- the analog of a magnetic flux threading a two-dimensional electron gas. We show that in the strong field regime, when the flux per plaquette is a sizable fraction of the flux quantum, magnetic quantum walks give rise to nearly flat energy bands featuring nonvanishing Chern numbers. We discuss an implementation of this scheme using neutral atoms in two-dimensional spin-dependent optical lattices, which enables the generation of arbitrary magnetic-field landscapes, including those with sharp boundaries. The robust atom transport, which is observed along boundaries separating regions of different field strength, reveals the topological character of the Chern bands. Magnetic quantum walks with nearly flat energy bands hold the promise to explore novel interaction-driven topological phases such as fractional Floquet Chern insulators.
Aug 28 2018
math.AP arXiv:1808.08919v1
This paper presents a novel affine Sobolev trace inequality with the sharp constant generated by the Poisson extension of a Sobolev function with the fractional antiderivative.
This research presents a deep learning based approach to predict stress fields in the solid material elastic deformation using convolutional neural networks (CNN). Two different architectures are proposed to solve the problem. One is Feature Representation embedded Convolutional Neural Network (FR-CNN) with a single input channel, and the other is Squeeze-and-Excitation Residual network modules embedded Fully Convolutional Neural network (SE-Res-FCN) with multiple input channels. Both the tow architectures are stable and converged reliably in training and testing on GPUs. Accuracy analysis shows that SE-Res-FCN has a significantly smaller mean squared error (MSE) and mean absolute error (MAE) than FR-CNN. Mean relative error (MRE) of the SE-Res-FCN model is about 0.25% with respect to the average ground truth. The validation results indicate that the SE-Res-FCN model can accurately predict the stress field. For stress field prediction, the hierarchical architecture becomes deeper within certain limits, and then its prediction becomes more accurate. Fully trained deep learning models have higher computational efficiency over conventional FEM models, so they have great foreground and potential in structural design and topology optimization.
Aug 28 2018
math.AP arXiv:1808.08898v1
The upscaling of a system of screw dislocations in a material subject to an external strain is studied. The $\Gamma$-limit of a suitable rescaling of the renormalized energy is characterized in the space of probability measures. This corresponds to a discrete-to-continuum limit of the dislocations, which, as a byproduct, provides information on their distribution when the circulation of the tangential component of the external strain becomes larger and larger. In particular, dislocations are shown to concentrate at the boundary of the material and to distribute as the limiting external strain.
Aug 28 2018
cs.CV arXiv:1808.08891v1
Emojis have evolved as complementary sources for expressing emotion in social-media platforms where posts are mostly composed of texts and images. In order to increase the expressiveness of the social media posts, users associate relevant emojis with their posts. Incorporating domain knowledge has improved machine understanding of text. In this paper, we investigate whether domain knowledge for emoji can improve the accuracy of emoji recommendation task in case of multimedia posts composed of image and text. Our emoji recommendation can suggest accurate emojis by exploiting both visual and textual content from social media posts as well as domain knowledge from Emojinet. Experimental results using pre-trained image classifiers and pre-trained word embedding models on Twitter dataset show that our results outperform the current state-of-the-art by 9.6\%. We also present a user study evaluation of our recommendation system on a set of images chosen from MSCOCO dataset.
Aug 28 2018
cs.CV arXiv:1808.08885v1
We explore the use of deep learning for breast mass segmentation in mammograms. By integrating the merits of residual learning and probabilistic graphical modelling with standard U-Net, we propose a new deep network, Conditional Residual U-Net (CRU-Net), to improve the U-Net segmentation performance. Benefiting from the advantage of probabilistic graphical modelling in the pixel-level labelling, and the structure insights of a deep residual network in the feature extraction, the CRU-Net provides excellent mass segmentation performance. Evaluations based on INbreast and DDSM-BCRP datasets demonstrate that the CRU-Net achieves the best mass segmentation performance compared to the state-of-art methodologies. Moreover, neither tedious pre-processing nor post-processing techniques are not required in our algorithm.
Coherence is a basic phenomenon in quantum mechanics and considered to be an essential resource in quantum information processing. Although the quantification of coherence has attracted a lot of interest, the lack of efficient methods to measure the coherence in experiments limits the applications. We address this problem by introducing an experiment-friendly method for coherence and spectrum estimation. This method is based on the theory of majorization and can not only be used to prove the presence of coherence, but also result in a rather precise lower bound of the amount of coherence. As an illustration, we show how to characterize the freezing phenomenon of coherence with only two local measurements for any $N$-qubit quantum systems. Our approach also has other applications in quantum information processing, such as the characterization of distillability and entanglement transformations.
In this paper, we characterize the rectifiability (both uniform and not) of an Ahlfors regular set, E, of arbitrary co-dimension by the behavior of a regularized distance function in the complement of that set. In particular, we establish a certain version of the Riesz transform characterization of rectifiability for lower-dimensional sets. We also uncover a special situation in which the regularized distance is itself a solution to a degenerate elliptic operator in the complement of E. This allows us to precisely compute the harmonic measure of those sets associated to this degenerate operator and prove that, in a sharp contrast with the usual setting of co-dimension one, a converse to the Dahlberg's theorem (see [Da] and [DFM2]) must be false on lower dimensional boundaries without additional assumptions.
Many real-world objects are designed by smooth curves, especially in the domain of aerospace and ship, where aerodynamic shapes (e.g., airfoils) and hydrodynamic shapes (e.g., hulls) are designed. To facilitate the design process of those objects, we propose a deep learning based generative model that can synthesize smooth curves. The model maps a low-dimensional latent representation to a sequence of discrete points sampled from a rational Bézier curve. We demonstrate the performance of our method in completing both synthetic and real-world generative tasks. Results show that our method can generate diverse and realistic curves, while preserving consistent shape variation in the latent space, which is favorable for latent space design optimization or design space exploration.
Aug 28 2018
cs.CV arXiv:1808.08867v1
Over the past decades, a large number of techniques have emerged in modern imaging systems to capture the exact information of the original scene regardless of shake, motion, lighting conditions and etc., These developments have progressively addressed the acquisition of images in high speed and high resolutions. However, the various ineradicable real-time factors cause the degradation of the information and the quality of the acquired images. The available techniques are not intelligent enough to generalize this complex phenomenon. Hence, it is necessary to develop an intellectual framework to recover the possible information presented in the original scene. In this article, we propose a kernel free framework based on conditional-GAN to recover the information from the heavily damaged images. The degradation of images is assumed to be occurred by the combination of a various blur. Learning parameter of the cGAN is optimized by multi-component loss function that includes improved wasserstein loss with regression loss function. The generator module of this network is developed by using U-Net architecture with local Residual connections and global skip connection. Local connections and a global skip connection are implemented for the utilization of all stages of features. Generated images show that the network has the potential to recover the probable information of blurred images from the learned features. This research work is carried out as a part of our IOP studio software 'Facial recognition module'.
Recent studies have shown that reinforcement learning (RL) is an effective approach for improving the performance of neural machine translation (NMT) system. However, due to its instability, successfully RL training is challenging, especially in real-world systems where deep models and large datasets are leveraged. In this paper, taking several large-scale translation tasks as testbeds, we conduct a systematic study on how to train better NMT models using reinforcement learning. We provide a comprehensive comparison of several important factors (e.g., baseline reward, reward shaping) in RL training. Furthermore, to fill in the gap that it remains unclear whether RL is still beneficial when monolingual data is used, we propose a new method to leverage RL to further boost the performance of NMT systems trained with source/target monolingual data. By integrating all our findings, we obtain competitive results on WMT14 English- German, WMT17 English-Chinese, and WMT17 Chinese-English translation tasks, especially setting a state-of-the-art performance on WMT17 Chinese-English translation task.
LHCb collaboration, I. Bediaga, M. Cruz Torres, J.M. De Miranda, A. Gomes, A. Massafferri, J. Molina Rodriguez, A.C. dos Reis, l. Soares Lavra, R. Tourinho Jadallah Aoude, S. Amato, K. Carvalho Akiba, F. Da Cunha Marinho, L. De Paula, F. Ferreira Rodrigues, M. Gandelman, A. Hicheur, J.H. Lopes, I. Nasteva, J.M. Otalora Goicochea, et al (814) Aug 28 2018
hep-ex arXiv:1808.08865v1
The LHCb Upgrade II will fully exploit the flavour-physics opportunities of the HL-LHC, and study additional physics topics that take advantage of the forward acceptance of the LHCb spectrometer. The LHCb Upgrade I will begin operation in 2020. Consolidation will occur, and modest enhancements of the Upgrade I detector will be installed, in Long Shutdown 3 of the LHC (2025) and these are discussed here. The main Upgrade II detector will be installed in long shutdown 4 of the LHC (2030) and will build on the strengths of the current LHCb experiment and the Upgrade I. It will operate at a luminosity up to $ 2 \times 10^{34} \rm cm^{-2}s^{-1}$, ten times that of the Upgrade I detector. New detector components will improve the intrinsic performance of the experiment in certain key areas. An Expression Of Interest proposing Upgrade II was submitted in February 2017. The physics case for the Upgrade II is presented here in more depth. $CP$-violating phases will be measured with precisions unattainable at any other envisaged facility. The experiment will probe $b\to s \ell^+\ell^-$ and $b\to d \ell^+\ell^-$ transitions in both muon and electron decays in modes not accessible at Upgrade I. Minimal flavour violation will be tested with a precision measurement of the ratio of $B(B^0\to\mu^+\mu^-)/B(B_s^0\to \mu^+\mu^-)$. Probing charm $CP$ violation at the $10^{-5}$ level may result in its long sought discovery. Major advances in hadron spectroscopy will be possible, which will be powerful probes of low energy QCD. Upgrade II potentially will have the highest sensitivity of all the LHC experiments on the Higgs to charm-quark couplings. Generically, the new physics mass scale probed, for fixed couplings, will almost double compared with the pre-HL-LHC era; this extended reach for flavour physics is similar to that which would be achieved by the HE-LHC proposal for the energy frontier.
Aug 28 2018
math.GT arXiv:1808.08864v1
We show that for any simply connected topological closed 4-manifold with trivial Kirby-Siebenmann invariant punctured along any prescribed compact, totally disconnected tame subset there exists an uncountable set of smoothings which are not diffeomorphic to any leaf of a $C^2$ codimension one foliation on a compact manifold. This includes the remarkable case of $S^4$ punctured along a tame Cantor set. If the Kirby-Siebenmann invariant is non-trivial the result also holds if the puncture set is not a Cantor set. In the specific case of a simply connected topological closed 4-manifold whose infinite puncture set is countable and closed we get a continuum of different smooth structures which are not diffeomorphic to leaves of any $C^{1,0}$ codimension one foliation. This is the lowest reasonable regularity for this realization problem. Some of our smooth non-leaves are homeomorphic to proper leaves of $C^\infty$ codimension one foliations on a compact manifold, these are what we call exotic non-leaves. Of independent interest, we also show that a euclidean space cannot be homeomorphic to a proper leaf of any Reebless $C^2$ codimension one foliation
Aug 28 2018
cs.CL arXiv:1808.08859v1
In order to extract the best possible performance from asynchronous stochastic gradient descent one must increase the mini-batch size and scale the learning rate accordingly. In order to achieve further speedup we introduce a technique that delays gradient updates effectively increasing the mini-batch size. Unfortunately with the increase of mini-batch size we worsen the stale gradient problem in asynchronous stochastic gradient descent (SGD) which makes the model convergence poor. We introduce local optimizers which mitigate the stale gradient problem and together with fine tuning our momentum we are able to train a shallow machine translation system 27% faster than an optimized baseline with negligible penalty in BLEU.
We present a neural framework for opinion summarization from online product reviews which is knowledge-lean and only requires light supervision (e.g., in the form of product domain labels and user-provided ratings). Our method combines two weakly supervised components to identify salient opinions and form extractive summaries from multiple reviews: an aspect extractor trained under a multi-task objective, and a sentiment predictor based on multiple instance learning. We introduce an opinion summarization dataset that includes a training set of product reviews from six diverse domains and human-annotated development and test sets with gold standard aspect annotations, salience labels, and opinion summaries. Automatic evaluation shows significant improvements over baselines, and a large-scale study indicates that our opinion summaries are preferred by human judges according to multiple criteria.
Aug 28 2018
cs.CL arXiv:1808.08850v1
Sentence Boundary Detection (SBD) has been a major research topic since Automatic Speech Recognition transcripts have been used for further Natural Language Processing tasks like Part of Speech Tagging, Question Answering or Automatic Summarization. But what about evaluation? Do standard evaluation metrics like precision, recall, F-score or classification error; and more important, evaluating an automatic system against a unique reference is enough to conclude how well a SBD system is performing given the final application of the transcript? In this paper we propose Window-based Sentence Boundary Evaluation (WiSeBE), a semi-supervised metric for evaluating Sentence Boundary Detection systems based on multi-reference (dis)agreement. We evaluate and compare the performance of different SBD systems over a set of Youtube transcripts using WiSeBE and standard metrics. This double evaluation gives an understanding of how WiSeBE is a more reliable metric for the SBD task.
In this paper, we study a class $\mathcal{A}(\lambda ,n,m)$ of self-similar sets with $m$ exact overlaps generated by $n$ similitudes of the same ratio $ \lambda $. We obtain a necessary condition for a self-similar set in $\mathcal{A}(\lambda ,n,m)$ to be Lipschitz equivalent to a self-similar set satisfying the strong separation condition, i.e., there exists an integer $ k\geq 2$ such that $x^{2k}-mx^{k}+n$ is reducible, in particular, $m$ belongs to $\{a^{i}:a\in \mathbb{N}$ with $i\geq 2\}.$
Manipulating light by adding and subtracting individual photons is a powerful approach with a principal drawback: the operations are fundamentally probabilistic and the probability is often small. This limits not only the fundamental scalability but also the number of operations that can be applied in realistic experimental settings. We propose and analyze an adaptive technique which can significantly increase the probability of success while preserving the quality of the photon subtraction. We show the improvement both in single mode preparation and manipulation of non-Gaussian states with negative Wigner functions and in two-mode entanglement distillation protocol with Gaussian states of light.
Aug 28 2018
math.FA arXiv:1808.08844v1
For each $ \alpha > 0 $, the $\alpha$-Bloch space is consisting of all analytic functions $f$ on the unit disk satisfying $ \sup_{|z|<1} (1-|z|^2)^\alpha |f'(z)| < + \infty.$ In this paper, we consider the following complex integral operator, namely the $\beta$-Cesàro operator \beginequation C_\beta(f)(z)=\int_0^z\fracf(w)w(1-w)^\betadw \nonumber \endequation and its generalization, acting from the $\alpha$-Bloch space to itself, where $f(0)=0$ and $\beta\in\mathbb{R}$. We investigate the boundedness and compactness of the $\beta$-Cesàro operators and their generalization. Also we calculate the essential norm and spectrum of these operators.
Aug 28 2018
cs.CV arXiv:1808.08834v1
We present a fast and accurate visual tracking algorithm based on the multi-domain convolutional neural network (MDNet). The proposed approach accelerates feature extraction procedure and learns more discriminative models for instance classification; it enhances representation quality of target and background by maintaining a high resolution feature map with a large receptive field per activation. We also introduce a novel loss term to differentiate foreground instances across multiple domains and learn a more discriminative embedding of target objects with similar semantics. The proposed techniques are integrated into the pipeline of a well known CNN-based visual tracking algorithm, MDNet. We accomplish approximately 25 times speed-up with almost identical accuracy compared to MDNet. Our algorithm is evaluated in multiple popular tracking benchmark datasets including OTB2015, UAV123, and TempleColor, and outperforms the state-of-the-art real-time tracking methods consistently even without dataset-specific parameter tuning.
This paper proposes Power Slow Feature Analysis, a gradient-based method to extract temporally-slow features from a high-dimensional input stream that varies on a faster time-scale, and a variant of Slow Feature Analysis (SFA). While displaying performance comparable to hierarchical extensions to the SFA algorithm, such as Hierarchical Slow Feature Analysis, for a small number of output-features, our algorithm allows end-to-end training of arbitrary differentiable approximators (e.g., deep neural networks). We provide experimental evidence that PowerSFA is able to extract meaningful and informative low-dimensional features in the case of a) synthetic low-dimensional data, b) visual data, and also for c) a general dataset for which symmetric non-temporal relations between points can be defined.
Aug 28 2018
cs.NE arXiv:1808.08818v1
Surrogate-based optimization and nature-inspired metaheuristics have become the state-of-the-art in solving real-world optimization problems. Still, it is difficult for beginners and even experts to get an overview that explains their advantages in comparison to the large number of available methods in the scope of continuous optimization. Available taxonomies lack the integration of surrogate-based approaches and thus their embedding in the larger context of this broad field. This article presents a taxonomy of the field, which further matches the idea of nature-inspired algorithms, as it is based on the human behavior in path finding. Intuitive analogies make it easy to conceive the most basic principles of the search algorithms, even for beginners and non-experts in this area of research. However, this scheme does not oversimplify the high complexity of the different algorithms, as the class identifier only defines a descriptive meta-level of the algorithm search strategies. The taxonomy was established by exploring and matching algorithm schemes, extracting similarities and differences, and creating a set of classification indicators to distinguish between five distinct classes. In practice, this taxonomy allows recommendations for the applicability of the corresponding algorithms and helps developers trying to create or improve their own algorithms.
Quantum holonomies of closed paths on the torus $T^2$ are interpreted as elements of the Heisenberg group $H_1$. Group composition in $H_1$ corresponds to path concatenation and the group commutator is a deformation of the relator of the fundamental group $\pi_1$ of $T^2$, making explicit the signed area phases between quantum holonomies of homotopic paths. Inner automorphisms of $H_1$ adjust these signed areas, and the discrete symplectic transformations of $H_1$ generate the modular group of $T^2$.
Aug 28 2018
cs.CV arXiv:1808.08803v1
We propose a novel attentive sequence to sequence translator (ASST) for clip localization in videos by natural language descriptions. We make two contributions. First, we propose a bi-directional Recurrent Neural Network (RNN) with a finely calibrated vision-language attentive mechanism to comprehensively understand the free-formed natural language descriptions. The RNN parses natural language descriptions in two directions, and the attentive model attends every meaningful word or phrase to each frame, thereby resulting in a more detailed understanding of video content and description semantics. Second, we design a hierarchical architecture for the network to jointly model language descriptions and video content. Given a video-description pair, the network generates a matrix representation, i.e., a sequence of vectors. Each vector in the matrix represents a video frame conditioned by the description. The 2D representation not only preserves the temporal dependencies of frames but also provides an effective way to perform frame-level video-language matching. The hierarchical architecture exploits video content with multiple granularities, ranging from subtle details to global context. Integration of the multiple granularities yields a robust representation for multi-level video-language abstraction. We validate the effectiveness of our ASST on two large-scale datasets. Our ASST outperforms the state-of-the-art by $4.28\%$ in Rank$@1$ on the DiDeMo dataset. On the Charades-STA dataset, we significantly improve the state-of-the-art by $13.41\%$ in Rank$@1,IoU=0.5$.
Aug 28 2018
cs.CV arXiv:1808.08802v1
Three discriminative representations for face presentation attack detection are introduced in this paper. Firstly we design a descriptor called spatial pyramid coding micro-texture (SPMT) feature to characterize local appearance information. Secondly we utilize the SSD, which is a deep learning framework for detection, to excavate context cues and conduct end-to-end face presentation attack detection. Finally we design a descriptor called template face matched binocular depth (TFBD) feature to characterize stereo structures of real and fake faces. For accurate presentation attack detection, we also design two kinds of representation combinations. Firstly, we propose a decision-level cascade strategy to combine SPMT with SSD. Secondly, we use a simple score fusion strategy to combine face structure cues (TFBD) with local micro-texture features (SPMT). To demonstrate the effectiveness of our design, we evaluate the representation combination of SPMT and SSD on three public datasets, which outperforms all other state-of-the-art methods. In addition, we evaluate the representation combination of SPMT and TFBD on our dataset and excellent performance is also achieved.