We introduce a class of generic spike-and-slab priors for high-dimensional linear regression with grouped variables and present a Coordinate-ascent Variational Inference (CAVI) algorithm for obtaining an optimal varia...
详细信息
Recent advances in distributed optimization and learning have shown that communication compression is one of the most effective means of reducing communication. While there have been many results on convergence rates ...
详细信息
We consider the problem of sequentially making decisions that are rewarded by "successes" and "failures" which can be predicted through an unknown relationship that depends on a partially controlla...
详细信息
ISBN:
(纸本)9781510829008
We consider the problem of sequentially making decisions that are rewarded by "successes" and "failures" which can be predicted through an unknown relationship that depends on a partially controllable vector of attributes for each instance. The learner takes an active role in selecting samples from the instance pool. The goal is to maximize the probability of success, either after the offline training phase or minimizing regret in online learning. Our problem is motivated by real-world applications where observations are time consuming and/or expensive. With the adaptation of an online Bayesian linear classifier, we develop a knowledge-gradient type policy to guide the experiment by maximizing the expected value of information of labeling each alternative, in order to reduce the number of expensive physical experiments. We provide a finite-time analysis of the estimated error and demonstrate the performance of the proposed algorithm on both synthetic problems and benchmark UCI datasets.
Several classification methods assume that the underlying distributions follow tree-structured graphical models. Indeed, trees capture statistical dependencies between pairs of variables, which may be crucial to attai...
详细信息
Several classification methods assume that the underlying distributions follow tree-structured graphical models. Indeed, trees capture statistical dependencies between pairs of variables, which may be crucial to attaining low classification errors. In this setting, the optimal classifier is linear in the log-transformed univariate and bivariate densities that correspond to the tree edges. In practice, observed data may not be well approximated by trees. Yet, motivated by the importance of pairwise dependencies for accurate classification, here we propose to approximate the optimal decision boundary by a sparse linear combination of the univariate and bivariate log-transformed densities. Our proposed approach is semi-parametric in nature: we non-parametrically estimate the univariate and bivariate densities, remove pairs of variables that are nearly independent using the Hilbert-Schmidt independence criterion, and finally construct a linear SVM using the retained log-transformed densities. We demonstrate on synthetic and real data sets, that our classifier, named SLB (sparse log-bivariate density), is competitive with other popular classification methods.
Building efficient, accurate, and generalizable reduced-order models of developed turbulence remains a major challenge. This manuscript approaches this problem by developing a hierarchy of parameterized reduced Lagran...
详细信息
Building efficient, accurate, and generalizable reduced-order models of developed turbulence remains a major challenge. This manuscript approaches this problem by developing a hierarchy of parameterized reduced Lagrangian models for turbulent flows, and it investigates the effects of enforcing physical structure through smoothed particle hydrodynamics (SPH) versus relying on neural networks (NNs) as universal function approximators. Starting from NN parametrizations of a Lagrangian acceleration operator, this hierarchy of models gradually incorporates a weakly compressible and parameterized SPH framework, which enforces physical symmetries, such as Galilean, rotational, and translational invariances. Within this hierarchy, two new parameterized smoothing kernels are developed to increase the flexibility of the learn-able SPH simulators. For each model we experiment with different loss functions which are minimized using gradient based optimization, where efficient computations of gradients are obtained by using automatic differentiation and sensitivity analysis. Each model within the hierarchy is trained on two data sets associated with weakly compressible homogeneous isotropic turbulence: (1) a validation set using weakly compressible SPH; and (2) a high-fidelity set from direct numerical simulations. Numerical evidence shows that encoding more SPH structure improves generalizability to different turbulent Mach numbers and time shifts, and that including the novel parameterized smoothing kernels improves the accuracy of SPH at the resolved scales.
Supervised learning in function spaces is an emerging area of machine learning research with applications to the prediction of complex physical systems such as fluid flows, solid mechanics, and climate modeling. By di...
详细信息
We introduce a machine-learning-based framework for constructing continuum non-Newtonian fluid dynamics model directly from a micro-scale description. Polymer solution is used as an example to demonstrate the essentia...
详细信息
Sparse Matrix Vector Multiplication (SpMV) is one of the most basic problems in scientific and engineering computations. It is the basic operation in many realms, such as solving linear systems or eigenvalue problems....
详细信息
Sparse Matrix Vector Multiplication (SpMV) is one of the most basic problems in scientific and engineering computations. It is the basic operation in many realms, such as solving linear systems or eigenvalue problems. Nowadays, more than 90 percent of the world’s highest performance parallel computers in the top 500 use multicore architecture. So it is important practically to design the efficient methods of computing SpMV on multicore parallel computers. Usually, algorithms based on compressed sparse row (CSR) format suffer from a number of nonzero elements on each row so hardly as to use the multicore structure efficiently. Compressed Sparse Block (CSB) format is an effective storage format which can compute SpMV efficiently in a multicore computer. This paper presents a parallel multicore CSB format and SpMV based on it. We carried out numerical experiments on a parallel multicore computer. The results show that our parallel multicore CSB format and SpMV algorithm can reach high speedup, and they are highly scalable for banded matrices.
Hyperuniform systems, which include crystals, quasicrystals, and special disordered systems, have attracted considerable recent attention, but rigorous analyses of the hyperuniformity of quasicrystals have been lackin...
详细信息
Hyperuniform systems, which include crystals, quasicrystals, and special disordered systems, have attracted considerable recent attention, but rigorous analyses of the hyperuniformity of quasicrystals have been lacking because the support of the spectral intensity is dense and discontinuous. We employ the integrated spectral intensity Z(k) to quantitatively characterize the hyperuniformity of quasicrystalline point sets generated by projection methods. The scaling of Z(k) as k tends to zero is computed for one-dimensional quasicrystals and shown to be consistent with independent calculations of the variance, σ2(R), in the number of points contained in an interval of length 2R. We find that one-dimensional quasicrystals produced by projection from a two-dimensional lattice onto a line of slope 1/τ fall into distinct classes determined by the width of the projection window. For a countable dense set of widths, Z(k)∼k4; for all others, Z(k)∼k2. This distinction suggests that measures of hyperuniformity define new classes of quasicrystals in higher dimensions as well.
We consider the trajectory planning of a 6-Degree-of-Freedom (DOF) robot manipulator using computer algebra, with controlling the orientation of the end-effector. As a first step towards the objective, we present a so...
详细信息
暂无评论