A fairly comprehensive analysis is presented for the gradient descent dynamics for training two-layer neural network models in the situation when the parameters in both layers are updated. General initialization schem...
详细信息
The weak-field limit of Einstein-Cartan (EC) relativity is studied. The equations of EC theory are rewritten such that they formally resemble those of Einstein general relativity (EGR); this allows ideas from post-New...
详细信息
The weak-field limit of Einstein-Cartan (EC) relativity is studied. The equations of EC theory are rewritten such that they formally resemble those of Einstein general relativity (EGR); this allows ideas from post-Newtonian theory to be imported without essential change. The equations of motion are then written both at first post-Newtonian (1PN) order and at 1.5PN order. EC theory’s 1PN equations of motion are found to be those of a micropolar/Cosserat elastic medium, along with a decoupled evolution equation for nonclassical, spin-related fields. It seems that a necessary condition for these results to hold is that one chooses the nonclassical fields to scale with the speed of light in a certain empirically reasonable way. Finally, the 1.5PN equations give greater insight into the coupling between energy-momentum and spin within slowly moving, weakly gravitating matter. Specifically, the weakly relativistic modifications to Cosserat theory involve a gravitational torque and an augmentation of the gravitational force due to a dynamic mass moment density with an accompanying dynamic mass moment density flux, and new forms of linear momentum density captured by a dynamic mass density flux and a dynamic momentum density.
This paper presents a novel algorithm for the 3D tomographic inversion problem that arises in single-particle electron cryomicroscopy (Cryo-EM). It is based on two key components: 1) a variational formulation that pro...
详细信息
This paper presents a novel algorithm for the 3D tomographic inversion problem that arises in single-particle electron cryomicroscopy (Cryo-EM). It is based on two key components: 1) a variational formulation that promotes sparsity in the wavelet domain and 2) the Toeplitz structure of the combined projection/back-projection operator. The first idea has proven to be very effective for the recovery of piecewise-smooth signals, which is confirmed by our numerical experiments. The second idea allows for a computationally efficient implementation of the reconstruction procedure, using only one circulant convolution per iteration.
Optimal a priori estimates are derived for the population risk, also known as the generalization error, of a regularized residual network model. An important part of the regularized model is the usage of a new path no...
详细信息
We present a continuous formulation of machine learning, as a problem in the calculus of variations and differential-integral equations, very much in the spirit of classical numerical analysis and statistical physics....
详细信息
We study the solutions of the one-phase supercooled Stefan problem with kinetic undercooling, which describes the freezing of a supercooled liquid, in one spatial dimension. Assuming that the initial temperature lies ...
详细信息
The behavior of the gradient descent (GD) algorithm is analyzed for a deep neural network model with skip-connections. It is proved that in the over-parametrized regime, for a suitable initialization, with high probab...
详细信息
The isotope effects in x-ray absorption spectra of liquid water are studied by a many-body approach within electron-hole excitation theory. The molecular structures of both light and heavy water are modeled by path-in...
详细信息
The isotope effects in x-ray absorption spectra of liquid water are studied by a many-body approach within electron-hole excitation theory. The molecular structures of both light and heavy water are modeled by path-integral molecular dynamics based on the advanced deep-learning technique. The neural network is trained on ab initio data obtained with SCAN density functional theory. The experimentally observed isotope effect in x-ray absorption spectra is reproduced semiquantitatively in theory. Compared to the spectrum in normal water, the blueshifted and less pronounced pre- and main-edge in heavy water reflect that the heavy water is more structured at short- and intermediate-range of the hydrogen-bond network. In contrast, the isotope effect on the spectrum is negligible at post-edge, which is consistent with the identical long-range ordering in both liquids as observed in the diffraction experiment.
We consider a probabilistic formulation of a singular two-phase Stefan problem in one space dimension, which amounts to a coupled system of two McKean-Vlasov stochastic differential equations. In the financial context...
详细信息
We study the one-phase one-dimensional supercooled Stefan problem with oscillatory initial conditions. In this context, the global existence of so-called physical solutions has been shown recently in [CRSF23], despite...
详细信息
暂无评论