The truncated singular value decomposition has been widely used in many areas of science including engineering,and statistics,*** this paper,the original truncated complex singular value decomposition problem is formu...
详细信息
The truncated singular value decomposition has been widely used in many areas of science including engineering,and statistics,*** this paper,the original truncated complex singular value decomposition problem is formulated as a Riemannian optimiza-tion problem on a product of two complex Stiefel manifolds,a practical algorithm based on the generic Riemannian trust-region method of Absil et *** presented to solve the underlying problem,which enjoys the global convergence and local superlinear conver-gence *** experiments are provided to illustrate the efficiency of the proposed *** with some classical Riemannian gradient-type methods,the existing Riemannian version of limited-memory BFGS algorithms in the MATLAB toolbox Manopt and the Riemannian manifold optimization library ROPTLIB,and some latest infeasible methods for solving manifold optimization problems,are also provided to show the merits of the proposed approach.
Adam has become one of the most favored optimizers in deep learning problems. Despite its success in practice, numerous mysteries persist regarding its theoretical understanding. In this paper, we study the implicit b...
ISBN:
(纸本)9798331314385
Adam has become one of the most favored optimizers in deep learning problems. Despite its success in practice, numerous mysteries persist regarding its theoretical understanding. In this paper, we study the implicit bias of Adam in linear logistic regression. Specifically, we show that when the training data are linearly separable, the iterates of Adam converge towards a linear classifier that achieves the maximum ℓ∞ -margin in direction. Notably, for a general class of diminishing learning rates, this convergence occurs within polynomial time. Our result shed light on the difference between Adam and (stochastic) gradient descent from a theoretical perspective.
Mobility is the key for people with disabilities to have full participation in life. To support their mobility, previous work primarily focused on accessibility as an attribute of the external environment to be evalua...
详细信息
Investors and financial analysts require accurate predictions of stock prices to make informed decisions. We present Momentum Spillover Network (MSNet), a deep learning model that predicts stock prices on the National...
详细信息
Finding an ideal sequence in games is crucial for maximizing gains in various scenarios. This process requires extensive investigation, which is both time consuming and demands significant domain expertise. The emerge...
详细信息
Finding an ideal sequence in games is crucial for maximizing gains in various scenarios. This process requires extensive investigation, which is both time consuming and demands significant domain expertise. The emergence of large language models (LLMs) represents a significant turning point in addressing this issue, attributable to their potent analytical capabilities. In this context, LLMs can serve to substantially alleviate the human labor needed to manage these complexities. Through comprehensive simulations of coin-tossing games, we have demonstrated that the adaptive switching strategies formulated by LLMs surpass predefined sequences in profitability when applied to certain paradoxical games. Furthermore, our experimental findings indicate that the proposed method not only automates the identification of effective strategies but also provides adaptability to various forms of these paradoxical games.
Open-vocabulary semantic segmentation (OVSS) is a challenging computer vision task that labels each pixel within an image based on text descriptions. Recent advancements in OVSS are largely attributed to the increased...
详细信息
It is often challenging to obtain large number of labeled data for retinal layer segmentation in optical coherence tomography scans due to the need for expert ophthalmologists. On the other hand, huge quantities of un...
详细信息
Neural architecture search (NAS) automates neural network design by using optimization algorithms to navigate architecture spaces, reducing the burden of manual architecture design. While NAS has achieved success, app...
详细信息
The Current Mirror (CM) is a basic building block commonly used in analogue and mixed-signal integrated circuits. Its significance lies in its ability to replicate and precisely regulate the current, making it crucial...
详细信息
The aging of operational reactors leads to increased mechanical vibrations in the reactor *** vibration of the incore sensors near their nominal locations is a new problem for neutronic field *** field-reconstruction ...
详细信息
The aging of operational reactors leads to increased mechanical vibrations in the reactor *** vibration of the incore sensors near their nominal locations is a new problem for neutronic field *** field-reconstruction methods fail to handle spatially moving *** this study,we propose a Voronoi tessellation technique in combination with convolutional neural networks to handle this *** from movable in-core sensors were projected onto the same global field structure using Voronoi tessellation,holding the magnitude and location information of the *** convolutional neural networks were used to learn maps from observations to the global *** proposed method reconstructed multi-physics fields(including fast flux,thermal flux,and power rate)using observations from a single field(such as thermal flux).Numerical tests based on the IAEA benchmark demonstrated the potential of the proposed method in practical engineering applications,particularly within an amplitude of 5 cm around the nominal locations,which led to average relative errors below 5% and 10% in the L_(2) and L_(∞)norms,respectively.
暂无评论