In this paper, we propose a provably efficient natural policy gradient algorithm called Spectral Dynamic Embedding Policy Optimization (SDEPO) for two-player zero-sum stochastic Markov games with continuous state spac...
ISBN:
(纸本)9798331314385
In this paper, we propose a provably efficient natural policy gradient algorithm called Spectral Dynamic Embedding Policy Optimization (SDEPO) for two-player zero-sum stochastic Markov games with continuous state space and finite action space. In the policy evaluation procedure of our algorithm, a novel kernel embedding method is employed to construct a finite-dimensional linear approximations to the state-action value function. We explicitly analyze the approximation error in policy evaluation, and show that SDEPO achieves an Õ(1/(1-γ)3ε) last-iterate convergence to the ε-optimal Nash equilibrium, which is independent of the cardinality of the state space. The complexity result matches the best-known results for global convergence of policy gradient algorithms for single agent setting. Moreover, we also propose a practical variant of SDEPO to deal with continuous action space and empirical results demonstrate the practical superiority of the proposed method.
A major open problem in computational complexity is the existence of a one-way function, namely a function from strings to strings which is computationally easy to compute but hard to invert. Levin (2023) formulated t...
详细信息
This paper investigates the effective utilization of unlabeled data for large-area cross-view gee-localization (CVGL), encompassing both unsupervised and semi-supervised settings. Common approaches to CVGL rely on gro...
详细信息
ISBN:
(数字)9798350353006
ISBN:
(纸本)9798350353013
This paper investigates the effective utilization of unlabeled data for large-area cross-view gee-localization (CVGL), encompassing both unsupervised and semi-supervised settings. Common approaches to CVGL rely on ground-satellite image pairs and employ label-driven supervised training. However, the cost of collecting precise cross-view image pairs hinders the deployment of CVGL in real-life scenarios. Without the pairs, CVGL will be more challenging to handle the significant imaging and spatial gaps between ground and satellite images. To this end, we propose an unsupervised framework including a cross-view projection to guide the model for retrieving initial pseudo-labels and a fast re-ranking mechanism to refine the pseudo-labels by leveraging the fact that “the perfectly paired ground-satellite image is located in a unique and identical scene”. The framework exhibits competitive performance compared with supervised works on three open-source benchmarks. Our code and models will be released on https://***/liguopeng0923/UCVGL.
Arranging the bits of a random string or real into k columns of a two-dimensional array or higher dimensional structure is typically accompanied with loss in the Kolmogorov complexity of the columns, which depends on ...
详细信息
This paper investigates the effective utilization of unlabeled data for large-area cross-view geo-localization (CVGL), encompassing both unsupervised and semi-supervised settings. Common approaches to CVGL rely on gro...
详细信息
Transformer is a new kind of neural architecture which encodes the input data as powerful features via the attention mechanism. Basically, the visual transformers first divide the input images into several local patch...
ISBN:
(纸本)9781713845393
Transformer is a new kind of neural architecture which encodes the input data as powerful features via the attention mechanism. Basically, the visual transformers first divide the input images into several local patches and then calculate both representations and their relationship. Since natural images are of high complexity with abundant detail and color information, the granularity of the patch dividing is not fine enough for excavating features of objects in different scales and locations. In this paper, we point out that the attention inside these local patches are also essential for building visual transformers with high performance and we explore a new architecture, namely, Transformer iN Transformer (TNT). Specifically, we regard the local patches (e.g., 16×16) as "visual sentences" and present to further divide them into smaller patches (e.g., 4×4) as "visual words". The attention of each word will be calculated with other words in the given visual sentence with negligible computational costs. Features of both words and sentences will be aggregated to enhance the representation ability. Experiments on several benchmarks demonstrate the effectiveness of the proposed TNT architecture, e.g., we achieve an 81.5% top-1 accuracy on the ImageNet, which is about 1.7% higher than that of the state-of-the-art visual transformer with similar computational cost. The PyTorch code is available at https://***/huawei-noah/CV-Backbones, and the MindSpore code is available at https://***/mindspore/models/tree/master/research/cv/TNT.
This study presents an energy consumption(EC)forecasting method for laser melting manufacturing of metal artifacts based on fusionable transfer learning(FTL).To predict the EC of manufacturing products,particularly fr...
详细信息
This study presents an energy consumption(EC)forecasting method for laser melting manufacturing of metal artifacts based on fusionable transfer learning(FTL).To predict the EC of manufacturing products,particularly from scale-down to scale-up,a general paradigm was first developed by categorizing the overall process into three main *** operating electrical power was further formulated as a combinatorial function,based on which an operator learning network was adopted to fit the nonlinear relations between the fabricating arguments and ***-arranged networks were constructed to investigate the impacts of fabrication variables and devices on *** the interconnections among these factors,the outputs of the neural networks were blended and fused to jointly predict the electrical *** innovatively,large artifacts can be decomposed into timedependent laser-scanning trajectories,which can be further transformed into fusionable information via neural networks,inspired by large language ***,transfer learning can deal with either scale-down or scale-up forecasting,namely,FTL with scalability within artifact *** effectiveness of the proposed FTL was verified through physical fabrication experiments via laser powder bed *** relative error of the average and overall EC predictions based on FTL was maintained below 0.83%.The melting fusion quality was examined using metallographic *** proposed FTL framework can forecast the EC of scaled structures,which is particularly helpful in price estimation and quotation of large metal products towards carbon peaking and carbon neutrality.
Assessing the action quality of cyborg animals helps to adjust control strategies, guide the development of control algorithm, and enhance the efficiency of navigation and military applications. However, existing rese...
详细信息
As feature size continues to shrink and light source wavelengths remain unchanged, the optical diffraction effects seriously degrade chip yield. Optical proximity correction (OPC) has become an essential step for chip...
详细信息
Graphical User Interface (GUI) testing is one of the primary approaches for testing mobile apps. Test scripts serve as the main carrier of GUI testing, yet they are prone to obsolescence when the GUIs change with the ...
详细信息
暂无评论