检索结果-内蒙古大学图书馆

Federated Offline Reinforcement Learning with Proximal Policy Evaluation

Chinese Journal of Electronics 2024年第6期33卷 1360-1372页

作者： Sheng YUE Yongheng DENG Guanbo WANG Ju REN Yaoxue ZHANG Department of Computer Science and Technology BNRist Tsinghua University Zhongguancun Laboratory

Offline reinforcement learning(RL) has gathered increasing attention in recent years, which seeks to learn policies from static datasets without active online exploration. However, the existing offline RL approaches often require a large amount of pre-collected data and hence are hardly implemented by a single agent in practice. Inspired by the advancement of federated learning(FL), this paper studies federated offline reinforcement learning(FORL),whereby multiple agents collaboratively carry out offline policy learning with no need to share their raw ***, a straightforward solution is to simply retrofit the off-the-shelf offline RL methods for FL, whereas such an approach easily overfits individual datasets during local updating, leading to instability and subpar performance. To overcome this challenge, we propose a new FORL algorithm, named model-free(MF)-FORL, that exploits novel“proximal local policy evaluation” to judiciously push up action values beyond local data support, enabling agents to capture the individual information without forgetting the aggregated knowledge. Further, we introduce a model-based variant, MB-FORL, capable of improving the generalization ability and computational efficiency via utilizing a learned dynamics model. We evaluate the proposed algorithms on a suite of complex and high-dimensional offline RL benchmarks, and the results demonstrate significant performance gains over the baselines.

关键词： Federated learning Computational modeling Heuristic algorithms Reinforcement learning Performance gain Benchmark testing Data models Computational efficiency Trajectory Servers

来源：评论

学校读者我要写书评

暂无评论

A secret sharing scheme based on integer decomposition and hexagonal structure

引用

International Journal of Information and Communication Technology 2024年第4期24卷 482-501页

作者： Rouia, Zender Lemnouar, Noui Rida, Abdessemed Mohamed LAMIE Laboratory Department of Computer Science Faculty of Mathematics and Computer Science University of Batna 2 Algeria

Security is a major challenge in storage and transmission of digital data. Secret sharing scheme is a fundamental primitive used in multiparty computations, access control and key management, which is based here on two concepts, namely: hexagonal structure and integer decomposition. Use of hexagonal structure is common in biological modelling. For integer decomposition, the oldest known method is Fermat’s factorisation, while for the proposed decomposition, the factorisation uniqueness of positive integer into two factors is exploited. Experimental results obtained from the applied scheme to digital images reveal interesting properties;this scheme turns out to be lossless, ideal, flexible, extensible, and even can detect and identify cheater;in sum, it has a good security. © 2024 Inderscience Enterprises Ltd.

关键词： Factorization

来源：评论

学校读者我要写书评

暂无评论

An infrastructure software perspective toward computation offloading between executable specifications and foundation models

引用

science China(Information sciences) 2025年第4期68卷 380-382页

作者： Dezhi RAN Mengzhou WU Yuan CAO Assaf MARRON David HAREL Tao XIE Key Laboratory of High Confidence Software Technologies (PKU) Ministry of Education School of Computer SciencePeking University School of Electronics Engineering and Computer Science Peking University Department of Computer Science and Applied Mathematics Weizmann Institute of Science

Foundation models(FMs) [1] have revolutionized software development and become the core components of large software systems. This paradigm shift, however, demands fundamental re-imagining of software engineering theories and methodologies [2]. Instead of replacing existing software modules implemented by symbolic logic, incorporating FMs' capabilities to build software systems requires entirely new modules that leverage the unique capabilities of ***, while FMs excel at handling uncertainty, recognizing patterns, and processing unstructured data, we need new engineering theories that support the paradigm shift from explicitly programming and maintaining user-defined symbolic logic to creating rich, expressive requirements that FMs can accurately perceive and implement.

关键词：

来源：评论

学校读者我要写书评

暂无评论

GPS: graph contrastive learning via multi-scale augmented views from adversarial pooling

引用

science China(Information sciences) 2025年第1期68卷 145-158页

作者： Wei JU Yiyang GU Zhengyang MAO Ziyue QIAO Yifang QIN Xiao LUO Hui XIONG Ming ZHANG School of Computer Science National Key Laboratory for Multimedia Information ProcessingPeking University Artificial Intelligence Thrust The Hong Kong University of Science and Technology Department of Computer Science University of California

Self-supervised graph representation learning has recently shown considerable promise in a range of fields, including bioinformatics and social networks. A large number of graph contrastive learning approaches have shown promising performance for representation learning on graphs, which train models by maximizing agreement between original graphs and their augmented views(i.e., positive views). Unfortunately, these methods usually involve pre-defined augmentation strategies based on the knowledge of human experts. Moreover, these strategies may fail to generate challenging positive views to provide sufficient supervision signals. In this paper, we present a novel approach named graph pooling contrast(GPS) to address these *** by the fact that graph pooling can adaptively coarsen the graph with the removal of redundancy, we rethink graph pooling and leverage it to automatically generate multi-scale positive views with varying emphasis on providing challenging positives and preserving semantics, i.e., strongly-augmented view and weakly-augmented view. Then, we incorporate both views into a joint contrastive learning framework with similarity learning and consistency learning, where our pooling module is adversarially trained with respect to the encoder for adversarial robustness. Experiments on twelve datasets on both graph classification and transfer learning tasks verify the superiority of the proposed method over its counterparts.

关键词： graph representation learning graph neural networks graph contrastive learning graph augmentations graph pooling

来源：评论

学校读者我要写书评

暂无评论

Re-quantization based binary graph neural networks

引用

science China(Information sciences) 2024年第7期67卷 160-171页

作者： Kai-Lang YAO Wu-Jun LI National Key Laboratory for Novel Software Technology Department of Computer Science and TechnologyNanjing University

Binary neural networks have become a promising research topic due to their advantages of fast inference speed and low energy consumption. However, most existing studies focus on binary convolutional neural networks, while less attention has been paid to binary graph neural networks. A common drawback of existing studies on binary graph neural networks is that they still include lots of inefficient full-precision operations in multiplying three matrices and are therefore not efficient enough. In this paper, we propose a novel method, called re-quantization-based binary graph neural networks(RQBGN), for binarizing graph neural networks. Specifically, re-quantization, a necessary procedure contributing to the further reduction of superfluous inefficient full-precision operations, quantizes the results of multiplication between any two matrices during the process of multiplying three matrices. To address the challenges introduced by requantization, in RQBGN we first study the impact of different computation orders to find an effective one and then introduce a mixture of experts to increase the model capacity. Experiments on five benchmark datasets show that performing re-quantization in different computation orders significantly impacts the performance of binary graph neural network models, and RQBGN can outperform other baselines to achieve state-of-the-art performance.

关键词： graph neural networks binary neural networks mixture of experts computation-efficient algorithms

来源：评论

学校读者我要写书评

暂无评论

Stochastic normalized gradient descent with momentum for large-batch training

引用

science China(Information sciences) 2024年第11期67卷 77-91页

作者： Shen-Yi ZHAO Chang-Wei SHI Yin-Peng XIE Wu-Jun LI National Key Laboratory for Novel Software Technology Department of Computer Science and TechnologyNanjing University

Stochastic gradient descent(SGD) and its variants have been the dominating optimization methods in machine learning. Compared with SGD with small-batch training, SGD with large-batch training can better utilize the computational power of current multi-core systems such as graphics processing units(GPUs)and can reduce the number of communication rounds in distributed training settings. Thus, SGD with large-batch training has attracted considerable attention. However, existing empirical results showed that large-batch training typically leads to a drop in generalization accuracy. Hence, how to guarantee the generalization ability in large-batch training becomes a challenging task. In this paper, we propose a simple yet effective method, called stochastic normalized gradient descent with momentum(SNGM), for large-batch training. We prove that with the same number of gradient computations, SNGM can adopt a larger batch size than momentum SGD(MSGD), which is one of the most widely used variants of SGD, to converge to an?-stationary point. Empirical results on deep learning verify that when adopting the same large batch size,SNGM can achieve better test accuracy than MSGD and other state-of-the-art large-batch training methods.

关键词： non-convex problems large-batch training stochastic normalized gradient descent momentum

来源：评论

学校读者我要写书评

暂无评论

AmplitudeArrow: On-the-Go AR Menu Selection Using Consecutive Simple Head Gestures and Amplitude Visualization

引用

IEEE Transactions on Visualization and computer graphics 2025年第05期31卷 3408-3417页

作者： Tian, Yang Zhang, Youpeng Yan, Yukang Zhao, Shengdong Ma, Xiaojuan Shi, Yuanchun Guangxi University Department of Computer Science Guangxi Key Laboratory of Multimedia Communications and Network Technology China University of Rochester Department of Computer Science United States City University of Hong Kong School of Creative Media and the Department of Computer Science Hong Kong Hong Kong University of Science and Technology Department of Computer Science and Engineering Hong Kong Tsinghua University Department of Computer Science and Technology China

Heads-up computing aims to provide synergistic digital assistance that minimally interferes with users' on-the-go daily activities. Currently, the input modalities of heads-up computing are mainly voice and finger gestures. In this work, we propose and evaluate the AmplitudeArrow (AA) technique designed for on-the-go AR menu selection to demonstrate that consecutive simple head gestures can also be an effective input modality for heads-up computing. Specifically, AA arranges menu icons into one/two row(s). To select a target icon, the user first makes their head yaw to pre-select the target icon or the column containing it and then makes their head pitch to make the arrow in the target icon expand until the arrow covers the target icon completely, i.e., the pitch amplitude surpasses the selection confirmation threshold. User studies indicated that AA demonstrated robust resistance to walking-caused head perturbation and external factors such as other people/obstacles, delivering high accuracy (error rate © 1995-2012 IEEE.

关键词： Augmented reality

来源：评论

学校读者我要写书评

暂无评论

Static video summarization based on genetic algorithm and deep learning approach

引用

Multimedia Tools and Applications 2025年第13期84卷 12487-12512页

作者： Benoughidene, Abdelhalim Titouna, Faiza Boughida, Adil Computer Science Department LaSTIC Laboratory University Batna 2 Batna05000 Algeria Computer Science Department LabSTIC Laboratory University of 08 may 1945 Guelma Guelma24000 Algeria

The development of information technology has led to the rise of big data. A large portion of this big data comes in the form of video information. The automatic analysis of this exponential growth in video content has become a popular research area. This research focuses on finding a video’s keyframes through a proposed static video summarization method. The method uses a deep learning-based shot boundary detection approach as a pre-processing step and exploits DBSCAN clustering to extract keyframes. A genetic algorithm is used to optimize the hyper-parameters of DBSCAN rather than having the user pre-tune them because the number of keyframes in a video can vary depending on the content of the video. The experimental results on standard databases Open Video Project (OVP) and YouTube (YT) show that the proposed method produces better results than existing methods. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： Genetic algorithms

来源：评论

学校读者我要写书评

暂无评论

Movements Classification Based on Surface Electromyography Using Time-frequency Domain Features 27

Movements Classification Based on Surface Electromyography U...

引用

27th International Conference on Soft Computing and Measurements, SCM 2024

作者： Kuznetsov, Ivan V. Far Eastern State Transport University Department of Computer Science and Computer Graphics Khabarovsk Russia

ISBN: (纸本)9798350363708

Feature selection makes significant role in movement classification based on electromyography data. It is assumed that the efficiency of movement classification is improved when time-domain (TD) and frequency-time-domain (TFD) features are used together. To validate the hypothesis, classifiers based on Support Vector Machines (SVM), K Nearest Neighbors (KNN), Random Forest (RF) were trained using NinaPro DB5 dataset. Classification efficiencies of 91.3% and 92.8% were achieved for Recall and Precision metrics, respectively, using KNN. The proposed methods can be used to create human-machine interfaces using muscle activity data. © 2024 IEEE.

关键词： Classification (of information)

来源：评论

学校读者我要写书评

暂无评论

A systematic and comprehensive review on low power wide area network: characteristics, architecture, applications and research challenges

引用

Discover Internet of Things 2025年第1期5卷 1-26页

作者： Diane, Ass Diallo, Ousmane Ndoye, El Hadji Malick Laboratory Department of Computer Science Assane SECK University of Ziguinchor Ziguinchor Senegal

The Internet of Things (IoT) has become a rapidly growing research field. This is due to the advancement of digital technologies, miniaturization, and the reduction of the cost of IoT devices and wireless connectivity, among others. Despite the plethora of technologies used for the Internet of Things, the trade-off between long data transmission range and low power consumption was not found until the advent of Low Power Wide Area Network (LPWAN) technologies. This paper reviews the main aspects of LPWANs and their technologies based on an exhaustive search in several online scientific databases, such as Springer, IEEE Xplore, the ACM digital library, and Google Scholar. This research methodology enabled us to gather recent work on LPWANs, which forms the basis of this article. It is informative and knowledge-updating support in the LPWANs’ environment that broadly covers LPWANs. This research work has developed the characteristics of LPWANs and the techniques used to achieve long-range energy efficiency, high scalability, and low cost. In addition, it presents the application areas of LPWAN technologies with use-case network architectures for each area, addresses spectrum and energy optimization, and discusses open research challenges that need to be focused to provide guidelines for further contributions. © The Author(s) 2025.

关键词： Applications Architecture IoT LPWAN Research challenges Standardization

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：