检索结果-内蒙古大学图书馆

Recent Progress in Reinforcement Learning and Adaptive Dynamic Programming for Advanced Control Applications

IEEE/CAA Journal of Automatica Sinica 2024年第1期11卷 18-36页

作者： Ding Wang Ning Gao Derong Liu Jinna Li Frank L.Lewis IEEE the Faculty of Information Technology Beijing Key Laboratory of Computational Intelligence and Intelligent SystemBeijing Laboratory of Smart Environmental Protectionand Beijing Institute of Artificial IntelligenceBeijing University of TechnologyBeijing 100124China the School of System Design and Intelligent Manufacturing Southern University of Science and TechnologyShenzhen 518055China the Department of Electrical and Computer Engineering University of Illinois at ChicagoChicago IL 60607 USA the School of Information and Control Engineering Liaoning Petrochemical UniversityFushun 113001China the UTA Research Institute the University of Texas at ArlingtonArlington TX 76118 USA

Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, ***, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation ***, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence.

关键词： Adaptive dynamic programming(ADP) advanced control complex environment data-driven control event-triggered design intelligent control neural networks nonlinear systems optimal control reinforcement learning(RL)

来源：评论

学校读者我要写书评

暂无评论

Digital Communications and networks

引用

Digital Communications and networks 2022年第3期8卷 235-236页

作者： Ruidong Li Shiwen Mao Periklis Chatzimisios Constandinos X.Mavromoustakis Intelligent computation and network laboratory(ICNL)at Kanazawa University Japan the Wireless Engineering Research and Education Center(WEREC) Department of Information and Electronic Engineering at the International Hellenic University(Greece) Constandinos X.Mavromoustakis is currently a Professor at the Department of Computer Science at the University of Nicosia Cyprus

Guest editorial The emerging applications,suchas Augmented and Virtual Realities(AR/VR),InternetofThings(IoT),4K/8Kstreaming,raisestrongrequirementsto movecomputationfrom thecloudtotheedgestobecloser *** are tremendous possibilities for the network edge,which may includeavariety ofentities,such as small datacenters,end devices,and resource-abundant network *** together provide the network computation and intelligence to users.

关键词： network IoT Communications

来源：评论

学校读者我要写书评

暂无评论

Bonding and Failure Mechanisms of Sintered-silver Die-attach on Electroless Nickel (Phosphorus) Surface Finish for Packaging Power Electronics Modules

引用

IEEE Transactions on Power Electronics 2024年第9期40卷 13086-13098页

作者： Wang, Meiyu Zhang, Haobo Hu, Weibo Mei, Yunhui Lu, Guo-Quan Nankai University College of Electronic Information and Optical Engineering Tianjin300350 China Shenzhen research institute of Nankai University Shenzhen518083 China Nankai University Tianjin Key Laboratory of Optoelectronic Sensor and Sensing Network Technology Tianjin300350 China Tiangong University School of Electrical Engineering Tianjin300387 China The Bradley Department of Electrical and Computer Engineering The Department of Materials Science and Engineering BlacksburgVA24061 United States

The low-temperature silver sintering technology has been increasingly applied for die-attach in power electronics modules. Most reported studies of the technology involved bonding on silver (Ag) or gold (Au) surface finish. Compared to Ag or Au, electroless nickel (phosphorus) or Ni(P) is a cost-effective and widely-used process for surface-finishing module substrates. However, there is a lack of studies on sintered-Ag bonding on the Ni(P) surface. In this study, IGBT devices were bonded using pressureless Ag-sintering on electroless Ni(P)-plated substrates. Micro-morphologies and macro-properties were then characterized before and after three reliability tests including high temperature storage life (HTSL), steady-state temperature-humidity bias life (STHBL), and temperature cycling (TC) tests. The as-sintered joints measured strong adhesion of 39 MPa, resulting from Ag-NiO-Ni chemical bonds across the (111) crystal planes with reduced lattice misfit. The Ag-Ni(P) joints exhibited high reliability under high-temperature and thermo-mechanical stress in the HTSL and TC tests, but performed poorly when exposed to high humidity in the STHBL test. The founding in this study helps understand the physical nature of bonding between sintered-Ag and Ni(P), and its failure mechanisms under the influence of different physical stresses. © 1986-2012 IEEE.

关键词： Bond strength (chemical)

来源：评论

学校读者我要写书评

暂无评论

On the Effectiveness of Function-Level Vulnerability Detectors for Inter-Procedural Vulnerabilities 24

On the Effectiveness of Function-Level Vulnerability Detecto...

引用

44th ACM/IEEE International Conference on Software engineering, ICSE 2024

作者： Li, Zhen Wang, Ning Zou, Deqing Li, Yating Zhang, Ruqian Xu, Shouhuai Zhang, Chao Jin, Hai Hubei Key Laboratory of Distributed System Security Hubei Engineering Research Center on Big Data Security Cluster and Grid Computing Lab National Engineering Research Center for Big Data Technology and System Services Computing Technology and System Lab Hong Kong Jin YinHu Laboratory Wuhan China School of Cyber Science and Engineering Huazhong University of Science and Technology Wuhan China University of Colorado Colorado Springs Department of Computer Science Colorado Springs Colorado United States Institute for Network Sciences and Cyberspace Tsinghua University Beijing China School of Computer Science and Technology Huazhong University of Science and Technology Wuhan China

ISBN: (纸本)9798400702174

Software vulnerabilities are a major cyber threat and it is important to detect them. One important approach to detecting vulnerabilities is to use deep learning while treating a program function as a whole, known as function-level vulnerability detectors. However, the limitation of this approach is not understood. In this paper, we investigate its limitation in detecting one class of vulnerabilities known as inter-procedural vulnerabilities, where the to-be-patched statements and the vulnerability-triggering statements belong to different functions. For this purpose, we create the first Inter -Procedural Vulnerability Dataset (InterPVD) based on C/C++ open-source software, and we propose a tool dubbed VulTrigger for identifying vulnerability-triggering statements across functions. Experimental results show that VulTrigger can effectively identify vulnerability-triggering statements and inter-procedural vulnerabilities. Our findings include: (i) inter-procedural vulnerabilities are prevalent with an average of 2.8 inter-procedural layers;and (ii) function-level vulner-ability detectors are much less effective in detecting to-be-patched functions of inter-procedural vulnerabilities than detecting their counterparts of intra-procedural vulnerabilities. © 2024 ACM.

关键词： Open source software

来源：评论

学校读者我要写书评

暂无评论

Pushing AI to wireless network edge: an overview on integrated sensing, communication, and computation towards 6G

引用

Science China(Information Sciences) 2023年第3期66卷 7-25页

作者： Guangxu ZHU Zhonghao LYU Xiang JIAO Peixi LIU Mingzhe CHEN Jie XU Shuguang CUI Ping ZHANG Shenzhen Research Institute of Big Data Future Network of Intelligence Institute (FNii) The Chinese University of Hong Kong (Shenzhen) School of Science and Engineering (SSE) The Chinese University of Hong Kong (Shenzhen) State Key Laboratory of Advanced Optical Communication Systems and Networks School of Electronics Peking University Department of Electrical and Computer Engineering and Institute for Data Science and Computing University of Miami Peng Cheng Laboratory State Key Laboratory of Networking and Switching Technology Beijing University of Posts and Telecommunications

Pushing artificial intelligence(AI) from central cloud to network edge has reached board consensus in both industry and academia for materializing the vision of artificial intelligence of things(AIoT) in the sixth-generation(6G) era. This gives rise to an emerging research area known as edge intelligence, which concerns the distillation of human-like intelligence from the vast amount of data scattered at the wireless network edge. Typically, realizing edge intelligence corresponds to the processes of sensing, communication,and computation, which are coupled ingredients for data generation, exchanging, and processing, ***, conventional wireless networks design the three mentioned ingredients separately in a task-agnostic manner, which leads to difficulties in accommodating the stringent demands of ultra-low latency, ultra-high reliability, and high capacity in emerging AI applications like auto-driving and metaverse. This thus prompts a new design paradigm of seamlessly integrated sensing, communication, and computation(ISCC) in a taskoriented manner, which comprehensively accounts for the use of the data in downstream AI tasks. In view of its growing interest, this study provides a timely overview of ISCC for edge intelligence by introducing its basic concept, design challenges, and enabling techniques, surveying the state-of-the-art advancements, and shedding light on the road ahead.

关键词： sixth-generation (6G) edge intelligence artificial intelligence of things (AIoT) integrated sensing communication and computation (ISCC)

来源：评论

学校读者我要写书评

暂无评论

Video2Reward: Generating Reward Function from Videos for Legged Robot Behavior Learning 27

Video2Reward: Generating Reward Function from Videos for Leg...

引用

27th European Conference on Artificial Intelligence, ECAI 2024

作者： Zeng, Runhao Zhou, Dingjie Liang, Qiwei Liu, Junlin Li, Hui Huang, Changxin Li, Jianqiang Hu, Xiping Sun, Fuchun Artificial Intelligence Research Institute Shenzhen MSU-BIT University China College of Mechatronics and Control Engineering Shenzhen University China College of Computer Science and Software Engineering Shenzhen University China National Engineering Laboratory for Big Data System Computing Technology Shenzhen University China Department of Computer Science and Technology Tsinghua University China

ISBN: (纸本)9781643685489

Learning behavior in legged robots presents a significant challenge due to its inherent instability and complex constraints. Recent research has proposed the use of a large language model (LLM) to generate reward functions in reinforcement learning, thereby replacing the need for manually designed rewards by experts. However, this approach, which relies on textual descriptions to define learning objectives, fails to achieve controllable and precise behavior learning with clear directionality. In this paper, we introduce a new video2reward method, which directly generates reward functions from videos depicting the behaviors to be mimicked and learned. Specifically, we first process videos containing the target behaviors, converting the motion information of individuals in the videos into keypoint trajectories represented as coordinates through a video2text transforming module. These trajectories are then fed into an LLM to generate the reward function, which in turn is used to train the policy. To enhance the quality of the reward function, we develop a video-assisted iterative reward refinement scheme that visually assesses the learned behaviors and provides textual feedback to the LLM. This feedback guides the LLM to continually refine the reward function, ultimately facilitating more efficient behavior learning. Experimental results on tasks involving bipedal and quadrupedal robot motion control demonstrate that our method surpasses the performance of state-of-the-art LLM-based reward generation methods by over 37.6% in terms of human normalized score. More importantly, by switching video inputs, we find our method can rapidly learn diverse motion behaviors such as walking and running. © 2024 The Authors.

关键词： Robots

来源：评论

学校读者我要写书评

暂无评论

Deep simulated annealing for the discovery of novel dental anesthetics with local anesthesia and anti-inflammatory properties

引用

Acta Pharmaceutica Sinica B 2024年第7期14卷 3086-3109页

作者： Yihang Hao Haofan Wang Xianggen Liu Wenrui Gai Shilong Hu Wencheng Liu Zhuang Miao Yu Gan Xianghua Yu Rongjia Shi Yongzhen Tan Ting Kang Ao Hai Yi Zhao Yihang Fu Yaling Tang Ling Ye Jin Liu Xinhua Liang Bowen Ke State Key Laboratory of Oral Diseases and National Clinical Research Center for Oral Diseases West China Hospital of StomatologySichuan UniversityChengdu 610041China College of Computer Science Sichuan UniversityChengdu 610065China Department of Anesthesiology Laboratory of Anesthesia and Critical Care MedicineNational-Local Joint Engineering Research Centre of Translational Medicine of AnesthesiologyFrontiers Science Center for Disease-Related Molecular NetworkWest China HospitalSichuan UniversityChengdu 610041China

Multifunctional therapeutics have emerged as a solution to the constraints imposed by drugs with singular or insufficient therapeutic *** primary challenge is to integrate diverse pharmacophores within a single-molecule *** address this,we introduced DeepSA,a novel edit-based generative framework that utilizes deep simulated annealing for the modification of articaine,a wellknown local *** integrates deep neural networks into metaheuristics,effectively constraining molecular space during compound *** framework employs a sophisticated objective function that accounts for scaffold preservation,anti-inflammatory properties,and covalent *** a sequence of local editing to navigate the molecular space,DeepSA successfully identified AT-17,a derivative exhibiting potent analgesic properties and significant anti-inflammatory activity in various animal *** insights into AT-17 revealed its dual mode of action:selective inhibition of NaV1.7 and 1.8 channels,contributing to its prolonged local anesthetic effects,and suppression of inflammatory mediators via modulation of the NLRP3 inflammasome *** findings not only highlight the efficacy of AT-17 as a multifunctional drug candidate but also highlight the potential of DeepSA in facilitating AI-enhanced drug discovery,particularly within stringent chemical constraints.

关键词： Multifunctional drugs Deep simulated annealing Molecule generation Articaine derivatives AI-enhanced drug discovery

来源：评论

学校读者我要写书评

暂无评论

AQROM:A quality of service aware routing optimization mechanism based on asynchronous advantage actor-critic in software-defined networks

引用

Digital Communications and networks 2024年第5期10卷 1405-1414页

作者： Wei Zhou Xing Jiang Qingsong Luo Bingli Guo Xiang Sun Fengyuan Sun Lingyu Meng School of Information and Communication Guilin University of Electronic TechnologyGuilin541004China The 34th Research Institute of China Electronics Technology Group Corporation Guilin541004China Guangxi Key Laboratory of Optical Network and Optical Information Security Guilin541004China State Key Laboratory of Information Photonics and Optical Communications BeijingUniversity of Posts and TelecommunicationsBeijing100876China Department of Electrical and Computer Engineering University of New MexicoAlbuquerqueNM87131USA

In Software-Defined networks(SDNs),determining how to efficiently achieve Quality of Service(QoS)-aware routing is challenging but critical for significantly improving the performance of a network,where the metrics of QoS can be defined as,for example,average latency,packet loss ratio,and *** SDN controller can use network statistics and a Deep Reinforcement Learning(DRL)method to resolve this *** this paper,we formulate dynamic routing in an SDN as a Markov decision process and propose a DRL algorithm called the Asynchronous Advantage Actor-Critic QoS-aware Routing Optimization Mechanism(AQROM)to determine routing strategies that balance the traffic loads in the *** can improve the QoS of the network and reduce the training time via dynamic routing strategy updates;that is,the reward function can be dynamically and promptly altered based on the optimization objective regardless of the network topology and traffic *** can be considered as one-step optimization and a black-box routing mechanism in high-dimensional input and output sets for both discrete and continuous states,and actions with respect to the operations in the *** simulations were conducted using OMNeT++and the results demonstrated that AQROM 1)achieved much faster and stable convergence than the Deep Deterministic Policy Gradient(DDPG)and Advantage Actor-Critic(A2C),2)incurred a lower packet loss ratio and latency than Open Shortest Path First(OSPF),DDPG,and A2C,and 3)resulted in higher and more stable throughput than OSPF,DDPG,and A2C.

关键词： Software-defined networks Asynchronous advantage actor-critic QoS-aware routing optimization mechanism

来源：评论

学校读者我要写书评

暂无评论

Understanding the Impact of Unobservable Variables on the Performance of Predictive Models: The Need for Feature Space Partitioning and Fusion

Understanding the Impact of Unobservable Variables on the Pe...

引用

AIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2025

作者： Juarez Garcia, Ezequiel Stephens, Chad L. Napoli, Nicholas J. Laboratory Department of Electrical and Computer Engineering University of Florida GainesvilleFL32611 United States Crew Systems & Aviation Operations Branch and System-Wide Safety Project NASA Langley Research Center HamptonVA23681 United States

ISBN: (数字)9781624107238

ISBN: (纸本)9781624107238

When developing predictive models over a dataset, the model is globally optimized across the entire feature space to learn a decision boundary. However, when unobservable variables—which cannot be measured or estimated—interact with the observable variables, this can negatively impact the optimization applied to the decision boundary since the data samples introduced by unobservable variables may have little to no association with the applied global optimization. This, consequently, penalizes the entire decision boundary and model performance. This paper examines some of the detrimental effects of unobservable variables, particularly their role in creating new modes in the distribution of observable variables and reducing the separ ability of class distributions. Such challenges result in unrepresentative training data, complex skewedor warped decision boundaries, and decreased accuracy of model predictions, particularly for interpretable models like logistic regression and decision trees. Through two illustrative case examples, we highlight the need to address the challenges imposed by unobservable variables. We propose a strategy to mitigate these challenges by creating local regions with in the feature space through partitioning. This enables the optimization of local models with inthe regions to overcome the impact of un observ ability and complex decision boundaries indifferent feature space localities. research into a more sophisticated partitioning strategy and where the partition should be relative to the sample of interest is left as future work. Through the analysis of the impact of un observability and the development of a partitioning method, we demonstrate the clear need for a partitioning strategy that integrates knowledge from multiple local models to estimate risk factors using information fusion. Thus, we establish the foundation and motivation for using partitioning and information fusion to overcome the effects of unobserv ability in predictive mo

关键词： Information fusion

来源：评论

学校读者我要写书评

暂无评论

Generative AI-Enhanced Cooperative MEC of UAVs and Ground Stations for Unmanned Surface Vehicles 59

Generative AI-Enhanced Cooperative MEC of UAVs and Ground St...

引用

59th Annual Conference on Information Sciences and systems, CISS 2025

作者： You, Jiahao Jia, Ziye Dong, Chao Wu, Qihui Han, Zhu Nanjing University of Aeronautics and Astronautics The Key Laboratory of Dynamic Cognitive System of Electromagnetic Spectrum Space Ministry of Industry and Information Technology Nanjing211106 China Southeast University National Mobile Communications Research Laboratory Nanjing211111 China University of Houston Department of Electrical and Computer Engineering HoustonTX77004 United States Kyung Hee University Department of Computer Science and Engineering Seoul446-701 Korea Republic of

ISBN: (纸本)9798331513269

The increasing deployment of unmanned surface vehicles (USVs) require computational support and coverage in applications such as maritime search and rescue. Unmanned aerial vehicles (UAVs) can offer low-cost, flexible aerial services, and ground stations (GSs) can provide powerful supports, which can cooperate to help the USVs in complex scenarios. However, the collaboration between UAVs and GSs for USVs faces challenges of task uncertainties, USVs trajectory uncertainties, heterogeneities, and limited computational resources. To address these issues, we propose a cooperative UAV and GS based robust multi-access edge computing framework to assist USVs in completing computational tasks. Specifically, we formulate the optimization problem of joint task offloading and UAV trajectory to minimize the total execution time, which is in the form of mixed integer nonlinear programming and NP-hard to tackle. Therefore, we propose the algorithm of generative artificial intelligence-enhanced heterogeneous agent proximal policy optimization (GAI-HAPPO). The proposed algorithm integrates GAI models to enhance the actor network ability to model complex environments and extract high-level features, thereby allowing the algorithm to predict uncertainties and adapt to dynamic conditions. Additionally, GAI stabilizes the critic network, addressing the instability of multi-agent reinforcement learning approaches. Finally, extensive simulations demonstrate that the proposed algorithm outperforms the existing benchmark methods, thus highlighting the potentials in tackling intricate, cross-domain issues in the considered scenarios. © 2025 IEEE.

关键词： Unmanned surface vehicles

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：