检索结果-内蒙古大学图书馆

One model, two skills: active vision and action learning model for robotic manipulation

Science China(Information Sciences) 2025年第6期68卷 331-349页

作者： Guokang WANG Yanhong LIU Huaping LIU School of Electrical and Information Engineering Zhengzhou University Department of Computer Science and Technology Tsinghua University

The perception in most existing vision-based reinforcement learning(RL) models for robotic manipulation relies heavily on static third-person or hand-mounted first-person cameras. In scenarios with occlusions and limited maneuvering space, these carefully positioned cameras often struggle to provide effective visual observations during manipulation. Taking inspiration from human capabilities, we introduce a novel RL-based dual-arm active visual-guided manipulation model(DAVMM), which simultaneously infers “eye” actions and “hand” actions for two separate robotic arms(referred to as the vision-arm and the worker-arm) based on current observations, empowering the robot with the ability to actively perceive and interact with its environment. To handle the extensive redundant observation-action space, we propose a decouplable target-centric reward paradigm to offer stable guidance for the training process. For making fine-grained manipulation action decisions, alongside a global scene image encoder, we utilize an independent encoder to extract local target texture features,enabling the simultaneous acquisition of both global and detailed local information. Additionally, we employ residual-RL and curriculum learning techniques to further enhance our model's sample efficiency and training stability. We conducted comparative experiments and analyses of DAVMM against a set of strong baselines on three occluded and narrow-space manipulation tasks. DAVMM notably improves the success rates across all manipulation tasks and showcases rapid learning capabilities.

关键词： robotic manipulation visual learning reinforcement learning active sensing machine vision

来源：评论

学校读者我要写书评

暂无评论

Privacy-preserving filtering, control and optimization for industrial cyber-physical systems

引用

Science China(Information Sciences) 2025年第4期68卷 267-283页

作者： Derui DING Qing-Long HAN Xiaohua GE Xian-Ming ZHANG Jun WANG Department of Control Science and Engineering University of Shanghai for Science and Technology School of Engineering Swinburne University of Technology Department of Computer Science City University of Hong Kong

Industrial cyber-physical systems closely integrate physical processes with cyberspace, enabling real-time exchange of various information about system dynamics, sensor outputs, and control decisions. The connection between cyberspace and physical processes results in the exposure of industrial production information to unprecedented security risks. It is imperative to develop suitable strategies to ensure cyber security while meeting basic performance *** the perspective of control engineering, this review presents the most up-to-date results for privacy-preserving filtering,control, and optimization in industrial cyber-physical systems. Fashionable privacy-preserving strategies and mainstream evaluation metrics are first presented in a systematic manner for performance evaluation and engineering *** discussion discloses the impact of typical filtering algorithms on filtering performance, specifically for privacy-preserving Kalman filtering. Then, the latest development of industrial control is systematically investigated from consensus control of multi-agent systems, platoon control of autonomous vehicles as well as hierarchical control of power systems. The focus thereafter is on the latest privacy-preserving optimization algorithms in the framework of consensus and their applications in distributed economic dispatch issues and energy management of networked power systems. In the end, several topics for potential future research are highlighted.

关键词： industrial cyber-physical systems privacy preservation distributed control distributed optimization power systems

来源：评论

学校读者我要写书评

暂无评论

DNACDS:Cloud IoE big data security and accessing scheme based on DNA cryptography

引用

Frontiers of computer Science 2024年第1期18卷 157-170页

作者： Ashish SINGH Abhinav KUMAR Suyel NAMASUDRA School of Computer Engineering KIIT Deemed to be UniversityBhubaneshwar 751024India Department of Computer Science and Engineering Indian Institute of Information Technology SuratSurat 394190India Department of Computer Science and Engineering National Institute of Technology AgartalaAgartala 799046India

The Internet of Everything(IoE)based cloud computing is one of the most prominent areas in the digital big data *** approach allows efficient infrastructure to store and access big real-time data and smart IoE services from the *** IoE-based cloud computing services are located at remote locations without the control of the data *** data owners mostly depend on the untrusted Cloud Service Provider(CSP)and do not know the implemented security *** lack of knowledge about security capabilities and control over data raises several security *** Acid(DNA)computing is a biological concept that can improve the security of IoE big *** IoE big data security scheme consists of the Station-to-Station Key Agreement Protocol(StS KAP)and Feistel cipher *** paper proposed a DNA-based cryptographic scheme and access control model(DNACDS)to solve IoE big data security and access *** experimental results illustrated that DNACDS performs better than other DNA-based security *** theoretical security analysis of the DNACDS shows better resistance capabilities.

关键词： IoE based cloud computing DNA cryptography IoE big data security StS KAP feistel cipher IoE big data access

来源：评论

学校读者我要写书评

暂无评论

Road Surface Analysis through Machine Learning Techniques

引用

IEIE Transactions on Smart Processing and Computing 2024年第4期13卷 344-353页

作者： Singh, Prabhat Sharma, Shilpi Kamal, Ahmed E. Kumar, Sunil Department of Computer Science and Engineering Amity School of Engineering & Technology Uttar Pradesh Noida India Department of Electrical and Computer Engineering Iowa State University Ames United States Computer Science and Engineering Amity School of Engineering & Technology Uttar Pradesh Noida India

Roads are an important part of transporting goods and products from one place to another. In developing countries, the main challenge is to maintain road conditions regularly. Roads can deteriorate from time to time. Monitoring the conditions of the roads, which may degrade with time, is very difficult, resulting in a delay in transportation and damage to the vehicles moving on the roads. Poor road conditions cause road accidents. A model is being proposed to monitor the conditions of the road surface by smartphone sensors. Accelerometer, gyroscope, and GPS sensors are deployed in the mobile phones, which will help to collect data on the road conditions. After collecting the data about the road conditions, various machine learning approaches, such as supervised, multi-layered, and multiclass, are applied to data filtration. Road conditions are divided into three categories to achieve this methodology: potholes, deep traverse cracks, and smooth roads. This categorization helped in analyzing the road surface condition through smartphone sensors over all three axes instead of taking it over a single axis. Neural networks helped analyze data or road conditions more accurately than Decision Tree and SVM. Copyrights © 2024 The Institute of Electronics and Information Engineers.

关键词： Decision trees

来源：评论

学校读者我要写书评

暂无评论

MMInstruct: a high-quality multi-modal instruction tuning dataset with extensive diversity

引用

Science China(Information Sciences) 2024年第12期67卷 36-51页

作者： Yangzhou LIU Yue CAO Zhangwei GAO Weiyun WANG Zhe CHEN Wenhai WANG Hao TIAN Lewei LU Xizhou ZHU Tong LU Yu QIAO Jifeng DAI School of Computer Science Nanjing University School of Electronic Information and Electrical Engineering Shanghai Jiao Tong University Shanghai AI Laboratory School of Computer Science Fudan University Department of Information Engineering The Chinese University of Hong Kong SenseTime Research Department of Electronic Engineering Tsinghua University

Despite the effectiveness of vision-language supervised fine-tuning in enhancing the performance of vision large language models(VLLMs), existing visual instruction tuning datasets include the following limitations.(1) Instruction annotation quality: despite existing VLLMs exhibiting strong performance,instructions generated by those advanced VLLMs may still suffer from inaccuracies, such as hallucinations.(2) Instructions and image diversity: the limited range of instruction types and the lack of diversity in image data may impact the model's ability to generate diversified and closer to real-world scenarios outputs. To address these challenges, we construct a high-quality, diverse visual instruction tuning dataset MMInstruct,which consists of 973k instructions from 24 domains. There are four instruction types: judgment, multiplechoice, long visual question answering, and short visual question answering. To construct MMInstruct, we propose an instruction generation data engine that leverages GPT-4V, GPT-3.5, and manual correction. Our instruction generation engine enables semi-automatic, low-cost, and multi-domain instruction generation at 1/6 the cost of manual construction. Through extensive experiment validation and ablation experiments,we demonstrate that MMInstruct could significantly improve the performance of VLLMs, e.g., the model fine-tuning on MMInstruct achieves new state-of-the-art performance on 10 out of 12 benchmarks. The code and data shall be available at https://***/yuecao0119/MMInstruct.

关键词： instruction tuning multi-modal multi-domain dataset vision large language model

来源：评论

学校读者我要写书评

暂无评论

UAV-Assisted Internet of Vehicles over Licensed and Unlicensed Spectrum: Architecture, Intelligent Resource Management, and Challenges

引用

IEEE Internet of Things Magazine 2023年第3期6卷 78-84页

作者： Su, Yuhan Liwang, Minghui Chen, Zhong Wang, Xianbin School of Electronic Science and Engineering Xiamen University China School of Informatics Xiamen University China Western University Department of Electrical and Computer Engineering London Canada

Benefited from their flexibility and on-demand deployment capability, unmanned aerial vehicles (UAVs) have emerged as critical aerial communication platforms in future Internet of Vehicles (IoV). However, limited spectrum resources can lead to unsatisfying data rate of IoV, which thus incur large latency, especially under congested IoV network conditions. Although UAVs and road side units (RSUs) can work within the same spectrum and increase spectral efficiency, mutual interference becomes unavoidable. To this end, this article develops a heterogeneous network architecture, in which a UAV-assisted IoV system coexists with a Wi-Fi system: the RSUs can properly occupy unlicensed spectrum to increase the capacity of the UAV-assisted IoV system while mitigating interference, without affecting the performance of the Wi-Fi system. A case study of resource management over licensed and unlicensed spectrum is investigated under the proposed architecture, where time and power are jointly optimized to maximize the sum user satisfaction of the system. We further provide an intelligent solution to tackle the problem in the considered case study. Simulations demonstrate that our proposed case can efficiently improve the sum user satisfaction of the system. Key challenges and opportunities for UAV-assisted IoV over licensed and unlicensed spectrum are discussed, while recommendable future research directions are investigated. © 2018 IEEE.

关键词： Wi-Fi

来源：评论

学校读者我要写书评

暂无评论

Development of a machine-learning-based method for early fault detection in photovoltaic systems

引用

Journal of engineering and Applied Science 2023年第1期70卷 27页

作者： Voutsinas, Stylianos Karolidis, Dimitrios Voyiatzis, Ioannis Samarakou, Maria Department of Informatics and Computer Engineering University of West Attica Athens Greece

In the process of the decarbonization of energy production, the use of photovoltaic systems (PVS) is an increasing trend. In order to optimize the power generation, the fault detection and identification in PVS is significant. The purpose of this work is the study and implementation of such an algorithm, for the detection as many as faults arising on the DC side of a photovoltaic system. A machine learning technique was chosen. The dataset used to train the algorithm was based on a year’s worth of irradiance and temperature data, as well as data from the PV cell used. The method uses logistic regression with cross validation as a new approach to detect and identify faults in PVS. It is applied to smart PV arrays, that can transmit voltage and current measurements from each PV cell of the array individually. The results are satisfactory since the algorithm can detect the majority of faults that occur on the DC side of a photovoltaic (open-circuit fault, short-circuit fault, mismatch faults). The accuracy of the algorithm (97.11%) is comparable to other methods presented by the literature. Moreover, the computational cost of the proposed method is significantly lower than the methods presented in the literature. In summary, the performance of the implemented algorithm is considered particularly satisfactory and can be easily applied to PVS. © 2023, The Author(s).

关键词： Machine learning

来源：评论

学校读者我要写书评

暂无评论

Improved Subsynchronous Oscillation Parameter Identification with Synchrophasor Based on Matrix Pencil Method in Power Systems

引用

Journal of Modern Power Systems and Clean Energy 2024年第1期12卷 22-33页

作者： Xiaoxue Zhang Fang Zhang Wenzhong Gao Jinghan He the School of Electrical Engineering Beijing Jiaotong UniversityBeijing 100044China the Department of Electrical and Computer Engineering University of DenverDenverUSA

The subsynchronous oscillations(SSOs)related to renewable generation seriously affect the stability and safety of the power *** realize the dynamic monitoring of SSOs by utilizing the high computational efficiency and noise-resilient features of the matrix pencil method(MPM),this paper propos es an improved MPM-based parameter identification with syn *** MPM is enhanced by the angular frequency fitting equations based on the characteristic polynomial coeffi cients of the matrix pencil to ensure the accuracy of the identi fied parameters,since the existing eigenvalue solution of the MPM ignores the angular frequency conjugation constraints of the two fundamental modes and two oscillation ***,the identification and recovery of bad data are proposed by uti lizing the difference in temporal continuity of the synchropha sors before and after noise *** proposed parameter identification is verified with synthetic,simulated,and actual measured phase measurement unit(PMU)*** with the existing MPM,the improved MPM achieves better accuracy for parameter identification of each component in SSOs,better real-time performance,and significantly reduces the effect of bad data.

关键词： Subsynchronous oscillations(SSOs) synchrophasor parameter identification matrix pencil method bad data

来源：评论

学校读者我要写书评

暂无评论

Attention-optimized vision-enhanced prompt learning for few-shot multi-modal sentiment analysis

引用

Neural Computing and Applications 2024年第33期36卷 21091-21105页

作者： Zhou, Zikai Qiao, Baiyou Feng, Haisong Han, Donghong Wu, Gang School of Computer Science and Engineering Northeastern University Shenyang110819 China School of Informatics Xiamen University Xiamen361105 China

To fulfill the explosion of multi-modal data, multi-modal sentiment analysis (MSA) emerged and attracted widespread attention. Unfortunately, conventional multi-modal research relies on large-scale datasets. On the one hand, collecting and annotating large-scale datasets is challenging and resource-intensive. On the other hand, the training on large-scale datasets also increases the research cost. However, the few-shot MSA (FMSA), which is proposed recently, requires only few samples for training. Therefore, in comparison, it is more practical and realistic. There have been approaches to investigating the prompt-based method in the field of FMSA, but they have not sufficiently considered or leveraged the information specificity of visual modality. Thus, we propose a vision-enhanced prompt-based model based on graph structure to better utilize vision information for fusion and collaboration in encoding and optimizing prompt representations. Specifically, we first design an aggregation-based multi-modal attention module. Then, based on this module and the biaffine attention, we construct a syntax–semantic dual-channel graph convolutional network to optimize the encoding of learnable prompts by understanding the vision-enhanced information in semantic and syntactic knowledge. Finally, we propose a collaboration-based optimization module based on the collaborative attention mechanism, which employs visual information to collaboratively optimize prompt representations. Extensive experiments conducted on both coarse-grained and fine-grained MSA datasets have demonstrated that our model significantly outperforms the baseline models. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Efficient Heuristic Replication Techniques for High Data Availability in Cloud

引用

computer Systems Science & engineering 2023年第6期45卷 3151-3164页

作者： H.L.Chandrakala R.Loganathan Department of Computer Science and Engineering School of EngineeringPresidency UniversityIndia Department of Computer Science and Engineering HKBK College of EngineeringIndia

Most social networks allow connections amongst many people based on shared *** networks have to offer shared data like videos,photos with minimum latency to the group,which could be challenging as the storage cost has to be minimized and hence entire data replication is not a *** replication of data across a network of read-intensive can potentially lead to increased savings in cost and energy and reduce the end-user’s response *** simple and adaptive replication strategies exist,the solution is non-deter-ministic;the replicas of the data need to be optimized to the data usability,perfor-mance,and stability of the application *** resolve the non-deterministic issue of replication,metaheuristics are *** this work,Harmony Search and Tabu Search algorithms are used optimizing the replication process.A novel Har-mony-Tabu search is proposed for effective placement and replication of *** on large datasets show the effectiveness of the proposed *** is seen that the bandwidth saving for proposed harmony-Tabu replication per-forms better in the range of 3.57%to 18.18%for varying number of cloud data-centers when compared to simple replication,Tabu replication and Harmony replication algorithm.

关键词： Cloud computing data replication bandwidth saving Tabu search Harmony search hybrid Harmony-Tabu search

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：