检索结果-内蒙古大学图书馆

2024 IEEE International Conference on robotics and Biomimetics, ROBIO 2024

作者： Slim, Malak Daher, Naseem Elhajj, Imad H. American University of Beirut Vision and Robotics Lab Electrical and Computer Engineering Department Beirut Lebanon

ISBN: (纸本)9781665481090

In this work, we present a novel solution aimed at improving robotic manipulators' performance in contact tasks. Inspired by the human motor control system, which relies on a feedforward mechanism to anticipate and plan movements based on the physical properties of the target environment, our approach plans the robot's motion during the reaching phase, prior to contact. To validate our approach, we conducted experiments using the KUKA youBot arm in two distinct environments, represented by soft and hard materials. Results showed that the robot exhibited compliant behavior, with an average reduction of 71% in overshoot, 60% in rise-time, and 68% steady-state error of the force control response during contact. © 2024 IEEE.

关键词： Motion planning

来源：评论

学校读者我要写书评

暂无评论

On Learning Deep O(n)-Equivariant Hyperspheres 41

On Learning Deep O(n)-Equivariant Hyperspheres

引用

41st International Conference on Machine Learning, ICML 2024

作者： Melnyk, Pavlo Felsberg, Michael Wadenbäck, Mårten Robinson, Andreas Le, Cuong Computer Vision Laboratory Department of Electrical Engineering Linköping University Sweden

In this paper, we utilize hyperspheres and regular n-simplexes and propose an approach to learning deep features equivariant under the transformations of nD reflections and rotations, encompassed by the powerful group of O(n). Namely, we propose O(n)-equivariant neurons with spherical decision surfaces that generalize to any dimension n, which we call Deep Equivariant Hyperspheres. We demonstrate how to combine them in a network that directly operates on the basis of the input points and propose an invariant operator based on the relation between two points and a sphere, which as we show, turns out to be a Gram matrix. Using synthetic and real-world data in nD, we experimentally verify our theoretical contributions and find that our approach is superior to the competing methods for O(n)-equivariant benchmark datasets (classification and regression), demonstrating a favorable speed/performance trade-off. The code is available on GitHub. Copyright 2024 by the author(s)

关键词： Contrastive Learning

来源：评论

学校读者我要写书评

暂无评论

Adaptive Feature-Based Plant Recognition

IEEE Transactions on AgriFood Electronics

引用

IEEE Transactions on AgriFood Electronics 2024年第2期2卷 335-346页

作者： Shirzi, Moteaal Asadi Kermani, Mehrdad R. Western University Advanced Robotics and Mechatronic Systems Laboratory Electrical and Computer Engineering Department LondonONN6A 5B9 Canada Western University Advanced Robotics and Mechatronic Systems Laboratory The Department of Electrical and Computer Engineering LondonONN6A 5B9 Canada

In this article, we propose a new algorithm to improve plant recognition through the use of feature descriptors. The accurate results from this identification method are essential for enabling autonomous tasks, such as stem-stake coupling, in precision agriculture. The proposed method divides the input seedling color image into subimages within the International Commission on Illumination, for three color axes, L for lightness, A for the green-red component, and B for the blue-yellow component, color space and extracts seven key feature descriptors for each subimage. It then uses feature descriptors to create a matrix, which is employed to train an artificial neural network to determine optimized cutoff values. This network suggests cutoff values for a multilevel threshold segmentation for plant recognition. The method provides robust and real-time adaptive segmentation adaptable to various seedlings, backgrounds, and lighting conditions. By enabling accurate segmentation of the plant, morphological image processing can more effectively eliminate leaves to locate the seedling stem. This methodology automates image analysis in seedling propagation facilities and greenhouses and enables a wide range of precision agricultural tasks. © 2023 IEEE.

关键词： Machine vision

来源：评论

学校读者我要写书评

暂无评论

MMInstruct: a high-quality multi-modal instruction tuning dataset with extensive diversity

引用

Science China(Information Sciences) 2024年第12期67卷 36-51页

作者： Yangzhou LIU Yue CAO Zhangwei GAO Weiyun WANG Zhe CHEN Wenhai WANG Hao TIAN Lewei LU Xizhou ZHU Tong LU Yu QIAO Jifeng DAI School of Computer Science Nanjing University School of Electronic Information and Electrical Engineering Shanghai Jiao Tong University Shanghai AI Laboratory School of Computer Science Fudan University Department of Information Engineering The Chinese University of Hong Kong SenseTime Research Department of Electronic Engineering Tsinghua University

Despite the effectiveness of vision-language supervised fine-tuning in enhancing the performance of vision large language models(VLLMs), existing visual instruction tuning datasets include the following limitations.(1) Instruction annotation quality: despite existing VLLMs exhibiting strong performance,instructions generated by those advanced VLLMs may still suffer from inaccuracies, such as hallucinations.(2) Instructions and image diversity: the limited range of instruction types and the lack of diversity in image data may impact the model's ability to generate diversified and closer to real-world scenarios outputs. To address these challenges, we construct a high-quality, diverse visual instruction tuning dataset MMInstruct,which consists of 973k instructions from 24 domains. There are four instruction types: judgment, multiplechoice, long visual question answering, and short visual question answering. To construct MMInstruct, we propose an instruction generation data engine that leverages GPT-4V, GPT-3.5, and manual correction. Our instruction generation engine enables semi-automatic, low-cost, and multi-domain instruction generation at 1/6 the cost of manual construction. Through extensive experiment validation and ablation experiments,we demonstrate that MMInstruct could significantly improve the performance of VLLMs, e.g., the model fine-tuning on MMInstruct achieves new state-of-the-art performance on 10 out of 12 benchmarks. The code and data shall be available at https://***/yuecao0119/MMInstruct.

关键词： instruction tuning multi-modal multi-domain dataset vision large language model

来源：评论

学校读者我要写书评

暂无评论

A Novel Technology for Semiautomatic and Automatic Stem–Stake Coupling of Seedlings and Plants

IEEE Transactions on AgriFood Electronics

引用

IEEE Transactions on AgriFood Electronics 2025年第1期3卷 254-262页

作者： Shirzi, Moteaal Asadi Kermani, Mehrdad R. Western University Advanced Robotics and Mechatronic Systems Laboratory Electrical and Computer Engineering Department LondonN6A 5B9 Canada Western University Department of Electrical and Computer Engineering LondonN6A 3K7 Canada Advanced Robotics and Mechatronic Systems Laboratory LondonN6A 5B9 Canada

This article introduces a novel mechatronic system for coupling the stems of seedlings and plants to wooden stakes or ropes, a crucial process for supporting them during growth, transportation, and fruiting in plant propagation facilities and greenhouses. The stem–stake coupling device utilizes interconnected mechanisms and an impedance control method to adjust motor torque and speed, shaping metallic wire into clips of various shapes and dimensions, effectively securing plant stems to stakes or ropes. In a robotic system, a claw-shaped arm mechanism, a stereo camera, and real-time vision techniques are integrated into the stem–stake coupling device to identify the optimal coupling point and automate the coupling task. This innovation addresses the labor-intensive task of manual coupling, offering a scalable solution for growers through handheld devices or fully automated robotic systems. In the context of increasing labor shortages and rising costs, the technology offers a sustainable and efficient alternative with significant potential to enhance operational efficiency in greenhouses and propagation facilities. © 2023 IEEE.

关键词： Seed

来源：评论

学校读者我要写书评

暂无评论

Adaptive Robust Control Integrated With Gaussian Processes for Quadrotors: Enhanced Accuracy, Fault Tolerance and Anti-Disturbance

引用

IEEE Transactions on Systems, Man, and Cybernetics: Systems 2025年第5期55卷 3235-3248页

作者： Liang, Weisheng Amer, Abdelhakim Mehndiratta, Mohit Chen, Zheng Yao, Bin Kayacan, Erdal Zhejiang University State Key Laboratory of Fluid Power and Mechatronic Systems Hangzhou310027 China Aarhus University Artificial Intelligence in Robotics Laboratory Department of Electrical and Computer Engineering Aarhus8000 Denmark GIM Robotics Espoo02650 Finland Zhejiang University Ocean College Ocean Research Center of Zhoushan Zhoushan316021 China Purdue University School of Mechanical Engineering West Lafayette47907 United States Department of Electrical Engineering and Information Technology Paderborn33098 Germany

With increasingly challenging applications for quadrotors, higher requirements are emerging for tracking accuracy and safety. While high accuracy is a prerequisite for complex tasks, safety is ensured through tolerance to actuator faults and resistance to external disturbances. In this article, adaptive robust control (ARC) integrated with Gaussian processes (GPs), i.e., ARC-GP, is proposed to achieve enhanced accuracy, fault tolerance, and anti-disturbance. These three requirements are interrelated and affected by uncertainties. The primary idea of this article is to categorize uncertainties into parametric and nonparametric types, which are then addressed through parameter adaptation and GP, respectively. First, a detailed dynamic model is established, including actuator models that reflect different types of faults corresponding to changes in different physical parameters. Then, parameter adaptation is designed, with direct and indirect methods adopted for different parameters. In particular, the actuator parameters are effectively estimated to achieve targeted fault compensation. Regarding GP for nonparametric uncertainties, its model parameters are also updated via parameter adaptation. The GP thereby also learns parameter estimation errors along with external disturbances. Accordingly, ARC controllers are designed, for which robust feedback terms are constructed to further mitigate uncertainties on the basis of the covariances predicted by GP. The experiments demonstrate that the proposed ARC-GP can actively tolerate various types of actuator faults and better resist wind disturbances. © 2013 IEEE.

关键词： Robust control

来源：评论

学校读者我要写书评

暂无评论

Drowning Recognition for Ocean Surveillance using computer vision and Drone Control

Drowning Recognition for Ocean Surveillance using Computer V...

引用

2023 IEEE International Conference on Electro Information Technology, eIT 2023

作者： Rakotondraibe, Maureen Fang, Tianyang Saniie, Jafar Research Laboratory Department of Electrical and Computer Engineering ChicagoIL United States

ISBN: (纸本)9781665493765

According to WHO's report from 2021, Drowning is the 3rd leading cause of unintentional death worldwide. The use of autonomous drones for drowning recognition can increase the survival rate and help lifeguards and rescuers with their life saving mission. This paper presents a real-time drowning recognition model and algorithm for ocean surveillance that can be implemented on a drone. The presented model has been trained using two different approaches and has 88% accuracy. Compared to the contemporary models of drowning recognition designed for swimming pools, the model presented is better suited for outdoor applications in the ocean. © 2023 IEEE.

关键词： Drones

来源：评论

学校读者我要写书评

暂无评论

Utilizing computer vision Algorithms to Detect and Classify Cyberattacks in IoT Environments in Real-Time

Utilizing Computer Vision Algorithms to Detect and Classify ...

引用

2023 IEEE International Conference on Electro Information Technology, eIT 2023

作者： Gromov, Mikhail Arnold, David Saniie, Jafar Research Laboratory Department of Electrical and Computer Engineering ChicagoIL United States

ISBN: (纸本)9781665493765

computer vision has proven itself capable of accurately detecting and classifying objects within images. This also works in cases where images are used as a way of representing data, without being actual photographs. In cybersecurity, computer vision is rarely used, however it has been used to detect botnets successfully. We applied computer vision to determine how well it would be able to detect and classify a large number of attacks and determined that it would be able to run at a decent rate on a Jetson Nano. This was accomplished by training a convolutional neural network using data publicly available in the IoT-23 database, which contains packet captures of IoT devices with and without different malware infections. The neural network was evaluated on an RTX 3050 and a Jetson Nano to see if it could be used in IoT. © 2023 IEEE.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

IoT-Enabled Smart Bike Helmet with an AI-Driven Collision Avoidance System

IoT-Enabled Smart Bike Helmet with an AI-Driven Collision Av...

引用

2023 IEEE International Conference on Electro Information Technology, eIT 2023

作者： Solus, Jacob Rakotondraibe, Maureen Yu, Xinrui Yi, Won-Jae Gromov, Mikhail Saniie, Jafar Research Laboratory Department of Electrical and Computer Engineering ChicagoIL United States

ISBN: (纸本)9781665493765

This paper presents a system design for a smart bike helmet with multiple safety features that are intended to empower bicycle riders to proactively avoid potential sources of danger or injury. A Smart Sensor/Actuator Node (SSAN), driven by an Arduino Uno single-board microcontroller, contains input sensors and actuators to provide riders the ability to send and receive warnings promptly on their helmet. A vision Node, driven by an NVIDIA Jetson Nano and a cable pin-connected camera, executes AI object detection algorithms for any dangerous objects that are out of sight of the rider and sends alerts to the SSAN as needed. By combining safety features of the SSAN and vision Node while continuously sending data to an IoT-enabled backend web server, the safety operation of a typical bike ride can be substantially improved. © 2023 IEEE.

关键词： Edge computing

来源：评论

学校读者我要写书评

暂无评论

Cybersecurity advancements for medical image transmission: a hybrid optical-based cryptosystem harnessing chaos, DNA sequences, and mandelbrot keys

引用

Multimedia Tools and Applications 2025年 1-41页

作者： Alalwan, Nasser El-Shafai, Walid Amoon, Mohammed Benjdira, Bilel Department of Computer and Communication Engineering King Saud University Riyadh Saudi Arabia Prince Sultan University Riyadh11586 Saudi Arabia Robotics and Internet-of-Things Laboratory Prince Sultan University Riyadh12435 Saudi Arabia Department of Electronics and Electrical Communications Engineering Faculty of Electronic Engineering Menoufia University Menouf32952 Egypt

In today's advanced technological age, characterized by innovations like big data processing, cloud computing, and the Internet of Things (IoT), there is a rising utilization of medical multimedia data, especially medical images. These images, integral to the Internet of Healthcare Things (IoHT), necessitate secure transmission due to the increasing risks of unauthorized breaches and tampering. Current security methods, especially for cloud and mobile platforms, often struggle with challenges related to processing capacity, memory use, data size, and energy, making them ill-suited for extensive medical data or resource-limited environments. To address these challenges, this study introduces a novel hybrid cryptosystem, drawing on the unique qualities of the optical Arnold chaotic map, DNA (DeoxyriboNucleic Acid) sequences, and Mandelbrot keys, providing a fortified approach to the secure streaming of medical images. The proposed framework operates via a precise and structured procedure. It begins by applying the optical Arnold chaotic map cipher to each of the three-color channels (R, G, and B) within a medical image. This is followed by overlaying DNA encoding sequences on the resultant encrypted image from the earlier ciphering phase. Leveraging this groundwork, we incorporate an advanced Mandelbrot set-driven shift mechanism specifically designed to create complex confusion patterns within the R, G, and B segments of the encrypted medical imagery. The efficacy of the proposed cryptosystem is rigorously substantiated through an extensive array of simulations supported by a comprehensive security analysis. The results highlight its unparalleled resilience and security capabilities in the realm of medical image encryption, marking a significant leap over previous systems in the literature. Essentially, our work pioneers a solution to a pressing challenge in medical image security, ensuring enhanced protection of delicate health data among the rapidly evolving advanc

关键词： Cybersecurity

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：