检索结果-内蒙古大学图书馆

Key Technologies for Machine vision for Picking robots:Review and Benchmarking

Machine Intelligence Research 2025年第1期22卷 2-16页

作者： Xu Xiao Yiming Jiang Yaonan Wang College of Electrical and Information Engineering Hunan UniversityChangsha 410082China National Engineering Research Center for Robot Vision Perception and Control Technology Hunan UniversityChangsha 410082China

The increase in precision agriculture has promoted the development of picking robot technology,and the visual recognition system at its core is crucial for improving the level of agricultural *** paper reviews the progress of visual recognition tech-nology for picking robots,including image capture technology,target detection algorithms,spatial positioning strategies and scene *** article begins with a description of the basic structure and function of the vision system of the picking robot and em-phasizes the importance of achieving high-efficiency and high-accuracy recognition in the natural agricultural ***-sequently,various image processing techniques and vision algorithms,including color image analysis,three-dimensional depth percep-tion,and automatic object recognition technology that integrates machine learning and deep learning algorithms,were *** the same time,the paper also highlights the challenges of existing technologies in dynamic lighting,occlusion problems,fruit maturity di-versity,and real-time processing *** paper further discusses multisensor information fusion technology and discusses methods for combining visual recognition with a robot control system to improve the accuracy and working rate of *** the same time,this paper also introduces innovative research,such as the application of convolutional neural networks(CNNs)for accurate fruit detection and the development of event-based vision systems to improve the response speed of the *** the end of this paper,the future development of visual recognition technology for picking robots is predicted,and new research trends are proposed,including the refinement of algorithms,hardware innovation,and the adaptability of technology to different agricultural *** purpose of this paper is to provide a comprehensive analysis of visual recognition technology for researchers and practitioners in the field of agricul-tural rob

关键词： Picking robot visual system perception technology image processing machine learning deep learning.

来源：评论

学校读者我要写书评

暂无评论

A data-driven subspace predictive control method for air-cooled data center thermal modelling and optimization

引用

Journal of the Franklin Institute 2023年第5期360卷 3657-3676页

作者： Li, Zhe Wang, Haoda Fang, Qiu Wang, Yaonan National Engineering Research Center for Robot Vision Perception and Control Technology Department of Control Science and Engineering Hunan University Hunan Changsha410082 China

This paper presents a data-driven predictive control method for optimizing the energy consumption of air-cooled data centers with unknown system model parameters. First, based on the measurable data of the studied system, the subspace predictive control (SPC) method is adopted to improve the energy use efficiency of the data center by regulating the power allocation of the server racks and the supply temperature of cold air, while ensuring the safe operating environment of the electronic equipment. Furthermore, a reasonable event-triggered law is designed to solve the problem of the low computational efficiency of the conventional SPC method. The simulation results illustrate that the designed event-triggered law can improve the computational efficiency of the algorithm while maintaining the control performance of the algorithm, which verifies its application prospect in practice. © 2023 The Franklin Institute

关键词： Computational efficiency

来源：评论

学校读者我要写书评

暂无评论

Intelligent monitoring method of bearings based on PCA-SOM 4

Intelligent monitoring method of bearings based on PCA-SOM

引用

4th International Conference on Testing technology and Automation engineering, TTAE 2024

作者： Xiao, Xianghui Dai, Yanting Yin, Baixin Yuan, Xiaofang College of Automation Foshan University Guangdong Foshan520000 China National Engineering Laboratory of Robot Vision Perception and Control Technology Hunan Changsha410082 China

ISBN: (数字)9781510686656

ISBN: (纸本)9781510686649

To address the issues of excessive maintenance and untimely maintenance of bearings, this paper proposes a performance evaluation method for bearing condition monitoring based on the combination of Principal Component Analysis (PCA) and Self-Organizing Map (SOM) networks. This method utilizes PCA to reduce the dimensionality of dual-domain features and conducts clustering analysis using the SOM network, constructing a minimal quantization error as a degradation indicator to assess the bearing degradation state. This is an unsupervised approach that does not require the setting of training labels for the network. Experiments were conducted on the IMS dataset, and the results indicate that this method can effectively identify early bearing faults and reduce reliance on human expertise, thereby enhancing both overall efficiency and the intelligence level of assessments, ultimately achieving intelligent bearing fault recognition. © 2024 SPIE.

关键词： Conformal mapping

来源：评论

学校读者我要写书评

暂无评论

Towards Source-Free Domain Adaptive Semantic Segmentation Via Importance-Aware and Prototype-Contrast Learning

IEEE Transactions on Intelligent Vehicles

引用

IEEE Transactions on Intelligent Vehicles 2024年 1-13页

作者： Cao, Yihong Zhang, Hui Lu, Xiao Xiao, Zheng Yang, Kailun Wang, Yaonan College of Computer Science and Electronic Engineering Hunan University Changsha China National Engineering Research Center of Robot Vision Perception and Control Technology School of Robotics Hunan University Changsha China College of Engineering and Design Hunan Normal University Changsha China

Domain adaptive semantic segmentation enables robust pixel- wise understanding in real-world driving scenes. Source-free domain adaptation, as a more practical technique, addresses the concerns of data privacy and storage limitations in typical unsupervised domain adaptation methods, making it especially relevant in the context of intelligent vehicles. It utilizes a well-trained source model and unlabeled target data to achieve adaptation in the target domain. However, in the absence of source data and target labels, current solutions cannot sufficiently reduce the impact of domain shift and fully leverage the information from the target data. In this paper, we propose an end-to-end source-free domain adaptation semantic segmentation method via Importance-Aware and Prototype-Contrast (IAPC) learning. The proposed IAPC framework effectively extracts domain-invariant knowledge from the well-trained source model and learns domain-specific knowledge from the unlabeled target domain. Specifically, considering the problem of domain shift in the prediction of the target domain by the source model, we put forward an importance-aware mechanism for the biased target prediction probability distribution to extract domain-invariant knowledge from the source model. We further introduce a prototype-contrast strategy, which includes a prototype-symmetric cross-entropy loss and a prototype-enhanced cross-entropy loss, to learn target intra-domain knowledge without relying on labels. A comprehensive variety of experiments on two domain adaptive semantic segmentation benchmarks demonstrates that the proposed end-to-end IAPC solution outperforms existing state-of-the-art methods. The source code is publicly available at https://***/yihong-97/Source-free-IAPC. IEEE

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

SMA-MVS: Segmentation-Guided Multi-Scale Anchor Deformation Patch Multi-View Stereo

引用

IEEE Transactions on Circuits and Systems for Video technology 2025年第6期35卷 5724-5737页

作者： Yin, Yufeng Liu, Xiaoyan Zhang, Zichao Hunan University National Engineering Laboratory for Robot Visual Perception and Control Technology College of Electrical and Information Engineering Changsha410082 China Innovation Institute of Industrial Design and Machine Intelligence Quanzhou-Hunan University Quanzhou362006 China

Multi-view stereo aims to recover the 3D model of a scene from a set of images. However, low-textured areas in the scene have always been a challenge in 3D reconstruction. In this work, we propose a segmentation-guided multi-scale anchor deformation patch multi-view stereo. Specifically, we use the Segment Anything Model to distinguish different instances in the scene, and propose an anchor deformation strategy guided by segmentation to adaptively generate a multi-scale anchor patch, so that the depth can be refined from coarse to fine scales. To propagate a better hypothesis for low-textured areas where photometric consistency is unreliable, we propose a non-local adaptive propagation scheme by using the segmented mask as a propagation domain. In order to reduce the interference of illumination on reconstruction completeness, we propose an outlier depth cost refinement guided by reliable points that improves the performance of depth estimation in unevenly illuminated areas. As a result, our method achieves state-of-the-art performance among traditional methods and exhibits better generalization capabilities on the ETH3D, Tanks and Temples, and DTU datasets. © 1991-2012 IEEE.

关键词： Image segmentation

来源：评论

学校读者我要写书评

暂无评论

Efficient Real-Time Recognition Model of Plant Diseases for Low-Power Consumption Platform

IEEE Transactions on Artificial Intelligence

引用

IEEE Transactions on Artificial Intelligence 2024年第5期5卷 2040-2054页

作者： Deng, Songyun Wu, Wanneng Zou, Kunlin Qin, Hai Cheng, Lekai Liang, Qiaokang Hunan University College of Electrical and Information Engineering Changsha410082 China Changsha University of Science and Technology School of Electrical and Information Engineering Changsha410114 China Hunan University National Engineering Research Center for Robot Vision Perception and Control College of Electrical and Information Engineering Changsha410082 China

Recognition and early warning of plant diseases is one of the keys to agricultural disaster prevention and mitigation. Deep learning-based image recognition methods give us a new idea for plant disease identification. Due to the harsh conditions in agricultural environment, recent research has focused on exploring ways to lightweight the recognition model for deployment on low-power devices. In this article, we propose an efficient and feature-guided real-time plant disease recognition model with a multiclassifier architecture, specifically designed for low-power devices. By comparing with other advanced methods, our model reaches the state of the art in the combined metrics of recognition accuracy, the number of parameters, and inference speed. First, we propose AMI-NanoNet based on Roofline theory to significantly reduce the number of parameters and computational complexity. This model can achieve 99.8343% accuracy on PlantVillage by using a feature-guided curriculum learning with stepwise training strategy. Moreover, we design another training strategy suitable for lightweight ensemble models. Based on this strategy, our model only needs to integrate the classifiers at the end of the network to achieve 99.8708% identification accuracy, and it hardly increases the number of operations and parameters of the network. Extensive evaluations on this dataset demonstrate the effectiveness of our ensemble learning method. Furthermore, we tested our proposed methods on another dataset from other domains to validate its applicability to different scenarios. Overall, our research provides a basis for rapid and intelligent identification of plant diseases. © 2023 IEEE.

关键词： Real time systems

来源：评论

学校读者我要写书评

暂无评论

LF-PGVIO: A Visual-Inertial-Odometry Framework for Large Field-of-View Cameras using Points and Geodesic Segments

IEEE Transactions on Intelligent Vehicles

引用

IEEE Transactions on Intelligent Vehicles 2024年 1-13页

作者： Wang, Ze Yang, Kailun Shi, Hao Zhang, Yufan Xu, Zhijie Gao, Fei Wang, Kaiwei State Key Laboratory of Extreme Photonics and Instrumentation School of Robotics and the National Engineering Research Center of Robot Visual Perception and Control Technology Hunan University Changsha China School of Computing and Engineering University of Huddersfield Huddersfield U.K State Key Laboratory of Industrial Control Technology Zhejiang University Hangzhou China

In this paper, we propose LF-PGVIO, a Visual-Inertial-Odometry (VIO) framework for large Field-of-View (FoV) cameras with a negative plane using points and geodesic segments. The purpose of our research is to unleash the potential of point-line odometry with large-FoV omnidirectional cameras, even for cameras with negative-plane FoV. To achieve this, we propose an Omnidirectional Curve Segment Detection (OCSD) method combined with a camera model which is applicable to images with large distortions, such as panoramic annular images, fisheye images, and various panoramic images. The geodesic segment is sliced into multiple straight-line segments based on the radian and descriptors are extracted and recombined. Descriptor matching establishes the constraint relationship between 3D line segments in multiple frames. In our VIO system, line feature residual is also extended to support large-FoV cameras. Extensive evaluations on public datasets demonstrate the superior accuracy and robustness of LF-PGVIO compared to state-of-the-art methods. The source code will be made publicly available at https://***/flysoaryun/LF-PGVIO. IEEE

关键词： Feature extraction

来源：评论

学校读者我要写书评

暂无评论

Fusion Visual Saliency with Dual-Spectral Saliency of Anomaly Detection for Hyperspectral Herbal Oral Liquid 9

Fusion Visual Saliency with Dual-Spectral Saliency of Anomal...

引用

9th Symposium on Novel Photoelectronic Detection technology and Applications

作者： Yin, Ating Mao, Jianxu Wang, Yaonan Zeng, Kai Zhang, Hui Li, Yaping College of Electrical and Information Engineering Hunan University Changsha China National Engineering Research Center of Robot Vision Perception and Control Technology Changsha China

ISBN: (纸本)9781510664432

Chinese herbal oral liquid can leach a variety of effective ingredients from herbs and has become a major drug for clinical application. However, it is easy to produce or introduce foreign matters that are very faint in the automatic filling production process. To solve the challenge of low accuracy of faint foreign matter detection, in this paper, we proposed a salient-based anomaly detection method which is fuses visual saliency with dual-spectral saliency (VDS) for the hyperspectral herbal oral liquid. Specifically, we first select the most discriminative bands via the band selection method to generate the pseudo-color map. Subsequently, the histogram-based contrast method is introduced to select the saliency feature map with the largest variance of color features, while fusing the multi-scale gradient features to obtain the preliminary vision-based anomaly detection map. After that, the spectral angles and spectral Euclidean distances are calculated separately based on the oral liquid hyperspectral images to fused into dual-spectral saliency maps. Finally, the dual-spectral saliency map is employed to suppress the background information of the preliminary anomaly detection map. The experimental results show that our proposed method outperforms the state-of-the-art anomaly detection methods, which accurately and quickly achieve the detection of faint foreign matter in the hyperspectral herbal oral liquid. It will accelerate the process of automated filling production lines for oral liquid in the pharmaceutical industry. © 2023 SPIE.

关键词： Anomaly detection

来源：评论

学校读者我要写书评

暂无评论

Beyond the Field-of-View: Enhancing Scene Visibility and perception with Clip-Recurrent Transformer

IEEE Transactions on Intelligent Vehicles

引用

IEEE Transactions on Intelligent Vehicles 2024年 1-16页

作者： Shi, Hao Jiang, Qi Yang, Kailun Yin, Xiaoting Ni, Huajian Wang, Kaiwei State Key Laboratory of Extreme Photonics and Instrumentation and the National Engineering Research Center of Optical Instrumentation Zhejiang University Hangzhou China School of Robotics and the National Engineering Research Center of Robot Visual Perception and Control Technology Hunan University Changsha China Shanghai SUPREMIND Technology Company Ltd Shanghai China

vision sensors are widely applied in vehicles, robots, and roadside infrastructure. However, due to limitations in hardware cost and system size, camera Field-of-View (FoV) is often restricted and may not provide sufficient coverage. Nevertheless, from a spatiotemporal perspective, it is possible to obtain information beyond the camera's physical FoV from past video streams. In this paper, we propose the concept of online video inpainting for autonomous vehicles to expand the field of view, thereby enhancing scene visibility, perception, and system safety. To achieve this, we introduce the FlowLens architecture, which explicitly employs optical flow and implicitly incorporates a novel clip-recurrent transformer for feature propagation. FlowLens offers two key features: 1) FlowLens includes a newly designed Clip-Recurrent Hub with 3D-Decoupled Cross Attention (DDCA) to progressively process global information accumulated over time. 2) It integrates a multi-branch Mix Fusion Feed Forward Network (MixF3N) to enhance the precise spatial flow of local features. To facilitate training and evaluation, we derive the KITTI360 dataset with various FoV mask, which covers both outer- and inner FoV expansion scenarios. We also conduct quantitative assessments of beyond-FoV semantics across different models and perform qualitative comparisons of beyond-FoV object detection. We illustrate that employing FlowLens to reconstruct unseen scenes even enhances perception within the field of view by providing reliable semantic context. Extensive experiments and user studies involving offline and online video inpainting, as well as beyond-FoV perception tasks, demonstrate that FlowLens achieves state-of-the-art performance. The source code and dataset are made publicly available at https://***/MasterHow/FlowLens. IEEE

关键词： Cameras

来源：评论

学校读者我要写书评

暂无评论

Self-Contrastive Learning Fault Diagnosis Network for Signal Imaging: A Case Study

Self-Contrastive Learning Fault Diagnosis Network for Signal...

引用

2024 China Automation Congress, CAC 2024

作者： He, Wenbin Mao, Jianxu Wang, Yaonan Li, Zhe Yan, Feng Liu, Kexin College of Electrical & Information Engineering National Engineering Research Center of Robot Visual Perception & Control Technology Hunan University Hunan China College of Control Science & Engineering State Key Laboratory of Industrial Control Technology Zhejiang University Zhejiang China

ISBN: (纸本)9798350368604

The deep convolution method based on MSDP signal imaging has been proven to be an effective means of monitoring the robot grinding process. This method has very high requirements on the quality of imaging and requires a lot of computing resources. This paper proposes an image self-contrastive learning network that can effectively extract fault information from low-pixel fuzzy images and achieve efficient and accurate condition monitoring. The network first calculates the correlation distance matrix between two identical modified symmetrized dot pattern (MSDP) images and uses the generated similarity labels to calculate the contrast loss to extract the embedded features of the image. Finally, the classification network is trained using the cross entropy loss to achieve the classification of fault images. The method was verified on a robotic grinding platform to significantly reduce the calculation time while maintaining the recognition accuracy, which provides reliable technical support for the real-time monitoring of the robotic grinding process. © 2024 IEEE.

关键词： Contrastive Learning fault diagnosis robotic grinding signal imaging

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：