Recent advancements in deep learning require a large amount of the annotated training data containing various terms and conditions of the environment. Thus, developing and testing algorithms for the navigation of mobi...
详细信息
Recent advancements in deep learning require a large amount of the annotated training data containing various terms and conditions of the environment. Thus, developing and testing algorithms for the navigation of mobile robots can be expensive and time-consuming. Motivated by the aforementioned problems, this article presents a photorealistic simulator for the computervision community working with omnidirectional vision systems. Built using unity, the simulator integrates sensors, mobile robots, and elements of the indoor environment and allows one to generate synthetic photorealistic data sets with automatic ground truth annotations. With the aid of the proposed simulator, two practical applications are studied, namely extrinsic calibration of the vision system and three-dimensional reconstruction of the indoor environment. For the proposed calibration and reconstruction techniques, the processes themselves are simple, robust, and accurate. Proposed methods are evaluated experimentally with data generated by the simulator. The proposed simulator and supporting materials are available online: http://***.
In this paper we’ve conducted multiple experiments with modern object detection system YOLO. Object detection systems are fundamental to many robotics tasks. Recognition algorithms involving object detection are ofte...
详细信息
vision tracking is a key component of a video sequence. It is the process of locating single or multiple moving objects over time using one or many cameras. The latter's function consists of detecting, categorizin...
详细信息
vision tracking is a key component of a video sequence. It is the process of locating single or multiple moving objects over time using one or many cameras. The latter's function consists of detecting, categorizing, and tracking. The development of the trustworthy solution for video sequence analysis opens up new horizons for a variety of applications, including intelligent transportation systems, biomedical, agriculture, human-machine interaction, augmented reality, video surveillance, robots, and many crucial research areas. To make efficient models, there are challenges in video observation to deal with, such as problems with the environment, light variation, pose variation, motion blur, clutter, occlusion, and so on. In this paper, we present several techniques that addressed the issues of detecting and tracking multiple targets on video sequences. The proposed comparative study relied on different methodologies. This paper's purpose is to list various approaches, classify them, and compare them, using the Weighted Scoring Model (WSM) comparison method. This includes studying these algorithms, selecting relevant comparison criteria, assigning weights for each criterion, and lastly computing scores. The obtained results of this study will reveal the strong and weak points of each algorithm mentioned and discussed.
In recent years, advancement of the industry and increased human activities created significant pollution in the marine environment and coastal regions of the Persian Gulf. These pollutions cause various diseases and ...
详细信息
ISBN:
(纸本)9781665409391
In recent years, advancement of the industry and increased human activities created significant pollution in the marine environment and coastal regions of the Persian Gulf. These pollutions cause various diseases and serious damages to the human health and animal species. Early identification of various pollutions helps the coastal management to organize their resources and rapidly respond to the problems. Due to the large scale of the coastal regions, manual investigation of the pollutions is a very time-consuming task. Unmanned robots can be used as autonomous agents for rapid large-scale detection and classification of pollutions in the coastal regions. In this paper, an artificial intelligence-based vision system for autonomous marine pollution detection is proposed. A combination of computervision and machine learning methods are used for autonomous detection of various pollutions in the coastal and marine environment. In this study, 3000 images of Persian Gulf coastal pollutions is collected and used for training an artificial intelligence system for coastal conservation. The experimental results shows that the proposed framework has a 98% accuracy for identifying and classifying coastal and marine pollutions. The proposed system can be used as the vision system of an autonomous coastal conservation robot and increase the speed of coastal conservation and management significantly.
基于智能机器人代替人到各种复杂环境完成探测、防疫等大量应用的需求,场景的识别引起了研究者的广泛关注。场景识别的目的是通过提取和分析场景中的特征,获得场景的高层语义信息,从而推理出所处的具体位置,它是同步定位与建图系统(Simultaneous Localization and Mapping,SLAM)、自动驾驶、机器人导航、闭环检测的基础。三维扫描技术的快速发展使得人们能够利用各种扫描仪快速获取各类场景的点云数据。不论扫描时间、光照环境如何变化,点云场景所获取的几何信息都具有较好的不变性,因此,基于点云的场景识别成为计算机视觉领域的研究热点。本文首先对近年来面向点云数据的场景识别方法进行了归纳和总结;然后介绍用于场景识别的大规模室内/室外场景的数据集,以及用于算法评测的评价指标,同时总结了各类算法的识别率。最后指出面向点云的场景识别中所面临的问题和挑战,对未来的研究趋势进行展望。研究结果有助于相关领域学者快速全面地了解基于点云数据场景识别的研究现状,为进一步提升场景识别精度奠定基础。
The integration of computervisiontechniques for the accomplishment of autonomous interaction tasks represents a challenging research direction in the context of aerial robotics. In this paper, we consider the proble...
详细信息
ISBN:
(数字)9781728162126
ISBN:
(纸本)9781728162133
The integration of computervisiontechniques for the accomplishment of autonomous interaction tasks represents a challenging research direction in the context of aerial robotics. In this paper, we consider the problem of contact-based inspection of a textured target of unknown geometry and pose. Exploiting state of the art techniques in computer graphics, tuned and improved for the task at hand, we designed a framework for the projection of a desired trajectory for the robot end-effector on a generically-shaped surface to be inspected. Combining these results with previous work on energy-based interaction control, we are laying the basis of what we call vision-based impedance control paradigm. To demonstrate the feasibility and the effectiveness of our methodology, we present the results of both realistic ROS/Gazebo simulations and preliminary experiments with a fully-actuated hexarotor interacting with heterogeneous curved surfaces whose geometric description is not available a priori, provided that enough visual features on the target are naturally or artificially available to allow the integration of localization and mapping algorithms.
3D vision systems will play an important role in next-generation dairy farming due to the sensing capabilities they provide in the automation of animal husbandry tasks such as the monitoring, herding, feeding, milking...
详细信息
3D vision systems will play an important role in next-generation dairy farming due to the sensing capabilities they provide in the automation of animal husbandry tasks such as the monitoring, herding, feeding, milking and bedding of animals This paper will review 3D computervision systems and techniques that are and may be implemented in Precision Dairy Farming. This review will include evaluations of the applicability of Time of Flight and Streoscopic vision systems to agricultural applications as well as a breakdown of the categories of computervisionalgorithms which are being explored in a variety of use cases. These use cases range from robotic platforms such as milking robots and autonomous vehicles which must interact closely and safely with animals to intelligent systems which can identify dairy cattle and detect deviations in health indicators such as Body Condition Score and Locomotion Score. Upon analysis of each use case, it is apparent that systems which can operate in unconstrained environments and adapt to variations in herd characteristics, weather conditions, farmyard layout and different scenarios in animal-robot interaction are required. Considering this requirement, this paper proposes the application of techniques arising from the emerging field of research in Artificial Intelligence that is Geometric Deep Learning. (C) 2019, IFAC (International Federation of Automatic Control) Hosting by Elsevier Ltd. All rights reserved.
Scene understanding represents one of the most primary problems in computervision. It implies the full knowledge of all the elements of the environment and the comprehension of the relationships between them. One of ...
详细信息
ISBN:
(纸本)9781728136059
Scene understanding represents one of the most primary problems in computervision. It implies the full knowledge of all the elements of the environment and the comprehension of the relationships between them. One of the major tasks in this process is the scene recognition, on which we focus in this work. Scene recognition is a relevant and helpful task in many robotic fields such as navigation, localization, manipulation, among others. The knowledge of the place (e.g. "office", "classroom" or "kitchen") can improve the performance of robots in indoor environments. This task can be difficult because of the variability, ambiguity, illumination changes, occlusions and scale variability present in this type of spaces. Commonly, this problem has been approached through the development of models based on local and global characteristics, incorporating context information and, more recently, using deep learning techniques. In this paper, we propose a multi-classifier model for scene recognition considering as priors the outcomes of independent base classifiers. We implement a weighted voting scheme based on genetic algorithms for the combination of different classifiers in order to improve the recognition performance. The results have proved the validity of our approach and how the proper combination of independent classifier models makes it possible to find a better and more efficient solution for the scene recognition problem.
暂无评论