检索结果-内蒙古大学图书馆

17th International Conference on machine vision applications (MVA)

作者： Wang, Hao Luo, Dingli Ikenaga, Takeshi Waseda Univ Grad Sch Informat Prod & Syst Kitakyushu Fukuoka 8080135 Japan

ISBN: (纸本)9784901122207

3D pose estimation based on a monocular camera can be applied to various fields such as human-computer interaction and human action recognition. As a two-stage 3D pose estimator, VideoPose3D achieves state-of-the-art accuracy. However, because of the limitation of two-stage processing, image information is partially lost in the process of mapping 2D poses to 3D space, which results in limited final accuracy. This paper proposes an image-assisting pose estimation model and a back-projection based offset generating module. The image-assisting pose estimation model consists of a 2D pose processing branch and an image processing branch. image information is processed to generate an offset to refine the intermediate 3D pose produced by the 2D pose processing network. The back-projection based offset generating module projects the intermediate 3D poses to 2D space and calculates the error between the projection and input 2D pose. With the error combining with extracted image feature, the neural network generates an offset to decrease the error. By evaluation, the accuracy on each action of Human3.6M dataset gets an average improvement of 0.9 mm over the VideoPose3D baseline.

关键词： Human computer interaction Three-dimensional displays machine vision Pose estimation Neural networks Feature extraction Cameras

来源：评论

学校读者我要写书评

暂无评论

Novel Sensing Approaches for Structural Deformation Monitoring and 3D Measurements

引用

IEEE SENSORS JOURNAL 2021年第10期21卷 11318-11328页

作者： Castro-Toscano, Moises J. Rodriguez-Quinonez, Julio C. Sergiyenko, Oleg Flores-Fuentes, Wendy Ramirez-Hernandez, Luis Roberto Hernandez-Balbuena, Daniel Lindner, Lars Rascon, Raul Univ Autonoma Baja California Fac Ingn Mexicali 21280 Baja California Mexico Univ Autonoma Baja California Inst Ingn Mexicali 21280 Baja California Mexico

Nowadays, laser vision systems have allowed the development of different applications such as reverse engineering, manufacturing, navigation systems and, structural health monitoring (SHM). However, most of the machine vision systems for structural behavior analysis have restricted field of view, consume high levels of computational resources for image processing and require special illumination conditions to achieve lower error rates. Therefore, the purpose of this paper is to present a technical vision system (TVS) for structural behavior analysis using dynamic laser triangulation and k-Nearest Neighbor (k-NN) machine learning regression algorithm. The proposed vision system was tested in order to demonstrate the practicality of it, different deformations and displacements were analyzed over real structures in controlled laboratory conditions to assure the reproducibility of the experimentation. The TVS prototype proved to be a reliable option on SHM tasks, presenting balance between precision and operating ranges, without the issues aforementioned.

关键词： Dynamic triangulation vision sensors laser scanner vision systems machine learning SHM

来源：评论

学校读者我要写书评

暂无评论

Rate-Distortion in image Coding for machines

Rate-Distortion in Image Coding for Machines

引用

Picture Coding Symposium (PCS)

作者： Harell, Alon De Andrade, Anderson Bajic, Ivan, V Simon Fraser Univ Sch Engn Sci Burnaby BC Canada

ISBN: (纸本)9781665492577

In recent years, there has been a sharp increase in transmission of images to remote servers specifically for the purpose of computer vision. In many applications, such as surveillance, images are mostly transmitted for automated analysis, and rarely seen by humans. Using traditional compression for this scenario has been shown to be inefficient in terms of bit-rate, likely due to the focus on human based distortion metrics. Thus, it is important to create specific image coding methods for joint use by humans and machines. One way to create the machine side of such a codec is to perform feature matching of some intermediate layer in a Deep Neural Network performing the machine task. In this work, we explore the effects of the layer choice used in training a learnable codec for humans and machines. We prove, using the data processing inequality, that matching features from deeper layers is preferable in the sense of rate-distortion. Next, we confirm our findings empirically by re-training an existing model for scalable human-machine coding. In our experiments we show the trade-off between the human and machine sides of such a scalable model, and discuss the benefit of using deeper layers for training in that regard.

关键词： image coding Deep neural networks Collaborative intelligence Object detection

来源：评论

学校读者我要写书评

暂无评论

Development of a Low-Cost Software to Obtain Quantitative Parameters in the Open Field Test for Application in Neuroscience Research 27th

Development of a Low-Cost Software to Obtain Quantitative Pa...

引用

27th Brazilian Congress on Biomedical Engineering (CBEB)

作者： Costalat, T. R. M. Negrao, I. P. R. Gomes-Leal, W. Fed Univ Para Inst Technol Belem Para Brazil Fed Univ Para Inst Biol Sci Lab Expt Neuroprotect & Neuroregenerat Belem Para Brazil

ISBN: (纸本)9783030706012;9783030706005

This paper describes the development of a low-cost software, called Rat Steps, which allows the obtention of quantitative data (total distance traveled and average speed) as well as the graphic trajectory performed by an animal in the open field test. This behavioral test is widely used in neuroscience in order to visualize locomotor impairment following acute brain injury, including stroke, as well as the effect of experimental therapies for these neural disorders. The main tools used for the software development were digital image processing techniques, Python programming, OpenCV library and machine learning algorithms, including the Mean Shift method. The software was successfully developed with effective obtention of quantitative parameters from the Open Field Test, which allows several applications in neuroscience research.

关键词： Computer vision image processing machine learning Neuroscience Open field test

来源：评论

学校读者我要写书评

暂无评论

3D reconstruction of moving object by double sampling based on phase shifting profilometry 9

3D reconstruction of moving object by double sampling based ...

引用

9th Symposium on Novel Photoelectronic Detection Technology and applications

作者： Zhang, Qinghui Li, Hao Lu, Lei Pan, Wei Su, Zhilong Zhang, Mengya Lv, Pengtao Key Laboratory of Grain Information Processing and Control of Ministry of Education Henan University of Technology Ministry of Education Zhengzhou450001 China College of Information Science and Engineering Henan University of Technology Zhengzhou450001 China Department of R & D OPT Machine Vision Tech Co. Ltd Dongguan523860 China Shanghai Institute of Applied Mathematics and Mechanics School of Mechanics and Engineering Science Shanghai University Shanghai200444 China

ISBN: (纸本)9781510664432

When using traditional phase-shift profilometry for 3D measurement, it is necessary to keep the measured object static during the shooting process. When the measured object is moving, errors will occur if the projection and capture of the fringe image is not fast enough. This paper proposes a new method to reconstruct the moving object by double sampling. A trigger control device is applied to the camera and projector, which ensures that after each projection, two consecutive images are captured before the next projection. Then, the phase information is retrieved by analyzing the relationship between the motion and fringe patterns. Finally, the moving object is retrieved successfully. The proposed method increased the frame rate of the moving object reconstruction. © 2023 SPIE.

关键词： Profilometry

来源：评论

学校读者我要写书评

暂无评论

A Review on Agriculture Monitoring: Plant Disease Detection Using Computer vision and machine Learning

SSRN

引用

SSRN 2024年

作者： Balderas-Silva, David Lopez-Bernal, Diego Díaz-Lara, Alfredo Rojas, Mario Ponce, Pedro Molina, Arturo Tecnologico de Monterrey Institute of Advanced Materials for Sustainable Manufacturing Anillo Perif. 6666 Mexico City14380 Mexico Tecnologico de Monterrey School of Engineering and Sciences Queretaro Santiago de Queretaro76130 Mexico

In recent years, the global population has shown substantial growth, leading to an increase in its food security needs. In response, greenhouse cultivation has emerged as a strategy to ensure controlled conditions for optimal crop growth. However, plant diseases continue to pose a threat to crops, and growers often lack the resources and expertise for effective crop monitoring and disease detection. This challenge has caused the exploration of new technologies, such as computer vision and machine learning, as alternative approaches to support plant disease control or management. Consequently, this work provides an overview of the current methods in computer vision and machine learning employed in the literature to address the issue of plant diseases, as well as the image acquisition, image pre-processing, and image classification techniques utilized by diverse authors. Hence, this review aims to provide valuable insights into the current landscape of computer vision and machine learning applications in plant disease detection. © 2024, The Authors. All rights reserved.

关键词： machine learning

来源：评论

学校读者我要写书评

暂无评论

A Semantics-Driven Methodology for High-Quality image Annotation 26

A Semantics-Driven Methodology for High-Quality Image Annota...

引用

26th European Conference on Artificial Intelligence, ECAI 2023

作者： Giunchiglia, Fausto Bagchi, Mayukh Diao, Xiaolei University of Trento Italy

ISBN: (纸本)9781643684369

Recent work in machine Learning and Computer vision has highlighted the presence of various types of systematic flaws inside ground truth object recognition benchmark datasets. Our basic tenet is that these flaws are rooted in the many-to-many mappings which exist between the visual information encoded in images and the intended semantics of the labels annotating them. The net consequence is that the current annotation process is largely under-specified, thus leaving too much freedom to the subjective judgment of annotators. In this paper, we propose vTelos, an integrated Natural Language processing, Knowledge Representation, and Computer vision methodology whose main goal is to make explicit the (otherwise implicit) intended annotation semantics, thus minimizing the number and role of subjective choices. A key element of vTelos is the exploitation of the WordNet lexico-semantic hierarchy as the main means for providing the meaning of natural language labels and, as a consequence, for driving the annotation of images based on the objects and the visual properties they depict. The methodology is validated on images populating a subset of the imageNet hierarchy. © 2023 The Authors.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Alps: An Adaptive Load Partitioning Scaling Solution for Stream processing System on Skewed Stream 33rd

Alps: An Adaptive Load Partitioning Scaling Solution for Str...

引用

33rd International Conference on Database and Expert Systems applications (DEXA)

作者： Zou, Beiji Zhang, Tao Zhu, Chengzhang Xiao, Ling Zeng, Meng Chen, Zhi Cent South Univ Sch Comp Sci & Engn Changsha 410083 Peoples R China Cent South Univ Coll Literature & Journalism Changsha 410083 Peoples R China Mobile Hlth Minist Educ China Mobile Joint Lab Changsha 410008 Peoples R China

ISBN: (纸本)9783031124266;9783031124259

The distributed stream processing system suffers from the rate variation and skewed distribution of input stream. The scaling policy is used to reduce the impact of rate variation, but cannot maintain high performance with a low overhead when input stream is skewed. To solve this issue, we propose Alps, an Adaptive Load Partitioning Scaling system. Alps exploits adaptive partitioning scaling algorithm based on the willingness function to determine whether to use a partitioning policy. To our knowledge, this is the first approach integrates scaling policy and partitioning policy in an adaptive manner. In addition, Alps achieves the outstanding performance of distributed stream processing system with the least overhead. Compared with state-of-the-art scaling approach DS2, Alps reduces the end-to-end latency by 2 orders of magnitude on high-speed skewed stream and avoids the waste of resources on low-speed or balanced stream.

关键词： Data streams Stream processing system Adaptive scaling policy

来源：评论

学校读者我要写书评

暂无评论

Exploring Google Earth Engine Platform for Satellite image Classification Using machine Learning Algorithms 8th

Exploring Google Earth Engine Platform for Satellite Image C...

引用

8th International Conference on Smart City applications, SCA 2023

作者： Ouchra, Hafsa Belangour, Abdessamad Erraissi, Allae Laboratory of Information Technology and Modeling LTIM Faculty of Sciences Ben M’sik Hassan II University Casablanca Morocco Polydisciplinary Faculty of Sidi Bennour Chouaib Doukkali University El Jadida Morocco

ISBN: (纸本)9783031543753

Google Earth Engine is a geospatial data processing platform that runs in the cloud. It offers free access to massive amounts of satellite data as well as unlimited computing power to monitor, visualize, and analyze environmental features on petabyte scale. The capability of this platform to support diverse approaches for land use and land cover (LULC) classification using both pixel based and object-oriented methods has been made possible through the provision of a variety of machine learning algorithms. Earth observation data have proven to be a valuable resource of quantitative information more consistent in time and space than traditional ground surveys. They offer numerous opportunities for mapping and monitoring urban areas, as well as a variety of physical, climatic, and socioeconomic data to support urban planning and decision-making. We used Landsat 8 satellite data to perform supervised classification in this paper, and we used three advanced machine learning methods Support Vector machine (SVM), Random Forest (RF), and Minimum Distance (MD) to classify water areas, built-up areas, cultivated areas, sandy areas, barren areas, and forest areas on Moroccan territory. The classification results are displayed using a set of accuracy indicators that includes overall accuracy (OA) and the Kappa coefficient. We obtained 0.93 as a better accuracy for the MD algorithm, however, the worst accuracy result is 0.74 for the SVM algorithm. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.

关键词： Engines

来源：评论

学校读者我要写书评

暂无评论

Recurrent RLCN-Guided Attention Network for Single image Deraining 17

Recurrent RLCN-Guided Attention Network for Single Image Der...

引用

17th International Conference on machine vision applications (MVA)

作者： Li, Yizhou Monno, Yusuke Okutomi, Masatoshi Tokyo Inst Technol Tokyo Japan

ISBN: (纸本)9784901122207

Single image deraining is an important yet challenging task due to the ill-posed nature of the problem to derive the rain-free clean image from a rainy image. In this paper, we propose Recurrent RLCN-Guided Attention Network (RRANet) for single image deraining. Our main technical contributions lie in threefold: (i) We propose rectified local contrast normalization (RLCN) to apply to the input rainy image to effectively mark candidates of rain regions. (ii) We propose RLCN-guided attention module (RLCN-GAM) to learn an effective attention map for the deraining without the necessity of ground-truth rain masks. (iii) We incorporate RLCN-GAM into a recurrent neural network to progressively derive the rainy-to-clean image mapping. The quantitative and qualitative evaluations using representative deraining benchmark datasets demonstrate that our proposed RRANet outperforms existing state-of-the-art deraining methods, where it is particularly noteworthy that our method clearly achieves the best performance on a real-world dataset.

关键词： Training Rain Recurrent neural networks machine vision Benchmark testing Task analysis

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：