检索结果-内蒙古大学图书馆

Optoelectronic Imaging and Multimedia Technology XI 2024

作者： Zelensky, A. Gapon, N. Zhdanova, M. voronin, v. Ilukhin, Y. Gribkov, A. Scientific-Manufacturing Complex «Technological Centre» Zelenograd Russia Don State Technical University Rostov-on-Don Russia Center for Cognitive Technology and Machine Vision Moscow State University of Technology «STANKIN» Moscow Russia

ISBN: (纸本)9781510682061

The goal of image enhancement is to improve specific features or details of an image and enhance its overall visual quality. We introduce a novel image enhancement algorithm based on block-rooting processing combined with multi-scale exposure image fusion. The proposed method integrates both local and global transform domain-based feedback mechanisms for imaging applications. The core concept of the local alpha-rooting method involves applying it to disjoint blocks of varying sizes, followed by the decomposition of the weight map and multi-scale enhanced images into Gaussian and Laplacian pyramids. Fusion is achieved by multiplying the multi-scale images and their corresponding weights. A new stage is introduced to obtain a local-global estimate of high-contrast images, which is also employed in the general artificial fusion model. Computer simulations conducted on image datasets demonstrate that the new enhancement algorithm outperforms state-of-the-art techniques. © 2024 SPIE.

关键词： image fusion

来源：评论

学校读者我要写书评

暂无评论

Efficient Approximate vedic Multiplier: Design, Analysis, and Application in image Blending 2

Efficient Approximate Vedic Multiplier: Design, Analysis, an...

引用

2nd IEEE International Conference on Computer vision and machine Intelligence, CvMI 2023

作者： Gupta, Sanjiv Kumar Yadav, Nilesh Kumar Dhawan, Amit Tiwari, Manish Jha, Sumit Kumar Motilal Nehru National Institute of Technology Department of Electronics and Communication Engineering Uttar Pradesh Allahabad211004 India

ISBN: (纸本)9798350305142

Approximate computing has become a widely recognized method for designing energy-efficient arithmetic architectures in the context of error-tolerant applications. This paper presents the design and analysis of a 4-bit approximate vedic multiplier (AvMT) using the Urdhva Tiryagbhyam method. This vedic approach, involving vertical and crosswise steps, outperforms traditional multiplication in terms of efficiency. An approximate 2-bit multiplier (AvM2) is designed, and an AvMT is proposed using AvM2. The proposed architecture has better propagation delay and less area utilization compared to other conventional multipliers. AvMT has an 11% reduction in area consumption and a 12% increase in processing speed compared to the exact vedic multiplier. To assess its practicality in real-world scenarios, the proposed multiplier is integrated into an image-blending application. The results indicate that the system achieves a Structural Similarity Index (SSIM) average value of 0.91, which proves to be suitable for error-resilient image processing applications. © 2023 IEEE.

关键词： Energy efficiency

来源：评论

学校读者我要写书评

暂无评论

Rlm-tracking: online multi-pedestrian tracking supported by relative location mapping

引用

INTERNATIONAL JOURNAL OF machine LEARNING AND CYBERNETICS 2024年第7期15卷 2881-2897页

作者： Ren, Kai Hu, Chuanping Xi, Hao Univ Zhengzhou Sch Elect & Informat Engn Zhengzhou Henan Peoples R China

The challenge of multi-object tracking stands as a fundamental focus in computer vision research, finding widespread applications in areas such as public safety, transportation, autonomous vehicles, robotics, and other domains involving artificial intelligence. Given the intricate nature of natural scenes, the occurrence of object occlusion and semi-occlusion is commonplace in basic tracking tasks. These factors often result in challenges such as ID switching, object loss, detection errors, and misaligned bounding boxes, thereby significantly impacting the precision of multi-object *** paper aims to address the aforementioned issues and proposes a novel multi-object tracker, incorporating Relative location mapping (RLM) and Target region density (TRD) modeling. The new tracker is more sensitive to differences in the spatial relationships between targets, allowing it to dynamically introduce low-scoring detection boxes into different regions based on the density of target regions in the image. This improves the accuracy of target tracking while avoiding the consumption of a significant amount of computational *** research results indicate that when applying this method to state-of-the-art multi-object tracking approaches, the proposed model achieves improvements of 0.4 to 0.8 points in the HOTA and IDF1 metrics on the MOT17 and MOT20 datasets. This demonstrates the effectiveness of the proposed method in enhancing multi-object tracking performance.

关键词： Relative location mapping Multi-target tracking video processing Kalman filtering Artificial intelligence

来源：评论

学校读者我要写书评

暂无评论

Currency Detector For visually Impaired

Currency Detector For Visually Impaired

引用

2023 Intelligent Computing and Control for Engineering and Business Systems, ICCEBS 2023

作者： MacRiga, G. Adiline Agustin Jebakumar, v. Ariharan, S. Sri Sairam Engineering College Dept of Information Technology Chennai India

ISBN: (纸本)9798350394580

The currency has a great meaning in everyday *** each of us using the Currency notes in our day to day lives through Cash or online Payment. Thus currency recognisation has gained a great interest for many researchers across the varoius part of the globe. Currency Identification System is an important area of processing of the image. This is used worldwide for various purposes. Our project aims to develop an intelligent system for recognizing Indian paper currency, which has diverse applications in electronic banking, currency monitoring systems, money exchange machines, and other related fields. With the rise of modern currency automation systems, the ability to accurately identify and process paper currency is becoming increasingly important. By leveraging advanced computer vision and machine learning techniques, we hope to create a system that can quickly and accurately recognize Indian paper currency, enabling it to be used in a range of applications. This project is essential to addressing the current needs of currency automation systems and has the potential to make a significant impact on various industries. In this project We are using Opencv(Open Computer vision) which is a image processing libraray, the Approach is correlation technique is applied to extract Gandhiji images and Thin Strip, and forming clusters of featuresIt is a very helpful system for blind people toknow the denomination on the paper currency © 2023 IEEE.

关键词： Currency Recognition RGB image,Correlation

来源：评论

学校读者我要写书评

暂无评论

The Normalization of vaping on TikTok Using Computer vision, Natural Language processing, and Qualitative Thematic Analysis: Mixed Methods Study

引用

JOURNAL OF MEDICAL INTERNET RESEARCH 2024年 26卷 e55591页

作者： Jung, Sungwon Murthy, Dhiraj Bateineh, Bara S. Loukas, Alexandra Wilkinson, Anna, v Univ Texas Austin Sch Journalism & Media 300 W Dean Keeton St Austin TX 78712 USA Univ Texas Hlth Sci Ctr Houston Sch Publ Hlth Houston TX USA Univ Texas Austin Dept Kinesiol & Hlth Educ Austin TX USA

Background: Social media posts that portray vaping in positive social contexts shape people's perceptions and serve to normalizevaping. Despite restrictions on depicting or promoting controlled substances, vape-related content is easily accessible on *** is a need to understand strategies used in promoting vaping on TikTok, especially among susceptible youth audiences. Objective: This study seeks to comprehensively describe direct (ie, explicit promotional efforts) and indirect (ie, subtlerstrategies) themes promoting vaping on TikTok using a mixture of computational and qualitative thematic analyses of socialmedia posts. In addition, we aim to describe how these themes might play a role in normalizing vaping behavior on TikTok foryouth audiences, thereby informing public health communication and regulatory policies regarding vaping endorsements onTikTok. Methods: We collected 14,002 unique TikTok posts using 50 vape-related hashtags (eg, #vapetokand #boxmod). Using thek-means unsupervised machine learning algorithm, we identified clusters and then categorized posts qualitatively based on ***, we organized all videos from the posts thematically and extracted the visual features of each theme using 3 machinelearning-based model architectures: residual network (ResNet) with 50 layers (ResNet50), visual Geometry Group model with16 layers, and vision transformer. We chose the best-performing model, ResNet50, to thoroughly analyze the image clusteringoutput. To assess clustering accuracy, we examined 4.01% (441/10,990) of the samples from each video cluster. Finally, werandomly selected 50 videos (5% of the total videos) from each theme, which were qualitatively coded and compared with the machine-derived classification for validation. Results: We successfully identified 5 major themes from the TikTok posts. vape product marketing(1160/10,990, 8.28%)reflected direct marketing, while the other 4 themes reflected indirect marketing: TikTok influencer(3775/

关键词： electronic cigarettes vaping social media natural language processing computer vision

来源：评论

学校读者我要写书评

暂无评论

Enhancing Scene Text Segmentation through Subtask Decomposition 2

Enhancing Scene Text Segmentation through Subtask Decomposit...

引用

2nd International Conference on image processing, Computer vision and machine Learning, ICICML 2023

作者： Wang, Yong Chen, Youguang East China Normal University School of Data Science and Engineering Shanghai China

ISBN: (纸本)9798350331417

The field of image processing widely utilizes scene text segmentation technology, with applications extending to image editing and font style transfer. These applications enhance image understanding quality and aid in boosting the performance of numerous computer vision tasks. The advent and progression of deep learning have led to substantial advancements in scene text segmentation technology. However, the limited size of existing scene text segmentation datasets constrains the performance of models. Therefore, we propose an algorithm for synthetic segmentation data. We first pretrain the model using large-scale synthetic data, then fine-tune it on the target dataset to address the issue of limited dataset size. Existing models employ end-to-end segmentation, which presents challenges in segmentation. We propose a scene text segmentation method. By decomposing the segmentation task into subtasks and solving them one by one, the complexity of the task can be reduced compared to direct segmentation of the entire image significantly improving the segmentation effect. The proposed method consists of three modules: a fragment crop module, a fragment segmentation module, and a fragment combination module. The fragment crop module is composed of an additional corp layer added after DBnet. The fragment segmentation module can be embedded with various segmentation methods. The fragment combination module uses the maximum pixel value pasting algorithm to combine the segmented fragments. We call this method Crop-Segmentation-Combination Framework (CSCF). We conducted experiments on the ICDAR 2013 and TextSeg datasets. The CSCF, embedded in Unet within the segment segmentation module, enhanced the text segmentation IoU by 5.80% on the ICDAR 2013 test dataset. Our suggested approach has been shown to notably enhance the efficiency of scene text segmentation. © 2023 IEEE.

关键词： image segmentation Scene text segmentation Synthetic data synthetic image generations Text segmentation

来源：评论

学校读者我要写书评

暂无评论

FastBeltNet: a dual-branch light-weight network for real-time conveyor belt edge detection

引用

JOURNAL OF REAL-TIME image processing 2024年第4期21卷 123-123页

作者： Zhao, Xing Zeng, Minhao Dong, Yanglin Rao, Gang Huang, Xianshan Mo, Xutao Anhui Univ Technol Sch Microelect & Data Sci Maanshan 243032 Anhui Peoples R China Shanghai Meishan Iron & Steel Co Ltd Steelmaking Plant Nanjing 210039 Jiangsu Peoples R China Anhui Univ Technol Sch Innovat & Entrepreneurship Maanshan 243032 Anhui Peoples R China

Belt conveyors are widely used in multiple industries, including coal, steel, port, power, metallurgy, and chemical, etc. One major challenge faced by these industries is belt deviation, which can negatively impact production efficiency and safety. Despite previous research on improving belt edge detection accuracy, there is still a need to prioritize system efficiency and light-weight models for practical industrial applications. To meet this need, a new semantic segmentation network called FastBeltNet has been developed specifically for real-time and highly accurate conveyor belt edge line segmentation while maintaining a light-weight design. This network uses a dual-branch structure that combines a shallow spatial branch for extracting high-resolution spatial information with a context branch for deep contextual semantic information. It also incorporates the Ghost blocks, Downsample blocks, and Input Injection blocks to reduce computational load, increase processing frame rate, and enhance feature representation. Experimental results have shown that FastBeltNet has performed comparatively better than some existing methods in different real-world production settings, achieving promising performance metrics. Specifically, FastBeltNet achieves 80.49% mIoU accuracy, 99.89 FPS processing speed, 895 k parameters, 8.23 GFLOPs, and 430.95 MB peak CUDA memory use, effectively balancing accuracy and speed for industrial production.

关键词： Belt deviation machine vision Deep learning Edge detection

来源：评论

学校读者我要写书评

暂无评论

Robust and constrained tracking of PSv interface using convolutional neural networks and optimistic horizon estimation

引用

JOURNAL OF PROCESS CONTROL 2025年 151卷

作者： Xie, Junyao Liang, Huiping Tatlici, Mahmut Berat Huang, Biao Univ Alberta Dept Chem & Mat Engn Edmonton AB T6G 2V4 Canada Cent South Univ Sch Automat Changsha 410083 Peoples R China

This manuscript proposes a novel video-based robust and constrained estimation framework using the convolutional neural network and optimistic moving horizon estimation, with applications in interface estimation of oil sand primary separation vessels (PSv). Although convolutional neural networks have achieved notable success across various computer vision and image analysis tasks, image outliers (such as blocking, blurriness, and lighting variations) would inevitably affect recognition/tracking performance. To address this issue, this manuscript proposes a robust estimation approach by leveraging a convolutional neural network and moving horizon estimation. Along this line, the interface recognition results by the convolutional neural network can be modeled as the measurements corrupted by disturbances and outliers, and the internal states can be modeled through a discrete-time finite-dimensional state space model. More importantly, the ubiquitously present constraints in the estimation task can be explicitly and readily handled by the moving horizon estimation. The stability analysis of the proposed method is provided in the presence of disturbances and model-plant mismatch. The effectiveness of the proposed method is validated through a pilot-scale laboratory study and an industrial primary separation vessel case study.

关键词： image processing Primary separation vessels Moving horizon estimation Interface level estimation Robust and constrained estimation Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Nanomaterial-Based Synaptic Optoelectronic Devices for In-Sensor Preprocessing of image Data

引用

ACS OMEGA 2023年第6期8卷 5209-5224页

作者： Lee, Minkyung Seung, Hyojin Kwon, Jong Ik Choi, Moon Kee Kim, Dae-Hyeong Choi, Changsoon Korea Inst Sci & Technol KIST Post Silicon Semicond Inst Ctr Optoelect Mat & Devices Seoul 02792 South Korea Inst Basic Sci IBS Ctr Nanoparticle Res Seoul 08826 South Korea Seoul Natl Univ Inst Chem Proc Sch Chem & Biol Engn Seoul 08826 South Korea Ulsan Natl Inst Sci & Technol UNIST Sch Mat Sci & Engn Ulsan 44919 South Korea Seoul Natl Univ Dept Mat Sci & Engn Seoul 08826 South Korea

With the advance in information technologies involving machine vision applications, the demand for energyand time-efficient acquisition, transfer, and processing of a large amount of image data has rapidly increased. However, current architectures of the machine vision system have inherent limitations in terms of power consumption and data latency owing to the physical isolation of image sensors and processors. Meanwhile, synaptic optoelectronic devices that exhibit photoresponse similar to the behaviors of the human synapse enable insensor preprocessing, which makes the front-end part of the image recognition process more efficient. Herein, we review recent progress in the development of synaptic optoelectronic devices using functional nanomaterials and their unique interfacial characteristics. First, we provide an overview of representative functional nanomaterials and device configurations for the synaptic optoelectronic devices. Then, we discuss the underlying physics of each nanomaterial in the synaptic optoelectronic device and explain related device characteristics that allow for the in-sensor preprocessing. We also discuss advantages achieved by the application of the synaptic optoelectronic devices to image preprocessing, such as contrast enhancement and image filtering. Finally, we conclude this review and present a short prospect.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Automatic Piped and Micro Irrigation Network 2

Automatic Piped and Micro Irrigation Network

引用

2nd IEEE International Conference on Networking and Communications, ICNWC 2024

作者： Ponsudha, P. Shalin, Elton NirmalRaj, D. Rahul, v. Velammal Engineering College Department Of Electronics and Communication Engineering Chennai India

ISBN: (纸本)9798350365269

This research paper presents cutting-edge technologies and methodologies to enhance precision agriculture and support sustainable farming practices. The study incorporates Satellite image processing for land classification, achieving a remarkable accuracy of 99.7%. The Ultrasonic Pest Repellent system showcases effective pest control with remote capabilities. The C-shaped unit integrates computer vision and machine learning, reducing chemical usage by up to 95%. An IoT-based plant disease detection system achieves superior accuracy in disease classification. The C-shaped Ground Unit addresses challenges faced by Indian farmers, optimizing plant care, nutrient supply, and pest repellence. Together, these innovations contribute to a more sustainable and efficient future for agriculture. © 2024 IEEE.

关键词： Ultrasonic applications

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：