检索结果-内蒙古大学图书馆

Optoelectronic Imaging and Multimedia Technology XI 2024

作者： Zelensky, A. Gapon, N. Zhdanova, M. Voronin, V. Ilukhin, Y. Gribkov, A. Scientific-Manufacturing Complex «Technological Centre» Zelenograd Russia Don State Technical University Rostov-on-Don Russia Center for Cognitive Technology and Machine Vision Moscow State University of Technology «STANKIN» Moscow Russia

ISBN: (纸本)9781510682061

The goal of image enhancement is to improve specific features or details of an image and enhance its overall visual quality. We introduce a novel image enhancement algorithm based on block-rooting processing combined with multi-scale exposure image fusion. The proposed method integrates both local and global transform domain-based feedback mechanisms for imaging applications. The core concept of the local alpha-rooting method involves applying it to disjoint blocks of varying sizes, followed by the decomposition of the weight map and multi-scale enhanced images into Gaussian and Laplacian pyramids. Fusion is achieved by multiplying the multi-scale images and their corresponding weights. A new stage is introduced to obtain a local-global estimate of high-contrast images, which is also employed in the general artificial fusion model. Computer simulations conducted on image datasets demonstrate that the new enhancement algorithm outperforms state-of-the-art techniques. © 2024 SPIE.

关键词： image fusion

来源：评论

学校读者我要写书评

暂无评论

Automated Detection of Diabetic Retinopathy Segmented images using ResNet50 and VGG16 Deep Learning Algorithms 2

Automated Detection of Diabetic Retinopathy Segmented Images...

引用

2nd International Conference on Inventive Computing and Informatics (ICICI)

作者： Betha, Sashi Kanth Seventline, J. B. GITAM Deemed Be Univ Visakhapatnam Andhra Pradesh India Vignans Inst Engn Women Dept ECE Visakhapatnam Andhra Pradesh India GITAM Deemed Be Univ Dept EECE Visakhapatnam Andhra Pradesh India

ISBN: (纸本)9798350373301;9798350373295

Diabetic retinopathy (DR), a severe complication arising from diabetes, make a significant threat to vision due to the deterioration of retinal blood vessels. This research work proposes a comprehensive methodology for the automated detection, grading, and segmentation of DR, leveraging advanced image processing, deep learning techniques and machine learning. The study utilizes the Indian Diabetic Retinopathy image dataset (IDRID), comprising 81 fundus images and labels, to rigorously evaluates the proposed methodology. Key steps include detailed image preprocessing, VGG16-based feature extraction, Random Forest classifier-based grading, and innovative segmentation techniques for lesion localization. The evaluation demonstrates exceptional performance, with both VGG16 and ResNet50 architectures achieving over 99% accuracy. The process of semantic segmentation enhances interpretability, supporting clinical decision-making in retinopathy diagnosis. While the results are promising, future validation on diverse datasets and careful consideration of ethical implications are essential for responsible deployment in clinical settings. The proposed methodology signifies a significant step toward precise diagnostics and improved patient outcomes in diabetic retinopathy and holds potential for broader applications in retinal disease diagnosis.

关键词： Diabetic retinopathy VGG16 Feature extraction image Preprocessing Segmentation

来源：评论

学校读者我要写书评

暂无评论

Research on verification framework of image processing IP core based on real-time reconfiguration 18

Research on verification framework of image processing IP co...

引用

Colloidal Nanoparticles for Biomedical applications XVIII 2023

作者： Mo, Wei Zhao, Lu Wen, Jianping Xi’an Xwzn Technology Co. Ltd. Shaanxi Science and Technology Holding Group Co. Ltd. Xi’an China Xi’an University of Science and Technology Xi’an China

ISBN: (纸本)9781510658950

The verification of IP core with image processing algorithm is important for SoC and FPGA application in the field of machine vision. This paper proposes a verification framework with general purpose, real-time performance and agility for IP core with image processing algorithm by using heterogeneous platform composed of ARM and FPGA. In the verification framework, the Gigabit Ethernet communication between PC and ARM is established. The FPGA is used to build the data bus to be compatible with multiple types of images, and combine with a partial reconfiguration to achieve fast iteration of IP cores of the algorithm to be verified. The validation framework is reusable for the algorithm IP core, and the deployment speed of the IP cores to be verified is 25 times faster than global reconfiguration. Compared with the existing FPGA verification technology, it has better reusability, shorter verification cycle, more targeted test stimulus, and faster deployment of IP cores to be verified. © 2023 SPIE.

关键词： image processing

来源：评论

学校读者我要写书评

暂无评论

image processing using Cloud for Surveillance 5

Image Processing using Cloud for Surveillance

引用

5th International Conference on Information Management and machine Intelligence, ICIMMI 2023

作者： Tiwari, Shivam Phukan, Aarhee Vedhavathy, T.R. Department of Networking and Communications School of Computing SRM Institute of Science and Technology Chengalpattu Tamil Nadu Kattankulathur603203 India

ISBN: (纸本)9798400709418

In the era of digitization and big data, the world is inundated with an ever-growing volume of visual content, be it images or videos. As organizations strive to harness the potential of these multimedia data sources, there is an increasing need for advanced image processing techniques that can automate the analysis and extraction of valuable information. Amazon Web Services (AWS) Rekognition emerges as a powerful solution in this landscape, offering a comprehensive system for image and video analysis through the lens of machine learning and computer vision. This paper delves into the realm of image processing using AWS Rekognition, unveiling the transformative capabilities of this cloud-based service and its applications in various domains. As we embark on this journey, we will explore the principles, methodologies, and real-world implications of leveraging AWS Rekognition for image analysis. © 2023 ACM.

关键词： Computer vision

来源：评论

学校读者我要写书评

暂无评论

Optical Preprocessing for Low-Latency machine vision

Optical Preprocessing for Low-Latency Machine Vision

引用

作者： Muminov, Baurzhan University of California Riverside

学位级别：Ph.D., Doctor of Philosophy

In recent years there has been an increased interest towards edge computing, i.e., computing performed on distributed devices as opposed to centralized high-power hubs. Examples of edge computing would be the local image processing performed on Unmanned Autonomous Vehicles (UAV's) or the specialized machine vision systems on drones. These edge computing applications require schemes that are efficient with power and memory and typically must operate real-time. Many state-of-the-art image processing solutions that employ advanced optimization and deep neural networks (NNs) achieve impressive benchmark results, but are computationally demanding and thus on many occasions, impractical. The additional requirement for a range of applications is noise robustness or the ability to work in (extreme) low-light conditions; reasonable quality image or accurate object classification may be critical when there is low light flux or when the environment is over-saturated with other signals. Here, we approach edge computing with a combination of optical preprocessing and shallow NN and we show that this hybrid approach greatly reduces the computational requirements. For low-SNR imaging, we develop a technique that reconstructs objects and scenes from their Fourier-plane images. The optical preprocessing is performed via encoded diffraction with optical vortex singularities. The optical vortex encoder achieves differentiation of the already-compressed Fourier-plane patterns and enables facile inverse inference of the original object scene. We demonstrate that our method is robust to noise. And for a simple NN architecture (one or two layers), leads to generalization, i.e., reconstruction of objects from classes that are greatly different from the ones the NN was trained on. Our research identifies strong potential for swift hybrid imaging systems with edge computing applications and highlights the valuable function of the vortex encoder for spectral differentiation.

关键词： Low-latency machine learning machine vision Noise robustness Topological optics Vortices

来源：评论

学校读者我要写书评

暂无评论

An AI pipeline for garment price projection using computer vision

引用

Neural Computing and applications 2024年第25期36卷 15631-15651页

作者： Rico Gómez, Rodrigo Lorentz, Joe Hartmann, Thomas Goknil, Arda Pal Singh, Inder Halaç, Tayfun Gökmen Boruzanlı Ekinci, Gülnaz DataThings 5 rue de l’industrie Luxembourg1811 Luxembourg SnT University of Luxembourg Campus Kirchberg Luxembourg1359 Luxembourg SINTEF Digital Oslo Norway Galaksiya Information Technologies Izmir Turkey Department of Mathematics Ege University Izmir Turkey

The fashion industry’s traditional price-setting methods, based on historical sales and Fashion Week trends, are inadequate in the digital era. Rapid changes in collections and consumer preferences necessitate advanced Artificial Intelligence (AI) techniques. These AI methods should analyze data from various sources, including social media and e-commerce, to predict future fashion trends and prices. In this paper, we propose, apply, and assess a data analytics approach, i.e., FashionXpert, employing several image processing and machine learning techniques in an AI pipeline for garment price prediction. It integrates various heterogeneous data sources (e.g., textual and image data from e-stores, brand websites, and social media) to obtain more consistent, accurate, and beneficial information. We evaluated its effectiveness with an industrial data set obtained by a fashion search tool from the electronic commerce sites of clothing brands. FashionXpert predicted garment prices with an average Mean Absolute Error (MAE) of 15.31 EUR on a data set that has a standard deviation of 72.99 EUR. © The Author(s) 2024.

关键词： image segmentation

来源：评论

学校读者我要写书评

暂无评论

Padding-Free Convolution Based on Preservation of Differential Characteristics of Kernels 22

Padding-Free Convolution Based on Preservation of Differenti...

引用

22nd IEEE International Conference on machine Learning and applications, ICMLA 2023

作者： Leng, Kuangdai Thiyagalingam, Jeyan Science and Technology Facilities Council Scientific Computing Department Didcot United Kingdom

ISBN: (纸本)9798350345346

Convolution is a fundamental operation in image processing and machine learning. Aimed primarily at maintaining image size, padding is a key ingredient of convolution, which, however, can introduce undesirable boundary effects. We present a non-padding-based method for size-keeping convolution based on the preservation of differential characteristics of kernels. The main idea is to make convolution over an incomplete sliding window 'collapse' to a linear differential operator evaluated locally at its central pixel, which no longer requires information from the neighbouring missing pixels. While the underlying theory is rigorous, our final formula turns out to be simple: the convolution over an incomplete window is achieved by convolving its nearest complete window with a transformed kernel. This formula is computationally lightweight, involving neither interpolation or extrapolation nor restrictions on image and kernel sizes. Our method favours data with smooth boundaries, such as high-resolution images and fields from physics. Our experiments include: i) filtering analytical and non-analytical fields from computational physics and, ii) training convolutional neural networks (CNNs) for the tasks of image classification, semantic segmentation and super-resolution reconstruction. In all these experiments, our method has exhibited visible superiority over the compared ones. © 2023 IEEE.

关键词： computer vision convolutional neural network differential operator machine learning padding

来源：评论

学校读者我要写书评

暂无评论

Mold Steel Grinding Process Application in Furniture Design Based on machine vision and Wireless Sensor Network Equipment

引用

MOBILE NETWORKS & applications 2025年第SUPPL1期30卷 18-18页

作者： Xu, Jinling Wang, Guodong Hexi Univ Acad Fine Arts Zhangye 734000 Gansu Peoples R China City Univ Macau Fac Innovat & Design Macau 999078 Macau Peoples R China

With the continuous development of furniture design, the machining accuracy and surface quality of die steel have been paid more and more attention. The traditional grinding process has problems such as low efficiency and unstable quality, so it is urgent to introduce advanced technical means to improve the intelligent level of the processing process. This study aims to explore the application of the die steel grinding process based on machine vision and wireless sensor network equipment in furniture design, and improve the efficiency and quality of the grinding process through real-time monitoring and data analysis. A grinding monitoring platform integrating machine vision system and wireless sensor network was developed. A machine vision system is used to capture critical image data during the grinding process in real time, while a wireless sensor network is used to collect and transmit grinding parameters, including temperature, vibration and acoustic emission signals. By analyzing the acquired data, the optimized grinding parameters and control strategy are worked out. The experimental results show that the grinding process using machine vision and wireless sensor network has improved the relevant parameters compared with the traditional methods. The real-time monitoring capability of the system significantly reduces the failure rate during grinding and provides a more stable and reliable die steel processing solution for furniture design.

关键词： machine vision Wireless sensor network equipment Die steel grinding process Furniture design

来源：评论

学校读者我要写书评

暂无评论

Towards an image Utility Assessment Framework for machine Perception 30

Towards an Image Utility Assessment Framework for Machine Pe...

引用

30th European Signal processing Conference (EUSIPCO)

作者： Khan, Zohaib Amjad Valenzise, Giuseppe Chetouani, Aladine Dufaux, Frederic Univ Paris Saclay Lab Signaux & Syst Cent Supelec CNRS Gif Sur Yvette France Univ Orleans Lab PRISME Orleans France

ISBN: (纸本)9789082797091

In real-world applications, images and videos used in computer vision algorithms are often distorted due, e.g., to compression and transmission. As a result, they may lose relevant information content, or they may deviate significantly from the original data distribution used to train the machine task, rendering the visual content practically useless with respect to its initial purpose. Evaluating the utility of an image for machine tasks has received little attention so far in the literature. This concept of utility is substantially different from the visual quality typically used in image/video compression, as the latter is related to the perception of the human visual system. In this paper, we propose a definition of utility as the degree of confidence by which a machine task is able to take a decision. In this context, we propose a full-reference utility loss measure: we assume that the decision on the pristine image is correct (reference), and we measure the utility loss as the confidence reduction in the decision due to a noisy input with respect to this reference. We apply this general definition on two specific tasks, classification and object detection, and we study practical solutions to predict utility, as well as the ability of our utility measure to generalize across tasks.

关键词： image utility for machines machine perception task-based assessment image utility assessment

来源：评论

学校读者我要写书评

暂无评论

Omnidirectional Gradient and Its Application in Stylized Edge Extraction of Infrared image

Omnidirectional Gradient and Its Application in Stylized Edg...

引用

International Conference on image processing, Computer vision and machine Learning (ICICML)

作者： Wu, Jun Wei, Xingzhan Chongqing Univ Chongqing 400044 Peoples R China Chinese Acad Sci Chongqing Inst Green & Intelligent Technol Chongqing 400714 Peoples R China Univ Chinese Acad Sci Chongqing Sch Chongqing 400714 Peoples R China

ISBN: (纸本)9781665464680

Gradient computing is a low-level technology widely used in image processing. For large gradient magnitude, the pixel value in the field changes a lot, and for small gradient magnitude the pixel in the domain changes little. This is the basis of classical edge extraction algorithms, but it is often necessary to manually set thresholds to differentiate. This paper innovatively brings out the concept of omnidirectional gradient, which uses flexible convolution kernel radius and special law to calculate, and omnidirectional gradient pays more attention to gradient direction and analyzes the relationship and change of the gradient direction with different kernel radius. We present here an algorithm for stylized edge extraction based on omnidirectional gradient, overcoming the drawback of classical edge extraction algorithms that require manual thresholding. Experimental results show that the proposed method outperforms the classical edge extraction methods in terms of adaptive, consistent, and visually friendlier features for infrared imaging. In addition, the algorithm is fast and efficient, its result can be used as real-time input for subsequent applications.

关键词： omnidirectional gradient stylized edge extraction infrared image

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：