检索结果-内蒙古大学图书馆

IEEE 8th International Conference on Signal and image processing applications (IEEE ICSIPA)

作者： Xian, Chear Li Sheikh, Usman Ullah Abu Bakar, Syed Abdul Rahman Syed Univ Teknol Malaysia Fac Elect Engn Johor Baharu Malaysia

ISBN: (纸本)9798350352368

Traffic sign recognition is crucial for the safe and efficient operation of autonomous vehicles. While previous research has primarily focused on traffic sign recognition in foreign countries, these studies often face limitations such as differing traffic sign designs, language barriers in textual information, and varying environmental conditions. In this paper, we propose a traffic sign detection and recognition system tailored for Malaysia, utilizing Convolutional Neural Networks (CNNs) and Optical Character Recognition (OCR). In this paper, we propose a traffic sign detection and recognition system utilizing You Only Look Once (YOLO) v8 for object detection and EasyOCR to process textual information on selected traffic signs. Our system achieves a mean Average Precision (mAP) of 0.824 and an average processing time of 1.2 seconds per frame, which is comparable to existing literature. Furthermore, the complexity of our method is significantly reduced, enhancing its potential for real-time processing applications, as evidenced by its efficient processing time.

关键词： Road Sign Detection Autonomous vehicles Intelligent Transportation Systems (ITS) Convolutional neural network (CNN) Traffic Sign Recognition (TSR)

来源：评论

学校读者我要写书评

暂无评论

Research on Industrial Production Defect Detection Method Based on machine vision Technology in Industrial Internet of Things

引用

TRAITEMENT DU SIGNAL 2022年第6期39卷 2061-2068页

作者： Jia, Limin Wang, Yang Shenzhen Decard Smartcard Tech Co Ltd Shenzhen 518055 Peoples R China Shenzhen Polytech Sch Elect & Commun Engn Shenzhen 518055 Peoples R China

The realization of automatic operation of production by the industrial Internet of Things needs the functional assistance of machine vision technology. Different from the recognition and detection of some known features, it is difficult to realize defect detection in machine vision applications. Therefore, this article studies the industrial production defect detection method based on machine vision technology in industrial Internet of Things. Firstly, in the second chapter, the images of industrial products collected by machine vision system are preprocessed and thinned to obtain more ideal detection accuracy and measurement accuracy. The methods of image binarization, morphological processing, thinning and burr elimination are given in detail. In the third chapter, product defect detection model is constructed based on U-Net network, and residual structure, hole convolution module, strip pooling module and attention mechanism module are introduced to optimize the network model. Experimental results verify the effectiveness of the model for product defect detection.

关键词： industrial internet of things machine vision industrial production product defect detection

来源：评论

学校读者我要写书评

暂无评论

A machine learning method to quantitatively predict alpha phase morphology in additively manufactured Ti-6Al-4v

引用

npj Computational Materials 2023年第1期9卷 338-352页

作者： Zhuohan Cao Qian Liu Qianchu Liu Xiaobo Yu Jamie J.Kruzic Xiaopeng Li School of Mechanical and Manufacturing Engineering University of New South Wales(UNSW Sydney)SydneyNSW 2052Australia Platforms Division Defence Science and Technology GroupMelbourneVIC 3207Australia

Quantitatively defining the relationship between laser powder bed fusion(LPBF)process parameters and the resultant microstructures for LPBF fabricated alloys is one of main research *** date,achieving the desired microstructures and mechanical properties for LPBF alloys is generally done by time-consuming and costly trial-and-error experiments that are guided by human ***,we develop an approach whereby an image-driven conditional generative adversarial network(cGAN)machine learning model is used to reconstruct and quantitatively predict the key microstructural features(e.g.,the morphology of martensite and the size of primary and secondary martensite)for LPBF fabricated *** results demonstrate that the developed image-driven machine learning model can effectively and efficiently reconstruct micrographs of the microstructures within the training dataset and predict the microstructural features beyond the training dataset fabricated by different LPBF parameters(i.e.,laser power and laser scan speed).This study opens an opportunity to establish and quantify the relationship between processing parameters and microstructure in LPBF Ti-6Al-4v using a GAN machine learning-based model,which can be readily extended to other metal alloy systems,thus offering great potential in applications related to process optimisation,material design,and microstructure control in the additive manufacturing field.

关键词： martensite microstructure alloy

来源：评论

学校读者我要写书评

暂无评论

Latin Square and machine Learning Techniques Combined Algorithm for image Encryption

引用

CIRCUITS SYSTEMS AND SIGNAL processing 2023年第11期42卷 6829-6853页

作者： Patel, Sakshi Thanikaiselvan, v. Vellore Inst Technol VIT Sch Elect Engn Vellore 632014 India

Multimedia data is crucial in the military, medical, forensics, social, etc., to transmit a large amount of data. Security of this sensitive information is the primary issue. This paper uses Latin square and machine learning techniques such as neural networks and genetic algorithm to design an image encryption algorithm. A new neural network-based pseudorandom number generator is proposed to generate a chaotic sequence for various applications. Encryption key images are designed using Latin squares in the finite field. Further, the Latin squares are XOR with the input matrix to get the encrypted images. The proposed algorithm is iterated a finite number of times to generate a cipher image population. Randomly two parents are chosen from the generated population, and row and column arrangements produce offspring. A genetic algorithm is the optimization technique used for the best encrypted image search. The pixel correlation value serves as a fitness function. Finally, the least correlated cipher image is obtained from the genetic algorithm applied to the parent and offspring of the population generated from the encryption algorithm. The simulation results from the proposed image encryption model surpass many communication channel attacks and perform better when compared to existing image security algorithms.

关键词： image encryption Neural network Pseudorandom number generator Latin square Genetic algorithm machine learning techniques

来源：评论

学校读者我要写书评

暂无评论

Single image Dehazing Using CNN 2nd

Single Image Dehazing Using CNN

引用

2nd International Conference on Computational Intelligence in machine Learning, ICCIML 2022

作者： Bhadane, Samarth Bidwe, Ranjeet vasant Zope, Bhushan Pune Institute of Computer Technology Maharashtra Pune India Maharashtra Lavale Pune India

ISBN: (纸本)9789819979530

Particulate matter in the atmosphere obscures the visibility of the atmosphere, causing a condition known as haze. Other natural phenomena like mist, fog, and dust also obscure the vision;this is because of scattering of light which attenuates the light intensity. All these instances are responsible for the degradation of image quality. Hazy images are problematic because these images cannot be used for computer vision and image processing applications like pattern and object recognition. Dehazing images improve the clarity and contrast of the images making them more suitable for computer vision and image processing. This paper presents a method of dehazing images using CNN. The proposed model is trained on D-HAZY (Ancuti et al. in 2016 IEEE international conference on image processing (ICIP), 2016) and SOTS (Li et al. in IEEE Trans image Process 28:492–505, 2019) datasets which contain a mix of natural and synthesized hazy images. To assess the model’s performance, we employ PSNR and SSIM metrics. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.

关键词： image enhancement

来源：评论

学校读者我要写书评

暂无评论

Optimizing zero-shot text-based segmentation of remote sensing imagery using SAM and Grounding DINO

ARTIFICIAL INTELLIGENCE IN GEOSCIENCES

引用

ARTIFICIAL INTELLIGENCE IN GEOSCIENCES 2025年第1期6卷

作者： Diab, Mohanad Kolokoussis, Polychronis Brovelli, Maria Antonia Politecn Milan Dept Civil & Environm Engn I-20133 Milan Italy Natl Tech Univ Athens Sch Rural Surveying & Geoinformat Engn Athens 15780 Greece

The use of AI technologies in remote sensing (RS) tasks has been the focus of many individuals in both the professional and academic domains. Having more accessible interfaces and tools that allow people of little or no experience to intuitively interact with RS data of multiple formats is a potential provided by this integration. However, the use of AI and AI agents to help automate RS-related tasks is still in its infancy stage, with some frameworks and interfaces built on top of well-known vision language models (vLM) such as GPT-4, segment anything model (SAM), and grounding DINO. These tools do promise and draw guidelines on the potentials and limitations of existing solutions concerning the use of said models. In this work, the state of the art AI foundation models (FM) are reviewed and used in a multi-modal manner to ingest RS imagery input and perform zero-shot object detection using natural language. The natural language input is then used to define the classes or labels the model should look for, then, both inputs are fed to the pipeline. The pipeline presented in this work makes up for the shortcomings of the general knowledge FMs by stacking pre-processing and post-processing applications on top of the FMs;these applications include tiling to produce uniform patches of the original image for faster detection, outlier rejection of redundant bounding boxes using statistical and machine learning methods. The pipeline was tested with UAv, aerial and satellite images taken over multiple areas. The accuracy for the semantic segmentation showed improvement from the original 64% to approximately 80%-99% by utilizing the pipeline and techniques proposed in this work. GitHub Repository: MohanadDiab/LangRS.

关键词： Foundation models Multi-modal models vision language models Semantic segmentation Segment anything model Earth observation Remote sensing

来源：评论

学校读者我要写书评

暂无评论

Automatic visual recognition, detection and classification of weeds in cotton fields based on machine vision

引用

CROP PROTECTION 2025年 187卷

作者： Memon, Muhammad Sohail Chen, Shuren Shen, Baoguo Liang, Runzhi Tang, Zhong Wang, Shuai Zhou, Weiwei Memon, Noreena Jiangsu Univ Key Lab Modern Agr Equipment & Technol Minist Educ Zhenjiang 212013 Jiangsu Peoples R China Jiangsu Univ Sch Agr Engn Zhenjiang 212013 Jiangsu Peoples R China Sindh Agr Univ Fac Agr Engn Dept Farm Power & Machinery Tandojam 70060 Pakistan Jiangsu Aviat Tech Coll Zhenjiang Key Lab UAV Applicat Technol Zhenjiang 212134 Peoples R China

Crops and weeds are involved in a continuous competition for equal resources, which may result in a potential decrease in crop yields by up to 31% and an increase in the costs of agricultural inputs by up to 22% of cultivation. Weeds further impact crop production, and their detection is crucial for effective crop management. In this research, we targeted common weeds of cotton field, specifically i) Digitaria sanguinalis (L.) Scop, ii) Amaranthus retroflexus L., iii) Acalypha australis, L., iv) Cephalanoplos segetum, and v) Chenopodium album L. Additionally, image processing techniques such as grayscale conversion, binarization, and Gaussian and morphological filters were also utilized. These methods are based on machine vision and facilitate rapid and straightforward weed detection by segmenting, scrutinizing, and comparing input images. The plant height and area were obtained during cotton planting within 32 days and fitted to develop the growth law concerning planting days for achieving the function of distinguishing cotton from weeds. We conducted recognition experiments by dividing images into four quadrants and categorizing weeds as either inter-row or intra-row. Meanwhile, the inter-row planting information was used to identify weeds, and the leaf pixel area and circularity were used as the identification methods for intra-row weeds, which reduced the algorithm's running time and improved real-time performance. The experimental results indicated that the inter-row weed recognition rate was 89.4%, with an average processing time of 102ms. Whereas in the case of intra-row weeds, the recognition rate was measured at 84.6%, and the overall recognition rate for cotton was 85.0%, with a mean time consumption of 437ms. Furthermore, the present research underscores recent advancements such as machine vision and high-resolution imaging, which have significantly improved the accuracy of automated weed identification in cotton fields while acknowledging ongoing challen

关键词： Weed detection Inter-row weeds Intra-row weeds machine vision algorithms Weed segmentation Cotton crop Precision farming

来源：评论

学校读者我要写书评

暂无评论

Local Geometric Indexing of High Resolution Data for Facial Reconstruction From Sparse Markers

引用

IEEE TRANSACTIONS ON vISUALIZATION AND COMPUTER GRAPHICS 2024年第8期30卷 5289-5298页

作者： Cong, Matthew Lan, Lana Fedkiw, Ronald Ind Light & Mag San Francisco CA 94129 USA Stanford Univ Dept Comp Sci Stanford CA 94305 USA

When considering sparse motion capture marker data, one typically struggles to balance its overfitting via a high dimensional blendshape system versus underfitting caused by smoothness constraints. With the current trend towards using more and more data, our aim is not to fit the motion capture markers with a parameterized (blendshape) model or to smoothly interpolate a surface through the marker positions, but rather to find an instance in the high resolution dataset that contains local geometry to fit each marker. Just as is true for typical machine learning applications, this approach benefits from a plethora of data, and thus we also consider augmenting the dataset via specially designed physical simulations that target the high resolution dataset such that the simulation output lies on the same so-called manifold as the data targeted.

关键词： Shape Faces Geometry Surface reconstruction Cameras Point cloud compression Deformation Computer graphics image processing and computer vision interpolation

来源：评论

学校读者我要写书评

暂无评论

No-reference image Quality Metric for NeRF (Neural Radiance Fields) Rendering in Automotive applications 26

No-reference Image Quality Metric for NeRF (Neural Radiance ...

引用

26th Irish machine vision and image processing Conference, IMvIP 2024

作者： Raymond, Mary Sistu, Ganesh Gallagher, Louis Valeo Vision Systems Ireland Maynooth University Ireland

ISBN: (纸本)9781837242672

Neural Radiance Fields (NeRF) rendering is a promising Artificial intelligence (AI) technology for generating photorealistic views, with significant potential for automotive applications. However, traditional metrics such as Structural Similarity Index Measure (SSIM), Peak Signal-to-Noise Ratio (PSNR), and Learned Perceptual image Patch Similarity (LPIPS) often fail to evaluate the model's quality for novel viewpoints outside the dataset range, which is crucial for real-life use. This study introduces the Fréchet Inception Distance (FID) as a no-reference image quality metric for novel viewpoints. Our experiments demonstrate that FID aligns well with human quality assessments and is effective in automotive scenarios with fisheye images. The need for further research on FID normalization, the sample sizes of generated viewpoints used to calculate FID, and measures of viewpoint difficulty is highlighted. Adopting FID advances NeRF evaluation, enhancing assessments in real life scenarios within automotive and robotics, and improving autonomous system performance and safety. More results from our experiments are available here https://***/watch?v=Lb8azH79EI0. © This is an open access article published by the IET under the Creative Commons Attribution License (http://***/licenses/by/3.0/)

关键词： Rendering (computer graphics)

来源：评论

学校读者我要写书评

暂无评论

Imaging and vision Development Platform with Algorithm Library for Intelligent vision Systems 7th

Imaging and Vision Development Platform with Algorithm Libra...

引用

7th International Symposium on Intelligent Informatics, ISI 2022

作者： Sreedhanya, L.R. Daniel, J. Jerry Nithin, P.v. Saivam, Murugan Kerala Thiruvananthapuram India

ISBN: (纸本)9789811980930

machine vision applications for intelligent vision systems in manufacturing industries were reported based on image processing and artificial intelligence technology. We propose the imaging and vision development platform in this research for creating vision applications using image processing, machine learning, and a deep learning algorithm library. An algorithm library, vision configurator, execution logic, display manager and deploy manager modules are all included in the proposed platform. This platform is based on an open-source software stack for machine learning and deep learning computer vision technologies including OpenCv, TensorFlow, CUDA, Keras, YOLO and PyTorch. To assess the performance of the suggested platform, real-time applications like vehicle identification, person detection, code scanner, and OCR vision application were developed, validated, and deployed in an embedded system utilizing this platform. The results of the experiments show that the suggested platform can be utilized to evaluate high resolution real-time images and construct vision applications. © 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

关键词： Deep learning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：