检索结果-内蒙古大学图书馆

25th IEEE Conference on Signal processing - Algorithms, Architectures, Arrangements, and applications (IEEE SPA)

作者： Suder, Jakub Podbucki, Kacper Marciniak, Tomasz Dabrowski, Adam Poznan Univ Tech Fac Automat Control Robot & Elect Engn Inst Automat Control & Robot Div Elect Syst & Signal Proc Jana Pawla II 24 PL-60965 Poznan Poland

ISBN: (数字)9788362065424

ISBN: (纸本)9788362065424

The article presents a concept of the analysis of mechanical wear of prisms in the in-pavement airport lamps. The solution is based on image processing technique that requires an appropriate selection of parameters due to the specificity of the objects. During the experimental tests, a database consisting of 316 photos of IDM airport lamps mounted in the airport areas was used. The proposed solution using an artificial neural network allows for the classification of lamps with an efficiency of 81.4%.

关键词： system airport lighting control neural networks artificial intelligence Hough transform

来源：评论

学校读者我要写书评

暂无评论

Application of Computer 3D image vision Algorithm in Intelligent image Recognition System

Application of Computer 3D Image Vision Algorithm in Intelli...

引用

IEEE International Conference on Artificial Intelligence and Computer applications (ICAICA)

作者： Yuan Li Xin Yu Modern Finance Industry School Shandong Institute of Commerce and Technology Jinan Shandong China

In this paper, the 3D space imaging model of machine vision is constructed. Starting from the traditional machine vision image processing algorithm flow, the image denoising process and target tracking process are optimized. The method uses the camera to collect the image and video information of the measured object, and transmits it to the controller. The controller corrects the signal obtained by the wireless sensor in the database to reproduce the position of the measured object and the 3D image. A real-time tracking method of motion trajectory based on computer vision is presented. The object autonomous capture, 3D position and motion trajectory tracking. Simulation experiments show that this method is quite different from conventional image processing methods. This method has the advantages of small computation, fast running speed and good real-time performance. It meets the needs of embedded image processing.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Artificial intelligence in pediatric surgery

引用

SEMINARS IN PEDIATRIC SURGERY 2024年第1期33卷 151390-151390页

作者： Tsai, Anthony Y. Carter, Stewart R. Greene, Alicia C. Penn State Hlth Childrens Hosp Div Pediat Surg 500 Univ Dr Hershey PA 17033 USA Univ Louisville Sch Med Div Pediat Surg Louisville KY 40292 USA

Artificial intelligence (AI) is rapidly changing the landscape of medicine and is already being utilized in conjunction with medical diagnostics and imaging analysis. We hereby explore AI applications in surgery and examine its relevance to pediatric surgery, covering its evolution, current state, and promising future. The various fields of AI are explored including machine learning and applications to predictive analytics and decision support in surgery, computer vision and image analysis in preoperative planning, image segmentation, surgical navigation, and finally, natural language processing assist in expediting clinical documentation, identification of clinical indications, quality improvement, outcome research, and other types of automated data extraction. The purpose of this review is to familiarize the pediatric surgical community with the rise of AI and highlight the ongoing advancements and challenges in its adoption, including data privacy, regulatory considerations, and the imperative for interdisciplinary collaboration. We hope this review serves as a comprehensive guide to AI's transformative influence on surgery, demonstrating its potential to enhance pediatric surgical patient outcomes, improve precision, and usher in a new era of surgical excellence.

关键词： Artificial intelligence machine learning Natural language processing Computer vision Pediatric surgery Predictive analysis

来源：评论

学校读者我要写书评

暂无评论

Drone Small Target Detection Based on YOLOV8-RAWM

Drone Small Target Detection Based on YOLOV8-RAWM

引用

image processing, Computer vision and machine Learning (ICICML), International Conference on

作者： Xianpeng Cheng Liu Feng School of Information Engineering Shanghai Maritime University Shanghai China

ISBN: (数字)9798350355413

ISBN: (纸本)9798350355420

With the widespread use of drones, detecting small targets in aerial images captured by drones poses significant challenges. Issues such as small size and low resolution make these targets difficult to detect. In response to this, this paper proposes an improved object detection algorithm based on YOLOv8. Firstly, a lightweight convolution and Manhattan self-attention mechanism were introduced into the backbone network, along with a feature fusion module in the neck, and the loss function was optimized to enhance the model's performance and robustness in small target detection. Experiments show that the model achieved a 6.2% improvement in mAP@0.5 and a 5% increase in recall on the VisDrone2019 dataset, demonstrating its effectiveness in detecting small targets in drone applications.

关键词： Deep learning Computer vision image resolution Convolution Computational modeling Object detection Feature extraction Robustness Neck Drones

来源：评论

学校读者我要写书评

暂无评论

Research on Badminton Movement machine Learning Model Based on Computer vision Technology

Research on Badminton Movement Machine Learning Model Based ...

引用

image processing and Computer applications (ICIPCA), IEEE International Conference on

作者： Cheng Zong Hohhot Vocational College Hohhot China

ISBN: (数字)9798350360240

ISBN: (纸本)9798350384161

This paper aims to explore an innovative method combining computer vision and machine learning to accurately identify and analyze various movements in badminton. This paper first summarizes the application prospect of computer vision in the field of sports analysis, and introduces its specific application scenarios in badminton in detail. By constructing a complete technical framework of image preprocessing module, feature extraction algorithm and deep learning model, the complex movements of badminton players such as swing, stroke and moving pace are captured and analyzed. In the research process, we used multi-view image fusion and key point detection technology to accurately extract action features in badminton, combined with convolutional neural network (CNN), recurrent neural network (RNN), long term memory network (LSTM) and other deep learning models to efficiently learn and model these features. Thus, the automatic classification and recognition of badminton movement can be realized. The experimental results show that the model has significant accuracy in badminton action recognition, good generalization ability and practicability, and can be effectively applied in the badminton teaching and training process of athlete performance evaluation, competition data analysis and other aspects. This research result not only expands the practical application of computer vision technology in the field of badminton, but also provides new ideas and tools for further promoting the development of sports intelligence and digitalization.

关键词： Training Deep learning Computer vision Analytical models Recurrent neural networks Computational modeling Feature extraction Data models Convolutional neural networks Sports

来源：评论

学校读者我要写书评

暂无评论

Enhanced image Segmentation by a Novel Test Time Augmentation and Super-Resolution 9th

Enhanced Image Segmentation by a Novel Test Time Augmentatio...

引用

9th International Work-Conference on the Interplay Between Natural and Artificial Computation (IWINAC)

作者： Garcia-Aguilar, Ivan Garcia-Gonzalez, Jorge Marcos Luque-Baena, Rafael Lopez-Rubio, Ezequiel Dominguez-Merino, Enrique Univ Malaga Dept Comp Languages & Comp Sci Bulevar Louis Pasteur 35 Malaga 29071 Spain Biomed Res Inst Malaga IBIMA C Doctor Miguel Diaz Recio 28 Malaga 29010 Spain

ISBN: (纸本)9783031065279;9783031065262

image segmentation in computer vision applications plays a critical role in the video processing workflow. In real applications, where interesting elements are moving in the presence of moving objects in the background, complex models are required in the segmentation process to obtain better results. In this paper, a methodology based on super-resolution and test time augmentation is proposed to improve the precision and effectiveness of the segmentation process. Our proposal avoids both modification and retraining of the model. Experiments show that our approach can increase the mean average precision of images segmentation in sequences from well-known benchmark datasets with a significant improvement.

关键词： image segmentation Convolutional neural networks (CNN) Test Time Augmentation (TTA) Super-resolution (SR)

来源：评论

学校读者我要写书评

暂无评论

Robust and Faster Zeroth-Order Minimax Optimization: Complexity and applications 38

Robust and Faster Zeroth-Order Minimax Optimization: Complex...

引用

38th Conference on Neural Information processing Systems, NeurIPS 2024

作者： An, Weixin Liu, Yuanyuan Shang, Fanhua Liu, Hongying Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education School of Artificial Intelligence Xidian University China College of Intelligence and Computing Tianjin University China Medical School Tianjin University China Peng Cheng Lab Shenzhen China

Many zeroth-order (ZO) optimization algorithms have been developed to solve nonconvex minimax problems in machine learning and computer vision areas. However, existing ZO minimax algorithms have high complexity and rely on some strict restrictive conditions for ZO estimations. To address these issues, we design a new unified ZO gradient descent extragradient ascent (ZO-GDEGA) algorithm, which reduces the overall complexity to O(dϵ-6) to find an ϵ-stationary point of the function ψ for nonconvex-concave (NC-C) problems, where d is the variable dimension. To the best of our knowledge, ZO-GDEGA is the first ZO algorithm with complexity guarantees to solve stochastic NC-C problems. Moreover, ZO-GDEGA requires weaker conditions on the ZO estimations and achieves more robust theoretical results. As a by-product, ZO-GDEGA has advantages on the condition number for the NC-strongly concave case. Experimentally, ZO-GDEGA can generate more effective poisoning attack data with an average accuracy reduction of 5%. The improved AUC performance also verifies the robustness of gradient estimations. © 2024 Neural information processing systems foundation. All rights reserved.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Spectral-Spatial Anomaly Detection in Hyperspectral imagery Based on Dual-Domain Autoencoders

Spectral-Spatial Anomaly Detection in Hyperspectral Imagery ...

引用

Iranian machine vision and image processing (MVIP)

作者： Mohamad Ebrahim Aghili Hassan Ghassemian Maryam Imani Arani Image Processing and Information Analysis Lab Faculty of Electrical and Computer Engineering Tarbiat Modares University Tehran Iran

Hyperspectral anomaly detection is crucial for applications like aerial surveillance in remote sensing images. However, robust identification of anomalous pixels remains challenging. A novel spectral-spatial anomaly detection technique called Dual-Domain Autoencoders (DDA) is proposed to address these challenges. First, Nonnegative Matrix Factorization (NMF) is applied to decompose the hyperspectral data into anomaly and background components. Refinement of the designation is then done using intersection masking. Next, a spectral autoencoder is trained on identified background signature pixels and used to reconstruct the image. The reconstruction error highlights spectral anomalies. Furthermore, a spatial autoencoder is trained on principal component patches from likely background areas. Fused reconstruction error from the spectral and spatial autoencoders is finally used to give enhanced anomaly detection. Experiments demonstrate higher AUC for DDA over individual autoencoders and benchmark methods. The integration of matrix factorization and dual-domain, fused autoencoders thus provides superior anomaly identification. Spatial modeling further constrains the background, enabling accurate flagging of unusual local hyperspectral patterns. This study provides the effectiveness of employing autoencoders trained on intelligently sampled hyperspectral pixel signatures and spatial features for improved spectral-spatial anomaly detection.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Formation of an algorithm for increasing the accuracy of restoring the boundaries of objects obtained to create 3D structures 4

Formation of an algorithm for increasing the accuracy of res...

引用

Conference on Artificial Intelligence and machine Learning in Defense applications IV

作者： Semenishchev, Evgenii Zelensky, Aleksandr Zhdanova, Marina Ilyukhin, Yury Sizyakin, Roman Voronin, Viacheslav Tula State Univ TulSU Lab Cognit Technol & Simulat Syst 92 Sq Lenina Tula 300012 Tula Region Russia Moscow State Tech Univ STANKIN Ctr Cognit Technol & Machine Vis 1a Vadkovsky Moscow 127055 Russia

ISBN: (纸本)9781510655560

The article proposes an approach to improve the accuracy of restoring the boundaries of objects obtained to create 3D structures by analyzing data obtained by a machine vision system. At the first stage, the operation of reducing the number of color gradients is performed, the technique allows you to combine similar values into common enlarged structures. This operation allows you to simplify the analyzed objects, since small details are not important. In parallel with the first operation of denoising is performed. The paper proposes the application of the multicriteria processing method with the possibility of smoothing locally stationary sections and preserving the boundaries of objects. As an algorithm for strengthening the boundaries of objects, a modification of the combined multi-criteria method is used, which makes it possible to reduce the effect of salt/pepper noise and impulse failures, as well as to strengthen the detected boundaries of objects. The resulting images with enhanced boundaries are fed to the input of the block for constructing three-dimensional objects. The data obtained by both a stereo pair and a camera based on 3D construction using structured light were used in the work. On a set of synthetic data simulating the work in real conditions, the increase in the efficiency of the system using the proposed approach is shown. Based on field data under conditions of interfering factors in the form of dust/fog, the applicability of the proposed approach for solving problems of increasing the accuracy of restoring the boundaries of objects obtained to create three-dimensional structures is shown. images of simple shapes are used as analyzed objects.

关键词： boundaries preprocessing machine vision 3D structures image processing

来源：评论

学校读者我要写书评

暂无评论

Decoding machine Learning Algorithms for DR Detection

Decoding Machine Learning Algorithms for DR Detection

引用

IT Innovation and Knowledge Discovery (ITIKD), International Conference on

作者： Jagrit Ahuja Varun Sharma Raghavv Gupta Preeti Kapoor Shaveta Arora Bharati Vidyapeeth Institute of Management and Research New Delhi India Department of Computer Science and Engineering The NorthCap University Gurugram India

ISBN: (数字)9798350355468

ISBN: (纸本)9798350355475

Diabetic retinopathy is a serious eye disease which can lead to vision defects in diabetic patients. Early detection is important for preventing vision loss. Automating the detection process makes it less laborious and helps in achieving more accurate results. Therefore, this research aims to study various machine learning methods and develop an effective model for diabetic retinopathy detection. The proposed model operates through four stages: pre-processing, feature extraction, feature optimization, and classification. image processing techniques are thoroughly utilized to enhance the fundus images. The textual and spatial features are then extracted and most relevant features are selected using particle swarm optimization. These features are then fed into machine learning classifiers for disease diagnosis. The model is trained and validated on the MESSIDOR I and MESSIDOR ii datasets. The model achieves an accuracy (85.9%), F1 score (0.8585), Sensitivity (0.8049) and Specificity (0.8958) on MESSIDOR ii data with Support Vector machine (SVM) classifier. The results obtained show the effectiveness of the work in early detection.

关键词： Support vector machines Deep learning Diabetic retinopathy Accuracy machine learning algorithms Nearest neighbor methods Feature extraction Particle swarm optimization Optimization vision defects

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：