检索结果-内蒙古大学图书馆

A court line extraction algorithm for badminton tournament videos with horizontal line projection learning

IET image processing 2023年第10期17卷 2907-2924页

作者： Wei, Chun-Ta Weng, Shiuh-Ku Natl Def Univ Chung Cheng Inst Technol Sch Def Sci Taoyuan Taiwan Chien Hsin Univ Sci & Technol Dept Elect Engn Taoyuan Taiwan Chien Hsin Univ Sci & Technol Dept Elect Engn Taoyuan 320678 Taiwan

Court line extraction is one of the important steps in the analysis of sport videos. The court extraction is the foundation of the analysis of badminton video, and an efficient method with horizontal line projection K-means machine learning algorithm to extract court lines from different broadcast badminton tournament videos is proposed in this paper. The horizontal lines are projected into 1-D histogram signal;then the signal is trained to learn the intensity of the histogram signal for locating the positions of the horizontal court lines. After the equations of the horizontal court lines and the court lines in the vertical direction have been formulized, the intersection points of the court lines can be calculated and the court line can be extracted. The experimental results show that the proposed method can extract the court lines more efficiently than that done by the Hough transform-related algorithms, which are widely applied in computer vision and self-driving car applications.

关键词： court lines horizontal line projections Hough transform k-means machine learning algorithm self-driving car applications

来源：评论

学校读者我要写书评

暂无评论

An Efficient Deep Learning based Hybrid Model for image Caption Generation

引用

International Journal of Advanced Computer Science and applications 2023年第3期14卷 231-237页

作者： Kaur, Mehzabeen Kaur, Harpreet Department of Computer Science and Engineering Punjabi University Patiala India

In the recent yeas, with the increase in the use of different social media platforms, image captioning approach play a major role in automatically describe the whole image into natural language sentence. image captioning plays a significant role in computer-based society. image captioning is the process of automatically generating the natural language textual description of the image using artificial intelligence techniques. Computer vision and natural language processing are the key aspect of the image processing system. Convolutional Neural Network (CNN) is a part of computer vision and used object detection and feature extraction and on the other side Natural Language processing (NLP) techniques help in generating the textual caption of the image. Generating suitable image description by machine is challenging task as it is based upon object detection, location and their semantic relationships in a human understandable language such as English. In this paper our aim to develop an encoder-decoder based hybrid image captioning approach using VGG16, ResNet50 and YOLO. VGG16 and ResNet50 are the pre-trained feature extraction model which are trained on millions of images. YOLO is used for real time object detection. It first extracts the image features using VGG16, ResNet50 and YOLO and concatenate the result in to single file. At last LSTM and BiGRU are used for textual description of the image. Proposed model is evaluated by using BLEU, METEOR and RUGE score © 2023, International Journal of Advanced Computer Science and *** Rights Reserved.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

Semantic Document Layout Analysis of Handwritten Manuscripts

引用

Computers, Materials & Continua 2023年第5期75卷 2805-2831页

作者： Emad Sami Jaha Department of Computer Science Faculty of Computing and Information TechnologyKing Abdulaziz UniversityJeddah21589Saudi Arabia

A document layout can be more informative than merely a document’s visual and structural ***,document layout analysis(DLA)is considered a necessary prerequisite for advanced processing and detailed document image analysis to be further used in several applications and different *** research extends the traditional approaches of DLA and introduces the concept of semantic document layout analysis(SDLA)by proposing a novel framework for semantic layout analysis and characterization of handwritten *** proposed SDLA approach enables the derivation of implicit information and semantic characteristics,which can be effectively utilized in dozens of practical applications for various purposes,in a way bridging the semantic gap and providingmore understandable high-level document image analysis and more invariant characterization via absolute and relative *** approach is validated and evaluated on a large dataset ofArabic handwrittenmanuscripts comprising complex *** experimental work shows promising results in terms of accurate and effective semantic characteristic-based clustering and retrieval of handwritten *** also indicates the expected efficacy of using the capabilities of the proposed approach in automating and facilitating many functional,reallife tasks such as effort estimation and pricing of transcription or typing of such complex manuscripts.

关键词： Semantic characteristics semantic labeling document layout analysis semantic document layout analysis handwritten manuscripts clustering retrieval image processing computer vision machine learning

来源：评论

学校读者我要写书评

暂无评论

Systematic Review of Retinal Blood Vessels Segmentation Based on AI-driven Technique

引用

JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2024年第4期37卷 1783-1799页

作者： Verma, Prem Kumari Kaur, Jagdeep Dr B R Ambedkar Natl Inst Technol Dept Comp Sci & Engn Jalandhar 144008 Punjab India

image segmentation is a crucial task in computer vision and image processing, with numerous segmentation algorithms being found in the literature. It has important applications in scene understanding, medical image analysis, robotic perception, video surveillance, augmented reality, image compression, among others. In light of this, the widespread popularity of deep learning (DL) and machine learning has inspired the creation of fresh methods for segmenting images using DL and ML models respectively. We offer a thorough analysis of this recent literature, encompassing the range of ground-breaking initiatives in semantic and instance segmentation, including convolutional pixel-labeling networks, encoder-decoder architectures, multi-scale and pyramid-based methods, recurrent networks, visual attention models, and generative models in adversarial settings. We study the connections, benefits, and importance of various DL- and ML-based segmentation models;look at the most popular datasets;and evaluate results in this Literature.

关键词： Retinal image segmentation machine learning Deep learning

来源：评论

学校读者我要写书评

暂无评论

3D Information in Robot vision System Based on Artificial Neural Network 14th

3D Information in Robot Vision System Based on Artificial Ne...

引用

14th International Conference on Frontier Computing, FC 2024

作者： Liu, Xiaoxiao Department of Mechanical and Electrical Engineering Jinan Engineering Vocational Technical College Shandong Jinan China

ISBN: (纸本)9789819627974

In the exploration of robot vision systems based on artificial neural networks, the research mainly focuses on their applications in 3D information recognition and processing. By simulating the processing of the human visual system, this technology enables robots to more effectively interpret and understand the three-dimensional spatial information of their environment, which has a revolutionary role in robot navigation, object recognition, obstacle avoidance and the execution of complex tasks. And this technology shows great potential in many fields such as industrial automation, autonomous vehicles, drone monitoring, and service robots. Not only that, it also plays a very important role in providing in-depth information, accurate positioning and efficient decision-making applications. According to the experimental data, among the 800 experimental subjects, more than 754 people were satisfied with the recognition accuracy, recognition speed, machine efficiency improvement, image recognition clarity, and overall satisfaction with the system. These findings indicate that with the continuous advancement of technology, 3D vision systems based on artificial neural networks will show more significant performance and value in future applications. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： machine vision

来源：评论

学校读者我要写书评

暂无评论

machine vision inspection systems /

引用

2020年

作者： edited by Muthukumaran Malarvel Soumya Ranjan Nayak Sury Narayan Panda Prasant Kumar Pattnaik...

来源：内蒙古大学图书馆图书评论

学校读者我要写书评

暂无评论

A Review on machine Learning Styles in Computer vision-Techniques and Future Directions

引用

IEEE ACCESS 2022年 10卷 107293-107329页

作者： Mahadevkar, Supriya, V Khemani, Bharti Patil, Shruti Kotecha, Ketan Vora, Deepali R. Abraham, Ajith Gabralla, Lubna Abdelkareim Symbiosis Int Deemed Univ Symbiosis Inst Technol Pune 412115 Maharashtra India Symbiosis Int Deemed Univ Symbiosis Ctr Appl Artificial Intelligence Symbiosis Inst Technol Pune 412115 Maharashtra India Machine Intelligence Res Labs MIR Labs Auburn WA 98071 USA Princess Nourah Bint Abdulrahman Univ Coll Appl Dept Comp Sci & Informat Technol Riyadh 11671 Saudi Arabia

Computer applications have considerably shifted from single data processing to machine learning in recent years due to the accessibility and availability of massive volumes of data obtained through the internet and various sources. machine learning is automating human assistance by training an algorithm on relevant data. Supervised, Unsupervised, and Reinforcement Learning are the three fundamental categories of machine learning techniques. In this paper, we have discussed the different learning styles used in the field of Computer vision, Deep Learning, Neural networks, and machine learning. Some of the most recent applications of machine learning in computer vision include object identification, object classification, and extracting usable information from images, graphic documents, and videos. Some machine learning techniques frequently include zero-shot learning, active learning, contrastive learning, self-supervised learning, life-long learning, semi-supervised learning, ensemble learning, sequential learning, and multi-view learning used in computer vision until now. There is a lack of systematic reviews about all learning styles. This paper presents literature analysis of how different machine learning styles evolved in the field of Artificial Intelligence (AI) for computer vision. This research examines and evaluates machine learning applications in computer vision and future forecasting. This paper will be helpful for researchers working with learning styles as it gives a deep insight into future directions.

关键词： machine learning Computer vision Object detection Artificial intelligence machine learning algorithms image segmentation Feature extraction machine learning techniques computer vision supervised learning multi-task learning object detection artificial intelligence image categorization zero-shot learning

来源：评论

学校读者我要写书评

暂无评论

Word to Sentence Visual Semantic Similarity for Caption Generation: Lessons Learned 18

Word to Sentence Visual Semantic Similarity for Caption Gene...

引用

18th International Conference on machine vision and applications (MVA)

作者： Sabir, Ahmed Univ Politecn Cataluna TALP Res Ctr Barcelona Spain

ISBN: (纸本)9784885523434

This paper focuses on enhancing the captions generated by image captioning systems. We propose an approach for improving caption generation systems by choosing the most closely related output to the image rather than the most likely output produced by the model. Our model revises the language generation output beam search from a visual context perspective. We employ a visual semantic measure in a word and sentence level manner to match the proper caption to the related information in the image. This approach can be applied to any caption system as a post-processing method.

关键词： image enhancement

来源：评论

学校读者我要写书评

暂无评论

Robot Automatic Wire Welding Based on machine vision 5

Robot Automatic Wire Welding Based on Machine Vision

引用

5th International Conference on Computer Engineering and Application (ICCEA)

作者： Liu, Xinsheng Li, Lidan Xing, Hongmei Wang, Shaohua Zhang, Chuan Diao, Xiyao Zhuo, Zhang China North Vehicle Res Inst Beijing Peoples R China

ISBN: (纸本)9798350386783;9798350386776

Currently, the welding process between electrical connectors and multi-core wires mainly relies on manual operation. This traditional method not only consumes a lot of time and manpower, but also long-term operation may cause certain physical burden and health hazards to the operator. Therefore, researching and implementing automated welding between electrical connectors and multi-core wires has become an urgent problem to be solved. On the basis of summarizing the current research status at home and abroad, the software and hardware parts of the system were designed to meet the requirements of identifying and positioning welding circular electrical connectors. By introducing image processing and machine vision technology, adopting a dual machine collaboration approach and based on machine vision methods, automatic wire welding of electrical connectors has been achieved, improving welding efficiency and reducing the labor intensity of operators. In addition, it is also conducive to promoting the development of industrial automation.

关键词： Industrial robot Automatic welding Visual image Aviation plug Wires

来源：评论

学校读者我要写书评

暂无评论

A survey on deep learning in UAV imagery for precision agriculture and wild flora monitoring: Datasets, models and challenges

引用

SMART AGRICULTURAL TECHNOLOGY 2024年 9卷

作者： Epifani, Lorenzo Caruso, Antonio Palazzo Fiorini Dept Math & Phys Ennio Giorgi Campus Ecotekne I-73100 Lecce Italy

machine learning is the state of the art for many recurring tasks in several heterogeneous domains. In the last decade, it has been also widely used in Precision Agriculture (PA) and Wild Flora Monitoring (WFM) to address a set of problems with a big impact on economy, society and academia, heralding a paradigm shift across the industry and academia. Many applications in those fields involve image processing and computer vision stages. Remote sensing devices are very popular choice for image acquisition in this context, and in particular, Unmanned Aerial Vehicles (UAVs) offer a good tradeoff between cost and area coverage. For these reasons, research literature is rich of works that face problems in Precision Agriculture and Wild Flora Monitoring domains with machine learning/computer vision methods applied to UAV imagery. In this work, we review this literature, with a special focus on algorithms, model sizing, dataset characteristics and innovative technical solutions presented in many domain-specific models, providing the reader with an overview of the research trend in recent years.

关键词： machine learning Deep neural networks image analysis Unmanned aerial vehicles Agritech

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：