检索结果-内蒙古大学图书馆

Improving image Clarity with Artificial Intelligence-Powered Super-Resolution Methods 1

3rd International Conference on Mechanical and Energy Technologies, ICMET 2023

作者： Malathy, v. Poornima, M. varun, v.L. venugopal, Padmaja Acharjee, Purnendu Bikash Krishnaveni, S. School of Engineering SR University Telangana Warangal India Department of Mathematics SJB Institute of Technology Bengaluru India Computer Science CHRIST University Pune Lavasa Campus Lavasa India Department of Management Studies Adithya Institute of Technology Coimbatore Kovilpalayam India

ISBN: (数字)9789819727162

ISBN: (纸本)9789819727155

Super-resolution has advanced significantly in the last 20 years, particularly with the application of deep learning methods. One of the most important image processing methods for boosting an image's resolution in computer vision is image super-resolution besides providing an extensive overview of the most recent developments in artificial intelligence and deep learning for single-image super-resolution. This study delves into the subject of image enhancement by investigating sophisticated AI-based super-resolution techniques. High-quality photographs have become more and more in demand in a variety of industries recently, including medical imaging, satellite imaging, entertainment, and surveillance. Pixilation reduction and detail preservation are two areas where traditional image enhancing techniques fall short. Artificial intelligence has demonstrated amazing promise in addressing these issues, especially with regard to Deep Learning models. The applications, benefits, and difficulties of modern super-resolution techniques are thoroughly examined in this work. We also suggest new approaches and push the limits of image enhancement by experimenting with state-of-the-art artificial intelligence algorithms. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.

关键词： image enhancement

来源：评论

学校读者我要写书评

暂无评论

Deep internally connected transformer hashing for image retrieval

引用

KNOWLEDGE-BASED systems 2023年 279卷

作者： Chao, Zijian Cheng, Shuli Li, Yongming Xinjiang Univ Sch Informat Sci & Engn 777 Huarui St Urumqi 830017 Peoples R China

Transformer based on self-attention mechanism has made remarkable achievements in natural language processing, which inspired the application research of Transformer in computer vision. The current deep hashing algorithms extract image features through the convolutional neural network (CNN). CNN concentrates on local information, and features lack global dependency information, which has an impact on image retrieval accuracy. To remedy the above defects, this paper proposes deep internally connected Transformer hashing for image retrieval (DICTH). DICTH has designed an improved Transformer block: internally connected Transformer block (ICT). ICT performs an embedded transformation on the feature maps, splices the generated Keys and Queries, to explore the rich context information between Query-Key pairs, and then dynamically encodes through multi-layer convolution to learn the context multi-head self-attention matrix. By combining ICT and ResNet18 to achieve selfattention injection, a long-distance dependency is established in the feature space to make up for the shortcomings of pure CNN in the feature extraction process and guide the algorithm to learn more accurate hash codes. At the same time, in the face of complex label information in big data sets, this paper uses an improved cross-entropy loss function: T-cross-entropy loss, to promote network learning of hash codes with more ability to distinguish between classes. In this paper, a lot of experiments have been conducted on CIFAR10, NUS-WIDE and MS-COCO datasets to verify the performance of DICTH. (c) 2023 Elsevier B.v. All rights reserved.

关键词： Deep hashing image retrieval Transformer Self-attention

来源：评论

学校读者我要写书评

暂无评论

Hybrid Facial Expression Analysis Model using Quantum Distance-based Classifier and Classical Support vector Machine 11

Hybrid Facial Expression Analysis Model using Quantum Distan...

引用

11th International Symposium on Electronic systems Devices and Computing, ESDC 2023

作者： Rengasamy, Karthikeyan Joshi, Piyush Raveendra, v.v.S. Indian Institute of Information Technology Department of Computer Science Chittoor India Tata Consultancy Services Bfsi Chennai India

ISBN: (纸本)9781665455725

Rapid advancements in image and video processing technologies are poised to create remarkable impacts on a wide range of industries. A significant challenge in these processing technologies resides in identifying the features fed for image classification algorithms. Though all classification algorithms could identify, extract and classify the features of a given image, their accuracy is directly proportional to the number of sample points taken from the image using a sampling technique. As the accuracy improves with a substantial number of sample points, the time consumed to process them looms large. These challenges beseech enormous computing power. Quantum computers avowed exceptional computing power is expected to bridge the growing demands. To address these challenges effectively, we have chosen a specific problem, Facial Expression Analysis, to explore in-depth and arrive at a purposeful approach to deliver the desired outcome. The purpose of this paper is two-pronged. Perform a comparative study of accuracy and performance of classical and quantum image processing algorithms in classical and quantum computers, respectively. Secondly, devise a novel hybrid model using a quantum distance-based classifier augmented with a classical linear support vector machine to overcome the limitations observed. Sample image features derived from the quantum classifier were used to train the linear classifier. The results were observed to be better relative to results from the classical distance-based classifier. Holistically, the novel hybrid model is observed as a promising solution for all image classification problems. Our future work will focus on sophisticated usage of a linear classification algorithm in quantum computing. © 2023 IEEE.

关键词： image classification

来源：评论

学校读者我要写书评

暂无评论

Cloud Architectural Design for image-based vehicle Positioning in Traffic Management 28

Cloud Architectural Design for Image-based Vehicle Positioni...

引用

28th International Conference on System Theory Control and Computing

作者： Beti, Iosif-Alin Herghelegiu, Paul-Corneliu Amarandei, Cristian-Mihai Caruntu, Constantin-Florin Gheorghe Asachi Tech Univ Iasi Dept Automat Control & Appl Informat Iasi Romania Gheorghe Asachi Tech Univ Iasi Dept Comp Sci & Engn Iasi Romania

ISBN: (纸本)9798350364309;9798350364293

vehicle positioning algorithms are essential for improving traffic management and safety by accurately locating vehicles in real-time, and, thus, minimizing congestion and accidents. They also support the development of advanced driver assistance systems and autonomous vehicles, relying on precise positioning data for safe navigation. One of the solutions involves using image processing algorithms, which can have two approaches. One approach is decentralized, in which each vehicle performs its own computing steps and determines its position concerning the other nearby vehicles. The second approach, proposed in this paper, is centralized, where each vehicle sends data to a server that uses cloud computing to process all the data in real-time. As such, vehicles can create a more comprehensive view of the driving conditions in the area by using either of these two approaches, which can help them anticipate potential hazards and make more informed decisions.

关键词： computer vision traffic optimization vehicle platooning cloud computing 5G networks vehicle positioning v2v communication

来源：评论

学校读者我要写书评

暂无评论

Low-Light image Enhancement Based on Retinex Reflectance Compensation 36

Low-Light Image Enhancement Based on Retinex Reflectance Com...

引用

36th International Conference on Software Engineering and Knowledge Engineering, SEKE 2024

作者： Zhang, Zhaozheng Wang, Wujun Yang, Weiran Ma, Ningtao Sun, Mingyang Wang, Xinxin Ruan, Liangyu Yi, Ru College of Computer Science Inner Mongolia University Hohhot China

ISBN: (纸本)1891706594

Enhancement of low-light images is a low-level visual task aimed at improving the quality of images captured under low-light conditions. In this study, a low-light image enhancement algorithm is proposed by compensating for the reflection loss of illumination components obtained through object reflection. Specifically, the algorithm first utilizes Gaussian filtering to process the value component (v) of the image, separating the illumination component from the reflection component. Then, through a illumination compensation strategy, the illumination component is processed and combined with the reflectance component to synthesize the enhanced value component (v). Finally, an adaptive global balance strategy is applied to optimize the enhanced value component (v) to ensure that the resulting image appears more natural and conforms to human visual perception habits. Experimental results demonstrate the effectiveness and superiority of our method compared to existing traditional processing algorithms and deep learning methods, showing excellent performance in enhancing dark details of images and maintaining natural colors. © 2024 Knowledge systems Institute Graduate School. All rights reserved.

关键词： image enhancement

来源：评论

学校读者我要写书评

暂无评论

A wandb based corrosion detection of metals through live video analytics 2

A wandb based corrosion detection of metals through live vid...

引用

2nd IEEE International Conference on IoT, Communication and Automation Technology, ICICAT 2024

作者： John Deva Prasanna, D.S. Sai Neeraj, Kuruvella Jaya Krishna, G. SRM Institute of Science and Technology Faculty of Engineering and Technology Department of DSBS India SRM Institute of Science and Technology Department of DSBS India

ISBN: (纸本)9798350368109

Detection of corrosion in moving objects like ships is challenging due to the dynamic nature of the input image. Existing machine learning techniques are suitable for static images and the algorithms suffer in performance when is a live video. In this paper, image processing for detecting corrosion using YOLOv8 which more suitable for processing live videos as speed and accuracy is better. This makes YOLO v8 for corrosion detection in live videos. In addition, Weights and Biases (W&B). is used in the algorithm as it is pivotal in establishing the connections between neurons and biases helps in circumventing flexible inputs. By combining YOLOv8's and W&B approach the accuracy and efficiency of corrosion detection systems is improved. This can ultimately assist in better maintenance and preservation of essential infrastructure resources. © 2024 IEEE.

关键词： video signal processing

来源：评论

学校读者我要写书评

暂无评论

Deep Learning-Driven Black Box Approach For image Segmentation 1

Deep Learning-Driven Black Box Approach For Image Segmentati...

引用

1st IEEE International Conference on Advances in Computing, Communication and Networking, ICAC2N 2024

作者： Sudha, Nuthalapati Kasetty, Sai Bhargav Madhuri, K. Sai Nikitha, Jajala Margret, Issac Neha Rajakumar, K. Mahindra University School of Engineering Telangana Hyderabad India V.I.T - University School of Computer Science and Engineering Tamilnadu Vellore India BVRIT Narsapur Dept. of Computer Science and Data Science Telangana Hyderabad India VNR VIET Dept. of CSE - AIML and IoT Telangana Hyderabad India

ISBN: (纸本)9798350356816

The diagnosis of a range of eye disorders needs to categorize the retinal vessels. Computerized implementation of this process is becoming increasingly essential for automated screening systems for retinal diseases. To achieve a more accurate extraction of the retinal vessels, a new pre-processing step is proposed. These proposed pre-processes are also compared to other algorithms to assess their impact. The proposed pre-processing process consists of two phases. The first phase is the implementation and validation of the pre-processing modules, and the second phase is the implementation of these pre-processes onto the retinal vessels that were to be extracted. To achieve a significantly improved segmented vessel image, the proposed pre-process phase employs a common image-processing technique. In recent years, there has been a great deal of focus on retinal vessel identification studies, and the importance of assessing and confirming the findings of retinal vessel segmentation. © 2024 IEEE.

关键词： image segmentation

来源：评论

学校读者我要写书评

暂无评论

Interactive Sign Language Learning System using Deep Learning 2

Interactive Sign Language Learning System using Deep Learnin...

引用

2nd International Conference on Self Sustainable Artificial Intelligence systems, ICSSAS 2024

作者： Ravikumar, S. Praharsha, P. Dimple Priya, J. Sri Harsha Sai, v. Naga Rohit Vel Tech Rangarajan Dr. Sagunthala R & D Institute of Science and Technology Department of Computer Science and Engineering Avadi Tamil Nadu Chennai India

ISBN: (纸本)9798350368413

The 'Interactive Sign Language Learning System' is a sophisticated application designed to facilitate the learning process of sign language learners. This comprehensive system encompasses several key features, including sign language alphabet and word recognition, text-to-action conversion for learners, multi-language support, and integrated voice output functionality. The system utilizes advanced algorithms for sign language recognition, employing techniques such as image processing and machine learning to accurately interpret hand gestures and movements. For sign language alphabet and word recognition, a combination of computer vision algorithms, possibly including convolutional neural networks (CNNs) or recurrent neural networks (RNNs), may be employed to analyze input images or video streams and classify them into corresponding sign language symbols or words. Text-to-action conversion involves mapping textual input to corresponding sign language actions or gestures, possibly using natural language processing (NLP) algorithms to understand the semantics of the text and generate appropriate sign language representations. The system's accuracy, measured in terms of correctly recognized sign language symbols or words, would depend on the effectiveness of the algorithms employed and the quality of the training data. With rigorous development and training, the system aims to achieve high accuracy levels, in sign language recognition and text-to-action conversion tasks. © 2024 IEEE.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

YOLO-based anomaly activity detection system for human behavior analysis and crime mitigation

引用

SIGNAL image AND vIDEO processing 2024年第SUPPL 1期18卷 417-427页

作者： Ganagavalli, K. Santhi, v. Bannari Amman Inst Technol Sathyamangalam India PSG Coll Technol Coimbatore India

Detecting the objects and tracking them is a wonder with the help of the recent technologies. In today's life, video surveillance is commonly present at maximum places. As the population increases, the crime rates also keep on increasing at different cases. The crime identification, dealing with the crime scenes and tracking down the criminal is a major task which involves numerous man powers. If there is a methodology for detecting and preventing the crime, it would be very much helpful to the public and also to the authorities. So the main objective of the proposed system is to automate the crime identification and crime tracking process by using the video surveillance with deep learning algorithms. Also, to alert the public before a crime takes place in public areas. Because of the timely alert, further crime activities may be avoided or the loss incurred may be reduced. The system also helps in identifying the criminals and tracking the information about the crime scenes, thus providing more useful information to the authorities in solving the cases. According to recent research works, the YOLO algorithm achieves higher accuracy with multi-object detection. In this paper, a framework with improvised YOLO algorithm is illustrated. The algorithm is fine-tuned with different hyperparameters for achieving the AUC with 0.91 for detecting vandalism behavior and 0.8299 over all the 14 classes of crime activities. The result of the proposed system is compared with the existing systems with parameters like training loss, testing loss, precision and F1 score.

关键词： Anomaly detection Abnormal behavior Anomaly activities Yolo Hyperparameters Multi object detection

来源：评论

学校读者我要写书评

暂无评论

image Understanding Through visual Question Answering: A Review from Past Research 23th

Image Understanding Through Visual Question Answering: A Rev...

引用

23rd International Conference on Intelligent systems Design and Applications, ISDA 2023

作者： Yanda, Nagamani Tagore Babu, J. Aswin Kumar, K. Taraka Rama Rao, M. Ranjith varma, K.v. Rahul Babu, N. GMR Institute of Technology Rajam532127 India

ISBN: (纸本)9783031648465

visual Question Answering (vQA) lies at the crossroads of computer vision, natural language processing, and deep learning, captivating researchers across various AI domains. This dynamic field involves processing an image alongside a corresponding textual question, generating, or selecting an answer from provided options. The past five years have witnessed substantial advancements in vQA and visual reasoning, fueled by deep learning and extensive annotated datasets. This study presents a comprehensive literature review, delving into the current state-of-the-art from four perspectives: problem definition, existing datasets, literature review, and evaluation metrics. Through a critical analysis, we address dataset limitations and scrutinize contemporary algorithms. Here we use multimodal Fusion, which achieves the state of the art compared to existing methodologies. Moreover, we explore potential future research directions to inspire innovative solutions and applications in this evolving domain, aiming to propel vQA into new realms of exploration and practical utility. This project will allow users to input an image and image-related text so that it will aid the question-answering system. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.

关键词： Deep learning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：