With the rapid development of big data and the Internet of Things technology, people have easier access to data, which leads to different perspectives when observing data. As a result, multi-view data has emerged, whi...
详细信息
The proceedings contain 85 papers. The topics discussed include: audio processing for tones validation using deep learning;effective generation of visual questions;elicitation of intracerebral hemorrhage using deep le...
ISBN:
(纸本)9781728168852
The proceedings contain 85 papers. The topics discussed include: audio processing for tones validation using deep learning;effective generation of visual questions;elicitation of intracerebral hemorrhage using deep learning;underwater image enhancement using color constancy via homomorphic filtering and depth estimation;fusion based underwater image enhancement and detail preserving;FPGA based feature extraction in real time computervision - a comprehensive survey;enhancement of liver ultrasoundimages by guided image filtering technique;exploration of horizontal and vertical components in polarimetric decomposition based on volume scattering;audio source count estimation using deep learning;detection of pneumonia from x-ray images using eigen decomposition and machine learning techniques;audio based detection of saw blade sharpness using machine learning;performance analysis and evaluation of estimator for RF cavity detuning measurement;speaker identification and verification using deep learning;and motor imagery based EEG signal classification using multi-scale CNN architecture.
The similarity measure plays the key role in the self-learning framework for single image super-resolution. This paper involves matrix regression with properties of robustness and two-dimensional structure to measure ...
详细信息
The similarity measure plays the key role in the self-learning framework for single image super-resolution. This paper involves matrix regression with properties of robustness and two-dimensional structure to measure the similarity between image blocks and enhance the effect of super-resolution. Specifically, we use the minimal nuclear norm of representation error as a criterion, and the alternating direction method of multipliers (ADMM) to calculate the similarity between high- and low-resolution image blocks. Evaluation on several images with different interference and experimental results of super-resolution images clearly demonstrate the advantages of our proposed method in visual robustness and super-resolution effects.
In today's digital world, most information is shared through images or videos. images and videos are more infor-mative than textual data. During the image or video capturing process, addition of noise is one of th...
详细信息
image captioning is a challenging task that lies at the intersection of computervision and Natural Language processing. There exists a legion of works that generate meaningful and realistic descriptions of images. Re...
image captioning is a challenging task that lies at the intersection of computervision and Natural Language processing. There exists a legion of works that generate meaningful and realistic descriptions of images. Recently, with the advent of attention mechanisms and transformers, there has been a drastic shift in modelling both language andvision tasks. However, there are very few extensive studies that review these approaches based on their progression, advantages and disadvantages. This paper presents a detailed summary of transformer-based models employed for tackling image captioning. In addition to this, we provide an overview of various pre-training tasks, datasets and metrics used for image captioning. Finally, the performance of all the reviewed approaches are compared on the COCO Captions dataset.
Wildfires can cause significant damage to forests and endanger wildlife. Detecting these forest fires at the initial stages helps the authorities in preventing them from spreading further. In this paper, we first prop...
详细信息
ISBN:
(纸本)9781450398473
Wildfires can cause significant damage to forests and endanger wildlife. Detecting these forest fires at the initial stages helps the authorities in preventing them from spreading further. In this paper, we first propose a novel technique, termed CIELAB-color technique, which detects fire based on the color of the fire in CIELAB color space. We train state-of-art CNNs to detect fire. Since deep learning (CNNs) andimageprocessing have complementary strengths, we combine their strengths to propose an ensemble architecture. It uses two CNNs and the CIELAB-color technique and then performs majority voting to decide the final fire/no-fire prediction output. We finally propose a chain-of-classifiers technique which first tests an image using the CIELAB-color technique. If an image is flagged as no-fire, then it further checks the image using a CNN. This technique has lower model size than ensemble technique. On FLAME dataset, the ensemble technique provides 93.32% accuracy, outperforming both previous works (88.01% accuracy) and individually using either CNNs or CIELAB-color technique. The source code can be obtained from https://***/CandleLabAI/FireDetection.
Today's industrial production is developing in the direction of intelligence, in order to improve production efficiency and supervise production. Nowadays, traditional manufacturing industry is gradually paying at...
详细信息
In medical applications, the boundless potential of imageprocessing utilizing Deep Neural Networks has grabbed the interest of researchers. Brain tumor segmentation, which is a crucial piece of task, determines the l...
详细信息
During the pandemic time government took many safety measures to protect the public at common gathering places. People are insisted on wearing a face mask to protect themselves from COVID. Even then many people were r...
详细信息
This paper aims to explore an innovative method combining computervision and machine learning to accurately identify and analyze various movements in badminton. This paper first summarizes the application prospect of...
详细信息
ISBN:
(数字)9798350360240
ISBN:
(纸本)9798350384161
This paper aims to explore an innovative method combining computervision and machine learning to accurately identify and analyze various movements in badminton. This paper first summarizes the application prospect of computervision in the field of sports analysis, and introduces its specific application scenarios in badminton in detail. By constructing a complete technical framework of image preprocessing module, feature extraction algorithm and deep learning model, the complex movements of badminton players such as swing, stroke and moving pace are captured and analyzed. In the research process, we used multi-view image fusion and key point detection technology to accurately extract action features in badminton, combined with convolutional neural network (CNN), recurrent neural network (RNN), long term memory network (LSTM) and other deep learning models to efficiently learn and model these features. Thus, the automatic classification and recognition of badminton movement can be realized. The experimental results show that the model has significant accuracy in badminton action recognition, good generalization ability and practicability, and can be effectively applied in the badminton teaching and training process of athlete performance evaluation, competition data analysis and other aspects. This research result not only expands the practical application of computervision technology in the field of badminton, but also provides new ideas and tools for further promoting the development of sports intelligence and digitalization.
暂无评论