Deep learning advancements have significantly enhanced computer visionapplications in precision agriculture. While RGB cameras operating in visible light are affordable, they provide limited information compared to m...
详细信息
In the era of digitization and big data, the world is inundated with an ever-growing volume of visual content, be it images or videos. As organizations strive to harness the potential of these multimedia data sources,...
详细信息
The face is a critical perspective in predicting human feelings and moods. More frequently than not human senti-ments are extricated with the utilization of the camera. Various applications are being made based on the...
详细信息
The proceedings contain 134 papers. The topics discussed include: an adaptive storage switching algorithm for fault-tolerant network attached storage systems;Covid-19 prediction using machine learning algorithms;energ...
ISBN:
(纸本)9798350387933
The proceedings contain 134 papers. The topics discussed include: an adaptive storage switching algorithm for fault-tolerant network attached storage systems;Covid-19 prediction using machine learning algorithms;energy management of hybrid electric vehicles using cascaded fuzzy logic controller;dynamic lane management with IoT for real-time lane configuration and traffic flow;a closer look at sclera: emerging trends in biometric security;cognitive vision companion: an ai-enhanced support system for the visually impaired;advances in medical imageprocessing for liver tumor recognition: a comprehensive survey;a gradient boosting algorithm to predict energy consumption for home applications;and review on text classification using improved deep learning models.
An image fusion is a kind of single process which combines the necessary or efficient information from a set of different or similar input images into a single output image where the resulting image is more accurate, ...
详细信息
Gradient computing is a low-level technology widely used in imageprocessing. For large gradient magnitude, the pixel value in the field changes a lot, and for small gradient magnitude the pixel in the domain changes ...
详细信息
ISBN:
(纸本)9781665464680
Gradient computing is a low-level technology widely used in imageprocessing. For large gradient magnitude, the pixel value in the field changes a lot, and for small gradient magnitude the pixel in the domain changes little. This is the basis of classical edge extraction algorithms, but it is often necessary to manually set thresholds to differentiate. This paper innovatively brings out the concept of omnidirectional gradient, which uses flexible convolution kernel radius and special law to calculate, and omnidirectional gradient pays more attention to gradient direction and analyzes the relationship and change of the gradient direction with different kernel radius. We present here an algorithm for stylized edge extraction based on omnidirectional gradient, overcoming the drawback of classical edge extraction algorithms that require manual thresholding. Experimental results show that the proposed method outperforms the classical edge extraction methods in terms of adaptive, consistent, and visually friendlier features for infrared imaging. In addition, the algorithm is fast and efficient, its result can be used as real-time input for subsequent applications.
Humans outperform object recognizers despite the fact that models perform well on current datasets, including those explicitly designed to challenge machines with debiased images or distribution shift. This problem pe...
详细信息
ISBN:
(纸本)9781713899921
Humans outperform object recognizers despite the fact that models perform well on current datasets, including those explicitly designed to challenge machines with debiased images or distribution shift. This problem persists, in part, because we have no guidance on the absolute difficulty of an image or dataset making it hard to objectively assess progress toward human-level performance, to cover the range of human abilities, and to increase the challenge posed by a dataset. We develop a dataset difficulty metric MVT, Minimum Viewing Time, that addresses these three problems. Subjects view an image that flashes on screen and then classify the object in the image. images that require brief flashes to recognize are easy, those which require seconds of viewing are hard. We compute the imageNet and ObjectNet image difficulty distribution, which we find significantly undersamples hard images. Nearly 90% of current benchmark performance is derived from images that are easy for humans. Rather than hoping that we will make harder datasets, we can for the first time objectively guide dataset difficulty during development. We can also subset recognition performance as a function of difficulty: model performance drops precipitously while human performance remains stable. Difficulty provides a new lens through which to view model performance, one which uncovers new scaling laws: vision-language models stand out as being the most robust and human-like while all other techniques scale poorly. We release tools to automatically compute MVT, along with image sets which are tagged by difficulty. Objective image difficulty has practical applications - one can measure how hard a test set is before deploying a real-world system - and scientific applications such as discovering the neural correlates of image difficulty and enabling new object recognition techniques that eliminate the benchmark-vsreal-world performance gap.
The proceedings contain 22 papers. The topics discussed include: biometric-based unique identification for bovine animals — comparative study of various machine and deep learning computer vision methods;review of gro...
ISBN:
(纸本)9798350333299
The proceedings contain 22 papers. The topics discussed include: biometric-based unique identification for bovine animals — comparative study of various machine and deep learning computer vision methods;review of groundwater potential storage and recharge zone map delineation using statistics based hydrological and machine learning based artificial intelligent models;significance of cyber security of IoT devices in the healthcare sector;cryptocurrency with blockchain technology — a literature review;federated learning to preserve the privacy of user data;machine learning techniques for heart disease prediction;Alzheimer prediction using machine learning algorithm;design and control of omnidirectional conveyor model using imageprocessing;and leveraging IoT technologies in retail industry to improve customer experience: current applications and future potential.
In view of the demand for cigarette case appearance quality detection in the production process of cigarette enterprises, a machinevision-based method for detecting cigarette case appearance defects is proposed, and ...
详细信息
This review article about Few-Shot Learning techniques is focused on Computer visionapplications based on Deep Convolutional Neural Networks. A general discussion about Few-Shot Learning is given, featuring a context...
详细信息
ISBN:
(纸本)9783031133244;9783031133237
This review article about Few-Shot Learning techniques is focused on Computer visionapplications based on Deep Convolutional Neural Networks. A general discussion about Few-Shot Learning is given, featuring a context-constrained description, a short list of applications, a description of a couple of commonly used techniques and a discussion of the most used benchmarks for FSL computer visionapplications. In addition, the paper features a few examples of recent publications in which FSL techniques are used for training models in the context of Human Behaviour Analysis and Smart City Environment Safety. These examples give some insight about the performance of state-of-the-art FSL algorithms, what metrics do they achieve, and how many samples are needed for accomplishing that.
暂无评论