Recently, near-infrared to visible light facial image matching is gaining popularity, especially for low-light and night-time surveillance scenarios. Unlike most of the work in literature, we assume that the near-infr...
详细信息
Contextual information plays a critical role in object recognition models within computervision, where changes in context can significantly affect accuracy, underscoring models' dependence on contextual cues. thi...
详细信息
ISBN:
(纸本)9798400710759
Contextual information plays a critical role in object recognition models within computervision, where changes in context can significantly affect accuracy, underscoring models' dependence on contextual cues. this study investigates how context manipulation influences both model accuracy and feature attribution, providing insights into the reliance of object recognition models on contextual information as understood through the lens of feature attribution methods. We employ a range of feature attribution techniques to decipher the reliance of deep neural networks on context in object recognition tasks. Using the imageNet-9 and our curated imageNet-CS datasets, we conduct experiments to evaluate the impact of contextual variations, analyzed through feature attribution methods. Our findings reveal several key insights: (a) Correctly classified images predominantly emphasize object volume attribution over context volume attribution. (b) the dependence on context remains relatively stable across different context modifications, irrespective of classification accuracy. (c) Context change exerts a more pronounced effect on model performance than Context perturbations. (d) Surprisingly, context attribution in 'no-information' scenarios is non-trivial. Our research moves beyond traditional methods by assessing the implications of broad-level modifications on object recognition, either in the object or its context. Code available at https://***/nineRishav/Lost-In-Context
In computer numerical control machine tools, using machining simulation to prevent collision have become more popular due to higher machining speed. the 3D reconstruction of objects is one of the main goals for 3D mac...
详细信息
ISBN:
(纸本)9781728108513
In computer numerical control machine tools, using machining simulation to prevent collision have become more popular due to higher machining speed. the 3D reconstruction of objects is one of the main goals for 3D machine vision is presented in this paper. In the stereo vision, a 2D edge feature detection algorithm designed by integration of vision base imageprocessing which can generate a new straight line on the edges of the object. the matching comparison process is solved withthe Normalized Cross Correlation (NCC) to obtain the results of a new straight line matching to find disparity from distance between feature points on plane. the disparity can be obtained from two cameras by overlapped two objects that show the different positions of two objects in horizontal using the principles of geometry of two cameras. the different position results can find the coordinates of 3D (x, y, z) to be generating the straight lines on edge of the 3D objects. this method is simply and easier in the computing process for the reconstruction of the object. the results obtain are presented with an effective and accurate model and can take results input into the 3D recognition process.
Recognition of human actions is one of the important tasks in various computervision applications including video surveillance, human computer interaction etc. Traditionally RGB or depth cameras are utilized for this...
详细信息
Patch-based techniques are proven to generate promising results and outperform many of the existing state-of-art techniques for most of the applications in digital imageprocessing. In this work we develop a patch bas...
详细信息
Face recognition has attained a greater importance in biometric authentication due to its non-intrusive property of identifying individuals at varying stand-off distance. Face recognition based on multi-spectral imagi...
详细信息
ISBN:
(纸本)9781450347532
Face recognition has attained a greater importance in biometric authentication due to its non-intrusive property of identifying individuals at varying stand-off distance. Face recognition based on multi-spectral imaging has recently gained prime importance due to its ability to capture spatial and spectral information across the spectrum. Our first contribution in this paper is to use extended multi-spectral face recognition in two different age groups. the second contribution is to show empirically the performance of face recognition for two age groups. thus, in this paper, we developed a multi-spectral imaging sensor to capture facial database for two different age groups (<= 15years and >= 20years) at nine different spectral bands covering 530nm to 1000nm range. We then collected a new facial images corresponding to two different age groups comprises of 168 individuals. Extensive experimental evaluation is performed independently on two different age group databases using four different state-of-the-art face recognition algorithms. We evaluate the verification and identification rate across individual spectral bands and fused spectral band for two age groups. the obtained evaluation results shows higher recognition rate for age groups >= 20years than <= 15years, which indicates the variation in face recognition across the different age groups.
the problem of automatically extracting anomalous events from any given video is a problem that has been researched from the early days of computervision. It has still not been fully solved, showing that it is indeed...
详细信息
Recently, the interest in Micro Aerial Vehicles (MAVs) and their autonomous flights has increased tremendously and significant advances have been made. the monocular camera has turned out to be most popular sensing mo...
详细信息
ISBN:
(纸本)9781450347532
Recently, the interest in Micro Aerial Vehicles (MAVs) and their autonomous flights has increased tremendously and significant advances have been made. the monocular camera has turned out to be most popular sensing modality for MAVs as it is light-weight, does not consume more power, and encodes rich information about the environment around. In this paper, we present DeepFly, our framework for autonomous navigation of a quadcopter equipped with monocular camera. the navigable space detection and waypoint selection are fundamental components of autonomous navigation system. they have broader meaning than just detecting and avoiding immediate obstacles. Finding the navigable space emphasizes equally on avoiding obstacles and detecting ideal regions to move next to. the ideal region can be defined by two properties: 1) All the points in the region have approximately same high depth value and 2) the area covered by the points of the region in the disparity map is considerably large. the waypoints selected from these navigable spaces assure collision-free path which is safer than path obtained from other waypoint selection methods which do not consider neighboring information. In our approach, we obtain a dense disparity map by performing a translation maneuver. this disparity map is input to a deep neural network which predicts bounding boxes for multiple navigable regions. Our deep convolutional neural network with shortcut connections regresses variable number of outputs without any complex architectural add on. Our autonomous navigation approach has been successfully tested in both indoors and outdoors environment and in range of lighting conditions.
In the field of computervision and imageprocessing, image similarity has been a central concern for decades. If you compare two pictures, image Similarity returns a value that tells you how physically they are close...
详细信息
ISBN:
(纸本)9781728185019
In the field of computervision and imageprocessing, image similarity has been a central concern for decades. If you compare two pictures, image Similarity returns a value that tells you how physically they are close. A quantitative measure of the degree of correspondence between the images concerned is given by this test. the score of the similarity between images varies from 0 to 1. In this paper, ORB (Oriented Fast Rotated Brief) algorithm is used to measure the similarity and other types of similarity measures like Structural Similarity Index (SSIM), pixel similarity, Earth mover's Distance are used to obtain the score. When two images are compared, it shows how much identical (common) objects are there in the two images. So, the accuracy or similarity score is about 87 percent when the two images are compared.
Within this framework we describe a novel technique for removing noise from digital noisy images, based on the modeling of wavelet coefficient with bivariate normal distribution and statistical calculation. A method f...
详细信息
暂无评论