The proceedings contain 54 papers. The special focus in this conference is on Computer Vision and imageprocessing. The topics include: CT image Synthesis from MR image Using Edge-Aware Generative Adversarial Network;...
ISBN:
(纸本)9783031314063
The proceedings contain 54 papers. The special focus in this conference is on Computer Vision and imageprocessing. The topics include: CT image Synthesis from MR image Using Edge-Aware Generative Adversarial Network;Modified Scaled-YOLOv4: Soccer Player and Ball Detection for realtime Implementation;candidNet: A Novel Framework for Candid Moments Detection;cost Efficient Defect Detection in Bangle Industry Using Transfer Learning;single image Dehazing Using Multipath Networks Based on Chain of U-Nets;leveraging Tri-Planar Views and Weighted Average Fusion Technique to Classify Lung Nodule Malignancy;a Bayesian Approach to Gaussian-Impulse Noise Removal Using Hessian Norm Regularization;deepTemplates: Object Segmentation Using Shape Templates;Data-Centric Approach to SAR-Optical image Translation;MIS-Net: A Deep Residual Network Based on Memorised Pooling Indices for Medical image Segmentation;linear and Non-Linear Filter-based Counter-Forensics Against image Splicing Detection;Ischemic Stroke Lesion Segmentation in CT Perfusion images Using U-Net with Group Convolutions;Multi-generator MD-GAN with Reset Discriminator: A Framework to Handle Non-IID Data;video Colorization Using Modified Autoencoder Generative Adversarial Networks;real-time Violence Detection Using Deep Neural Networks and DTW;skin Disease Detection Using Saliency Maps and Segmentation Techniques;an Alternate Approach for Single image Haze Removal Using Path Prediction;Detecting Tropical Cyclones in INSAT-3D Satellite images Using CNN-Based Model;Two Stream RGB-LBP Based Transfer Learning Model for Face Anti-spoofing;Logarithmic Progressive-SMOTE: Oversampling Minorities in Retinal Fundus Multi-disease image Dataset;HD-VAE-GAN: Hiding Data with Variational Autoencoder Generative Adversarial Networks;sequence Recognition in Bharatnatyam Dance;Multi-modality Fusion for Siamese Network Based RGB-T Tracking (mfSiamTrack).
In contemporary times, with the increasing demand for rail transportation, the preservation of the integrity and overall condition of railways holds paramount importance. This is because any damage to these routes can...
详细信息
The proceedings contain 92 papers. The special focus in this conference is on image Analysis and processing. The topics include: An Effective CNN-Based Super Resolution Method for video Coding;medical Transformers for...
ISBN:
(纸本)9783031431470
The proceedings contain 92 papers. The special focus in this conference is on image Analysis and processing. The topics include: An Effective CNN-Based Super Resolution Method for video Coding;medical Transformers for Boosting Automatic Grading of Colon Carcinoma in Histological images;FERMOUTH: Facial Emotion Recognition from the MOUTH Region;consensus Ranking for Efficient Face image Retrieval: A Novel Method for Maximising Precision and Recall;towards Explainable Navigation and Recounting;towards Facial Expression Robustness in Multi-scale Wild Environments;depth Camera Face Recognition by Normalized Fractal Encodings;automatic Generation of Semantic Parts for Face image Synthesis;improved Bilinear Pooling for real-time Pose Event Camera Relocalisation;continual Source-Free Unsupervised Domain Adaptation;End-to-End Asbestos Roof Detection on Orthophotos Using Transformer-Based YOLO Deep Neural Network;OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data;UAV Multi-object Tracking by Combining Two Deep Neural Architectures;GLR: Gradient-Based Learning Rate Scheduler;a Large-scale Analysis of Athletes’ Cumulative Race time in Running Events;uncovering Lies: Deception Detection in a Rolling-Dice Experiment;active Class Selection for Dataset Acquisition in Sign Language Recognition;MC-GTA: A Synthetic Benchmark for Multi-Camera Vehicle Tracking;a Differentiable Entropy Model for Learned image Compression;learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation;self-Similarity Block for Deep image Denoising;SCENE-pathy: Capturing the Visual Selective Attention of People Towards Scene Elements;not with My Name! Inferring Artists’ Names of Input Strings Employed by Diffusion Models;benchmarking of Blind video Deblurring Methods on Long Exposure and Resource Poor Settings;LieToMe: An LSTM-Based Method for Deception Detection by Hand Movements;spatial Transformer Generative Adversarial Network for image Super
The proceedings contain 54 papers. The special focus in this conference is on Computer Vision and imageprocessing. The topics include: CT image Synthesis from MR image Using Edge-Aware Generative Adversarial Network;...
ISBN:
(纸本)9783031314162
The proceedings contain 54 papers. The special focus in this conference is on Computer Vision and imageprocessing. The topics include: CT image Synthesis from MR image Using Edge-Aware Generative Adversarial Network;Modified Scaled-YOLOv4: Soccer Player and Ball Detection for realtime Implementation;candidNet: A Novel Framework for Candid Moments Detection;cost Efficient Defect Detection in Bangle Industry Using Transfer Learning;single image Dehazing Using Multipath Networks Based on Chain of U-Nets;leveraging Tri-Planar Views and Weighted Average Fusion Technique to Classify Lung Nodule Malignancy;a Bayesian Approach to Gaussian-Impulse Noise Removal Using Hessian Norm Regularization;deepTemplates: Object Segmentation Using Shape Templates;Data-Centric Approach to SAR-Optical image Translation;MIS-Net: A Deep Residual Network Based on Memorised Pooling Indices for Medical image Segmentation;linear and Non-Linear Filter-based Counter-Forensics Against image Splicing Detection;Ischemic Stroke Lesion Segmentation in CT Perfusion images Using U-Net with Group Convolutions;Multi-generator MD-GAN with Reset Discriminator: A Framework to Handle Non-IID Data;video Colorization Using Modified Autoencoder Generative Adversarial Networks;real-time Violence Detection Using Deep Neural Networks and DTW;skin Disease Detection Using Saliency Maps and Segmentation Techniques;an Alternate Approach for Single image Haze Removal Using Path Prediction;Detecting Tropical Cyclones in INSAT-3D Satellite images Using CNN-Based Model;Two Stream RGB-LBP Based Transfer Learning Model for Face Anti-spoofing;Logarithmic Progressive-SMOTE: Oversampling Minorities in Retinal Fundus Multi-disease image Dataset;HD-VAE-GAN: Hiding Data with Variational Autoencoder Generative Adversarial Networks;sequence Recognition in Bharatnatyam Dance;Multi-modality Fusion for Siamese Network Based RGB-T Tracking (mfSiamTrack).
Pedestrian detection and tracking, particularly in low-light conditions, is critical for ensuring road safety, especially for vulnerable populations like senior citizens. Traditional methods often struggle with multip...
详细信息
We present an approach to automatic visible light/infrared (VL/IR) image registration that leverages multiple visible light apertures for fast computation on resource-constrained systems. VL/IR registration is computa...
详细信息
ISBN:
(纸本)9781510650916;9781510650909
We present an approach to automatic visible light/infrared (VL/IR) image registration that leverages multiple visible light apertures for fast computation on resource-constrained systems. VL/IR registration is computationally challenging due to the different modalities of image generation. Although feature-based algorithms for direct registration exist, these methods proved too complex to reliably perform registration on low-cost, embedded processors in realtime. We instead employed a second VL camera to dynamically estimate 2D translations aligning the brightest (warmest) objects in the IR video stream with their counterparts in the first VL video stream. Regions of interest are first selected based on the brightest areas in the IR image, as our application is primarily concerned with detecting objects warmer than background. The same broad region - e.g. the lower-left quadrant of the frame - is then selected in the VL1 and VL2 images. The translation that best registers the first VL ROI to the second is estimated through template matching. Because all apertures in our camera system are fixed and coplanar relative to one another, the translation that best aligns the IR ROI to the VL1 ROI can be estimated from the translation from the VL2 ROI to the VL1 ROI. This approach provides dynamic registration of 1080P video at upwards of 10Hz on an ODROID-XU4 single-board computer, while also allowing the processor time to render the IR-augmented video stream at 20Hz. imagery collected using Deep Analytics' IR Boom Camera will be presented to demonstrate the approach.
The proceedings contain 9 papers. The special focus in this conference is on Design and Architectures for Signal and imageprocessing. The topics include: Brain Blood Vessel Segmentation in Hyperspectral images T...
ISBN:
(纸本)9783031299698
The proceedings contain 9 papers. The special focus in this conference is on Design and Architectures for Signal and imageprocessing. The topics include: Brain Blood Vessel Segmentation in Hyperspectral images Through Linear Operators;Neural Network Predictor for Fast Channel Change on DVB Set-Top-Boxes;AINoC: New Interconnect for Future Deep Neural Network Accelerators;real-time FPGA Implementation of the Semi-global Matching Stereo Vision Algorithm for a 4K/UHD video Stream;TaPaFuzz - An FPGA-Accelerated Framework for RISC-V IoT Graybox Fuzzing;Adaptive Inference for FPGA-Based 5G Automatic Modulation Classification;High-Level Online Power Monitoring of FPGA IP Based on Machine Learning.
This paper proposes an efficient moving objects detection pipeline focusing on dynamic object detection on video streams captured by traffic monitoring cameras. While developing autonomous vehicle systems, we found th...
详细信息
ISBN:
(纸本)9798350320565
This paper proposes an efficient moving objects detection pipeline focusing on dynamic object detection on video streams captured by traffic monitoring cameras. While developing autonomous vehicle systems, we found that views from self-driving vehicles can be occluded by dynamic or static objects on the street. Whereas infrastructure nodes such as traffic monitoring cameras having broader fieldof-views and better perspectives can be used as auxiliary sensors to share traffic information with nearby self-driving cars in real-time. However, these infrastructure cameras usually have constrained computation resources, and detecting hundreds of static background objects in consecutive video frames is wasteful. In our detection pipeline, we leverage the image eccentricity analysis as a pre-processing step to fast generate moving objects segmentation maps. These maps are used to mask the original images to get images that only contain the moving objects in the scene. These sparse images are then passed to an object detection model built with a sparse convolution backbone network, resulting in significant reduction in computational costs. Our quantitative experiments illustrate that the proposed detection pipeline can achieve up to 50% inference speedup with negligible detection accuracy drop in images obtained from traffic monitoring cameras.
Lane-line detection is one of the key technologies in the field of intelligent driving. This paper changes the anchor box regression loss function of the YOLOv5s to the EIoU function to construct a kind of lane-line d...
详细信息
Identifying and minimizing physiological artifacts in EEG is challenging because these artifacts may corrupt the underlying brain activity severely. In this work, we proposed a hybrid approach to detect/reduce EEG art...
详细信息
暂无评论