The proceedings contain 929 papers. The topics discussed include: image adaptation for color vision deficient viewers using vision transformers;a regional-level resource-saving model for winter road surface snow detec...
ISBN:
(纸本)9798331510831
The proceedings contain 929 papers. The topics discussed include: image adaptation for color vision deficient viewers using vision transformers;a regional-level resource-saving model for winter road surface snow detection in extreme weathers;beyond grids: exploring elastic input sampling for vision transformers;loose social-interaction recognition in real-world therapy scenarios;adversarial attention deficit: fooling deformable vision transformers with collaborative adversarial patches;enhancing scene graph generation with hierarchical relationships and commonsense knowledge;bandit-based attention mechanism in vision transformers;pre-capture privacy via adaptive single-pixel imaging;and context-aware outlier rejection for robust multi-view 3d tracking of similar small birds in an outdoor aviary.
The proceedings contain 845 papers. The topics discussed include: partial binarization of neural networks for budget-aware efficient learning;GC-MVSNet: multi-view, multi-scale, geometrically-consistent multi-view ste...
ISBN:
(纸本)9798350318920
The proceedings contain 845 papers. The topics discussed include: partial binarization of neural networks for budget-aware efficient learning;GC-MVSNet: multi-view, multi-scale, geometrically-consistent multi-view stereo;contrastive viewpoint-aware shape learning for long-term person re-identification;ARNIQA: learning distortion manifold for image quality assessment;improving fairness using vision-language driven image augmentation;VCISR: blind single image super-resolution with video compression synthetic data;PIDiffu: pixel-aligned diffusion model for high-fidelity clothed human reconstruction;continuous adaptation for interactive segmentation using teacher-student architecture;learning to recognize occluded and small objects with partial inputs;and reverse knowledge distillation: training a large model using a small one for retinal image matching on limited data.
The proceedings contain 123 papers. The topics discussed include: the SARFish dataset and challenge;NORPPA: NOvel ringed seal re-identification by pelage pattern aggregation;multiple toddler tracking in indoor videos;...
ISBN:
(纸本)9798350370287
The proceedings contain 123 papers. The topics discussed include: the SARFish dataset and challenge;NORPPA: NOvel ringed seal re-identification by pelage pattern aggregation;multiple toddler tracking in indoor videos;challenges in video-based infant action recognition: a critical examination of the state of the art;KABR: in-situ dataset for kenyan animal behavior recognition from drone videos;the hitchhiker's guide to endangered species pose estimation;efficient domain adaptation via generative prior for 3D infant pose estimation;dynamic gaussian splatting from markerless motion capture reconstruct infants movements;neural texture puppeteer: a framework for neural geometry and texture rendering of articulated shapes, enabling re-identification at interactive speed;and DigiDogs: single-view 3D pose estimation of dogs using synthetic training data.
The proceedings contain 633 papers. The topics discussed include: token pooling in vision transformers for image classification;D2F2WOD: learning object proposals for weakly-supervised object detection via progressive...
ISBN:
(纸本)9781665493468
The proceedings contain 633 papers. The topics discussed include: token pooling in vision transformers for image classification;D2F2WOD: learning object proposals for weakly-supervised object detection via progressive domain adaptation;composite relationship fields with transformers for scene graph generation;towards few-annotation learning for object detection: are transformer-based models more efficient?;scaling novel object detection with weakly supervised detection transformers;dense prediction with attentive feature aggregation;boosting vision transformers for image retrieval;two-level data augmentation for calibrated multi-view detection;TCAM: temporal class activation maps for object localization in weakly-labeled unconstrained videos;dynamic mixture of counter network for location-agnostic crowd counting;and simultaneous acquisition of high quality RGB image and polarization information using a sparse polarization sensor.
The proceedings contain 23 papers. The topics discussed include: per-frame mAP prediction for continuous performance monitoring of object detection during deployment;geeks and guests: estimating player s level of expe...
ISBN:
(纸本)9781665419673
The proceedings contain 23 papers. The topics discussed include: per-frame mAP prediction for continuous performance monitoring of object detection during deployment;geeks and guests: estimating player s level of experience from board game behaviors;domain adaptive knowledge distillation for driving scene semantic segmentation;DriveGuard: robustification of automated driving systems with deep spatio-temporal convolutional autoencoder;facial expression neutralization with StoicNet;neural vision-based semantic 3D world modeling;2020 sequestered data evaluation for known activities in extended video: summary and results;reliability of GAN generated data to train and validate perception systems for autonomous vehicles;and explainable fingerprint ROI segmentation using Monte Carlo dropout.
The proceedings contain 406 papers. The topics discussed include: TB-Net: a three-stream boundary-aware network for fine-grained pavement disease segmentation;learning to generate dense point clouds with textures on m...
ISBN:
(纸本)9780738142661
The proceedings contain 406 papers. The topics discussed include: TB-Net: a three-stream boundary-aware network for fine-grained pavement disease segmentation;learning to generate dense point clouds with textures on multiple categories;detecting human-object interaction with mixed supervision;deep preset: blending and retouching photos with color style transfer;how to make a BLT sandwich? learning VQA towards understanding web instructional videos;exploration of spatial and temporal modeling alternatives for HOI;do we really need gold samples for sample weighting under label noise?;multi-frame recurrent adversarial network for moving object segmentation;unsupervised meta-domain adaptation for fashion retrieval;and a unified framework for compressive video recovery from coded exposure techniques.
The proceedings contain 24 papers. The topics discussed include: mitigating algorithmic bias: evolving an augmentation policy that is non-biasing;bumblebee re-identification dataset;activity detection in untrimmed vid...
ISBN:
(纸本)9781728171623
The proceedings contain 24 papers. The topics discussed include: mitigating algorithmic bias: evolving an augmentation policy that is non-biasing;bumblebee re-identification dataset;activity detection in untrimmed videos using chunk-based classifiers;exploring techniques to improve activity recognition using human pose skeletons;Syn2Real: forgery classification via unsupervised domain adaptation;re-identification of zebrafish using metric learning;similarity learning networks for animal individual re-identification - beyond the capabilities of a human observer;summary of the 2019 activity detection in extended videos prize challenge;and impact of ImageNet model selection on domain adaptation.
The proceedings contain 378 papers. The topics discussed include: weakly supervised Gaussian networks for action detection;multi receptive field network for semantic segmentation;two-grid preconditioned solver for bun...
ISBN:
(纸本)9781728165530
The proceedings contain 378 papers. The topics discussed include: weakly supervised Gaussian networks for action detection;multi receptive field network for semantic segmentation;two-grid preconditioned solver for bundle adjustment;extracting identifying contours for African elephants and humpback whales using a learned appearance model;microbatchGAN: stimulating diversity with multi-adversarial discrimination;appearance and shape from water reflection;LEAF-QA: locate, encode - attend for figure question answering;multiparty visual co-occurrences for estimating personality traits in group meetings;an extended exposure fusion and its application to single image contrast enhancement;plugin networks for inference under partial evidence;and image to video domain adaptation using web supervision.
暂无评论