This Volume 2 of 2 of the conference proceedings contains 114 papers. Topics discussed include statistical methods and learning, matching, patternrecognition, tracking, sensors, feature extraction, shape, corresponde...
详细信息
This Volume 2 of 2 of the conference proceedings contains 114 papers. Topics discussed include statistical methods and learning, matching, patternrecognition, tracking, sensors, feature extraction, shape, correspondence and reconstruction, structure and motion, systems and applications.
The proceedings contain 2 papers. The topics discussed include: attention mechanism exploits temporal contexts: real-time 3D human pose reconstruction;and cascaded deep monocular 3D human pose estimation with evolutio...
ISBN:
(纸本)9781728171685
The proceedings contain 2 papers. The topics discussed include: attention mechanism exploits temporal contexts: real-time 3D human pose reconstruction;and cascaded deep monocular 3D human pose estimation with evolutionary training data.
The proceedings contain 471 papers. The topics discussed include: depth acquisition from density modulated binary patterns;rolling Riemannian manifolds to solve the multi-class classification problem;exploring composi...
The proceedings contain 471 papers. The topics discussed include: depth acquisition from density modulated binary patterns;rolling Riemannian manifolds to solve the multi-class classification problem;exploring compositional high order pattern potentials for structured output learning;kernel methods on the Riemannian manifold of symmetric positive definite matrices;discovering the structure of a planar mirror system from multiple observations of a single point;tensor-based human body modeling;improving the visual comprehension of point sets;detecting changes in 3D structure of a scene from multi-view images captured by a vehicle-mounted camera;templateless quasi-rigid shape modeling with implicit loop-closure;shape from silhouette probability maps: reconstruction of thin objects in the presence of silhouette extraction and calibration error;and joint geodesic upsampling of depth images.
The proceedings contain 2715 papers. The topics discussed include: revisiting adversarial training at scale;SPIDeRS: structured polarization for invisible depth and reflectance sensing;MA-LMM: memory-augmented large m...
ISBN:
(纸本)9798350353006
The proceedings contain 2715 papers. The topics discussed include: revisiting adversarial training at scale;SPIDeRS: structured polarization for invisible depth and reflectance sensing;MA-LMM: memory-augmented large multimodal model for long-term video understanding;geometrically-driven aggregation for zero-shot 3D point cloud understanding;TextCraftor: your text encoder can be image quality controller;ViLa-MIL: dual-scale vision-language multiple instance learning for whole slide image classification;HumanNorm: learning normal diffusion model for high-quality and realistic 3D human generation;AnEmpirical study of scaling law for scene text recognition;improving image restoration through removing degradations in textual representations;and steganographic passport: an owner and user verifiable credential for deep model ip protection without retraining.
The proceedings contain 156 papers. The topics discussed include: real-time mobile food recognition system;style finder: fine-grained clothing style detection and retrieval;stereo camera tracking for mobile devices;to...
ISBN:
(纸本)9780769549903
The proceedings contain 156 papers. The topics discussed include: real-time mobile food recognition system;style finder: fine-grained clothing style detection and retrieval;stereo camera tracking for mobile devices;towards auto-calibration of smart phones using orientation sensors;detection of moving objects with non-stationary cameras in 5.8ms: bringing motion detection to your mobile device;mobile video capture of multi-page documents;collision detection for visually impaired from a body-mounted camera;video demo: an egocentric vision based assistive co-robot;mobile exergames - burn calories while playing games on a smartphone;a mobile vision system for fast and accurate ellipse detection;stabilization of magnified videos on a mobile device for visually impaired;and an augmented linear discriminant analysis approach for identifying identical twins with the aid of facial asymmetry features.
The proceedings contain 1658 papers. The topics discussed include: single-stage instance shadow detection with bidirectional relation learning;learning Delaunay surface elements for mesh reconstruction;fusing the old ...
ISBN:
(纸本)9781665445092
The proceedings contain 1658 papers. The topics discussed include: single-stage instance shadow detection with bidirectional relation learning;learning Delaunay surface elements for mesh reconstruction;fusing the old with the new: learning relative camera pose with geometry-guided uncertainty;uncertainty guided collaborative training for weakly supervised temporal action detection;privacy-preserving collaborative learning with automatic transformation search;rethinking and improving the robustness of image style transfer;style-aware normalized loss for improving arbitrary style transfer;faster meta update strategy for noise-robust deep learning;a hyperbolic-to-hyperbolic graph convolutional network;training networks in null space of feature covariance for continual learning;and exponential moving average normalization for self-supervised and semi-supervised learning.
The proceedings contain 437 papers. The topics discussed include: efficient marginal likelihood optimization in blind deconvolution;natural image denoising: optimality and inherent bounds;a Sobolev-type metric for pol...
ISBN:
(纸本)9781457703942
The proceedings contain 437 papers. The topics discussed include: efficient marginal likelihood optimization in blind deconvolution;natural image denoising: optimality and inherent bounds;a Sobolev-type metric for polar active contours;multi-target tracking by continuous energy minimization;towards cross-category knowledge propagation for learning visual concepts;are sparse representations really relevant for image classification?;smoothly varying affine stitching;noise resistant graph ranking for improved web image search;real-time human pose recognition in parts from single depth images;online domain adaptation of a pre-trained cascade of classifiers;learning effective human pose estimation from inaccurate annotation;shape grammar parsing via reinforcement learning;parameter learning with truncated message-passing;and structured light 3D scanning in the presence of global illumination.
The proceedings contain 1294 papers. The topics discussed include: finding task-relevant features for few-shot learning by category traversal;edge-labeling graph neural network for few-shot learning;generating classif...
ISBN:
(纸本)9781728132938
The proceedings contain 1294 papers. The topics discussed include: finding task-relevant features for few-shot learning by category traversal;edge-labeling graph neural network for few-shot learning;generating classification weights with GNN denoising autoencoders for few-shot learning;kervolutional neural networks;why ReLU networks yield high-confidence predictions far away from the training data and how to mitigate the problem;on the structural sensitivity of deep convolutional networks to the directions of fourier basis functions;hardness-aware deep metric learning;auto-deeplab: hierarchical neural architecture search for semantic image segmentation;striking the right balance with uncertainty;and SDRSAC: semidefinite-based randomized approach for robust point cloud registration without correspondences.
The proceedings contain 802 papers. The topics discussed include: X-VARS: introducing explainability in football refereeing with multi-modal large language models;a hybrid ANN-SNN architecture for low-power and low-la...
ISBN:
(纸本)9798350365474
The proceedings contain 802 papers. The topics discussed include: X-VARS: introducing explainability in football refereeing with multi-modal large language models;a hybrid ANN-SNN architecture for low-power and low-latency visual perception;pseudo-label based unsupervised fine-tuning of a monocular 3D pose estimation model for sports motions;towards efficient audio-visual learners via empowering pre-trained vision transformers with cross-modal adaptation;a dual-mode approach for vision-based navigation in a lunar landing scenario;class similarity transition: decoupling class similarities and imbalance from generalized few-shot segmentation;ReweightOOD: loss reweighting for distance-based OOD detection;Hinge-Wasserstein: estimating multimodal aleatoric uncertainty in regression tasks;and ConPro: learning severity representation for medical images using contrastive learning and preference optimization.
The proceedings contain 698 papers. The topics discussed include: learning unbiased classifiers from biased data with meta-learning;robustness against gradient based attacks through cost effective network fine-tuning;...
ISBN:
(纸本)9798350302493
The proceedings contain 698 papers. The topics discussed include: learning unbiased classifiers from biased data with meta-learning;robustness against gradient based attacks through cost effective network fine-tuning;gradient attention balance network: mitigating face recognition racial bias via gradient attention;estimating and maximizing mutual information for knowledge distillation;synthetic sample selection for generalized zero-shot learning;training strategies for vision transformers for object detection;does image anonymization impact computervision training?;ultra-sonic sensor based object detection for autonomous vehicles;improvements to image reconstruction-based performance prediction for semantic segmentation in highly automated driving;zero-shot classification at different levels of granularity;difficulty estimation with action scores for computervision tasks;detail-preserving self-supervised monocular depth with self-supervised structural sharpening;isolated sign language recognition based on tree structure skeleton images;deep prototypical-parts ease morphological kidney stone identification and are competitively robust to photometric perturbations;wildlife image generation from scene graphs;towards characterizing the semantic robustness of face recognition;high-level context representation for emotion recognition in images;and mitigating catastrophic interference using unsupervised multi-part attention for RGB-IR face recognition.
暂无评论