The proceedings contain 353 papers. The topics discussed include: random subwindows for robust image classification;Bayesian object detection in dynamic scenes;pixels that sound;a rational function lens distortion mod...
详细信息
ISBN:
(纸本)0769523722
The proceedings contain 353 papers. The topics discussed include: random subwindows for robust image classification;Bayesian object detection in dynamic scenes;pixels that sound;a rational function lens distortion model for general cameras;robust boosting for learning from few examples;optimization design of cascaded classifiers;pruning training sets for learning of object categories;computervision for music identification;restoration and recognition in a loop;robust face detection with multi-class boosting;learning with constrained and unlabelled data;real-time non-rigid surface detection;evaluating image retrieval;towards complete generic camera calibration;and unsupervised learning of object features from video sequences.
The proceedings contain 159 papers. The topics discussed include: geometric image primitives by complex moments in Gabor space and the application to texture segmentation;geometric primitive extraction using a genetic...
ISBN:
(纸本)0818628553
The proceedings contain 159 papers. The topics discussed include: geometric image primitives by complex moments in Gabor space and the application to texture segmentation;geometric primitive extraction using a genetic algorithm;model based region segmentation using co-occurrence matrices;morphological grayscale reconstruction: definition, efficient algorithm and applications in image analysis;edge detection in range images through morphological residue analysis;morphological decomposition of restricted domains: a vector space solution;spatial reasoning based on multivariate belief functions;analysis of the least median of squares estimator for computervision applications;and image segmentation via edge contour finding: a graph theoretic approach.
The proceedings contain 525 papers. The topics discussed include: learning visual similarity measures for comparing never seen objects;a contextual dissimilarity measure for accurate and efficient image search;learnin...
详细信息
ISBN:
(纸本)1424411807
The proceedings contain 525 papers. The topics discussed include: learning visual similarity measures for comparing never seen objects;a contextual dissimilarity measure for accurate and efficient image search;learning local image descriptors;principal curvature-based region detector for object recognition;a benchmark for the comparison of 3-D motion segmentation algorithms;a nonparametric treatment for location/segmentation based visual tracking;learning gaussian conditional random fields for low-level vision;hierarchical structuring of data on manifolds;learning GMRF structures for spatial priors;element rearrangement for tensor-based subspace learning;unsupervised clustering using multi-resolution perceptual grouping;object tracking by asymmetric kernel mean shift with automatic scale and orientation selection;and free-form nonrigid image registration using generalized elastic nets.
Fine-grained 3D shape classification (FGSC) remains challenging due to the difficulty of adaptively capturing global structure differences and subtle inter-class distinctions. This paper directly extends vision Transf...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
Fine-grained 3D shape classification (FGSC) remains challenging due to the difficulty of adaptively capturing global structure differences and subtle inter-class distinctions. This paper directly extends vision Transformer (ViT) to FGSC, proposing a pure Transformer network FG3DFormer that fully leverages ViT’s global correlation and local attention abilities. FG3Dformer comprises the Hierarchical Feature Extraction (HFE) and the Hierarchical Feature Refinement (HFR), interconnected through the Adaptive View Region Selection (AVRS). Firstly, the HFE comprehensively evaluates the significance of intra-view patches and views driven by inter-view and intraview attention. Then, the AVRS adaptively selects crucial patch Tokens from different views to serve as sources of subtle local features. Finally, the HFR refines the 3D shape descriptor, capturing more discriminative global and subtle local features by leveraging both the view and selected crucial patch Tokens. Extensive experiments on FG3D and ModelNet40 demonstrate the superiority of FG3Dformer in FGSC and meta-category 3D shape classification tasks.
The proceedings contain 167 papers. The topics discussed include: incremental learning of object detectors using a visual shape alphabet;multiclass object recognition with sparse, localized features;unsupervised learn...
详细信息
ISBN:
(纸本)0769525970
The proceedings contain 167 papers. The topics discussed include: incremental learning of object detectors using a visual shape alphabet;multiclass object recognition with sparse, localized features;unsupervised learning of categories from sets of partially matching image features;the layout consistent random random field for recognizing and segmenting partially occluded objects;ultrasound-specific segmentation via correlation and statistical region-based active contours;principled hybrids of generative and discriminative models;a comic section classifier and its application to image datasets;learning non-metric partial similarity based on maximal margin criterion;distributed cost boosting on mis-classification cost;equivalence of non-iterative algorithms for simultaneous low rank approximations of matrices;and semi-supervised classification using liner neighborhood propagation.
The proceedings contain 471 papers. The topics discussed include: depth acquisition from density modulated binary patterns;rolling Riemannian manifolds to solve the multi-class classification problem;exploring composi...
The proceedings contain 471 papers. The topics discussed include: depth acquisition from density modulated binary patterns;rolling Riemannian manifolds to solve the multi-class classification problem;exploring compositional high order pattern potentials for structured output learning;kernel methods on the Riemannian manifold of symmetric positive definite matrices;discovering the structure of a planar mirror system from multiple observations of a single point;tensor-based human body modeling;improving the visual comprehension of point sets;detecting changes in 3D structure of a scene from multi-view images captured by a vehicle-mounted camera;templateless quasi-rigid shape modeling with implicit loop-closure;shape from silhouette probability maps: reconstruction of thin objects in the presence of silhouette extraction and calibration error;and joint geodesic upsampling of depth images.
The proceedings contain 2715 papers. The topics discussed include: revisiting adversarial training at scale;SPIDeRS: structured polarization for invisible depth and reflectance sensing;MA-LMM: memory-augmented large m...
ISBN:
(纸本)9798350353006
The proceedings contain 2715 papers. The topics discussed include: revisiting adversarial training at scale;SPIDeRS: structured polarization for invisible depth and reflectance sensing;MA-LMM: memory-augmented large multimodal model for long-term video understanding;geometrically-driven aggregation for zero-shot 3D point cloud understanding;TextCraftor: your text encoder can be image quality controller;ViLa-MIL: dual-scale vision-language multiple instance learning for whole slide image classification;HumanNorm: learning normal diffusion model for high-quality and realistic 3D human generation;AnEmpirical study of scaling law for scene text recognition;improving image restoration through removing degradations in textual representations;and steganographic passport: an owner and user verifiable credential for deep model ip protection without retraining.
The proceedings contain 539 papers. The topics discussed include: fast and accurate image matching with cascade hashing for 3D reconstruction;minimal solvers for relative pose with a single unknown radial distortion;s...
ISBN:
(纸本)9781479951178
The proceedings contain 539 papers. The topics discussed include: fast and accurate image matching with cascade hashing for 3D reconstruction;minimal solvers for relative pose with a single unknown radial distortion;spectral graph reduction for efficient image and streaming video segmentation;video motion segmentation using new adaptive manifold denoising model;event detection using multi-level relevance labels and multiple features;full-angle quaternions for robustly matching vectors of 3D rotations;semi-supervised spectral clustering for image set classification;learning mid-level filters for person re-identification;DeepReID: deep filter pairing neural network for person re-identification;NMF-KNN: image annotation using weighted multi-view non-negative matrix factorization;beyond comparing image pairs: setwise active learning for relative attributes;and histograms of pattern sets for image classification and object recognition.
The proceedings contain 151 papers. The topics discussed include: clustering appearance for scene analysis;fast compact city modeling for navigation pre-visualization;fusion of summation invariants in 3D human face re...
详细信息
ISBN:
(纸本)0769525970
The proceedings contain 151 papers. The topics discussed include: clustering appearance for scene analysis;fast compact city modeling for navigation pre-visualization;fusion of summation invariants in 3D human face recognition;deformation modeling for robust 3D face matching;locally linear models on face appearance manifolds with application to dual-subspace based classification;learning examplar-based categorization for the detection of multi-view multi-pose objects;aligning ASL for statistical translation using discriminative word model;a graph based approach for naming faces in news photos;fast human detection using a cascade of histograms of oriented gradients;real-time-hand pose recognition using low resolution depth images;automatic cast listing in feature-length films with anisotropic manifold space;and body localization in still images using hierarchical models and hybrid search.
暂无评论