The proceedings contain 802 papers. The topics discussed include: X-VARS: introducing explainability in football refereeing with multi-modal large language models;a hybrid ANN-SNN architecture for low-power and low-la...
ISBN:
(纸本)9798350365474
The proceedings contain 802 papers. The topics discussed include: X-VARS: introducing explainability in football refereeing with multi-modal large language models;a hybrid ANN-SNN architecture for low-power and low-latency visual perception;pseudo-label based unsupervised fine-tuning of a monocular 3D pose estimation model for sports motions;towards efficient audio-visual learners via empowering pre-trained vision transformers with cross-modal adaptation;a dual-mode approach for vision-based navigation in a lunar landing scenario;class similarity transition: decoupling class similarities and imbalance from generalized few-shot segmentation;ReweightOOD: loss reweighting for distance-based OOD detection;Hinge-Wasserstein: estimating multimodal aleatoric uncertainty in regression tasks;and ConPro: learning severity representation for medical images using contrastive learning and preference optimization.
The proceedings contain 16 papers. The special focus in this conference is on Segment Anything in Medical Images on Laptop. The topics include: Filters, Thresholds, and Geodesic Distances for Scribble-Based ...
ISBN:
(纸本)9783031818530
The proceedings contain 16 papers. The special focus in this conference is on Segment Anything in Medical Images on Laptop. The topics include: Filters, Thresholds, and Geodesic Distances for Scribble-Based Interactive Segmentation of Medical Images;Rep-MedSAM: Towards Real-Time and Universal Medical Image Segmentation;Swin-LiteMedSAM: A Lightweight Box-Based Segment Anything Model for Large-Scale Medical Image Datasets;a Light-Weight Universal Medical Segmentation Network for Laptops Based on Knowledge Distillation;taking a Step Back: Revisiting Classical Approaches for Efficient Interactive Segmentation of Medical Images;ExpertsMedSAM: Faster Medical Image Segment Anything with Mixture-of-Experts;efficient Quantization-Aware Training on Segment Anything Model in Medical Images and Its Deployment;Lite Class-Prompt Tiny-VIT for Multi-modality Medical Image Segmentation;Segment Anything in Medical Images with nnUNet;SwiftMedSAM: An Ultra-lightweight Prompt-Based Universal Medical Image Segmentation Model for Highly Constrained Environments;RepViT-MedSAM: Efficient Segment Anything in the Medical Images;U-MedSAM: Uncertainty-Aware MedSAM for Medical Image Segmentation;Modality-Specific Strategies for Medical Image Segmentation Using Lightweight SAM Architectures;gray’s Anatomy for Segment Anything Model: Optimizing Grayscale Medical Images for Fast and Lightweight Segmentation.
The proceedings contain 355 papers. The topics discussed include: MultiNet++: multi-stream feature aggregation and geometric loss strategy for multi-task learning;privacy-preserving action recognition using coded aper...
ISBN:
(纸本)9781728125060
The proceedings contain 355 papers. The topics discussed include: MultiNet++: multi-stream feature aggregation and geometric loss strategy for multi-task learning;privacy-preserving action recognition using coded aperture videos;evading face recognition via partial tampering of faces;privacy-preserving annotation of face images through attribute-preserving face synthesis;towards deep neural network training on encrypted data;fooling automated surveillance cameras: adversarial patches to attack person detection;anonymousnet: natural face de-identification with measurable privacy;regularizer to mitigate gradient masking effect during single-step adversarial training;privacy preserving group membership verification and identification;defending against adversarial attacks using random forest;intersection to overpass: instance segmentation on filamentous structures with an orientation-aware neural network and terminus pairing algorithm;and surface parameterization and registration for statistical multiscale atlasing of organ development.
The proceedings contain 516 papers. The topics discussed include: OmniLayout: room layout reconstruction from indoor spherical panoramas;boosting adversarial robustness using feature level stochastic smoothing;beyond ...
ISBN:
(纸本)9781665448994
The proceedings contain 516 papers. The topics discussed include: OmniLayout: room layout reconstruction from indoor spherical panoramas;boosting adversarial robustness using feature level stochastic smoothing;beyond joint demosaicking and denoising: an image processing pipeline for a pixel-bin image sensor;assessment of deep learning based blood pressure prediction from PPG and rPPG signals;towards domain-specific explainable AI: model interpretation of a skin image classifier using a human approach;DAMSL: domain agnostic meta score-based learning;deep learning based spatial-temporal in-loop filtering for versatile video coding;automated tackle injury risk assessment in contact-based sports - a rugby union example;two-stage network for single image super-resolution;and ***: dataset for automatic mapping of buildings, woodlands, water and roads from aerial imagery.
The proceedings contain 523 papers. The topics discussed include: latent fingerprint image enhancement based on progressive generative adversarial network;zero-shot learning in the presence of hierarchically coarsened...
ISBN:
(纸本)9781728193601
The proceedings contain 523 papers. The topics discussed include: latent fingerprint image enhancement based on progressive generative adversarial network;zero-shot learning in the presence of hierarchically coarsened labels;multivariate confidence calibration for object detection;context-guided super-class inference for zero-shot detection;learning sparse ternary neural networks with entropy-constrained trained ternarization (EC2T);now that i can see, i can improve: enabling data-driven finetuning of CNNs on the edge;enhancing facial data diversity with style-based face aging;a simplified framework for zero-shot cross-modal sketch data retrieval;unsupervised single image super-resolution network (USISResNet) for real-world data using generative adversarial network;cross-regional oil palm tree detection;and leaf spot attention network for apple leaf disease identification.
The proceedings contain 561 papers. The topics discussed include: CORE: consistent representation learning for face forgery detection;aria: adversarially robust image attribution for content provenance;the reliability...
ISBN:
(纸本)9781665487399
The proceedings contain 561 papers. The topics discussed include: CORE: consistent representation learning for face forgery detection;aria: adversarially robust image attribution for content provenance;the reliability of forensic body-shape identification;detecting real-time deep-fake videos using active illumination;on the exploitation of deepfake model recognition;is synthetic voice detection research going into the right direction?;on improving cross-dataset generalization of deepfake detectors;rethinking adversarial examples in wargames;privacy leakage of adversarial training models in federated learning systems;towards comprehensive testing on the robustness of cooperative multi-agent reinforcement learning;robustness and adaptation to hidden factors of variation;adversarial robustness through the lens of convolutional filters;RODD: a self-supervised approach for robust out-of-distribution detection;an empirical study of data-free quantization’s tuning robustness;exploring robustness connection between artificial and natural adversarial examples;and adversarial machine learning attacks against video anomaly detection systems.
The proceedings contain 698 papers. The topics discussed include: learning unbiased classifiers from biased data with meta-learning;robustness against gradient based attacks through cost effective network fine-tuning;...
ISBN:
(纸本)9798350302493
The proceedings contain 698 papers. The topics discussed include: learning unbiased classifiers from biased data with meta-learning;robustness against gradient based attacks through cost effective network fine-tuning;gradient attention balance network: mitigating face recognition racial bias via gradient attention;estimating and maximizing mutual information for knowledge distillation;synthetic sample selection for generalized zero-shot learning;training strategies for vision transformers for object detection;does image anonymization impact computervision training?;ultra-sonic sensor based object detection for autonomous vehicles;improvements to image reconstruction-based performance prediction for semantic segmentation in highly automated driving;zero-shot classification at different levels of granularity;difficulty estimation with action scores for computervision tasks;detail-preserving self-supervised monocular depth with self-supervised structural sharpening;isolated sign language recognition based on tree structure skeleton images;deep prototypical-parts ease morphological kidney stone identification and are competitively robust to photometric perturbations;wildlife image generation from scene graphs;towards characterizing the semantic robustness of face recognition;high-level context representation for emotion recognition in images;and mitigating catastrophic interference using unsupervised multi-part attention for RGB-IR face recognition.
The proceedings contain 2715 papers. The topics discussed include: revisiting adversarial training at scale;SPIDeRS: structured polarization for invisible depth and reflectance sensing;MA-LMM: memory-augmented large m...
ISBN:
(纸本)9798350353006
The proceedings contain 2715 papers. The topics discussed include: revisiting adversarial training at scale;SPIDeRS: structured polarization for invisible depth and reflectance sensing;MA-LMM: memory-augmented large multimodal model for long-term video understanding;geometrically-driven aggregation for zero-shot 3D point cloud understanding;TextCraftor: your text encoder can be image quality controller;ViLa-MIL: dual-scale vision-language multiple instance learning for whole slide image classification;HumanNorm: learning normal diffusion model for high-quality and realistic 3D human generation;AnEmpirical study of scaling law for scene text recognition;improving image restoration through removing degradations in textual representations;and steganographic passport: an owner and user verifiable credential for deep model ip protection without retraining.
Medical Image Foundation Models have proven to be powerful tools for mask prediction across various datasets. However, accurately assessing the uncertainty of their predictions remains a significant challenge. To addr...
详细信息
Medical image segmentation tasks are often intricate and require medical domain expertise. Recent advancements in deep learning have expedited these demanding tasks, transitioning from specialized models tailored to e...
详细信息
暂无评论