检索结果-内蒙古大学图书馆

About the Incorporation of Topological Prescriptions in CNNs for Medical Image Semantic Segmentation

JOURNAL OF MATHEMATICAL IMAGING AND vision 2024年第4期66卷 419-446页

作者： Lambert, Zoe Le Guyader, Carole Normandie Univ INSA Rouen Normandie UR 3226 LMI F-76000 Rouen France ENDSUM CEREMA F-76121 Le Grand Quevilly France

Incorporating prior knowledge into a segmentation task, whether it be under the form of geometrical constraints (area/volume penalisation, convexity enforcement, etc.) or of topological constraints (to preserve the contextual relations between objects, to monitor the number of connected components), proves to increase accuracy in medical image segmentation. In particular, it allows to compensate for the issue of weak boundary definition, of imbalanced classes, and to be more in line with anatomical consistency even though the data do not explicitly exhibit those features. This observation underpins the introduced contribution that aims, in a hybrid setting, to leverage the best of both worlds that variational methods and supervised deep learning approaches embody: (a) versatility and adaptability in the mathematical formulation of the problem to encode geometrical/topological constraints, (b) interpretability of the results for the former formalism, while (c) more efficient and effective processing models, (d) ability to become more proficient at learning intricate features and executing more computationally intensive tasks, for the latter one. To be more precise, a unified variational framework involving topological prescriptions in the training of convolutional neural networks through the design of a suitable penalty in the loss function is provided. These topological constraints are implicitly enforced by viewing the segmentation procedure as a registration task between the processed image and its associated ground truth under incompressibility conditions, thus making them homeomorphic. A very preliminary version (Lambert et al., in Calatroni, Donatelli, Morigi, Prato, Santacesaria (eds) scale space and variational methods in computer vision, Springer, Berlin, 2023, pp. 363-375) of this work has been published in the proceedings of the Ninth international conference on scale space and variational methods in computer vision, 2023. It contained neither all the theo

关键词： Nested variational/deep learning-based methods Joint segmentation/registration Nonlinear elasticity Incompressibility Nonconvex optimisation Splitting algorithm

来源：评论

学校读者我要写书评

暂无评论

Enclosing Prototypical variational Autoencoder for Explainable Out-of-Distribution Detection 1

引用

43rd international conference on computer Safety, Reliability and Security (SAFECOMP)

作者： Orglmeister, Conrad Bochinski, Erik Eiselein, Volker Fleig, Elvira DB InfraGO AG Digitale Schiene Deutschland Berlin Germany Tech Univ Berlin Commun Syst Grp Berlin Germany

ISBN: (数字)9783031687389

ISBN: (纸本)9783031687372;9783031687389

Understanding the decision-making and trusting the reliability of Deep Machine Learning Models is crucial for adopting such methods to safety-relevant applications which play an important role in the digitization of the railway system. We extend self-explainable Prototypical variational models with autoencoder-based Out-of-Distribution (OOD) detection: A variational Autoencoder is applied to learn a meaningful latent space which can be used for distance-based classification, likelihood estimation for OOD detection, and reconstruction. The In-Distribution (ID) region is defined by a Gaussian mixture distribution with learned prototypes representing the center of each mode. Furthermore, a novel restriction loss is introduced that promotes a compact ID region in the latent space without collapsing it into single points. The reconstructive capabilities of the Autoencoder ensure the explainability of the prototypes and the ID region of the classifier, further aiding the discrimination of OOD samples. Extensive evaluations on common OOD detection benchmarks as well as a large-scale dataset from a real-world railway application demonstrate the usefulness of the approach, outperforming previous methods.

关键词： Out-of-Distribution detection Explainable AI Prototypical variational Autoencoder Reconstruction Distance

来源：评论

学校读者我要写书评

暂无评论

EmNeF: Neural Fields for Embedded variational Problems in Imaging 9th

EmNeF: Neural Fields for Embedded Variational Problems in ...

引用

9th international conference on scale space and variational methods in computer vision, SSVM 2023

作者： Bednarski, Danielle Lellmann, Jan Institute of Mathematics and Image Computing University of Lübeck Lübeck Germany

ISBN: (纸本)9783031319747

We propose a model-driven neural fields approach for solving variational problems. The approach can be applied to a variety of problems with convex, 1-homogeneous regularizer and arbitrary, possibly non-convex, data term. Our strategy is to embed the non-convex energy into a higher-dimensional space, reaching a convex primal-dual formulation. Instead of using classical gradient-descent based optimization algorithms, we propose training multiple fields representing the primal and dual variables in order to solve the problem. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

关键词： Gradient methods

来源：评论

学校读者我要写书评

暂无评论

Generalised scale-space Properties for Probabilistic Diffusion Models 9th

Generalised Scale-Space Properties for Probabilistic Diffus...

引用

9th international conference on scale space and variational methods in computer vision, SSVM 2023

作者： Peter, Pascal Mathematical Image Analysis Group Faculty of Mathematics and Computer Science Campus E1.7 Saarland University Saarbrücken66041 Germany

ISBN: (纸本)9783031319747

Probabilistic diffusion models enjoy increasing popularity in the deep learning community. They generate convincing samples from a learned distribution of input images with a wide field of practical applications. Originally, these approaches were motivated from drift-diffusion processes, but these origins find less attention in recent, practice-oriented publications. We investigate probabilistic diffusion models from the viewpoint of scale-space research and show that they fulfil generalised scale-space properties on evolving probability distributions. Moreover, we discuss similarities and differences between interpretations of the physical core concept of drift-diffusion in the deep learning and model-based world. To this end, we examine relations of probabilistic diffusion to osmosis filters. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

关键词： Probability distributions

来源：评论

学校读者我要写书评

暂无评论

Multi-modal Multi-scale State space Model for Medical Visual Question Answering 33rd

Multi-modal Multi-scale State Space Model for Medical Visual...

引用

33rd international conference on Artificial Neural Networks and Machine Learning (ICANN)

作者： Chen, Qishen Bian, Minjie He, Wenxuan Xu, Huahu Shanghai Univ Shanghai Peoples R China Shanghai Data Grp Shanghai Peoples R China

ISBN: (纸本)9783031723520;9783031723537

Medical Visual Question Answering (Med-VQA) is pivotal for interpreting medical queries via corresponding images. While multi-modal fusion stages in Med-VQA benefit from attention mechanisms and Transformer-based methods, the latter's computational demands limit scalability. Emerging as a robust alternative, State space Models (SSMs), particularly the Mamba model, have shown promise in sequence modeling and deep learning network building. However, their limitation to single-modality data processing curtails their direct application in complex vision-language tasks inherent in Med-VQA. Additionally, we identify and address the underutilization of multi-scale visual information in existing Med-VQA frameworks, incorporating it into our fusion process for enhanced context comprehension. Our approach also features an innovative asymmetric fusion structure tailored to bridge the gap between open-ended and close-ended questions, optimizing question answering accuracy. Comparative analyses on benchmark datasets VQA-RAD and SLAKE underscore our method's efficiency, outperforming state-of-the-art Med-VQA models in accuracy while operating with significantly fewer parameters than Transformer-based counterparts. This study not only proposed a powerful Med-VQA model but also broadens the scope of SSMs in tackling complex multi-modal challenges.

关键词： State space Model Multi-Modal Fusion Medical Visual Question Answering Medical Image

来源：评论

学校读者我要写书评

暂无评论

Learned Discretization Schemes for the Second-Order Total Generalized Variation 9th

Learned Discretization Schemes for the Second-Order Total ...

引用

9th international conference on scale space and variational methods in computer vision, SSVM 2023

作者： Bogensperger, Lea Chambolle, Antonin Effland, Alexander Pock, Thomas Institute of Computer Graphics and Vision Graz University of Technology Graz Austria CEREMADE CNRS & Université Paris-Dauphine PSL Paris France Institute for Applied Mathematics University of Bonn Bonn Germany

ISBN: (纸本)9783031319747

The total generalized variation extends the total variation by incorporating higher-order smoothness. Thus, it can also suffer from similar discretization issues related to isotropy. Inspired by the success of novel discretization schemes of the total variation, there has been recent work to improve the second-order total generalized variation discretization, based on the same design idea. In this work, we propose to extend this to a general discretization scheme based on interpolation filters, for which we prove variational consistency. We then describe how to learn these interpolation filters to optimize the discretization for various imaging applications. We illustrate the performance of the method on a synthetic data set as well as for natural image denoising. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

关键词： Image denoising

来源：评论

学校读者我要写书评

暂无评论

Collation of Face Recognition Techniques: Image Registration and Dimension Reduction Approaches 3

Collation of Face Recognition Techniques: Image Registration...

引用

3rd international conference on Technological Advancements in Computational Sciences, ICTACS 2023

作者： Gupta, Gaurav Goel, Anuj Kumar Chandigarh University Apex Institute of Technology Computer Science Engineering Department Punjab Mohali India Chandigarh University Chandigarh University Electronics and Communication Engineering Department Punjab Mohali India

ISBN: (纸本)9798350342338

Faces must be present in images for intelligent vision-based human computer interaction to work. Since many years ago, face recognition research has been ongoing and significant. This procedure entails facial tracking, expression recognition, and many other things. It is crucial to first register the positions of the photos in this process. However, maintaining such a database is highly challenging because of pose invariance, illumination invariance, shift invariance, scale and noise invariance. The Image Registration Algorithm will be used to register all of the unknown photos that are currently in the database. With the use of a reference, this registration technique tries to align a pattern image. This algorithm will produce features that are independent of translation, scaling and rotation. High dimension space, which must be decreased using a dimension reduction algorithm, is the primary challenge in face recognition. This low dimension subspace can be produced using dimension reduction methods like as Linear Discriminant Analysis, Principal Component Analysis and Locality Preserving Projection. In order to find the needed image from the given database, this study will first minimize the dimensions of every image that is now there. © 2023 IEEE.

关键词： Dimensions Eigen value Laplacian Faces Projection Recognition

来源：评论

学校读者我要写书评

暂无评论

Learning Differential Invariants of Planar Curves 9th

Learning Differential Invariants of Planar Curves

引用

9th international conference on scale space and variational methods in computer vision, SSVM 2023

作者： Velich, Roy Kimmel, Ron Technion - Israel Institute of Technology Haifa Israel

ISBN: (纸本)9783031319747

We propose a learning paradigm for the numerical approximation of differential invariants of planar curves. Deep neural-networks’ (DNNs) universal approximation properties are utilized to estimate geometric measures. The proposed framework is shown to be a preferable alternative to axiomatic constructions. Specifically, we show that DNNs can learn to overcome instabilities and sampling artifacts and produce consistent signatures for curves subject to a given group of transformations in the plane. We compare the proposed schemes to alternative state-of-the-art axiomatic constructions of differential invariants. We evaluate our models qualitatively and quantitatively and propose a benchmark dataset to evaluate approximation models of differential invariants of planar curves. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

10th international conference on scale space and variational methods in computer vision, SSVM 2025

10th International Conference on Scale Space and Variational...

引用

10th international conference on scale space and variational methods in computer vision, SSVM 2025

ISBN: (纸本)9783031923654

The proceedings contain 63 papers. The special focus in this conference is on scale space and variational methods in computer vision. The topics include: Fast Inexact Bilevel Optimization for Analytical Deep Image Priors;why do we Regularise in Every Iteration for Imaging Inverse Problems?;on an Analytical Inversion Formula for the Modulo Radon Transform;multiResolution Low-Rank Regularization of Dynamic Imaging Problems;a Fractional Graph La+Ψ Approach to Image Reconstruction;a Fractional-Order Telegraph Diffusion Model for Multiplicative Noise Removal;self-supervised Conformal Prediction for Uncertainty Quantification in Imaging Problems;real-Time Scene Recovery from Image scale space and Perceptual Hue Similarity;a Novel Interpretation of the Radon Transform’s Ray and Pixel-Driven Discretizations Under Balanced Resolutions;Product of Gaussian Mixture Diffusion Model for Non-linear MRI Inversion;Whiteness-Based Bilevel Estimation of Weighted TV Parameter Maps for Image Denoising;Direct Atomistic Reconstruction in Homogeneous Cryo-EM Using Protein Geometry Regularization;enhanced Denoising and Convergent Regularisation Using Tweedie Scaling;max-Sparsity Atomic Autoencoders with Application to Inverse Problems;equivariant Bootstrap for Uncertainty Quantification in Image Classification;equivariant Denoisers for Image Restoration;max-Normalized Radon Cumulative Distribution Transform for Limited Data Classification;prediction of Parametric Surfaces for Multi-object Segmentation in 3D Biological Imaging;plug-and-Play Half-Quadratic Splitting for Ptychography;deep Unrolling for Learning Optimal Spatially Varying Regularisation Parameters for Total Generalised Variation;identifying Memorization of Diffusion Models Through p-Laplace Analysis;learning Anisotropic Metrics for Geodesic Distances via the Heat Equation for Image Segmentation;deceptive Diffusion: Generating Synthetic Adversarial Examples;TomoSelfDEQ: Self-supervised Deep Equilibrium Learning for Sparse-Angle C

关键词：

来源：评论

学校读者我要写书评

暂无评论

10th international conference on scale space and variational methods in computer vision, SSVM 2025

10th International Conference on Scale Space and Variational...

引用

10th international conference on scale space and variational methods in computer vision, SSVM 2025

ISBN: (纸本)9783031923685

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：