Advances in medical imaging technologies have enabled the collection of longitudinal images, which involve repeated scanning of the same patients over time, to monitor disease progression. However, predictive modeling...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
Advances in medical imaging technologies have enabled the collection of longitudinal images, which involve repeated scanning of the same patients over time, to monitor disease progression. However, predictive modeling of such data remains challenging due to high dimensionality, irregular sampling, and data sparsity. To address these issues, we propose imageFlowNet, a novel model designed to forecast disease trajectories from initial images while preserving spatial details. imageFlowNet first learns multiscale joint representation spaces across patients and time points, then optimizes deterministic or stochastic flow fields within these spaces using a position-parameterized neural ODE/SDE framework. The model leverages a UNet architecture to create robust multiscale representations and mitigates data scarcity by combining knowledge from all patients. We provide theoretical insights that support our formulation of ODEs, and motivate our regularizations involving high-level visual features, latent space organization, and trajectory smoothness. We validate imageFlowNet on three longitudinal medical image datasets depicting progression in geographic atrophy, multiple sclerosis, and glioblastoma, demonstrating its ability to effectively forecast disease progression and outperform existing methods. Our contributions include the development of imageFlowNet, its theoretical underpinnings, and empirical validation on real-world datasets.
In Virtual Reality, it is important to make the rendering of images displayed on an HMD headset as realistic as possible. In this case, it is a question of taking into account the multiple reflections, refractions and...
In Virtual Reality, it is important to make the rendering of images displayed on an HMD headset as realistic as possible. In this case, it is a question of taking into account the multiple reflections, refractions and diffusions using stochasticmethods (Monte Carlo, Gaussian processes, etc.) or AI (CNN neural networks). These methods can be used in other fields such as vision, crowd simulation, etc. This Keynote Lecture begins by introducing the different processing necessary for obtaining a computer-generated image such as: geometric modeling, camera parameters, color, photometric quantities, etc. Then it presents in detail the various stochastic rendering methods allowing the generation of synthetic images with a high level of realism. Among these methods we can cite: the Monte Carlo method, Bayesian Monte Carlo, Metropolis, spectral Analysis of Quadrature Rules and Fourier Truncation-based methods Applied to the Shading Integral, etc. This lecture will be illustrated with realistic images generated with different rendering methods.
Scientific and methodological foundations for the optimal identification of non-stationary objects based on the use of neural networks have been developed. Models and algorithms for detection, extraction of hidden rel...
详细信息
Scientific and methodological foundations for the optimal identification of non-stationary objects based on the use of neural networks have been developed. Models and algorithms for detection, extraction of hidden relationships, useful properties and patterns in data, formation of a database and knowledge bases are proposed. Mechanisms have been developed for using the statistical, dynamic and specific characteristics of images, unique features of three, five-layer neural networks and combined models for setting variables with typical recognition and classification tools. Have been developed computational schemes for determining and adjusting the weights of neurons, choosing a suitable activation function, coefficients of synaptic and interneuronal connections, rational neural network architecture, the number of layers and neurons in the layers of the network, a set of functions of nonlinear dependencies "inputs - outputs". Data pre-processing algorithms are implemented that perform the functions of informative features selection, segmentation, object image contour extraction, search based on methods with annealing, prohibition, and stochastic search. Tested neural networks of Hopfield, Hamming, Hebb, Kohonen, bidirectional associative memory were tested. Schemes for two and three-dimensional image reconstruction based on the synthesis of tools for calculating Mellin transform functions, initial values of centroids, and the formation of a suboptimal set of variables are proposed. The identification software package in C++ was developed and implemented in the CUDA parallel computing environment.
This paper presents a fast heuristic for comparing Bayesian models to solve inverse problems related to signal processing. We focus on problems that are convex w.r.t. the unknown signal and where no ground truth is av...
详细信息
ISBN:
(纸本)9781728157672
This paper presents a fast heuristic for comparing Bayesian models to solve inverse problems related to signal processing. We focus on problems that are convex w.r.t. the unknown signal and where no ground truth is available. The proposed heuristic is very computationally efficient and does not require the estimation of the model evidence. Instead, the model evidence is used indirectly to set the regularisation parameters that define each competing model by maximum marginal likelihood estimation, followed by a simple likelihood-based or residual-based comparison of the models based on their empirical Bayesian maximum-a-posteriori solutions. The proposed methodology is illustrated with a total-variation image deblurring experiment, where it performs remarkably well.
Aim: Identifying and classifying traffic signs board images is accomplished through image recognition and determine whether the image is a speed sign or stop sign using Convolutional Neural Network (CNN) comparison wi...
详细信息
ISBN:
(纸本)9781665460712
Aim: Identifying and classifying traffic signs board images is accomplished through image recognition and determine whether the image is a speed sign or stop sign using Convolutional Neural Network (CNN) comparison with Radial Basis Function (RBF). Materials and methods: We can determine the accuracy of stop and speed signs based on the size dataset collected for each from 4,470 image files. Convolutional Neural Networks (CNN) provide an optimal starting point for detection where the sample size taken is 23 and with comparison algorithm Radial basis function (RBF) the sample size is 23, CNN is regarded as more powerful than RBF since RBF offers fewer features than CNN. Results: Using the dataset Traffic sign board images, CNN detects when inputs are introduced and produces accurate results with accuracy rate 96%. where in RBF the image detecting pression is 73% and accuracy given is 82%. It shows a statistical significance of (p<0.002) from the Independent Sample T-test. Conclusion: CNN in terms of accuracy and precision, performs better than RBF.
Canonical correlation analysis (CCA) is a widely used mutivariate statistical technique for exploring the relationship between two multivariable datasets. It extracts existing relationship information by finding pairs...
详细信息
ISBN:
(数字)9798350349399
ISBN:
(纸本)9798350349405
Canonical correlation analysis (CCA) is a widely used mutivariate statistical technique for exploring the relationship between two multivariable datasets. It extracts existing relationship information by finding pairs of linear combinations from the two sets of variables with maximum correlation. In some applications however, the observed datasets may be contaminated by outliers and the standard CCA methods are sensitive to the presence of outliers in the datasets. In this paper, a robust CCA (RCCA) algorithm is presented. It is obtained using the interpretation of CCA as a latent variable model with two Gaussian random vectors and a robust loss function derived from the $\alpha$-divergence as an alternative to maximum likelihood. Compared to existing robust CCA approaches, the proposed loss has the advantage of belonging to class of redescending M-estimators, guaranteeing inference stability for large deviation from the Gaussian nominal noise model. Experimental results on simulated and real datasets show that the proposed RCCA outperforms some existing robust and standard CCA methods.
There continues to be a trade-off between the biological realism and performance of neural networks. Contemporary deep learning techniques allow neural networks to be trained to perform challenging computations at (ne...
详细信息
ISBN:
(纸本)9781713871088
There continues to be a trade-off between the biological realism and performance of neural networks. Contemporary deep learning techniques allow neural networks to be trained to perform challenging computations at (near) human-level, but these networks typically violate key biological constraints. More detailed models of biological neural networks can incorporate many of these constraints but typically suffer from subpar performance and trainability. Here, we narrow this gap by developing an effective method for training a canonical model of cortical neural circuits, the stabilized supralinear network (SSN), that in previous work had to be constructed manually or trained with undue constraints. SSNs are particularly challenging to train for the same reasons that make them biologically realistic: they are characterized by strongly-connected excitatory cells and expansive firing rate non-linearities that together make them prone to dynamical instabilities unless stabilized by appropriately tuned recurrent inhibition. Our method avoids such instabilities by initializing a small network and gradually increasing network size via the dynamics-neutral addition of neurons during training. We first show how SSNs can be trained to perform typical machine learning tasks by training an SSN on MNIST classification. We then demonstrate the effectiveness of our method by training an SSN on the challenging task of performing amortized Markov chain Monte Carlo-based inference under a Gaussian scale mixture generative model of natural image patches with a rich and diverse set of basis functions - something that was not possible with previous methods. These results open the way to training realistic cortical-like neural networks on challenging tasks at scale.
With the increasing popularity of digital media along with the fact that massive image data has already gained popularity, the automatic image recognition and classification have become a very critical and urgent prob...
详细信息
Conventional phrase grounding aims to localize noun phrases mentioned in a given caption to their corresponding image regions, which has achieved great success recently. Apparently, sole noun phrase grounding is not e...
详细信息
Traditional CT data processingmethods often employ the inherent window to clip the data and store the image. For some DICOM-type data processing software, although the window can be adjusted dynamically, it still req...
详细信息
ISBN:
(纸本)9781450390057
Traditional CT data processingmethods often employ the inherent window to clip the data and store the image. For some DICOM-type data processing software, although the window can be adjusted dynamically, it still requires users' the medical knowledge and familiarity in medical images. In this work, we propose a derivative windowing process method (multi-tissue derived windowing technology) for multiple tissues and construct organs based on the statistical features of CT images. Then, we design a transformation function to convert the density value of CT image into gray value. Finally, we construct the automatic colorization network using Dense U-Net as the generator of Generation Adversarial Network(GAN) model, making the multiple organs of the human body colored in a globally optimized way to highlight the color texture features of them. In this way the quality of color images can be improved significantly. In the test study, we evaluate the proposed method using many image quality evaluation methods and subjective visual evaluation on the 3Dircadb data set. The experimental results demonstrate the effectiveness of the proposed method, which indeed improves the quality of the image.
暂无评论