statistical shape models (SSMs) play important roles in the analysis of medical images. SSMs of particular organs are built by learning statistical shape patterns of the target organs from medical images. As a prerequ...
ISBN:
(数字)9781728123455
ISBN:
(纸本)9781728123462
statistical shape models (SSMs) play important roles in the analysis of medical images. SSMs of particular organs are built by learning statistical shape patterns of the target organs from medical images. As a prerequisite step of SSM construction, the anatomical correspondences between the training subjects have to be obtained, requiring nontrivial and tedious organ segmentation from the training images, as well as difficult registration of the segmented organ shapes. To tackle these problems, we design a novel strategy called merged intensity and landmark registration (MILR) which does not require segmentation of the target organ. The MILR method directly registers an organ shape template to the training images by simultaneously optimizing the template-to-image intensity similarity and the key anatomical landmark alignment. Compared to conventional correspondence calculation methods, the MILR method not only saves the time of organ segmentation but also ensures accurate alignment of important anatomical feature points, improving both the efficiency and accuracy of SSM construction. To verify this method, we constructed the human vertebral column SSM from volumetric CT images. We show results of correspondences by three validation metrics anatomical landmark distance (ALD), Dice similarity coefficient (DSC) and average surface distance (ASD)and the corresponding construction of SSMs by generalization ability and specificity of the human vertebral column from the CT images of 17 different subjects. The novel merged strategy, with merged deformation field reconstruction, shows competitive results against the conventional methods.
Face hallucination is a specific super-resolution problem which aims to generate high-resolution(HR) faces from low-resolution(LR) input. Recently, deep learning methods have been widely applied in single-image super ...
详细信息
ISBN:
(纸本)9781538644584
Face hallucination is a specific super-resolution problem which aims to generate high-resolution(HR) faces from low-resolution(LR) input. Recently, deep learning methods have been widely applied in single-image super resolution. Considering face images have great similarities in both pixel value and global structure, we propose a wavelet-based deep learning method with loop architecture for face hallucination. In contrast to existing wavelet-based methods that generate wavelet coefficients independently without considering relationships between them, we propose a three-stage method with loop architecture. This alternately updated loop structure explores the statistical relationships among wavelet coefficients and has a maximum use of information flow with a small number of parameters. Because of multi-resolution property of wavelet transform, we adopt a mixed input strategy to train images with different sizes to realize multi-scale face hallucination without retraining and adding extra subnetworks. Experiments demonstrate that our method can get a robust performance with multi-scale face hallucination.
The research on the image pattern recognition has always been a hot topic. In this paper, the automatic identification technology of image is studied, the research contents include image preprocessing, image feature e...
详细信息
Document image binarization, especially old handwritten documents, is a very important yet challenging task. There are various bottlenecks for binarizing historical documents due to different types of degradation pres...
详细信息
Document image binarization, especially old handwritten documents, is a very important yet challenging task. There are various bottlenecks for binarizing historical documents due to different types of degradation present imultaneously such as back impression, ink bleed through, faded colours, and wear and tear of the writing media. We consider these degradation as various types of noise in the document image. Here we have proposed a 2D morphological network which consists of basic morphological operation like dilation and erosion to perform our targeted task. The network also includes linear combination of output from dilation and erosion operations. The aforementioned 2D morphological network is applied for image binarization, where the structuring elements (SEs) and the weights of the linear combination layer are learned through back-propagation. The proposed network has been evaluated on DIBCO 2017 and H-DIBCO 2018 and ISI-Letter dataset. Our results show more convincing as compared to the results of other state-of-the-art methods. Though the network is developed for old handwritten documents, it may be tuned to work for imageprocessing task. The source code can be found here https://***/ranjanZ/ICDAR_Binarization.
Acute Lymphoblastic Leukemia (ALL) is the most prevalent acute leukemia in adults after Acute Myeloid Leukemia, with a diffusion of over 6500 persons per year just in the United States. In this research, we propose a ...
详细信息
ISBN:
(纸本)9781538611227
Acute Lymphoblastic Leukemia (ALL) is the most prevalent acute leukemia in adults after Acute Myeloid Leukemia, with a diffusion of over 6500 persons per year just in the United States. In this research, we propose a smart assistant determination method for ALL diagnosis using microscopic images. In this regard, K-means is employed to extract cell images after that wavelet transform is hired on cell images then statistical moments of the transformed image are computed to extract features. Afterward, a Chain Tabu search algorithm is proposed for feature selection of normal and abnormal cells to enable classifiers classifying ALL efficiently. Finally, Multi-Layer Perceptron (MLP) is used for classification. The proposed method is evaluated on ALL-IDB2. The proposed method achieved the accuracy of 98.88% and outperforms existed ALL diagnosis methods.
Efforts are afoot to design better context-aware human-computer interaction techniques that have knowledge of both their surrounding and the affective state of the user. One of the most important nonverbal behavioural...
详细信息
ISBN:
(纸本)9781538646588
Efforts are afoot to design better context-aware human-computer interaction techniques that have knowledge of both their surrounding and the affective state of the user. One of the most important nonverbal behavioural cues for affective human-machine interaction is laughter. Automatic detection of laughter is an interesting, yet challenging problem, which in recent years has gained increased attention from both the academic and industrial communities. The majority of existing laughter detection systems rely on either audio or video modalities. Humans, however, typically rely on audio-visual cues during conversation and/or interaction, thus it is expected that improved results can be achieved if both modalities are used. In this work, we propose a multimodal framework that analyzes audio and video channels separately, then fuses their decisions. Conventional speech spectral and prosodic features are used, whereas new multi-scale multi-resolution binarized statisticalimage features are proposed due to their improved expressive power. Experiments with the publicly available MAHNOB Laughter database show that decision level fusion based on support vector machine classifiers leads to improved performance over single modality approaches, as well as over previously-proposed methods, all whilst requiring just a fraction of the computational power.
image hash functions find extensive applications in content authentication, database search, and digital forensic. Robust image hash has been widely investigated to authenticate the reliability of images transmitted b...
详细信息
This paper is on Bayesian inference for parametric statistical models that are defined by a stochastic simulator which specifies how data is generated. Exact sampling is then possible but evaluating the likelihood fun...
详细信息
Medical digital images and methods for their processing and automatic analysis have been used for faster and more precise diagnosis. Computer-aided diagnosis systems are widely used by specialists as help for detectin...
ISBN:
(数字)9781728116242
ISBN:
(纸本)9781728116259
Medical digital images and methods for their processing and automatic analysis have been used for faster and more precise diagnosis. Computer-aided diagnosis systems are widely used by specialists as help for detecting and analyzing suspicions regions in medical digital images. Various types of medical digital images and numerous diseases that can be detected on them make this wide research field. One of the diseases that can be detected in lung CT images is chronic obstructive pulmonary disease or emphysema. In this paper we analyzed the capabilities of texture descriptors, local binary pattern, for detecting and classification of emphysema. Three different types of local binary pattern are used. Instead of using a whole local binary pattern operator output, statistical measurements have been used. Support vector machine optimized by elephant herding optimization algorithm was used for classification. Based on the obtained results, it can be concluded that six statistical information of uniform local binary pattern achieve the best classification accuracy.
Classification is an important and difficult problem in Polarimetric SAR (POLSAR) imageprocessing. Most existing classification methods combine multiple features (scattering parameters or statistical distribution) to...
详细信息
ISBN:
(纸本)9781538694961
Classification is an important and difficult problem in Polarimetric SAR (POLSAR) imageprocessing. Most existing classification methods combine multiple features (scattering parameters or statistical distribution) to improve the performance. However, based on the observation that various regions have different characteristics due to the different scattering mechanism, which implies that different features should be used for certain pixels rather than using the combination of various features for the whole image, so that simple combinations will result in numerous error classifications. In this paper, a novel POLSAR classification method based on multitask learning with multiple features is proposed. Firstly, different types of features are extracted, and then POLSAR classification problem is formulated as a multitask joint sparse representation learning problem. The strength of different features are employed by using of a joint sparse norm. Finally, experimental results on real POLSAR data show that our method outperforms several state-of-the-art algorithms.
暂无评论