检索结果-内蒙古大学图书馆

Electronics and Renewable Systems (ICEARS), International conference on

作者： Sivanantham Basireddy Satish Kumar Reddy J. SaiGnaneswar Kalahasti Balaji K. Siva Vivek Reddy Kusam Lokesh Reddy Computer Science and Systems Engineering SVEC Tirupati Andhra Pradesh

A deep learning approach will be used to recover ancient pictures that have suffered significant damage. Unlike typical reconstruction processes that are easily handled by supervised learning methods, real-world picture degradation seems to be complex, and the system is unable to generalize due to domain differences between synthetic pictures and actual old pictures. Therefore, using huge amounts of synthetic image pairs combined with real photos, Therefore, using huge amounts of synthetic picture pairs combined with real photos, A unique triplet domain translation network. Two variational autoencoders (VAEs) have been trained to create latent spaces from both fresh and old images, respectively. The translation between two regions is thenmanaged to learn using artificially paired data. This translation normalizes well to actual photographs as the domain gap is filled in the compact latent space. The translation between these two various latent regions has been taught using artificially paired data. This translation normalizes well to images found in the real world because the compact latent space is filled with the domain gap. A global division with an incomplete nonlocal block will target structural issues like cuts and bruises and a local division attacking unstructured defects like unwanted noise and poor contrast to handle the various degradations mixed throughout an old photograph. The latent space fusion of two branches increases the ability to correct numerous flaws in old images. Convolutional neural networks (CNNs) outperform multiple-layer sequenced neural network models at identifying distinct marks, forms, and patterns in images, making them the most efficient method for processing data. The filters are applied by CNN to every pixel in the image. When it comes to visual quality, the suggested method for repairing old photographs performs better than cutting-edge techniques.

关键词： Degradation Deep learning Visualization Renewable energy sources Supervised learning Noise reduction Neural networks

来源：评论

学校读者我要写书评

暂无评论

Self-Guided and MR-Guided Deep-Learned Post-reconstruction PET Processing

Self-Guided and MR-Guided Deep-Learned Post-Reconstruction P...

引用

2021 IEEE Nuclear Science Symposium and Medical Imaging conference, NSS/MIC 2021

作者： Corda-D'incan, Guillaume Schnabel, Julia A. Reader, Andrew J. King's College London School of Biomedical Engineering and Imaging Sciences United Kingdom The Technical University of Munich Germany

ISBN: (纸本)9781665421133

Reconstructed PET images exhibit high noise levels and low spatial resolution when shorter scan times and reduced injected doses are used. Regularisation methods such as post-reconstruction smoothing can help to improve image quality. Recently, neural networks have proved to be highly effective for this task by learning an ensemble of kernels. For post-processing of PET images, a high-resolution MR image can also be used for guidance to further improve the final image quality. In this work, we investigate the impact of the input choice and the number of training samples used on a neural network's performance for PET post-reconstruction processing. To do so, six combinations of low-count PET and MR independent reconstruction outputs are fed into a state-of-the-art residual convolutional neural network (CNN). Six different networks were trained using as input i) the last iteration of a conventional PET reconstruction, ii) all the iterates from the PET reconstruction, iii) only the final PET and MR estimates, iv) all the PET estimates and the final MR, v) the final PET and all the MR estimates and vi) all the iteration outputs of independent PET and MR reconstruction. The networks have been trained using a different number of training samples as well. The results obtained suggest that using all the intermediate reconstructions lead the network to perform better when the training set size is limited. Furthermore, the gain in performance observed when the dataset size increases are higher for methods using all the intermediate reconstruction outputs. Future work will focus on training networks with a higher number of training samples to confirm the trend observed and assess the proposed method on 3D real data. © 2021 IEEE.

关键词： image quality

来源：评论

学校读者我要写书评

暂无评论

Masked Vision Transformers for Hyperspectral image Classification

Masked Vision Transformers for Hyperspectral Image Classific...

引用

IEEE Computer Society conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

作者： Linus Scheibenreif Michael Mommert Damian Borth AIML Lab School of Computer Science University of St. Gallen

Transformer architectures have become state-of-the-art models in computer vision and natural language processing. To a significant degree, their success can be attributed to self-supervised pre-training on large scale unlabeled datasets. This work investigates the use of self-supervised masked image reconstruction to advance transformer models for hyperspectral remote sensing imagery. To facilitate self-supervised pre-training, we build a large dataset of unlabeled hyperspectral observations from the EnMAP satellite and systematically investigate modifications of the vision transformer architecture to optimally leverage the characteristics of hyperspectral data. We find significant improvements in accuracy on different land cover classification tasks over both standard vision and sequence transformers using (i) blockwise patch embeddings, (ii) spatialspectral self-attention, (iii) spectral positional embeddings and (iv) masked self-supervised pre-training 1 . The resulting model outperforms standard transformer architectures by +5% accuracy on a labeled subset of our EnMAP data and by +15% on Houston2018 hyperspectral dataset, making it competitive with a strong 3D convolutional neural network baseline. In an ablation study on label-efficiency based on the Houston2018 dataset, self-supervised pre-training significantly improves transformer accuracy when little labeled training data is available. The self-supervised model outperforms randomly initialized transformers and the 3D convolutional neural network by +7-8% when only 0.1-10% of the training labels are available.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Medical image reconstruction with image-adaptive priors learned by use of generative adversarial networks

Medical image reconstruction with image-adaptive priors lear...

引用

conference on Medical Imaging - Physics of Medical Imaging

作者： Bhadra, Sayantan Zhou, Weimin Anastasio, Mark A. Washington Univ Dept Comp Sci & Engn St Louis MO 63110 USA Washington Univ Dept Elect & Syst Engn St Louis MO 63130 USA Univ Illinois Dept Bioengn Urbana IL 61801 USA

ISBN: (纸本)9781510633926

Medical image reconstruction is often an ill-posed inverse problem. In order to address such ill-posed inverse problems, prior knowledge of the sought after object property is usually incorporated by means of regularization. For example, sparsity-promoting regularization in a suitable transform domain is widely used to reconstruct images with diagnostic quality from noisy and/or incomplete medical data. However, sparsity-promoting regularization may not be able to comprehensively describe the actual prior information of the objects being imaged. Deep generative models, such as generative adversarial networks (GANs) have shown great promise in learning the underlying distribution of images. Prior distributions for images estimated using GANs have been employed as a means of regularization with impressive results in several linear inverse problems in computer vision that are also relevant to medical imaging. However, in practice, it can be difficult for a GAN to comprehensively describe prior distributions, which can potentially lead to a lack of fidelity between the reconstructed image and the observed data. Recently, an image-adaptive GAN-based reconstruction method (IAGAN) was proposed to guarantee stronger data consistency by adapting the trained generative model parameters to the observed measurements. In this work, for the first time, we apply the IAGAN method to reconstruct images from undersampled magnetic resonance imaging (MRI) measurements. A state-of-the-art GAN model called Progressive Growing of GANs (ProGAN) was trained on a large number of ground truth images from the NYU fastMRI dataset, and the learned generator was subsequently employed in the IAGAN framework to reconstruct high fidelity images from retrospectively undersampled experimental k-space data in the validation dataset. It is demonstrated that by use of the GAN-based reconstruction method with noisy and/or incomplete measurements, we can potentially recover fine structures in the object th

关键词： medical image reconstruction inverse problems regularization deep learning generative adversarial networks compressed sensing magnetic resonance imaging

来源：评论

学校读者我要写书评

暂无评论

Fingerprint restoration using cubic Bezier curve

引用

BMC BIOINFORMATICS 2020年第21-Sup期21卷 514-514页

作者： Tu, Yanglin Yao, Zengwei Xu, Jiao Liu, Yilin Zhang, Zhe Jinan Univ Zhuhai Hosp Zhuhai Peoples Hosp 79 Kangning Rd Zhuhai 519000 Guangdong Peoples R China Harbin Inst Technol Shenzhen HIT Campus Univ Town Shenzhen Shenzhen 518055 Guangdong Peoples R China Yale Sch Publ Hlth Dept Biostat 60 Coll StPOB 208034 New Haven CT 06520 USA

Background Fingerprint biometrics play an essential role in authentication. It remains a challenge to match fingerprints with the minutiae or ridges missing. Many fingerprints failed to match their targets due to the incompleteness. Result In this work, we modeled the fingerprints with Bezier curves and proposed a novel algorithm to detect and restore fragmented ridges in incomplete fingerprints. In the proposed model, the Bezier curves' control points represent the fingerprint fragments, reducing the data size by 89% compared to image representations. The representation is lossless as the restoration from the control points fully recovering the image. Our algorithm can effectively restore incomplete fingerprints. In the SFinGe synthetic dataset, the fingerprint image matching score increased by an average of 39.54%, the ERR (equal error rate) is 4.59%, and the FMR1000 (false match rate) is 2.83%, these are lower than 6.56% (ERR) and 5.93% (FMR1000) before restoration. In FVC2004 DB1 real fingerprint dataset, the average matching score increased by 13.22%. The ERR reduced from 8.46% before restoration to 7.23%, and the FMR1000 reduced from 20.58 to 18.01%. Moreover, We assessed the proposed algorithm against FDP-M-net and U-finger in SFinGe synthetic dataset, where FDP-M-net and U-finger are both convolutional neural network models. The results show that the average match score improvement ratio of FDP-M-net is 1.39%, U-finger is 14.62%, both of which are lower than 39.54%, yielded by our algorithm. Conclusions Experimental results show that the proposed algorithm can successfully repair and reconstruct ridges in single or multiple damaged regions of incomplete fingerprint images, and hence improve the accuracy of fingerprint matching.

关键词： incomplete fingerprint Fingerprint description reconstruction Breakpoints Bezier curve

来源：评论

学校读者我要写书评

暂无评论

Sound Field reconstruction from incomplete data by Solving Fuzzy Relational Equations

Sound Field Reconstruction from Incomplete Data by Solving F...

引用

International Scientific conference on Intellectual Systems of Decision-Making and Problems of Computational Intelligence (ISDMCI)

作者： Azarov, Olexiy Krupelnitskyi, Leonid Rakytyanska, Hanna Vinnytsia Natl Tech Univ Vinnytsia Ukraine

ISBN: (纸本)9783030542153;9783030542146

The approach to solving inverse problems of source identification in acoustics is proposed based on fuzzy relational calculus. The compositional rule of inference connects the real and observed fuzzy acoustic image using the relationship matrix, which reflects the degree of completeness of the microphone array measurement data. The fuzzy model of the acoustic field is based on 3D membership functions, for which the degree of membership decreases in proportion to the square of the distance to the source. The problem of reconstructing the acoustic field is formulated as the problem of inverse logical inference. The method for reconstructing the acoustic field from incomplete data is proposed based on solving fuzzy relational equations. The problem consists in finding such a number of sound sources, their locations and powers, which minimize the difference between the model and observed fuzzy acoustic image. The solutions of the equation system represent the variants of the acoustic field reconstruction in the form of the main acoustic surface and a set of secondary acoustic surfaces. The main acoustic surface is generated by the least number of sources. The set of secondary acoustic surfaces represents the variants of the sound field reconstruction generated by the upper solutions for the number of sources. Since the sources distribution is completely determined by the properties of the solution set, the proposed approach allows avoiding the generation and selection of candidate sources, that provides simplification of the reconstruction process and reduction of time costs. The genetic and neural algorithm provides accurate and fast reconstruction of the acoustic field for an unknown number of sources and their configuration.

关键词： Inverse problems in acoustics Source identification problem Sound field reconstruction Solving fuzzy relational equations

来源：评论

学校读者我要写书评

暂无评论

Average Consistency: A Superior Way of Using the Composite image to Boost Dynamic CT reconstruction

Average Consistency: A Superior Way of Using the Composite I...

引用

conference on Medical Imaging - Physics of Medical Imaging

作者： Tao, Xi Wang, Yongbo Hong, Zixuan Fu, Shuai Zhang, Hua Ma, Jianhua Southern Med Univ Sch Biomed Engn Guangzhou 510515 Guangdong Peoples R China

ISBN: (纸本)9781510633926

Dynamic imaging (such as computed tomography (CT) perfusion, dynamic CT angiography, dynamic positron emission tomography, four-dimensional CT, etc.) is widely used in the clinic. The multiple-scan mechanism of dynamic imaging results in greatly increased radiation dose and prolonged acquisition time. To deal with these problems, low-mAs or sparse-view protocols are usually adopted, which lead to noisy or incomplete data for each frame. To obtain high-quality images from the corrupted data, a popular strategy is to incorporate the composite image that reconstructed using the full dataset into the iterative reconstruction procedure. Previous studies have tried to enforce each frame to approach the composite image in each iteration, which, however, introduces mixed temporal information into each frame. In this paper, we propose an average consistency (AC) model for dynamic CT image reconstruction. The core idea of AC is to enforce the average of all frames to approach the composite image in each iteration, which preserves image edges and noise characteristics while avoids the invasion of mixed temporal information. Experiment on a dynamic phantom and a patient for CT perfusion imaging shows that the proposed method obtains the best qualitative and quantitative results. We conclude that the AC model is a general framework and a superior way of using the composite image for dynamic CT reconstruction.

关键词： Computed tomography Dynamic imaging image reconstruction Average consistency Composite image

来源：评论

学校读者我要写书评

暂无评论

Deep image reconstruction for Reducing Limited-Angle Artifacts in a Dual-Panel TOF PET

Deep Image Reconstruction for Reducing Limited-Angle Artifac...

引用

2020 IEEE Nuclear Science Symposium and Medical Imaging conference, NSS/MIC 2020

作者： Li, Yusheng Matej, Samuel The Department of Radiology University of Pennsylvania 3620 Hamilton Walk PhiladelphiaPA19104 United States

ISBN: (纸本)9781728176932

Dual-panel PET scanners have many advantages in dedicated breast imaging and on-board imaging applications since the compact scanners can be combined with other imaging and treatment modalities. The major challenges of dual-panel PET imaging are the incomplete sampling and data truncation, which can cause severe limited-angle artifacts. In this incomplete sampling case, time-of-flight (TOF) provides new information and thus reduces the artifacts, however, the problem is still quite challenging even with 200 to 300 ps timing resolution. In this work, we explore deep learning based image reconstruction for limited-angle artifacts reduction for dual-panel TOF PET imaging. The deep image reconstruction consists of two components, namely, TOF ordered subsets expectation maximization (OSEM) reconstruction, and a deep neural network for limited-angle artifacts reduction. We adopt and optimize a U-net based architecture for limited-angle artifacts reduction (LaU-net) to predict expected images from limited-angle TOF reconstructions. We perform numerical simulations with a generic 2D dual-panel TOF PET system with timing resolution of 300 ps and angular coverage of 90°. We generate 640 2D training datasets by performing TOF ordered subsets expectation maximization (OSEM) reconstructions from randomly generated phantom images. Then 3 additional folds of datasets were obtained using data augmentation by flipping horizontal and vertical dimensions for each dataset. We used Kullback-Leibler divergence as loss function for nonnegative images, and the Adam optimizer for training. We show from both random phantoms and a high resolution hot-rod phantom that the deep reconstruction can substantially reduce limited-angle artifacts and improve quantitative accuracy of reconstructed images. © 2020 IEEE

关键词： image reconstruction

来源：评论

学校读者我要写书评

暂无评论

Preoperative image Segmentation for Organ Visualization Using Augmented Reality Technology During Open Liver Surgery

Preoperative Image Segmentation for Organ Visualization Usin...

引用

International conference on Information Visualisation (iv)

作者： Aymen Afli Nessrine Elloumi Aicha Ben Makhlouf Borhen Louhichi Mehdi Jaidane João Manuel RS Tavares University of Sousse ISSATSo Sousse Tunisia University of Sfax SETIT ISBS Sfax Tunisia University of Sousse LATIS ENISo Sousse Tunisia University of Sousse LMS ENISo Sousse Tunisia Sahloul University Hospital Sousse Tunisia Departamento de Engenharia Mecânica Faculdade de Engenharia Universidade do Porto Porto Portugal

ISBN: (纸本)9781665490085

With the emergence of Computed Tomography (CT) and Magnetic Resonance Imaging (MRI), three-dimensional images facilitate the generation of 3D models of a patient, providing a new practical and accurate assistance, particularly for surgical planning. These images can be manipulated to produce an accurate 3D representation of an organ. The reconstructed mesh can be used to generate and visualize a deformable model during surgical intervention using Augmented Reality (AR) technology. To obtain an efficient reconstruction, a segmentation of these medical images using deep learning architecture can be used to extract the target organ's properties. Many methods were proposed based on the captured pre-operative patient's CT scans. Generally, the segmentation process is done manually using image processing software. In this context several approaches were proposed, these methods are not efficient and need human interaction to select the segmentation area correctly. This work aims to develop a deep learning method using a Convolutional Neural Network (CNN) that captures the liver organ from a set of CT scans. Given preoperative patient-specific data (CT scans), the U-net architecture is implemented to detect the liver organ. As a result, the segmented 2D images are used to generate a 3D patient-specific liver model.

关键词： Deep learning image segmentation Solid modeling Three-dimensional displays Computed tomography Magnetic resonance imaging Liver

来源：评论

学校读者我要写书评

暂无评论

Geometry to the Rescue: 3D Instance reconstruction from a Cluttered Scene

Geometry to the Rescue: 3D Instance Reconstruction from a Cl...

引用

IEEE/CVF conference on Computer Vision and Pattern Recognition (CVPR)

作者： Li, Lin Khan, Salman Barnes, Nick Australian Natl Univ CSIRO Data61 Canberra ACT Australia Australian Natl Univ IIAI Canberra ACT Australia Australian Natl Univ Canberra ACT Australia

ISBN: (纸本)9781728193601

3D object instance reconstruction from a cluttered 2D scene image is an ill-posed problem. The main challenge is posed by the lack of geometric information in color images and heavy occlusions that lead to incomplete shape details. To deal with this problem, existing works on 3D instance reconstruction directly learn the mapping between the intensity image and the corresponding 3D volume model. Different from these works, we propose to explicitly incorporate 2.5D geometric cues, such as the surface normal, relative depth, and height, while generating full 3D shapes from 2D images. With an intermediate step focused on estimating these 2.5D geometric features, we propose a novel convolutional neural network design that progressively moves from 2D to full 3D estimation. Our model automatically generates instance-specific surface normal maps, relative depth, and height that are compactly encoded within our network design and consequently used to improve the 3D instance reconstruction. Our experimental results on the large-scale synthetic SUNCG dataset and the real-world NYU depth v2 dataset demonstrate the effectiveness of the proposed approach where it beats the state-of-the-art Factored3D network [15].

关键词： Geometry

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：