检索结果-内蒙古大学图书馆

l, r-Stitch Unit: encoder-decoder-cnn Based Image-Mosaicing Mechanism for Stitching Non-Homogeneous Image Sequences

IEEE ACCESS 2021年 9卷 16761-16782页

作者： Chilukuri, Premith Kumar Padala, Preethi Padala, Pushkal Desanamukula, Venkata Subbaiah Reddy, P. V. G. D. Prasad Andhra Univ Coll Engn A Dept CS & SE Visakhapatnam 530003 Andhra Pradesh India Natl Inst Technol Surathkal Dept Comp Sci & Engn CSE Mangalore 575025 India Natl Inst Engn Dept Comp Sci & Engn CSE Mysore 570008 Karnataka India

Image-stitching (or) mosaicing is considered an active research-topic with numerous use-cases in computer-vision, AR/VR, computer-graphics domains, but maintaining homogeneity among the input image sequences during the stitching/mosaicing process is considered as a primary-limitation & major-disadvantage. To tackle these limitations, this article has introduced a robust and reliable image stitching methodology (l,r-Stitch Unit), which considers multiple non-homogeneous image sequences as input to generate a reliable panoramically stitched wide view as the final output. The l,r-Stitch Unit further consists of a pre-processing, post-processing sub-modules & a l,r-PanoED-network, where each sub-module is a robust ensemble of several deep-learning, computer-vision & image-handling techniques. This article has also introduced a novel convolutional-encoder-decoder deep-neural-network (l,r-PanoED-network) with a unique split-encoding-network methodology, to stitch non-coherent input left, right stereo image pairs. The encoder-network of the proposed l,r-PanoED extracts semantically rich deep-feature-maps from the input to stitch/map them into a wide-panoramic domain, the feature-extraction & feature-mapping operations are performed simultaneously in the l,r-PanoED's encoder-network based on the split-encoding-network methodology. The decoder-network of l,r-PanoED adaptively reconstructs the output panoramic-view from the encoder networks' bottle-neck feature-maps. The proposed l,r-Stitch Unit has been rigorously benchmarked with alternative image-stitching methodologies on our custom-built traffic dataset and several other public-datasets. Multiple evaluation metrics (SSIM, PSNR, MSE, L-alpha,L-beta,L-gamma,L- FM-rate, Average-latency-time) & wild-Conditions (rotational/color/intensity variances, noise, etc) were considered during the benchmarking analysis, and based on the results, our proposed method has outperformed among other image-stitching methodologies and has prov

关键词： Deep feature extraction encoder-decoder cnn image mosaicing multi-image registration non-homogeneous image stitching

来源：评论

学校读者我要写书评

暂无评论

ISP-Net: Fusing features to predict ischemic stroke infarct core on CT perfusion maps

引用

COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022年 215卷 106630-106630页

作者： Zhu, Haichen Chen, Yang Tang, Tianyu Ma, Gao Zhou, Jiaying Zhang, Jiulou Lu, Shanshan Wu, Feiyun Luo, Limin Liu, Sheng Ju, Shenghong Shi, Haibin Southeast Univ Lab Image Sci & Technol Key Lab Comp Network & Informat Integrat Minist Educ Nanjing 210096 Peoples R China Southeast Univ Zhongda Hosp Dept Radiol Jiangsu Key Lab Mol & Funct Imaging Nanjing 210009 Peoples R China Nanjing Med Univ Dept Radiol Affiliated Hosp 1 Nanjing 210029 Peoples R China Nanjing Med Univ Dept Intervent Radiol Affiliated Hosp 1 Nanjing 210029 Peoples R China Southeast Univ Sch Comp Sci & Engn Jiangsu Prov Joint Int Res Lab Med Informat Proc Nanjing 210096 Peoples R China

Background: Acute ischemic stroke is one of the leading death causes. Delineating stoke infarct core in medical images plays a critical role in optimal stroke treatment selection. However, accurate estimation of infarct core still remains challenging because of 1) the large shape and location variation of infarct cores;2) the complex relationships between perfusion parameters and final tissue outcome. Methods: We develop an encoder-decoder based semantic model, i.e., Ischemic Stroke Prediction Network (ISP-Net), to predict infarct core after thrombolysis treatment on CT perfusion (CTP) maps. Features of native CTP, CBF (Cerebral Blood Flow), CBV (Cerebral Blood Volume), MTT (Mean Transit Time), Tmax are generated and fused with five-path convolutions for comprehensive analysis. A multi-scale atrous convolution (MSAC) block is firstly put forward as the enriched high-level feature extractor in ISP-Net to improve prediction accuracy. A retrospective dataset which is collected from multiple stroke centers is used to evaluate the performance of ISP-Net. The gold standard infarct cores are delineated on the follow-up scans, i.e., non-contrast CT (NCCT) or MRI diffusion-weighted image (DWI). Results: In clinical dataset crossvalidation, we achieve mean Dice Similarity Coefficient (DSC) of 0.801, precision of 81.3%, sensitivity of 79.5%, specificity of 99.5%, Area Under Curve (AUC) of 0.721. Our approach yields better outcomes than several advanced deep learning methods, i.e., Deeplab V3, U-Net++, CE-Net, X-Net and Non-local U-Net, demonstrating the promising performance in infarct core prediction. No significant difference of the prediction error is shown for the patients with follow-up NCCT and follow-up DWI (P>0.05). Conclusion: This study provides an approach for fast and accurate stroke infarct core estimation. We anticipate the prediction results of ISP-Net could offer assistance to the physicians in the thrombolysis or thrombectomy therapy selection. (C) 2022 Publis

关键词： Acute ischemic stroke CT Perfusion Infarct core estimation encoder-decoder cnn

来源：评论

学校读者我要写书评

暂无评论

A Deep Learning Based Approach for Strawberry Yield Prediction via Semantic Graphics 21

A Deep Learning Based Approach for Strawberry Yield Predicti...

引用

21st International Conference on Control, Automation and Systems (ICCAS)

作者： Ilyas, Talha Kim, Hyongsuk Jeonbuk Natl Univ Intelligent Robots Res Ctr Dept Elect Engn Jeonju 54896 South Korea Jeonbuk Natl Univ Dept Elect & Informat Engn Jeonju 54896 South Korea

ISBN: (纸本)9788993215212

In Korea, strawberry producers lack efficient and precise yield forecasts, which would allow them to deploy optimal manpower, equipment, and other resources for harvesting, shipping, and marketing. Reliable estimation of the quantity of strawberry fruit with respect to their ripeness level is critical for forecasting the upcoming strawberry production. Typically, the quantity and ripeness of fruits are estimated manually, which is labor-intensive and time-consuming. In this case, automated yield prediction based on robotic agriculture is a realistic option. We provide in this study an automated strawberry fruit recognition and counting system for accurate and reliable yield prediction. This paper proposes a unique neural network training approach for strawberry fruit counting and ripeness detection that combines semantic graphics for data annotation with a fully convolutional neural network (FCN). Semantic graphics, the suggested data annotation approach, is straightforward to apply, and the desired targets can be quickly tagged with little effort. Moreover, the proposed FCN is an enhanced encoder-decoder network designed specifically for efficient learning of semantic graphics. Quantitative analysis of proposed algorithm showed 4.47% increase in detection accuracy over prior techniques while running at higher frames per second.

关键词： semantic graphics yield forecasting crop management segmentation deep learning encoder-decoder cnn

来源：评论

学校读者我要写书评

暂无评论

Multi-level pooling encoder-decoder convolution neural network for MRI reconstruction

引用

PEERJ COMPUTER SCIENCE 2022年 8卷 e934页

作者： Karnjanapreechakorn, Sarattha Kusakunniran, Worapan Siriapisith, Thanongchai Saiviroonporn, Pairash Mahidol Univ Fac Informat & Commun Technol Nakhon Pathom Thailand Mahidol Univ Fac Med Siriraj Hosp Dept Radiol Bangkok Thailand

MRI reconstruction is one of the critical processes of MRI machines, along with the acquisition. Due to a slow processing time of signal acquiring, parallel imaging and reconstruction techniques are applied for acceleration. To accelerate the acquisition process, fewer raw data are sampled simultaneously with all RF coils acquisition. Then, the reconstruction uses under-sampled data from all RF coils to restore the final MR image that resembles the fully sampled MR image. These processes have been a traditional procedure inside the MRI system since the invention of the multi-coils MRI machine. This paper proposes the deep learning technique with a lightweight network. The deep neural network is capable of generating the high-quality reconstructed MR image with a high peak signal-to-noise ratio (PSNR). This also opens a high acceleration factor for MR data acquisition. The lightweight network is called Multi-Level Pooling encoder-decoder Net (MLPED Net). The proposed network outperforms the traditional encoder-decoder networks on 4-fold acceleration with a significant margin on every evaluation metric. The network can be trained end-to-end, and it is a lightweight structure that can reduce training time significantly. Experimental results are based on a publicly available MRI Knee dataset from the fastMRl competition.

关键词： MRI reconstruction Multi-level pooling encoder-decoder cnn Fast MRI

来源：评论

学校读者我要写书评

暂无评论

A deep learning model for burn depth classification using ultrasound imaging

引用

JOURNAL OF THE MECHANICAL BEHAVIOR OF BIOMEDICAL MATERIALS 2022年第0期125卷 104930-104930页

作者： Lee, Sangrock Rahul Lukan, James Boyko, Tatiana Zelenova, Kateryna Makled, Basiel Parsey, Conner Norfleet, Jack De, Suvranu Rensselaer Polytech Inst Ctr Modeling Simulat & Imaging Med Troy NY 12180 USA SUNY Buffalo Dept Surg Buffalo NY 14215 USA US Army Futures Command Combat Capabil Dev Command Soldier Ctr STTC Orlando FL 32826 USA

Identification of burn depth with sufficient accuracy is a challenging problem. This paper presents a deep convolutional neural network to classify burn depth based on altered tissue morphology of burned skin manifested as texture patterns in the ultrasound images. The network first learns a low-dimensional manifold of the unburned skin images using an encoder-decoder architecture that reconstructs it from ultrasound images of burned skin. The encoder is then re-trained to classify burn depths. The encoder-decoder network is trained using a dataset comprised of B-mode ultrasound images of unburned and burned ex vivo porcine skin samples. The classifier is developed using B-mode images of burned in situ skin samples obtained from freshly euthanized postmortem pigs. The performance metrics obtained from 20-fold cross-validation show that the model can identify deeppartial thickness burns, which is the most difficult to diagnose clinically, with 99% accuracy, 98% sensitivity, and 100% specificity. The diagnostic accuracy of the classifier is further illustrated by the high area under the curve values of 0.99 and 0.95, respectively, for the receiver operating characteristic and precision-recall curves. A post hoc explanation indicates that the classifier activates the discriminative textural features in the B-mode images for burn classification. The proposed model has the potential for clinical utility in assisting the clinical assessment of burn depths using a widely available clinical imaging device.

关键词： Deep learning encoder-decoder cnn Ultrasound imaging Burn depth classification

来源：评论

学校读者我要写书评

暂无评论

End-to-End Deep Background Subtraction based on encoder-decoder Network 6

End-to-End Deep Background Subtraction based on Encoder-Deco...

引用

6th National-Foundation-for Science-and-Technology-Development (NAFOSTED) Conference on Information and Computer Science (NICS)

作者： Le, Duy H. Pham, Tuan, V Danang Univ Technol & Educ Fac Elect & Elect Engn Danang Vietnam Danang Univ Sci & Technol Fac Adv Sci & Technol Danang Vietnam

ISBN: (纸本)9781728151632

Background subtraction or change detection aims to segment the moving object from the background, and it has become a vital task in many video analysis and surveillance systems. There are different approaches were proposed to solve this problem effectively. In the past, all background subtraction methods use low-level handcraft features such as raw color space or local pattern. Recently, there are many background subtraction methods based on a convolutional neural network that have demonstrated astonishing results. Therefore, in this paper, we introduce an end-to-end convolutional neural network for background subtraction. The network is based on the idea of encoder-decoder deep neural network. In the encoder part, deep features from target frame and reference frame are extracted and compared to measure the difference. Then the decoder converts these features from an encoder to into segmentation map with fine detail. The experimental results tested on CDNet 2014 dataset show that the proposed structure archives the state-of-the-art performance.

关键词： Background subtraction encoder-decoder cnn deep feature subtraction

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：