检索结果-内蒙古大学图书馆

Convolution-deconvolution architecture with the pyramid pooling module for semantic segmentation

MULTIMEDIA TOOLS AND APPLICATIONS 2019年第22期78卷 32379-32392页

作者： Malekijoo, Amirhossein Fadaeieslam, Mohammad Javad Semnan Univ Elect & Comp Engn Dept Semnan Iran

Recognizing the content of an image is an important challenge in machine vision. semantic segmentation is one of the most important ways to overcome this challenge. It is utilized in different applications such as autonomous driving, indoor navigation, virtual or augmented reality systems, and recognition tasks. In this paper, a novel and practical deep fully convolutional neural network architecture was introduced for semantic pixel-wise segmentation termed as P-DecovNet. The proposed architecture combines the Convolution-Deconvolution Neural Network architecture with the Pyramid Pooling Module. In this project, the high-level features were extracted from the image using the Convolutional Neural Network. To reinforce the local information, the Pooling module was added to the architecture. CamVid road scene dataset was used to evaluate the performance of the P-DecovNet. With respect to different criteria (including - but not limited to - accuracy and mIoU), the experimental results demonstrated that P-DecovNet practically has a good performance in the domain of Convolution-Deconvolution Network. To achieve such performance, this work uses a smaller number of training images with lesser iterations compared to the existing methods.

关键词： Convolution neural network Machine vision semantic pixel-wise segmentation Convolution-deconvolution network Road scene dataset

来源：评论

学校读者我要写书评

暂无评论

semantic segmentation of JPEG blocks using a deep CNN for non-aligned JPEG forgery detection and localization

引用

MULTIMEDIA TOOLS AND APPLICATIONS 2020年第11-12期79卷 8249-8265页

作者： Alipour, Neda Behrad, Alireza Shahed Univ Dept Elect Engn Tehran Iran

In this paper, a new approach is proposed for non-aligned JPEG forgery detection and localization. Our method is based on the semantic pixel-wise segmentation of JPEG blocks using a deep neural network. semantic segmentation is the process of assigning each pixel of an image to a class label. We train a deep Convolutional Neural Network (CNN) to segment the boundaries of JPEG blocks. The trained deep CNN can accurately detect block boundaries related to various JPEG compressions. Therefore, non-aligned JPEG forgeries can be easily detected and localized by detecting irregularities in the segmented block boundaries. The proposed approach can detect and localize JPEG forgeries with the same and different quantization matrices as well as image forgeries with several compression stages. We tested the proposed algorithm with various forged and authentic images and compared the results with the state-of-the-art approaches. Experimental results showed that the proposed CNN-based algorithm performs well for non-aligned JPEG forgery detection and localization.

关键词： JPEG forgery detection and localization Image forensics Deep convolutional neural network semantic pixel-wise segmentation

来源：评论

学校读者我要写书评

暂无评论

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image segmentation

引用

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2017年第12期39卷 2481-2495页

作者： Badrinarayanan, Vijay Kendall, Alex Cipolla, Roberto Univ Cambridge Dept Engn Machine Intelligence Lab Cambridge CB2 1TN England

We present a novel and practical deep fully convolutional neural network architecture for semantic pixel-wise segmentation termed SegNet. This core trainable segmentation engine consists of an encoder network, a corresponding decoder network followed by a pixel-wise classification layer. The architecture of the encoder network is topologically identical to the 13 convolutional layers in the VGG16 network [1]. The role of the decoder network is to map the low resolution encoder feature maps to full input resolution feature maps for pixel-wise classification. The novelty of SegNet lies is in the manner in which the decoder upsamples its lower resolution input feature map(s). Specifically, the decoder uses pooling indices computed in the max-pooling step of the corresponding encoder to perform non-linear upsampling. This eliminates the need for learning to upsample. The upsampled maps are sparse and are then convolved with trainable filters to produce dense feature maps. We compare our proposed architecture with the widely adopted FCN [2] and also with the well known DeepLab-LargeFOV [3], DeconvNet [4] architectures. This comparison reveals the memory versus accuracy trade-off involved in achieving good segmentation performance. SegNet was primarily motivated by scene understanding applications. Hence, it is designed to be efficient both in terms of memory and computational time during inference. It is also significantly smaller in the number of trainable parameters than other competing architectures and can be trained end-to-end using stochastic gradient descent. We also performed a controlled benchmark of SegNet and other architectures on both road scenes and SUN RGB-D indoor scene segmentation tasks. These quantitative assessments show that SegNet provides good performance with competitive inference time and most efficient inference memory-wise as compared to other architectures. We also provide a Caffe implementation of SegNet and a web demo at http://***

关键词： Deep convolutional neural networks semantic pixel-wise segmentation indoor scenes road scenes encoder decoder pooling upsampling

来源：评论

学校读者我要写书评

暂无评论

Convolutional neural network for automated mass segmentation in mammography

引用

BMC BIOINFORMATICS 2020年第1-Sup期21卷 192-192页

作者： Abdelhafiz, Dina Bi, Jinbo Ammar, Reda Yang, Clifford Nabavi, Sheida Univ Connecticut Dept Comp Sci & Engn Storrs CT 06269 USA City Sci Res & Technol Applicat SRTA City Informat Res Inst IRI Alexandria Egypt Univ Connecticut Dept Diagnost Imaging Hlth Ctr Farmington CT 06030 USA

BackgroundAutomatic segmentation and localization of lesions in mammogram (MG) images are challenging even with employing advanced methods such as deep learning (DL) methods. We developed a new model based on the architecture of the semantic segmentation U-Net model to precisely segment mass lesions in MG images. The proposed end-to-end convolutional neural network (CNN) based model extracts contextual information by combining low-level and high-level features. We trained the proposed model using huge publicly available databases, (CBIS-DDSM, BCDR-01, and INbreast), and a private database from the University of Connecticut Health Center (UCHC).ResultsWe compared the performance of the proposed model with those of the state-of-the-art DL models including the fully convolutional network (FCN), SegNet, Dilated-Net, original U-Net, and Faster R-CNN models and the conventional region growing (RG) method. The proposed Vanilla U-Net model outperforms the Faster R-CNN model significantly in terms of the runtime and the Intersection over Union metric (IOU). Training with digitized film-based and fully digitized MG images, the proposed Vanilla U-Net model achieves a mean test accuracy of 92.6%. The proposed model achieves a mean Dice coefficient index (DI) of 0.951 and a mean IOU of 0.909 that show how close the output segments are to the corresponding lesions in the ground truth maps. Data augmentation has been very effective in our experiments resulting in an increase in the mean DI and the mean IOU from 0.922 to 0.951 and 0.856 to 0.909, *** proposed Vanilla U-Net based model can be used for precise segmentation of masses in MG images. This is because the segmentation process incorporates more multi-scale spatial context, and captures more local and global context to predict a precise pixel-wise segmentation map of an input full MG image. These detected maps can help radiologists in differentiating benign and malignant lesions depend on the lesion s

关键词： Mammograms (MGs) Breast cancer Deep learning (DL) Convolutional neural networks (CNNs) Machine learning (ML) Computer-aided detection (CAD) U-Net Vanilla U-Net SegNet Ground truth maps (GTMs) Detection semantic pixel-wise segmentation Localization Pre-processing Region growing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：