检索结果-内蒙古大学图书馆

A combination method of stacked autoencoder and 3d deep residual network for hyperspectral image classification

INTERNATIONAL JOURNAL OF APPLIEd EARTH OBSERVATION ANd GEOINFORMATION 2021年 102卷

作者： Zhao, Jinling Hu, Lei dong, Yingying Huang, Linsheng Weng, Shizhuang Zhang, dongyan Anhui Univ Natl Engn Res Ctr Agroecol Big Data Anal & Applic 111 Jiulong Rd Hefei 230601 Peoples R China Anhui Univ Sch Elect & Informat Engn Hefei 230601 Peoples R China Chinese Acad Sci Aerosp Informat Res Inst Key Lab Digital Earth Sci Beijing 100094 Peoples R China

In comparison with conventional machine learning algorithms, deep learning can effectively express the deep features of remote sensing images. Considering the rich spectral and spatial information contained in hyperspectral images (HSIs), a combination method was proposed for HSI classification based on stacked autoencoder (SAE) and 3d deep residual network (3ddRN). Specifically, a SAE neural network was first built to reduce the dimensions of original HSIs. A 3d convolutional neural network (3dCNN) was then designed and the residual network module was introduced to build a 3ddRN. The dimension-reduced 3d HSI cubes were input into the 3ddRN to extract identifiable joint spectral-spatial features. Finally, the deep features continuously identified by the 3ddRN were input to Softmax classification layer to realize the classification. In addition, Batch Normalization (BN) and dropout were used during the learning process to avoid overfitting on training data. The training and test sets of Indian Pines (IP), Pavia University (PU) and Salinas (SA) hyperspectral data sets were selected as the modeling and verification data sources. Six classical classification algorithms were adopted for comparing our proposed method, specifically including conventional machine learning algorithms of Radial Basis FunctionSupport Vector Machine (RBF-SVM), Kernel Simultaneous Orthogonal Matching Pursuit (KSOMP) and Local Binary Pattern-K-Nearest Neighbor (LBP-KNN), and mainstream deep learning algorithms of Variational Autoencoder (VAE), convolutional neural network (CNN) and Spectral-Spatial Residual network (SSRN). The results showed that the overall accuracy (OA) reached 98.97%, 99.69% and 99.24%, respectively, only based on 10%, 5% and 1% of training samples for IP, PU and SA. Consequently, the proposed method shows a better classification performance, even in the case of limited samples.

关键词： deep learning Hyperspectral remote sensing Residual network 3d convolutional neural network Spectral-spatial features Stacked autoencoder

来源：评论

学校读者我要写书评

暂无评论

Noncontact Sleep Monitoring With Infrared Video data to Estimate Sleep Apnea Severity and distinguish Between Positional and Nonpositional Sleep Apnea: Model development and Experimental Validation

引用

JOURNAL OF MEdICAL INTERNET RESEARCH 2021年第11期23卷 e26524页

作者： Akbarian, Sina Ghahjaverestan, Nasim Montazeri Yadollahi, Azadeh Taati, Babak Univ Hlth Network Toronto Rehabil Inst Kite Res Inst 550 Univ Ave Toronto ON M5G 2A2 Canada Univ Toronto Inst Biomat & Biomed Engn Toronto ON Canada Vector Inst Toronto ON Canada Univ Toronto Dept Comp Sci Toronto ON Canada

Background: Sleep apnea is a respiratory disorder characterized by frequent breathing cessation during sleep. Sleep apnea severity is determined by the apnea-hypopnea index (AHI), which is the hourly rate of respiratory events. In positional sleep apnea, the AHI is higher in the supine sleeping position than it is in other sleeping positions. Positional therapy is a behavioral strategy (eg, wearing an item to encourage sleeping toward the lateral position) to treat positional apnea. The gold standard of diagnosing sleep apnea and whether or not it is positional is polysomnography;however, this test is inconvenient, expensive, and has a long waiting list. Objective: The objective of this study was to develop and evaluate a noncontact method to estimate sleep apnea severity and to distinguish positional versus nonpositional sleep apnea. Methods: A noncontact deep-learning algorithm was developed to analyze infrared video of sleep for estimating AHI and to distinguish patients with positional vs nonpositional sleep apnea. Specifically, a 3d convolutional neural network (CNN) architecture was used to process movements extracted by optical flow to detect respiratory events. Positional sleep apnea patients were subsequently identified by combining the AHI information provided by the 3d-CNN model with the sleeping position (supine vs lateral) detected via a previously developed CNN model. Results: The algorithm was validated on data of 41 participants, including 26 men and 15 women with a mean age of 53 (Sd 13) years, BMI of 30 (Sd 7), AHI of 27 (Sd 31) events/hour, and sleep duration of 5 (Sd 1) hours;20 participants had positional sleep apnea, 15 participants had nonpositional sleep apnea, and the positional status could not be discriminated for the remaining 6 participants. AHI values estimated by the 3d-CNN model correlated strongly and significantly with the gold standard (Spearman correlation coefficient 0.79, P<.001). Individuals with positional sleep apnea (based o

关键词： sleep apnea deep learning noncontact monitoring computer vision positional sleep apnea 3d convolutional neural network 3d-CNN

来源：评论

学校读者我要写书评

暂无评论

FogNet: A multiscale 3d CNN with double-branch dense block and attention mechanism for fog prediction

引用

MACHINE LEARNING WITH APPLICATIONS 2021年 5卷

作者： Kamangir, Hamid Collins, Waylon Tissot, Philippe King, Scott A. dinh, Hue Thi Hong durham, Niall Rizzo, James Texas A&M Univ Conrad Blucher Inst Surveying & Sci Corpus Christi TX 78412 USA Natl Weather Serv Corpus Christi TX USA Texas A&M Univ Dept Comp Sci Corpus Christi TX USA NSF Inst Res Trustworthy AI Weather Climate & Coas Alexandria VA USA

The reduction of visibility adversely affects land, marine, and air transportation. Thus, the ability to skillfully predict fog would provide utility. We predict fog visibility categories below 1600 m, 3200 m and 6400 m by post -processing numerical weather prediction model output and satellite -based sea surface temperature (SST) using a 3d -convolutional neural network (3d -CNN). The target is an airport located on a barrier island adjacent to a major US port;measured visibility from this airport serves as a proxy for fog that develops over the port. The features chosen to calibrate and test the model originate from the North American Mesoscale Forecast System, with values of each feature organized on a 32 x 32 horizontal grid;the SSTs were obtained from the NASA Multiscale Ultra Resolution dataset. The input to the model is organized as a high dimensional cube containing 288 to 384 layers of 2d horizontal fields of meteorological variables (predictor maps). In this 3d -CNN (hereafter, FogNet), two parallel branches of feature extraction have been designed, one for spatially auto -correlated features (spatial -wise dense block and attention module), and the other for correlation between input variables (variable -wise dense block and attention mechanism.) To extract features representing processes occurring at different scales, a 3d multiscale dilated convolution is used. data from 2009 to 2017 (2018 to 2020) are used to calibrate (test) the model. FogNet performance results for 6, 12- and 24 - h lead times are compared to results from the High -Resolution Ensemble Forecast (HREF) system. FogNet outperformed HREF using 8 standard evaluation metrics.

关键词： Fog prediction deep learning 3d convolutional neural network dense block Attention mechanism dilated convolutions

来源：评论

学校读者我要写书评

暂无评论

Automatic Extraction of Abnormalities on Temporal CT Subtraction Images Using Sparse Coding and 3d-CNN

Automatic Extraction of Abnormalities on Temporal CT Subtrac...

引用

International Conference on Control, Automation and Systems

作者： Yuichiro Koizumi Noriaki MIYAKE Huimin Lu Hyoungseop Kim Seiichi MURAKAMI Takatoshi AOKI Shoji KIdO Kyushu Institute of Technology 1-1 Sensui Tobata Kitakyushu Japan University of Occupational and Environmental Health 1-1 Iseigaoka Yahatanishi Kitakyushu Japan Yamaguchi University 2-16-1 Tokiwadai Ube Yamaguchi Japan

ISBN: (纸本)9781538670804;9788993215168

In recent years, the proportion of deaths from cancer tends to increase in Japan, especially the number of deaths from lung cancer is increasing. CT device is effective for early detection of lung cancer. However, there is concern that an increase in burden on doctors will be caused by high performance of CT improving. Therefore, by presenting the "second opinion" by the CAd system, it reduces the burden on the doctor. In this paper, we develop a CAd system for automatic detection of lesion candidate regions such as lung nodules or ground glass opacity (GGO) from 3d CT images. Our proposed method consists of three steps. In the first step, lesion candidate regions are extracted using temporal subtraction technique. In the second step, the image is reconstructed by sparse coding for the extracted region. In the final step, 3d convolutional neural network (3d-CNN) identification using reconstructed images is performed. We applied our method to 51 cases and True Positive rate (TP) of 79.81% and False Positive rate (FP) of 37.65% are obtained.

关键词： Ground glass opacity Temporal subtraction technique Sparse coding 3d convolutional neural network

来源：评论

学校读者我要写书评

暂无评论

A Three-dimensional detector Based on Focal Loss for Pulmonary Nodules detection

A Three-dimensional Detector Based on Focal Loss for Pulmona...

引用

第三十八届中国控制会议

作者： Lei Wang Yaping dai Zhiyang Jia Yongkang Nie Liang Liu School of Automation Beijing Institute of Technology The General Hospital of the People's Liberation Army

The problem of class imbalance exists in detecting the pulmonary nodules from Computed Tomography(CT) by means of convolutional neural network. A Three-dimensional detector Based on Focal Loss(FLTdd) is designed in this paper to ensure that the pulmonary nodules in CT could be identified more exactly. Its framework focuses more on samples that are difficult to be classified. Besides, three dimensional detector contains richer spatial information and gets more distinguishing features. The experiment results obtained from LIdC-IdRI data set show that the average sensitivity score of FLTdd achieves89.62%. It has a 1.47% improvement compared with the published CASEd method.

关键词： Pulmonary Nodule detection Class Imbalance Focal Loss 3d convolutional neural network

来源：评论

学校读者我要写书评

暂无评论

A Vision-based Human Action Recognition System for Moving Cameras Through deep Learning 19

A Vision-based Human Action Recognition System for Moving Ca...

引用

Proceedings of the 2019 2nd International Conference on Signal Processing and Machine Learning

作者： Ming-Jen Chang Jih-Tang Hsieh Chiung-Yao Fang Sei-Wang Chen National Taiwan Normal University Taipei City Taiwan

ISBN: (纸本)9781450372213

This study presents a vision-based human action recognition system using a deep learning technique. The system can recognize human actions successfully when the camera of a robot is moving toward the target person from various directions. Therefore, the proposed method is useful for the vision system of indoor mobile robots. The system uses three types of information to recognize human actions, namely, information from color videos, optical flow videos, and depth videos. First, Kinect 2.0 captures color videos and depth videos simultaneously using its RGB camera and depth sensor. Second, the histogram of oriented gradient features is extracted from the color videos, and a support vector machine is used to detect the human region. Based on the detected human region, the frames of the color video are cropped and the corresponding frames of the optical flow video are obtained using the Farnebäck method (https://***=.org/3.4/d4/dee/ tutorial_optical_***). The number of frames of these videos is then unified using a frame sampling technique. Subsequently, these three types of videos are input into three modified 3d convolutional neural networks (3d CNNs) separately. The modified 3d CNNs can extract the spatiotemporal features of human actions and recognize them. Finally, these recognition results are integrated to output the final recognition result of human actions. The proposed system can recognize 13 types of human actions, namely, drink (sit), drink (stand), eat (sit), eat (stand), read, sit down, stand up, use a computer, walk (horizontal), walk (straight), play with a phone/tablet, walk away from each other, and walk toward each other. The average human action recognition rate of 369 test human action videos was 96.4%, indicating that the proposed system is robust and efficient.

关键词： Color information Optical flow deep learning 3d convolutional neural network VGG net Indoor mobile robots depth information Human action recognition Moving camera

来源：评论

学校读者我要写书评

暂无评论

COVId-19 identification from volumetric chest CT scans using a progressively resized 3d-CNN incorporating segmentation, augmentation, and class-rebalancing

引用

Informatics in Medicine Unlocked 2021年 26卷 100709页

作者： Hasan, Md. Kamrul Jawad, Md. Tasnim Hasan, Kazi Nasim Imtiaz Partha, Sajal Basak Masba, Md. Masum Al Saha, Shumit Moni, Mohammad Ali Department of Electrical and Electronic Engineering Khulna University of Engineering & Technology Khulna-9203 Bangladesh Department of Computer Science and Engineering Khulna University of Engineering & Technology Khulna-9203 Bangladesh Department of Electronics and Communication Engineering Khulna University of Engineering & Technology Khulna-9203 Bangladesh Department of Computer Science & Engineering Pabna University of Science and Technology Pabna-6600 Bangladesh School of Health and Rehabilitation Sciences Faculty of Health and Behavioural Sciences The University of Queensland St Lucia QLD 4072 Australia

The novel COVId-19 is a global pandemic disease overgrowing worldwide. Computer-aided screening tools with greater sensitivity are imperative for disease diagnosis and prognosis as early as possible. It also can be a helpful tool in triage for testing and clinical supervision of COVId-19 patients. However, designing such an automated tool from non-invasive radiographic images is challenging as many manually annotated datasets are not publicly available yet, which is the essential core requirement of supervised learning schemes. This article proposes a 3d convolutional neural network (CNN)-based classification approach considering both the inter-and intra-slice spatial voxel information. The proposed system is trained end-to-end on the 3d patches from the whole volumetric Computed Tomography (CT) images to enlarge the number of training samples, performing the ablation studies on patch size determination. We integrate progressive resizing, segmentation, augmentations, and class-rebalancing into our 3d network. The segmentation is a critical prerequisite step for COVId-19 diagnosis enabling the classifier to learn prominent lung features while excluding the outer lung regions of the CT scans. We evaluate all the extensive experiments on a publicly available dataset named MosMed, having binary- and multi-class chest CT image partitions. Our experimental results are very encouraging, yielding areas under the Receiver Operating Characteristics (ROC) curve of 0.914±0.049 and 0.893±0.035 for the binary- and multi-class tasks, respectively, applying 5-fold cross-validations. Our method's promising results delegate it as a favorable aiding tool for clinical practitioners and radiologists to assess COVId-19. © 2021

关键词： 3d convolutional neural network 3d patches COVId-19 Progressive resizing Volumetric chest CT scans

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：