检索结果-内蒙古大学图书馆

16th ASME International Manufacturing Science and Engineering Conference (MSEC)

作者： Qu, Yongzhi Vogl, Gregory W. Wang, Zechao Univ Minnesota Duluth MN 55812 USA NIST Gaithersburg MD 20899 USA Wuhan Univ Technol Wuhan Peoples R China

ISBN: (纸本)9780791885079

The frequency response function (FRF), defined as the ratio between the Fourier transform of the time-domain output and the Fourier transform of the time-domain input, is a common tool to analyze the relationships between inputs and outputs of a mechanical system. Learning the FRF for mechanical systems can facilitate system identification, condition-based health monitoring, and improve performance metrics, by providing an input-output model that describes the system dynamics. Existing FRF identification assumes there is a one-to-one mapping between each input frequency component and output frequency component. However, during dynamic operations, the FRF can present complex dependencies with frequency cross-correlations due to modulation effects, nonlinearities, and mechanical noise. Furthermore, existing FRFs assume linearity between inputoutput spectrums with varying mechanical loads, while in practice FRFs can depend on the operating conditions and show high nonlinearities. Outputs of existing neural networks are typically low-dimensional labels rather than real-time high-dimensional measurements. This paper proposes a vector regression method based on deep neural networks for the learning of runtime FRFs from measurement data under different operating conditions. More specifically, a neural network based on an encoder-decoder with a symmetric compression structure is proposed. The deep encoder-decoder network features simultaneous learning of the regression relationship between input and output embeddings, as well as a discriminative model for output spectrum classification under different operating conditions. The learning model is validated using experimental data from a high-pressure hydraulic test rig. The results show that the proposed model can learn the FRF between sensor measurements under different operating conditions with high accuracy and denoising capability. The learned FRF model provides an estimation for sensor measurements when a physical sensor

关键词： Frequency response function encoder-decoder Neural network Deep learning

来源：评论

学校读者我要写书评

暂无评论

NON-PARALLEL MANY-TO-MANY VOICE CONVERSION BY KNOWLEDGE TRANSFER FROM A TEXT-TO-SPEECH MODEL

NON-PARALLEL MANY-TO-MANY VOICE CONVERSION BY KNOWLEDGE TRAN...

引用

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

作者： Yu, Xinyuan Mak, Brian Hong Kong Univ Sci & Technol Dept Comp Sci & Engn Hong Kong Peoples R China

ISBN: (纸本)9781728176055

In this paper, we present a simple but novel framework to train a non-parallel many-to-many voice conversion (VC) model based on the encoder-decoder architecture. It is observed that an encoder-decoder text-to-speech (TTS) model and an encoder-decoder VC model have the same structure. Thus, we propose to pre-train a multi-speaker encoder-decoder TTS model and transfer knowledge from the TTS model to a VC model by (1) adopting the TTS acoustic decoder as the VC acoustic decoder, and (2) forcing the VC speech encoder to learn the same speaker-agnostic linguistic features from the TTS text encoder so as to achieve speaker disentanglement in the VC encoder output. We further control the conversion of the pitch contour from source speech to target speech, and condition the VC decoder on the converted pitch contour during inference. Subjective evaluation shows that our proposed model is able to handle VC between any speaker pairs in the training speech corpus of over 200 speakers with high naturalness and speaker similarity.

关键词： text-to-speech generation voice conversion encoder-decoder knowledge transfer

来源：评论

学校读者我要写书评

暂无评论

Edge Intelligence with Deep Learning in Greenhouse Management 10

Edge Intelligence with Deep Learning in Greenhouse Managemen...

引用

10th International Conference on Smart Cities and Green ICT Systems (SMARTGREENS)

作者： Proietti, Massimiliano Bianchi, Federico Marini, Andrea Menculini, Lorenzo Termite, Loris Francesco Garinei, Alberto Biondi, Lorenzo Marconi, Marcello Idea Re Srl Perugia Italy Guglielmo Marconi Univ Dept Sustainabil Engn Rome Italy K Digitale Srl Perugia Italy

ISBN: (纸本)9789897585128

This paper presents a methodology to control greenhouse operations based on deep learning. The proposed methodology employs Artificial Intelligence algorithms working on edge devices, allowing the detection of anomalies in plants growth and greenhouse control equipment, in view of taking possible corrective actions. Edge Intelligence allows the greenhouse to work independently of the network to which it is connected. It also guarantees privacy to the processed data and contributes to fast and efficient decision-making. In this work, a Long-Short Time Memory encoder-decoder architecture is used for greenhouse anomaly detection. The best performance is achieved when using one LSTM layer and 64 LSTM units.

关键词： Greenhouse Farming Deep Learning Computer Vision Edge Intelligence Anomaly Detection encoder-decoder Smart Local Systems

来源：评论

学校读者我要写书评

暂无评论

基于SBAS-InSAR与神经网络-数据同化机场跑道沉降预测研究

基于SBAS-InSAR与神经网络-数据同化机场跑道沉降预测研究

引用

作者：王志鹏中国民用航空飞行学院

学位级别：硕士

机场跑道沉降将增加飞机起降风险,使飞机在该过程中遭受震动和冲击,进而导致飞机机身受损,甚至是机毁人亡。尽管既有激光测量技术、GPS监测技术、地面震动测试技术、分布式光纤技术等跑道沉降监测预警技术具有高精度优势,但不是存在成... 详细信息

机场跑道沉降将增加飞机起降风险,使飞机在该过程中遭受震动和冲击,进而导致飞机机身受损,甚至是机毁人亡。尽管既有激光测量技术、GPS监测技术、地面震动测试技术、分布式光纤技术等跑道沉降监测预警技术具有高精度优势,但不是存在成本高就是存在干扰跑道正常运行等问题,因而如何建立低成本、非干扰式的跑道沉降监测预警模型,是当前研究的一个热点。针对该问题,本文提出了一种基于SBAS-In SAR和神经网络-数据同化的机场跑道沉降预测模型。(1)通过SBAS-In SAR技术克服现有沉降监测方法检测范围受监测点影响的缺点,充分利用In SAR技术的监测全面性,为沉降预测提供准确的基础数据。(2)通过三次样条插值方法对SBAS-In SAR获取的沉降时间序列数据进行插值处理,符合实际沉降趋势的同时扩充数据数量,为沉降预测模型提供多量数据。(3)构建LSTM和BP人工神经网络数据驱动模型,实现自主学习数据特征,提高预测模型的泛化能力。(4)引入En KF数据同化方法,构建En KF-LSTM和En KF-BP人工神经网络数据驱动模型。通过En KF代替LSTM和BP模型中使用梯度反向传播的优化方法,避免模型梯度优化产生的梯度爆炸或梯度消失问题,融入观测值实现模型的可持续性学习功能。(5)构建En KF-LSTM-ED模型,通过实现encoder-decoder结构,预测方法自循环预测更改为序列到序列的预测,提高En KF-LSTM模型针对时序数据中短期预测能力。结果表明:本文提出模型具有一定的预测准确性。本模型中,SBAS-In SAR技术获取了康定机场和双流机场跑道沉降时序数据,分析主要沉降区域和数据变化趋势,设计实验案例对比说明,En KF-LSTM模型和En KF-BP模型较未优化模型的预测准确性提升约12%,并且En KF-LSTM模型较En KF-BP模型具有更强的预测能力;序列到序列的En KF-LSTM-ED预测方法较循环预测方法可信预测结果提高到30天。

关键词：机场跑道安全沉降预测 SBAS-InSAR 数据同化神经网络 encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

Machine learning and deep learning-based sentiment analysis of IMDB user reviews

Machine learning and deep learning-based sentiment analysis ...

引用

作者： Shengjie Xia School of Information Science and Engineering Lanzhou University

Movie reviews have always been a popular and enduring subject of interest among researchers. Sentiment analysis plays a significant role in this domain. The utilization of machine learning and natural language processing techniques can provide valuable insights into the emotional responses of audiences towards movies, as well as facilitate the appraisal of their reputation and market potential. This is achieved through the analysis of sentiment expressed in movie reviews. Furthermore, this approach is highly valuable in various application domains such as data mining, web mining, and social media analysis. This paper aims to conduct a comparative analysis by utilizing typical models based on machine learning and neural networks,along with the integration of natural language processing techniques. The IMDB database, which contains 50,000 reviews, will be used, and data preprocessing will be performed before applying these models. By comparing the accuracy of each model, insights regarding movie reviews can be derived.

关键词： Machine learning encoder-decoder sentiment analysis movies reviews binary classification

来源：评论

学校读者我要写书评

暂无评论

Split, Embed and Merge: An accurate table structure recognizer

引用

PATTERN RECOGNITION 2022年 126卷 108565-108565页

作者： Zhang, Zhenrong Zhang, Jianshu Du, Jun Wang, Fengren Univ Sci & Technol China Natl Engn Res Ctr Speech & Language Informat Proc 96 JinZhai Rd Hefei Anhui Peoples R China IFLYTEK Res Hefei Anhui Peoples R China

Table structure recognition is an essential part for making machines understand tables. Its main task is to recognize the internal structure of a table. However, due to the complexity and diversity in their structure and style, it is very difficult to parse the tabular data into the structured format which machines can understand, especially for complex tables. In this paper, we introduce Split, Embed and Merge (SEM), an accurate table structure recognizer. SEM is mainly composed of three parts, splitter, embedder and merger. In the first stage, we apply the splitter to predict the potential regions of the table row/column separators, and obtain the fine grid structure of the table. In the second stage, by taking a full consideration of the textual information in the table, we fuse the output features for each table grid from both vision and text modalities. Moreover, we achieve a higher precision in our experiments through providing additional textual features. Finally, we process the merging of these basic table grids in a self-regression manner. The corresponding merging results are learned through the attention mechanism. In our experiments, SEM achieves an average F1-Measure of 97 . 11% on the SciTSR dataset which outperforms other methods by a large margin. We also won the first place of complex tables and third place of all tables in Task-B of ICDAR 2021 Competition on Scientific Literature Parsing. Extensive experiments on other publicly available datasets further demonstrate the effectiveness of our proposed approach. (c) 2022 Elsevier Ltd. All rights reserved.

关键词： Table structure recognition Self-regression Attention mechanism encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

PhyCRNet: Physics-informed convolutional-recurrent network for solving spatiotemporal PDEs

引用

COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING 2022年 389卷 114399-114399页

作者： Ren, Pu Rao, Chengping Liu, Yang Wang, Jian-Xun Sun, Hao Northeastern Univ Dept Civil & Environm Engn Boston MA 02115 USA Northeastern Univ Dept Mech & Ind Engn Boston MA 02115 USA Univ Notre Dame Dept Aerosp & Mech Engn Notre Dame IN 46556 USA Renmin Univ China Gaoling Sch Artificial Intelligence Beijing 100872 Peoples R China Beijing Key Lab Big Data Management & Anal Method Beijing 100872 Peoples R China MIT Dept Civil & Environm Engn 77 Massachusetts Ave Cambridge MA 02139 USA

Partial differential equations (PDEs) play a fundamental role in modeling and simulating problems across a wide range of disciplines. Recent advances in deep learning have shown the great potential of physics-informed neural networks (PINNs) to solve PDEs as a basis for data-driven modeling and inverse analysis. However, the majority of existing PINN methods, based on fully-connected NNs, pose intrinsic limitations to low-dimensional spatiotemporal parameterizations. Moreover, since the initial/boundary conditions (I/BCs) are softly imposed via penalty, the solution quality heavily relies on hyperparameter tuning. To this end, we propose the novel physics-informed convolutional-recurrent learning architectures (PhyCRNet and PhyCRNet-s) for solving PDEs without any labeled data. Specifically, an encoder-decoder convolutional long short-term memory network is proposed for low-dimensional spatial feature extraction and temporal evolution learning. The loss function is defined as the aggregated discretized PDE residuals, while the I/BCs are hard-encoded in the network to ensure forcible satisfaction (e.g., periodic boundary padding). The networks are further enhanced by autoregressive and residual connections that explicitly simulate time marching. The performance of our proposed methods has been assessed by solving three nonlinear PDEs (e.g., 2D Burgers' equations, the lambda-omega and FitzHugh Nagumo reaction-diffusion equations), and compared against the start-of-the-art baseline algorithms. The numerical results demonstrate the superiority of our proposed methodology in the context of solution accuracy, extrapolability and generalizability. (C) 2021 Elsevier B.V. All rights reserved.

关键词： Convolutional-recurrent learning Partial differential equations encoder-decoder Physics-informed deep learning Residual connection Hard-encoding of I/BCs

来源：评论

学校读者我要写书评

暂无评论

Recurrent Attention and Semantic Gate for Remote Sensing Image Captioning

引用

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING 2022年 60卷 1页

作者： Li, Yunpeng Zhang, Xiangrong Gu, Jing Li, Chen Wang, Xin Tang, Xu Jiao, Licheng Xidian Univ Sch Artificial Intelligence Xian 710071 Peoples R China Xi An Jiao Tong Univ Sch Elect & Informat Engn Xian 710049 Peoples R China

The remote sensing image captioning has attracted wide spread attention in remote sensing field due to its application potentiality. However, most existing approaches model limited interactions between image content and sentence and fail to exploit special characteristics of the remote sensing images. We introduce a novel recurrent attention and semantic gate (RASG) framework to facilitate the remote sensing image captioning in this article, which integrates competitive visual features and a recurrent attention mechanism to generate a better context vector for the images every time as well as enhances the representations of the current word state. Specifically, we first project each image into competitive visual features by taking the advantage of both static visual features and multiscale features. Then, a novel recurrent attention mechanism is developed to extract the high-level attentive maps from encoded features and nonvisual features, which can help the decoder recognize and focus on the effective information for understanding the complex content of the remote sensing images. Finally, the hidden states from the long short-term memory (LSTM) and other semantic references are incorporated into a semantic gate, which contributes to more comprehensive and precise semantic understanding. Comprehensive experiments on three widely used datasets, Sydney-Captions, UCM-Captions, and Remote Sensing Image Captioning Dataset, have demonstrated the superiority of the proposed RASG over a series of attentive models based on image captioning methods.

关键词： Feature extraction Semantics Visualization Remote sensing Logic gates Decoding Neural networks Attention mechanism encoder-decoder remote sensing image captioning semantic understanding

来源：评论

学校读者我要写书评

暂无评论

Aspect-based sentiment analysis with attention-assisted graph and variational sentence representation

引用

KNOWLEDGE-BASED SYSTEMS 2022年 258卷

作者： Feng, Shi Wang, Bing Yang, Zhiyao Ouyang, Jihong Jilin Univ Coll Comp Sci & Technol Changchun 130022 Peoples R China Jilin Univ Key Lab Symbol Computat & Knowledge Engn Minist Educ Changchun 130022 Peoples R China

Aspect-based sentiment analysis (ABSA) is a fine-grained task that detects the sentiment polarities of particular aspect words in a sentence. With the rise of graph convolution networks (GCNs), current ABSA models mostly use graph-based methods. These methods construct a dependency tree for each sentence, and regard each word as a unique node. To be more specific, they conduct classification using aspect representations instead of sentence representations, and update them with GCNs. However, this kind of method relies too much on the quality of the dependency tree and may lose the global sentence information, which is also helpful for classification. To deal with these, we design a new ABSA model AG-VSR. Two kinds of representations are proposed to perform the final classification, Attention-assisted Graph-based Representation (A2GR) and Variational Sentence Representation (VSR). A2GR is produced by the GCN module, which inputs a dependency tree modified by the attention mechanism. Furthermore, VSR is sampled from a distribution learned by a VAE-like encoder-decoder structure. Extensive experiments show that our model AG-VSR achieves competitive results. Our code and data have been released in https://***/wangbing1416/VAGR.(c) 2022 Elsevier B.V. All rights reserved.

关键词： Aspect-based sentiment analysis Graph neural network encoder-decoder Self-attention

来源：评论

学校读者我要写书评

暂无评论

Sintering Quality Prediction Model Based on Semi-Supervised Dynamic Time Feature Extraction Framework

引用

SENSORS 2022年第15期22卷 5861-5861页

作者： Li, Yuxuan Yang, Chunjie Sun, Youxian Zhejiang Univ Coll Control Sci & Engn State Key Lab Ind Control Technol Hangzhou 310027 Peoples R China

In the sintering process, it is difficult to obtain the key quality variables in real time, so there is lack of real-time information to guide the production process. Furthermore, these labeled data are too few, resulting in poor performance of conventional soft sensor models. Therefore, a novel semi-supervised dynamic feature extraction framework (SS-DTFEE) based on sequence pre-training and fine-tuning is proposed in this paper. Firstly, based on the DTFEE model, the time features of the sequences are extended and extracted. Secondly, a novel weighted bidirectional LSTM unit (BiLSTM) is designed to extract the latent variables of original sequence data. Based on improved BiLSTM, an encoder-decoder model is designed as a pre-training model with unsupervised learning to obtain the hidden information in the process. Next, through model migration and fine-tuning strategy, the prediction performance of labeled datasets is improved. The proposed method is applied in the actual sintering process to estimate the FeO content, which shows a significant improvement of the prediction accuracy, compared to traditional methods.

关键词： LSTM semi-supervised learning FeO content soft sensor encoder-decoder dynamic feature extraction

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：