encoder-to-decoder is a newly architecture for Neural Machine Translation(NMT). Convolutional Neural Network(CNN) based on this framework has gained significant success in NMT task. Challenges remain in the practical ...
详细信息
encoder-to-decoder is a newly architecture for Neural Machine Translation(NMT). Convolutional Neural Network(CNN) based on this framework has gained significant success in NMT task. Challenges remain in the practical use of CNN model, which is in need of bilingual sentence pairs for training and each bilingual data is designed for CNN translation model needing retraining. Although some successful performance has been reported, it is an important research direction to avoid model overfitting caused by the scarcity of parallel corpus. The paper introduces a simple and efficient knowledge distillation method for regularization to solve CNN training overfitting problems by transferring the knowledge of source model to adapted model on low-resource languages in NMT task. The experiment on English-Czech dataset result shows that our model solve the over fitting problem, get better generalization, and improve the performance of a low-resource languages translation task.
Currently sketch artists are employed by the police to draw sketches of suspects based on the description given by an eye-witness. These sketches can sometimes be inaccurate due to incorrect drawings of the artist or ...
详细信息
ISBN:
(数字)9781728123394
ISBN:
(纸本)9781728123400
Currently sketch artists are employed by the police to draw sketches of suspects based on the description given by an eye-witness. These sketches can sometimes be inaccurate due to incorrect drawings of the artist or the incorrect description given by the witness. Generative Adversarial Network (GAN) is a way of training a Neural Network to output images which belong to a specific class. This network is trained by using an adversarial process which pits the generator against the discriminator in a minimax game. Traditional GANs are unable to generate high-resolution images hence, StyleGAN is used to resolve this issue. The generated images may still need to be altered to get a close match so TL-GAN is used to alter the generated image by altering the latent-space input of the StyleGAN. TL-GAN offers users the ability to finely tune one or multiple features of the face holistically. The main objective of the proposed work is to develop a Suspect Face Generation System as the sketches made by sketch artists are only 13 out of 160 times (approx. 8%) accurate. This system will help the society in reduction of misidentification of crime suspects and considerably reduce the crime rate.
Phase-coding structured light is an important technique in 3 D ***,a great challenge is the wrapped phase that causes geometry *** unwrapping methods such as spatial and temporal approaches face the problem of error p...
详细信息
Phase-coding structured light is an important technique in 3 D ***,a great challenge is the wrapped phase that causes geometry *** unwrapping methods such as spatial and temporal approaches face the problem of error propagation and low *** this paper,we propose to solve the phase unwrapping problem with a deep neural *** be specific,the phase unwrapping problem is cast to a semantic segmentation task,where the wrapped phase is the input and the fringe index for every pixel is the *** encoderdecoder architecture,which is like U-net,is adopted as the *** further propose a combined loss function by considering cross entropy loss,phase consistency loss and edge consistency *** 10000 artificially synthesized samples,the proposed method converges *** results demonstrate that the trained model well predicts fringe orders on both simulation data and real captured *** addition,it unwrapps every pixel independently and avoids phase error propagation,and further achieves accurate 3 D reconstruction.
As there is an increase in the usage of digital applications, the availability of data generated has increased to a tremendous scale. Data is an important component in almost every domain where research and analysis a...
详细信息
ISBN:
(数字)9781728141428
ISBN:
(纸本)9781728141435
As there is an increase in the usage of digital applications, the availability of data generated has increased to a tremendous scale. Data is an important component in almost every domain where research and analysis are required to solve the problems. It is available in a structured or unstructured format. Therefore, in order to get corresponding data as per the application's purpose, easily and quickly from different sources of data on the internet, an online content summarizer is desired. Summarizers makes it easier for users to understand the content without reading it completely. Abstractive Text Summarizer helps in defining the content by considering the important words and helps in creating summaries that are in a human-readable format. The main aim is to make summaries in such a way that it should not lose its context. Various Neural Network models are employed along with other machine translation models to bring about a concise summary generation. This paper aims to highlight and study the existing contemporary models for abstractive text summarization and also to explore areas for further research.
This paper presents a character-level encoder-decoder mod-eling method for question answering(QA)from large-scale knowledge bases(KB).This method improves the existing approach [9] from three ***,long short-term memor...
详细信息
ISBN:
(纸本)9783319690049
This paper presents a character-level encoder-decoder mod-eling method for question answering(QA)from large-scale knowledge bases(KB).This method improves the existing approach [9] from three ***,long short-term memory(LSTM)structures are adopted to replace the convolutional neural networks(CNN)for encoding the can-didate entities and ***,a new strategy of generating neg-ative samples for model training is ***,a data augmentation strategy is applied to increase the size of the training set by generating factoid questions using another trained encoder-decoder ***-mental results on the SimpleQuestions dataset and the Freebase5M KB demonstrates the effectiveness of the proposed method,which improves the state-of-the-art accuracy from 70.3%to 78.8%when augmenting the training set with 70,000 generated triple-question pairs.
Table-to-text generation involves using natural language to describe a table which has formal structure and valuable information. This paper introduces a two-level encoder-decoder neural model for table-to-text genera...
详细信息
Table-to-text generation involves using natural language to describe a table which has formal structure and valuable information. This paper introduces a two-level encoder-decoder neural model for table-to-text generation. To make the most of the structure which ordinarily is expressed as a set of field-value records and deal with rare words appearing in a table, this study adopts an improved encoder-decoder approach and uses field information to reprocess words in texts as decoding result. In encoder, two LSTM-RNNs used for combining fields and values that one LSTM-RNN gives priority to fields and the other gives first place to values. In decoder, two-level attention mechanism used on states encoded before to get the relation between words in the text and fields in the table and the relation between words in the text and values in the table. At last the decoding result is transformed to real words. The model is experimented on WIKIBIO and WEATHERGOV, and improves the current stateof-the-art BLEU-4 score from 44.89 to 45.77, and from 61.01 to 62.89 respectively.
暂无评论