检索结果-内蒙古大学图书馆

International Conference on Circuits, Systems, Communication and Information Technology Applications (CSCITA)

作者： Harsh Jaykumar Jalan Gautam Maurya Canute Corda Sunny Dsouza Dakshata Panchal SFIT Mumbai India

ISBN: (数字)9781728123394

ISBN: (纸本)9781728123400

Currently sketch artists are employed by the police to draw sketches of suspects based on the description given by an eye-witness. These sketches can sometimes be inaccurate due to incorrect drawings of the artist or the incorrect description given by the witness. Generative Adversarial Network (GAN) is a way of training a Neural Network to output images which belong to a specific class. This network is trained by using an adversarial process which pits the generator against the discriminator in a minimax game. Traditional GANs are unable to generate high-resolution images hence, StyleGAN is used to resolve this issue. The generated images may still need to be altered to get a close match so TL-GAN is used to alter the generated image by altering the latent-space input of the StyleGAN. TL-GAN offers users the ability to finely tune one or multiple features of the face holistically. The main objective of the proposed work is to develop a Suspect Face Generation System as the sketches made by sketch artists are only 13 out of 160 times (approx. 8%) accurate. This system will help the society in reduction of misidentification of crime suspects and considerably reduce the crime rate.

关键词： GAN StyleGAN TL-GAN Resnet-50 CNN encoder-decoder Sketches artist gallium nitrate images Face Proposed works Minimax False identification vandalism crime Law Enforcement Officers dynamos

来源：评论

学校读者我要写书评

暂无评论

针对复杂场景的图像描述研究 ——基于动态聚焦模型

针对复杂场景的图像描述研究 ——基于...

引用

作者：吕沁芸江西财经大学

学位级别：硕士

近年来,以M-RNN为代表的网络结构在Image Captioning领域的应用表现出不错的效果。在使用神经网络模型实现该任务的过程中,从编码器的图像特征提取到解码器中选择关注图像信息还是文本信息从而产生文本描述,这一系列的算法都将影响模型... 详细信息

近年来,以M-RNN为代表的网络结构在Image Captioning领域的应用表现出不错的效果。在使用神经网络模型实现该任务的过程中,从编码器的图像特征提取到解码器中选择关注图像信息还是文本信息从而产生文本描述,这一系列的算法都将影响模型的性能。在以往使用深度学习模型的研究中,有的研究者使用深层的卷积神经网络以及多层的双向的长短期记忆网络构建模型,也有的尝试在图像空间特征上施加注意力机制,但无论哪种都没有明确关注抽象概念的预测。本文以multi-model形式构建深度学习网络架构实现Image Captioning任务,通过encoder-decoder框架,使用不同的CNN提取图像特征,使用预训练的Glove作为词嵌入结合LSTM,在编码器中加入改进的注意力机制作为最终网络结构。在测试集上的预测,使用了Beam Search确认最终预测得到图片描述文本,并使用BLEU,METEOR和CIDEr三个指标评价模型性能。我们的模型能在产生图像描述的每个时间步动态处理来自视觉和非视觉的信息,因此能对抽象概念产生较好的预测。这也是本文的特色所在,即:在decoder部分,模型能自动选择是否利用视觉信息还是语言模型,从而产生质量较高的图像描述。

关键词： Image Captioning encoder-decoder 注意力机制

来源：评论

学校读者我要写书评

暂无评论

Study on Abstractive Text Summarization Techniques

Study on Abstractive Text Summarization Techniques

引用

Emerging Trends in Information Technology and Engineering (ic-ETITE), Conference on

作者： Parth Rajesh Dedhia Hardik Pradeep Pachgade Aditya Pradip Malani Nataasha Raul Meghana Naik Sardar Patel Institute of Technology Mumbai India

ISBN: (数字)9781728141428

ISBN: (纸本)9781728141435

As there is an increase in the usage of digital applications, the availability of data generated has increased to a tremendous scale. Data is an important component in almost every domain where research and analysis are required to solve the problems. It is available in a structured or unstructured format. Therefore, in order to get corresponding data as per the application's purpose, easily and quickly from different sources of data on the internet, an online content summarizer is desired. Summarizers makes it easier for users to understand the content without reading it completely. Abstractive Text Summarizer helps in defining the content by considering the important words and helps in creating summaries that are in a human-readable format. The main aim is to make summaries in such a way that it should not lose its context. Various Neural Network models are employed along with other machine translation models to bring about a concise summary generation. This paper aims to highlight and study the existing contemporary models for abstractive text summarization and also to explore areas for further research.

关键词： Abstractive Summarization Attention Seq2Seq encoder-decoder Pointer Mechanism Data sources texts data availability machine translation reading Online content

来源：评论

学校读者我要写书评

暂无评论

基于深度学习的属性级情感分析

基于深度学习的属性级情感分析

引用

作者：周安桥大连理工大学

学位级别：硕士

属性级情感分析旨在从非结构化的文本中分析出人们对属性术语的情感倾向。目前该任务主要有三个研究问题:属性术语抽取、属性级情感分类和属性情感联合抽取。本文主要研究后两个问题。属性级情感分类的目的是确定上下文语境中某些属性... 详细信息

属性级情感分析旨在从非结构化的文本中分析出人们对属性术语的情感倾向。目前该任务主要有三个研究问题:属性术语抽取、属性级情感分类和属性情感联合抽取。本文主要研究后两个问题。属性级情感分类的目的是确定上下文语境中某些属性术语所表达的情感。已有的方法利用多注意力机制缓解注意力分散问题,改善模型性能。然而这些模型没有深度融合属性术语和上下文的语义,并且忽略先前的注意力信息。针对这些问题,本文提出一种基于动态注意力DAGRU的属性术语情感分类模型。模型通过动态注意力机制考虑先前的注意力信息,改善注意力的准确性。模型的编码层采用联合编码方案进行语义编码,然后结合动态注意力对属性术语和上下文语义进行深度融合。最后在SemEval 2014的Laptop、Restaurant数据集上的实验结果显示:提出的模型准确率比目前最好的多注意力机制高1.73%,证明了动态注意力机制是有效的。属性情感联合抽取旨在通过一个端到端的方法同时抽取句中的属性术语及其情感。目前的研究主要采用基于Collapsed方法的模型,然而Collapsed方法无法建模属性术语与句子之间的交互,并且忽视不同属性术语之间的关联。针对这些问题,本文设计一种属性术语及其情感极性联合生成方案,并提出一个基于encoder-decoder结构的端到端模型抽取属性术语及其情感。该生成方案能够使encoder-decoder结构适应该任务,并保证预测的完整性。另一方面,decoder通过信息传递考虑属性术语之间的关系,并且结合注意力模块实现属性术语与句子间的交互。实验在三个基准数据集进行,结果显示:模型在各个数据集的F1值分别能达到64.28%、74.76%、54.52%,超过目前最优结果,证明了基于encoder-decoder结构的模型是有效的。综上所述,对于属性级情感分类,本文针对属性术语和上下文的语义融合问题以及忽视先前注意力信息问题,提出基于动态注意力GRU的情感分类模型;对于属性情感联合抽取,针对属性术语与上下文交互问题以及属性术语间关联问题,提出一种基于encoder-decoder结构端到端模型,并且针对领域适应问题及预测完整性问题,设计了一种新的生成方案。最后的实验证明了提出的这些方法是有效的。

关键词：属性级情感分析注意力机制 GRU encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

Natural Language Description of Video Streams Using Task-Specific Feature Encoding

引用

IEEE ACCESS 2018年 6卷 16639-16645页

作者： Dilawari, Aniqa Khan, Muhammad Usman Ghani Farooq, Ammarah Zahoor-Ur-Rehman Rho, Seungmin Mehmood, Irfan Univ Engn & Technol Lahore Dept Comp Sci & Engn Lahore 54890 Pakistan Univ Engn & Technol Lahore Al Khawarizmi Inst Comp Sci Lahore 54890 Pakistan COMSATS Inst Informat Technol Attock Attock 43600 Pakistan Sungkyul Univ Dept Media Software Anyang 430742 South Korea Sejong Univ Dept Software Seoul 143747 South Korea

In recent years, deep learning approaches have gained great attention due to their superior performance and the availability of high speed computing resources. These approaches are also extended towards the real time processing of multimedia content exploiting its spatial and temporal structure. In this paper, we propose a deep learning-based video description framework which first extracts visual features from video frames using deep convolutional neural networks (CNN) and then pass the derived representations into a long-short term memory-based language model. In order to capture accurate information for human presence, a fine-tuned multi-task CNN is presented. The proposed pipeline is end-to-end, trainable, and capable of learning dense visual features along with an accurate framework for the generation of natural language descriptions of video streams. The evaluation is done by calculating Metric for Evaluation of Translation with Explicit ORdering and Recall-Oriented Understudy for Gisting Evaluation (ROUGE) scores between system generated and human annotated video descriptions for a carefully designed data set. The video descriptions generated by the traditional feature learning and proposed deep learning frameworks are also compared through the ROUGE scores.

关键词： Convolutional neural network encoder-decoder LSTM natural language generation TRECVid 2007/2008 video description video to text

来源：评论

学校读者我要写书评

暂无评论

Attentive encoder-based Extractive Text Summarization 18

Attentive Encoder-based Extractive Text Summarization

引用

27th ACM International Conference on Information and Knowledge Management (CIKM)

作者： Feng, Chong Cai, Fei Chen, Honghui de Rijke, Maarten Natl Univ Def Technol Sci & Technol Informat Syst Engn Lab Changsha Hunan Peoples R China Univ Amsterdam Inst Informat Amsterdam Netherlands

ISBN: (纸本)9781450360142

In previous work on text summarization, encoder-decoder architectures and attention mechanisms have both been widely used. Attention-based encoder-decoder approaches typically focus on taking the sentences preceding a given sentence in a document into account for document representation, failing to capture the relationships between a sentence and sentences that follow it in a document in the encoder. We propose an attentive encoder-based summarization (AES) model to generate article summaries. AES can generate a rich document representation by considering both the global information of a document and the relationships of sentences in the document. A unidirectional recurrent neural network (RNN) and a bidirectional RNN are considered to construct the encoders, giving rise to unidirectional attentive encoder-based summarization (Uni-AES) and bidirectional attentive encoder-based summarization (Bi-AES), respectively. Our experimental results show that Bi-AES outperforms Uni-AES. We obtain substantial improvements over a relevant start-of-the-art baseline.

关键词： Summarization Attention mechanism encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

Emotional Human-Machine Conversation Generation Based on Long Short-Term Memory

引用

COGNITIVE COMPUTATION 2018年第3期10卷 389-397页

作者： Sun, Xiao Peng, Xiaoqi Ding, Shuai Hefei Univ Technol Sch Management TunXi Rd 193 Hefei Anhui Peoples R China Hefei Univ Technol Sch Comp & Informat TunXi Rd 193 Hefei Anhui Peoples R China

With the rise in popularity of artificial intelligence, the technology of verbal communication between man and machine has received an increasing amount of attention, but generating a good conversation remains a difficult task. The key factor in human-machine conversation is whether the machine can give good responses that are appropriate not only at the content level (relevant and grammatical) but also at the emotion level (consistent emotional expression). In our paper, we propose a new model based on long short-term memory, which is used to achieve an encoder-decoder framework, and we address the emotional factor of conversation generation by changing the model's input using a series of input transformations: a sequence without an emotional category, a sequence with an emotional category for the input sentence, and a sequence with an emotional category for the output responses. We perform a comparison between our work and related work and find that we can obtain slightly better results with respect to emotion consistency. Although in terms of content coherence our result is lower than those of related work, in the present stage of research, our method can generally generate emotional responses in order to control and improve the user's emotion. Our experiment shows that through the introduction of emotional intelligence, our model can generate responses appropriate not only in content but also in emotion.

关键词： Long short-term memory encoder-decoder Emotional category Emotion intelligence

来源：评论

学校读者我要写书评

暂无评论

Repeated review based image captioning for image evidence review

引用

SIGNAL PROCESSING-IMAGE COMMUNICATION 2018年 63卷 141-148页

作者： Guan, Jinning Wang, Eric Harbin Inst Technol Shenzhen Grad Sch Shenzhen Key Lab Internet Informat Collaborat Shenzhen 518055 Peoples R China

We propose a repeated review deep learning model for image captioning in image evidence review process. It consists of two subnetworks. One is the convolutional neural network which is employed to extract the image features and the other is the recurrent neural network which is used to decode the image features into captions. Our model combines the advantages of the two subnetworks by recalling visual information different from the traditional model of encoder-decoder, and then introduces multimodal layer to fuse the image and caption effectively. The proposed model has been validated on benchmark datasets (MSCOCO, Flick). It shows that the proposed model performs well on bleu-3 and bleu-4, even to some extent, beyond the best models available today (such as NIC, m-RNN, etc.).

关键词： Repeated review Image captioning encoder-decoder Multimodal

来源：评论

学校读者我要写书评

暂无评论

High-Resolution Remote Sensing Imagery Classification of Imbalanced Data Using Multistage Sampling Method and Deep Neural Networks

引用

REMOTE SENSING 2019年第21期11卷

作者： Xia, Wei Ma, Caihong Liu, Jianbo Liu, Shibin Chen, Fu Yang, Zhi Duan, Jianbo Chinese Acad Sci Aerosp Informat Res Inst Beijing 100094 Peoples R China Univ Chinese Acad Sci Sch Elect Elect & Commun Engn Beijing 101408 Peoples R China Sanya Inst Remote Sensing Sanya 572029 Peoples R China China Elect Power Res Inst Co Ltd Beijing 100055 Peoples R China

Class imbalance is a key issue for the application of deep learning for remote sensing image classification because a model generated by imbalanced samples training has low classification accuracy for minority classes. In this study, an accurate classification approach using the multistage sampling method and deep neural networks was proposed to classify imbalanced data. We first balance samples by multistage sampling to obtain the training sets. Then, a state-of-the-art model is adopted by combining the advantages of atrous spatial pyramid pooling (ASPP) and encoder-decoder for pixel-wise classification, which are two different types of fully convolutional networks (FCNs) that can obtain contextual information of multiple levels in the encoder stage. The details and spatial dimensions of targets are restored using such information during the decoder stage. We employ four deep learning-based classification algorithms (basic FCN, FCN-8S, ASPP, and encoder-decoder with ASPP of our approach) on multistage training sets (original, MUS1, and MUS2) of WorldView-3 images in southeastern Qinghai-Tibet Plateau and GF-2 images in northeastern Beijing for comparison. The experiments show that, compared with existing sets (original, MUS1, and identical) and existing method (cost weighting), the MUS2 training set of multistage sampling significantly enhance the classification performance for minority classes. Our approach shows distinct advantages for imbalanced data.

关键词： high-resolution remote sensing image classification deep learning imbalanced data multistage sampling ASPP encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

Dense Semantic Labeling with Atrous Spatial Pyramid Pooling and decoder for High-Resolution Remote Sensing Imagery

引用

REMOTE SENSING 2019年第1期11卷

作者： Wang, Yuhao Liang, Binxiu Ding, Meng Li, Jiangyun Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China Minist Educ Key Lab Knowledge Automat Ind Proc Beijing 100083 Peoples R China Thermo Fisher Sci Richardson TX 75081 USA

Dense semantic labeling is significant in high-resolution remote sensing imagery research and it has been widely used in land-use analysis and environment protection. With the recent success of fully convolutional networks (FCN), various types of network architectures have largely improved performance. Among them, atrous spatial pyramid pooling (ASPP) and encoder-decoder are two successful ones. The former structure is able to extract multi-scale contextual information and multiple effective field-of-view, while the latter structure can recover the spatial information to obtain sharper object boundaries. In this study, we propose a more efficient fully convolutional network by combining the advantages from both structures. Our model utilizes the deep residual network (ResNet) followed by ASPP as the encoder and combines two scales of high-level features with corresponding low-level features as the decoder at the upsampling stage. We further develop a multi-scale loss function to enhance the learning procedure. In the postprocessing, a novel superpixel-based dense conditional random field is employed to refine the predictions. We evaluate the proposed method on the Potsdam and Vaihingen datasets and the experimental results demonstrate that our method performs better than other machine learning or deep learning methods. Compared with the state-of-the-art DeepLab_v3+ our model gains 0.4% and 0.6% improvements in overall accuracy on these two datasets respectively.

关键词： remote sensing imagery dense semantic labeling fully convolutional networks atrous spatial pyramid pooling encoder-decoder superpixel-based DenseCRF

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：