咨询与建议

限定检索结果

文献类型

  • 6 篇 会议
  • 5 篇 期刊文献

馆藏范围

  • 11 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 11 篇 工学
    • 10 篇 计算机科学与技术...
    • 1 篇 电气工程
    • 1 篇 软件工程
  • 2 篇 理学
    • 2 篇 物理学
  • 1 篇 医学
    • 1 篇 临床医学
  • 1 篇 管理学
    • 1 篇 图书情报与档案管...

主题

  • 11 篇 text encoder
  • 3 篇 image encoder
  • 2 篇 transformers
  • 1 篇 graph neural net...
  • 1 篇 representation l...
  • 1 篇 unified text att...
  • 1 篇 resnet-152
  • 1 篇 deep learning
  • 1 篇 lstm
  • 1 篇 text-to-image
  • 1 篇 large language m...
  • 1 篇 keyword spotting
  • 1 篇 bidirectional co...
  • 1 篇 natural language...
  • 1 篇 popularity predi...
  • 1 篇 speech enhanceme...
  • 1 篇 acoustics
  • 1 篇 knowledge graph
  • 1 篇 cross-modal pre-...
  • 1 篇 image generation

机构

  • 1 篇 univ sci & techn...
  • 1 篇 yanshan univ key...
  • 1 篇 sharif univ tech...
  • 1 篇 natl inst techno...
  • 1 篇 hubei univ sch c...
  • 1 篇 wuhan univ sch i...
  • 1 篇 microsoft sunnyv...
  • 1 篇 vellore inst tec...
  • 1 篇 vellore inst tec...
  • 1 篇 gandhi inst tech...
  • 1 篇 tsinghua univ de...
  • 1 篇 kashi inst elect...
  • 1 篇 univ elect sci &...
  • 1 篇 oakland univ sch...
  • 1 篇 resbee info tech...
  • 1 篇 ajay kumar garg ...
  • 1 篇 srm inst sci & t...
  • 1 篇 hefei univ techn...
  • 1 篇 koneru lakshmaia...
  • 1 篇 univ elect sci &...

作者

  • 1 篇 li zhifei
  • 1 篇 hou xiaoju
  • 1 篇 hu yajun
  • 1 篇 ling zhenhua
  • 1 篇 naik devang
  • 1 篇 zheng yumin
  • 1 篇 kananian makan
  • 1 篇 li xianshan
  • 1 篇 rajakumar b. r.
  • 1 篇 zhang ruofei
  • 1 篇 sun hao
  • 1 篇 dong yuan
  • 1 篇 bojja giridhar r...
  • 1 篇 xu jiajia
  • 1 篇 gajendran sudhak...
  • 1 篇 zhongjian q.
  • 1 篇 manjula d.
  • 1 篇 zhao huasha
  • 1 篇 li huiping
  • 1 篇 jian yue

语言

  • 11 篇 英文
  • 1 篇 德文
  • 1 篇 法文
  • 1 篇 意大利文
检索条件"主题词=Text encoder"
11 条 记 录,以下是1-10 订阅
排序:
TEAR: A Cross-Modal Pre-Trained text encoder Enhanced by Acoustic Representations for Speech Synthesis
收藏 引用
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 2025年 33卷 1117-1128页
作者: Wang, Shiming Ai, Yang Chen, Liping Hu, Yajun Ling, Zhenhua Univ Sci & Technol China Natl Engn Res Ctr Speech & Language Informat Proc Hefei 230027 Peoples R China
text encoders play an important role in text-to-speech (TTS) by analyzing text input and converting it into linguistic representations. In order to generate expressive speech from text, pre-training text encoders on l... 详细信息
来源: 评论
textGNN: Improving text encoder via Graph Neural Network in Sponsored Search  21
TextGNN: Improving Text Encoder via Graph Neural Network in ...
收藏 引用
30th World Wide Web Conference (WWW)
作者: Zhu, Jason Yue Cui, Yanling Liu, Yuming Sun, Hao Li, Xue Pelger, Markus Yang, Tianqi Zhang, Liangjie Zhang, Ruofei Zhao, Huasha Stanford Univ Stanford CA 94305 USA Microsoft Beijing Peoples R China Microsoft Sunnyvale CA USA Microsoft San Francisco CA USA
text encoders based on C-DSSM or transformers have demonstrated strong performance in many Natural Language Processing (NLP) tasks. Low latency variants of these models have also been developed in recent years in orde... 详细信息
来源: 评论
ADAL-GCN: Action Description Aided Learning Graph Convolution Network for Early Action Prediction  7th
ADAL-GCN: Action Description Aided Learning Graph Convolutio...
收藏 引用
7th Chinese Conference on Pattern Recognition and Computer Vision
作者: Li, Xianshan Dong, Yuan Ning, Xingxing Zhang, Pengwei Zhao, Fengda Yanshan Univ Qinhuangdao Hebei Peoples R China Xinjiang Univ Sci & Technol Sch Informat Sci & Engn Urumqi Xinjiang Peoples R China Yanshan Univ Key Lab Software Engn IIebei Prov Qinhuangdao Hebei Peoples R China
Early human action prediction aims to complete the prediction of complete action sequences based solely on initial action sequences acquired at an initial stage. Considering that the execution of a single action usual... 详细信息
来源: 评论
text-enhanced knowledge graph representation learning with local structure
收藏 引用
INFORMATION PROCESSING & MANAGEMENT 2024年 第5期61卷
作者: Li, Zhifei Jian, Yue Xue, Zengcan Zheng, Yumin Zhang, Miao Zhang, Yan Hou, Xiaoju Wang, Xiaoguang Hubei Univ Sch Comp Sci & Informat Engn Wuhan 430062 Peoples R China Wuhan Univ Intellectual Comp Lab Cultural Heritage Wuhan 430072 Peoples R China Hubei Univ Key Lab Intelligent Sensing Syst & Secur Minist Educ Wuhan 430062 Peoples R China Hubei Univ Hubei Key Lab Big Data Intelligent Anal & Applicat Wuhan 430062 Peoples R China Cent China Normal Univ Fac Artificial Intelligence Educ Wuhan 430079 Hubei Peoples R China Guangdong Ind Polytech Inst Vocat Educ Guangzhou 510300 Peoples R China Wuhan Univ Sch Informat Management Wuhan 430072 Peoples R China
Knowledge graph representation learning entails transforming entities and relationships within a knowledge graph into vectors to enhance downstream tasks. The rise of pre -trained language models has recently promoted... 详细信息
来源: 评论
PCCM-GAN: Photographic text-to-Image Generation with Pyramid Contrastive Consistency Model
收藏 引用
NEUROCOMPUTING 2021年 449卷 330-341页
作者: Zhongjian, Q. Sun, Jun Qian, Jinzhao Xu, Jiajia Zhan, Shu Hefei Univ Technol Key Lab Knowledge Engn Big Data Minist Educ Hefei Peoples R China Hefei Univ Technol Sch Comp & Informat Hefei 230601 Anhui Peoples R China Tsinghua Univ Dept Automat Beijing 100084 Peoples R China iFlytek Co Ltd Hefei 230088 Anhui Peoples R China
Synthesizing photographic images from given text descriptions is a challenging problem. Although previous many studies have made significant progress on the visual quality of the generated images by using the multi-st... 详细信息
来源: 评论
Popularity Prediction Model With Context, Time and User Sentiment Information: An Optimization Assisted Deep Learning Technique
收藏 引用
INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS 2023年 第2期31卷 283-302页
作者: Mannepalli, Kasiprasad Singh, Suryabhan Pratap Kolli, Chandra Sekhar Raj, Sundeep Bojja, Giridhar Reddy Rajakumar, B. R. Binu, D. Koneru Lakshmaiah Educ Fdn Dept ECE Vijayawada Andhra Prades India Deen Dayal Upadhyay Gorakhpur Univ Inst Engn & Technol Dept Informat Technol Gorakhpur Uttar Pradesh India Gandhi Inst Technol & Management GITAM Sch Sci Dept Comp Sci Visakhapatnam Andhra Prades India Ajay Kumar Garg Engn Coll Dept Informat Technol Ghaziabad Uttar Pradesh India Dakota State Univ Madison SD USA Resbee Info Technol Private Ltd Thuckalay Tamil Nadu India
In social media, the data-sharing activities have turned out to be more pervasive;individuals and companies have comprehended the significance of promoting info by social media network. However, these individuals and ... 详细信息
来源: 评论
GraMuFeN: graph-based multi-modal fake news detection in social media
收藏 引用
SOCIAL NETWORK ANALYSIS AND MINING 2024年 第1期14卷 1-17页
作者: Kananian, Makan Badiei, Fatemeh Gh. Ghahramani, S. AmirAli Sharif Univ Technol Dept Comp Engn Int Campus Kish Isl Iran
Nowadays media overload is a pretty common scenario all around the world. The prevalence of media overload grants both individuals and governmental entities the ability to shape public opinions, highlighting the need ... 详细信息
来源: 评论
Matching Latent Encoding for Audio-text based Keyword Spotting  24
Matching Latent Encoding for Audio-Text based Keyword Spotti...
收藏 引用
Interspeech Conference
作者: Nishu, Kumari Cho, Minsik Naik, Devang Apple Inc Cupertino CA 95014 USA
Using audio and text embeddings jointly for Keyword Spotting (KWS) has shown high-quality results, but the key challenge of how to semantically align two embeddings for multi-word keywords of different sequence length... 详细信息
来源: 评论
Cross Modal Retrieval Algorithm Based on Iterative Queries  13th
Cross Modal Retrieval Algorithm Based on Iterative Queries
收藏 引用
13th International Conference on Computer Engineering and Networks (CENet)
作者: Cheng, Xiuchuan Yang, Xiaoyu Li, Huiping Wang, Zhiguo Yin, Guangqiang UESTC Shenzhen Inst Adv Study Shenzhen 518110 Peoples R China Univ Elect Sci & Technol China Chengdu 611730 Peoples R China Kashi Inst Elect & Informat Ind Kashi 844199 Peoples R China Univ Elect Sci & Technol China Kashi 844199 Peoples R China
The single-modal information retrieval pattern is gradually unable to meet the growing information processing needs. Cross-modal retrieval based on deep learning, as a new information retrieval scheme, is gradually re... 详细信息
来源: 评论
text to Image Synthesis Using Bridge Generative Adversarial Network and Char CNN Modeltext to Image Synthesis Using Bridge Generative Adversarial Network and Char CNN Model  28th
Text to Image Synthesis Using Bridge Generative Adversarial ...
收藏 引用
28th International Conference on Applications of Natural Language to Information Systems (NLDB)
作者: Gajendran, Sudhakaran Arunarani, Ar. Manjula, D. Sugumaran, Vijayan Vellore Inst Technol Sch Elect Engn Chennai Tamil Nadu India SRM Inst Sci & Technol Sch Comp Dept Computat Intelligence Chennai Tamil Nadu India Vellore Inst Technol Sch Comp Sci & Engn Chennai Tamil Nadu India Oakland Univ Ctr Data Sci & Big Data Analyt Rochester MI 48309 USA Oakland Univ Sch Business Adm Dept Decis & Informat Sci Rochester MI 48063 USA
Acontent to picture production approach seeks to produce photorealistic images that are semantically coherent with the provided descriptions from text descriptions. Applications for creating photorealistic visuals fro... 详细信息
来源: 评论