咨询与建议

限定检索结果

文献类型

  • 7 篇 期刊文献
  • 2 篇 会议

馆藏范围

  • 9 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 9 篇 工学
    • 6 篇 计算机科学与技术...
    • 3 篇 电气工程
    • 2 篇 控制科学与工程
    • 1 篇 仪器科学与技术
    • 1 篇 电子科学与技术(可...
  • 1 篇 管理学
    • 1 篇 管理科学与工程(可...

主题

  • 9 篇 image encoder
  • 3 篇 text encoder
  • 2 篇 text-to-image
  • 1 篇 attention fusion...
  • 1 篇 object detection
  • 1 篇 visual sensor
  • 1 篇 transformer enco...
  • 1 篇 resnet-152
  • 1 篇 network layer
  • 1 篇 attention gate
  • 1 篇 cross modal
  • 1 篇 multimodal relat...
  • 1 篇 image captioning
  • 1 篇 neural nets
  • 1 篇 reinforcement le...
  • 1 篇 transformer deco...
  • 1 篇 3c assembly
  • 1 篇 pipelining
  • 1 篇 vlsi architectur...
  • 1 篇 discrete wavelet...

机构

  • 1 篇 gitam univ dept ...
  • 1 篇 northwestern pol...
  • 1 篇 sharif univ tech...
  • 1 篇 king saud univ c...
  • 1 篇 koneru lakshmaia...
  • 1 篇 tsinghua univ de...
  • 1 篇 kashi inst elect...
  • 1 篇 jnnce shivamogga...
  • 1 篇 univ elect sci &...
  • 1 篇 beijing technol ...
  • 1 篇 hefei univ techn...
  • 1 篇 univ elect sci &...
  • 1 篇 tsinghua univ pe...
  • 1 篇 sambalpur univ i...
  • 1 篇 king saud univ c...
  • 1 篇 klescet dept ele...
  • 1 篇 sun yat sen univ...
  • 1 篇 iflytek co ltd p...
  • 1 篇 beijing informat...
  • 1 篇 hefei univ techn...

作者

  • 1 篇 wei siwei
  • 1 篇 kananian makan
  • 1 篇 xu jiajia
  • 1 篇 hu haifeng
  • 1 篇 zhongjian q.
  • 1 篇 mohanty basant k...
  • 1 篇 li huiping
  • 1 篇 shi haobin
  • 1 篇 talasila vamsidh...
  • 1 篇 cheng xiuchuan
  • 1 篇 alqahtani fayez
  • 1 篇 yang xiaoyu
  • 1 篇 liu naijun
  • 1 篇 kunte srinivasa ...
  • 1 篇 cai yuanyuan
  • 1 篇 almakhadmeh zafe...
  • 1 篇 yin guangqiang
  • 1 篇 badiei fatemeh
  • 1 篇 zhang qingchuan
  • 1 篇 chen dihu

语言

  • 9 篇 英文
检索条件"主题词=Image Encoder"
9 条 记 录,以下是1-10 订阅
Efficient image Semantic Representation and Visual-Textual Semantic Fusion for Multimodal Relation Extraction and Multimodal-Named Entity Recognition
收藏 引用
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS 2025年 第8期34卷
作者: Zhang, Qingchuan Wei, Siwei Alqahtani, Fayez Almakhadmeh, Zafer Cai, Yuanyuan Beijing Technol & Business Univ Natl Engn Res Ctr Agriprod Qual Traceabil 11 & 33 Fucheng Rd Beijing 100048 Peoples R China King Saud Univ Coll Comp & Informat Sci Software Engn Dept Riyadh 12372 Saudi Arabia King Saud Univ Community Coll Comp Sci Dept Riyadh Saudi Arabia
Recently, multimodal relation extraction (MRE) and multimodal-named entity recognition (MNER) have attracted widespread attention. However, prior research works have encountered challenges including inadequate semanti... 详细信息
来源: 评论
PCCM-GAN: Photographic Text-to-image Generation with Pyramid Contrastive Consistency Model
收藏 引用
NEUROCOMPUTING 2021年 449卷 330-341页
作者: Zhongjian, Q. Sun, Jun Qian, Jinzhao Xu, Jiajia Zhan, Shu Hefei Univ Technol Key Lab Knowledge Engn Big Data Minist Educ Hefei Peoples R China Hefei Univ Technol Sch Comp & Informat Hefei 230601 Anhui Peoples R China Tsinghua Univ Dept Automat Beijing 100084 Peoples R China iFlytek Co Ltd Hefei 230088 Anhui Peoples R China
Synthesizing photographic images from given text descriptions is a challenging problem. Although previous many studies have made significant progress on the visual quality of the generated images by using the multi-st... 详细信息
来源: 评论
Memory-Efficient Multiplier-Less 2-D DWT Design Using Combined Convolution and Lifting Schemes for Wireless Visual Sensors
收藏 引用
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS 2024年 第4期32卷 695-703页
作者: Mohanty, Basant Kumar Sambalpur Univ Inst Informat Technol Dept Elect & Commun Engn Burla 768019 Odisha India
In this article, the combined convolution-lifting scheme is explored to address the design issues of 2-D discrete wavelet transform (DWT) structures. We found that the combined convolution-lifting scheme of type-1 (co... 详细信息
来源: 评论
Using digital twin to enhance Sim2real transfer for reinforcement learning in 3C assembly
收藏 引用
INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION 2024年 第1期51卷 125-133页
作者: Mu, Weiwen Chen, Wenbai Zhou, Huaidong Liu, Naijun Shi, Haobin Li, Jingchen Beijing Informat Sci & Technol Univ Qinghe Xiaoying Campus Beijing Peoples R China Tsinghua Univ Beijing Peoples R China Northwestern Polytech Univ Xian Peoples R China
PurposeThis paper aim to solve the problem of low assembly success rate for 3c assembly lines designed based on classical control algorithms due to inevitable random disturbances and other factors,by incorporating int... 详细信息
来源: 评论
Transformer with sparse self-attention mechanism for image captioning
收藏 引用
ELECTRONICS LETTERS 2020年 第15期56卷 764-+页
作者: Wang, Duofeng Hu, Haifeng Chen, Dihu Sun Yat Sen Univ Sch Elect & Informat Engn Guangzhou 510006 Guangdong Peoples R China
Recently, transformer has been applied to the image caption model, in which the convolutional neural network and the transformer encoder act as the image encoder of the model, and the transformer decoder acts as the d... 详细信息
来源: 评论
BI-LSTM Based Encoding and GAN for Text-to-image Synthesis
收藏 引用
SENSING AND IMAGING 2022年 第1期23卷 1-17页
作者: Talasila, Vamsidhar Narasingarao, M. R. Koneru Lakshmaiah Educ Fdn Dept Comp Sci & Engn Vaddeswaram Andhra Pradesh India GITAM Univ Dept Comp Sci & Engn Visakhapatnam Andhra Pradesh India
Synthesizing images from text is to produce images with reliable content as specified text depiction that is an extremely demanding task with the most important problems like: content consistency and visual realism. O... 详细信息
来源: 评论
Design and Implementation Issues of Parallel Vector Quantization in FPGA for Real Time image Compression
Design and Implementation Issues of Parallel Vector Quantiza...
收藏 引用
International Conference on Intelligent Computing and Information Science
作者: Rasane, Krupa R. Kunte, Srinivasa Rao R. KLESCET Dept Elect & Commun Engn Belgaum Karnataka India JNNCE Shivamogga Karnataka India
In this paper a 4 codebook, Vector Quantization (VQ) core is implemented on FPGA (Field Programmable Gate Array). The proposed design has certain advantages over the earlier architecture in the form of design reuse of... 详细信息
来源: 评论
Cross Modal Retrieval Algorithm Based on Iterative Queries  13th
Cross Modal Retrieval Algorithm Based on Iterative Queries
收藏 引用
13th International Conference on Computer Engineering and Networks (CENet)
作者: Cheng, Xiuchuan Yang, Xiaoyu Li, Huiping Wang, Zhiguo Yin, Guangqiang UESTC Shenzhen Inst Adv Study Shenzhen 518110 Peoples R China Univ Elect Sci & Technol China Chengdu 611730 Peoples R China Kashi Inst Elect & Informat Ind Kashi 844199 Peoples R China Univ Elect Sci & Technol China Kashi 844199 Peoples R China
The single-modal information retrieval pattern is gradually unable to meet the growing information processing needs. Cross-modal retrieval based on deep learning, as a new information retrieval scheme, is gradually re... 详细信息
来源: 评论
GraMuFeN: graph-based multi-modal fake news detection in social media
收藏 引用
SOCIAL NETWORK ANALYSIS AND MINING 2024年 第1期14卷 104-104页
作者: Kananian, Makan Badiei, Fatemeh Gh. Ghahramani, S. AmirAli Sharif Univ Technol Dept Comp Engn Int Campus Kish Isl Iran
Nowadays media overload is a pretty common scenario all around the world. The prevalence of media overload grants both individuals and governmental entities the ability to shape public opinions, highlighting the need ... 详细信息
来源: 评论