咨询与建议

限定检索结果

文献类型

  • 3 篇 期刊文献
  • 2 篇 会议

馆藏范围

  • 5 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 5 篇 工学
    • 5 篇 计算机科学与技术...
    • 1 篇 电气工程
    • 1 篇 信息与通信工程
  • 1 篇 医学
    • 1 篇 基础医学(可授医学...
    • 1 篇 临床医学

主题

  • 5 篇 text-based image...
  • 1 篇 language bias
  • 1 篇 textcaps
  • 1 篇 diversity
  • 1 篇 image captioning
  • 1 篇 transformers
  • 1 篇 bottom-up top-do...
  • 1 篇 multimodal trans...
  • 1 篇 scene text
  • 1 篇 optical characte...
  • 1 篇 m4c
  • 1 篇 visualization
  • 1 篇 semantics
  • 1 篇 feature extracti...
  • 1 篇 collaborative at...
  • 1 篇 multimodal infor...
  • 1 篇 lightweight netw...
  • 1 篇 multi-modal alig...
  • 1 篇 scene graph
  • 1 篇 grid feature

机构

  • 1 篇 southeast univ s...
  • 1 篇 south china univ...
  • 1 篇 chinese acad sci...
  • 1 篇 moutai inst dept...
  • 1 篇 hefei univ techn...
  • 1 篇 guizhou univ ctr...
  • 1 篇 guangxi key lab ...
  • 1 篇 guangdong univ t...
  • 1 篇 univ chinese aca...
  • 1 篇 vietnam natl uni...
  • 1 篇 vietnam natl uni...
  • 1 篇 guangxi univ sch...
  • 1 篇 guizhou univ col...
  • 1 篇 moe china key la...

作者

  • 1 篇 li qiang
  • 1 篇 yang zhenguo
  • 1 篇 liu yun
  • 1 篇 li bing
  • 1 篇 zhang yao
  • 1 篇 truc trinh
  • 1 篇 huang qingbao
  • 1 篇 cai yi
  • 1 篇 wang qi
  • 1 篇 bui doanh c.
  • 1 篇 wang yazhou
  • 1 篇 zhao wenye
  • 1 篇 ma can
  • 1 篇 khang nguyen
  • 1 篇 vo nguyen d.
  • 1 篇 hu zhenzhen
  • 1 篇 hao gefei
  • 1 篇 xu dongsheng
  • 1 篇 wu xue
  • 1 篇 song zijie

语言

  • 5 篇 英文
检索条件"主题词=Text-based image captioning"
5 条 记 录,以下是1-10 订阅
排序:
LCM-Captioner: A lightweight text-based image captioning method with collaborative mechanism between vision and text
收藏 引用
NEURAL NETWORKS 2023年 162卷 318-329页
作者: Wang, Qi Deng, Hongyu Wu, Xue Yang, Zhenguo Liu, Yun Wang, Yazhou Hao, Gefei Guizhou Univ Coll Comp Sci & Technol State Key Lab Publ Big Data Guizhou Peoples R China Guizhou Univ Ctr Res & Dev Fine Chem Guiyang Peoples R China Guangdong Univ Technol Sch Comp Guangdong Peoples R China Moutai Inst Dept Automation Nanjing Peoples R China Southeast Univ Sch Microelect Nanjing 210096 Peoples R China
text-based image captioning (textCap) aims to remedy the shortcomings of existing image captioning tasks that ignore text content when describing images. Instead, it requires models to recognize and describe images fr... 详细信息
来源: 评论
EAES: Effective Augmented Embedding Spaces for text-based image captioning
收藏 引用
IEEE ACCESS 2022年 10卷 32443-32452页
作者: Khang Nguyen Bui, Doanh C. Truc Trinh Vo, Nguyen D. Vietnam Natl Univ Ho Chi Minh City VNUHCM Univ Informat Technol Ho Chi Minh City 7000 Vietnam Vietnam Natl Univ Ho Chi Minh City VNUHCM Ho Chi Minh City 700000 Vietnam
text-based image captioning has been a novel problem since 2020. This topic remains challenging because it requires the model to comprehend not only the visual context but also the scene texts that appear in an image.... 详细信息
来源: 评论
Zero-textCap: Zero-shot Framework for text-based image captioning  23
Zero-TextCap: Zero-shot Framework for Text-based Image Capti...
收藏 引用
31st ACM International Conference on Multimedia (MM)
作者: Xu, Dongsheng Zhao, Wenye Cai, Yi Huang, Qingbao Guangxi Univ Sch Elect Engn Nanning Guangxi Peoples R China South China Univ Technol Sch Software Engn Guangzhou Guangdong Peoples R China MOE China Key Lab Big Data & Intelligent Robot SCUT Guangzhou Guangdong Peoples R China Guangxi Key Lab Multimedia Commun & Network Techn Nanning Guangxi Peoples R China
text-based image captioning is a vital but under-explored task, which aims to describe images by captions containing scene text automatically. Recent studies have made encouraging progress, but they are still sufferin... 详细信息
来源: 评论
Exploring coherence from heterogeneous representations for OCR image captioning
收藏 引用
MULTIMEDIA SYSTEMS 2024年 第5期30卷 1-13页
作者: Zhang, Yao Song, Zijie Hu, Zhenzhen Hefei Univ Technol Sch Comp Sci & Informat Engn Hefei Peoples R China
text-based image captioning is an important task, aiming to generate descriptions based on reading and reasoning the scene texts in images. text-based image contains both textual and visual information, which is diffi... 详细信息
来源: 评论
Relation-Aware Global-Augmented Transformer for textCaps  31st
Relation-Aware Global-Augmented Transformer for TextCaps
收藏 引用
31st International Conference on Artificial Neural Networks (ICANN)
作者: Li, Qiang Li, Bing Ma, Can Chinese Acad Sci Inst Informat Engn Beijing Peoples R China Univ Chinese Acad Sci Sch Cyber Secur Beijing Peoples R China
text-based image captioning (textCaps) task aims to describe the given image reasonably based on scene text and visual objects simultaneously. Although previous works have shown great success, they pay too much attent... 详细信息
来源: 评论