咨询与建议

限定检索结果

文献类型

  • 19 篇 期刊文献
  • 13 篇 会议

馆藏范围

  • 32 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 29 篇 工学
    • 22 篇 计算机科学与技术...
    • 6 篇 电气工程
    • 4 篇 信息与通信工程
    • 3 篇 软件工程
    • 2 篇 仪器科学与技术
    • 2 篇 控制科学与工程
    • 2 篇 测绘科学与技术
    • 2 篇 环境科学与工程(可...
    • 1 篇 电子科学与技术(可...
    • 1 篇 土木工程
    • 1 篇 生物医学工程(可授...
  • 5 篇 医学
    • 2 篇 临床医学
    • 1 篇 基础医学(可授医学...
    • 1 篇 公共卫生与预防医...
    • 1 篇 特种医学
  • 4 篇 管理学
    • 4 篇 管理科学与工程(可...
  • 3 篇 理学
    • 2 篇 地球物理学
    • 1 篇 化学
    • 1 篇 生物学
  • 1 篇 文学
    • 1 篇 外国语言文学
  • 1 篇 艺术学
    • 1 篇 设计学(可授艺术学...

主题

  • 32 篇 visual language ...
  • 4 篇 large language m...
  • 2 篇 autonomous drivi...
  • 2 篇 image captioning
  • 2 篇 keyword spotting
  • 2 篇 visualization
  • 2 篇 feature extracti...
  • 2 篇 computer vision
  • 2 篇 query likelihood...
  • 2 篇 generalizable pe...
  • 1 篇 multimodal learn...
  • 1 篇 representation l...
  • 1 篇 clip
  • 1 篇 semantic-related
  • 1 篇 semantic constra...
  • 1 篇 semantic segment...
  • 1 篇 object classific...
  • 1 篇 deep learning
  • 1 篇 video action rec...
  • 1 篇 instruction fine...

机构

  • 1 篇 shandong univ sm...
  • 1 篇 chinese acad sci...
  • 1 篇 shanghai normal ...
  • 1 篇 natl univ def te...
  • 1 篇 inner mongolia u...
  • 1 篇 china acad chine...
  • 1 篇 johns hopkins un...
  • 1 篇 shandong prov de...
  • 1 篇 univ macau dept ...
  • 1 篇 southeast univ m...
  • 1 篇 institute of inf...
  • 1 篇 univ sci & techn...
  • 1 篇 hong kong univ s...
  • 1 篇 zhengzhou univ s...
  • 1 篇 minist educ key ...
  • 1 篇 zhengzhou univ s...
  • 1 篇 southeast univ k...
  • 1 篇 univ polytech ha...
  • 1 篇 univ genoa dibri...
  • 1 篇 natl trusted emb...

作者

  • 2 篇 geng xin
  • 2 篇 chen hao
  • 2 篇 zhao huazhong
  • 2 篇 qi lei
  • 1 篇 guan yanchen
  • 1 篇 lei lin
  • 1 篇 lal jay
  • 1 篇 rivera corban g.
  • 1 篇 wang junyu
  • 1 篇 yin jianing
  • 1 篇 hamidouche wassi...
  • 1 篇 lin jia-rui
  • 1 篇 yonekura haruki
  • 1 篇 zhao rui
  • 1 篇 ramos ana paula ...
  • 1 篇 sun jie
  • 1 篇 deguchi daisuke
  • 1 篇 hua xian-sheng
  • 1 篇 liu jiyuan
  • 1 篇 murase hiroshi

语言

  • 31 篇 英文
  • 1 篇 中文
检索条件"主题词=Visual Language Model"
32 条 记 录,以下是1-10 订阅
排序:
DDC-Chat: Achieving accurate distracted driver classification through instruction tuning of visual language model
收藏 引用
JOURNAL OF SAFETY SCIENCE AND RESILIENCE 2025年 第2期6卷 250-264页
作者: Liao, Chupei Lin, Kuoyi Guilin Univ Elect Technol Sch Business Guilin 541004 Guangxi Peoples R China
Driver behavior is a critical factor in road safety, highlighting the need for advanced methods in Distracted Driving Classification (DDC). In this study, we introduce DDC-Chat, a novel classification method based on ... 详细信息
来源: 评论
RAVL: A Retrieval-Augmented visual language model Framework for Knowledge-Based visual Question Answering  13th
RAVL: A Retrieval-Augmented Visual Language Model Framework ...
收藏 引用
13th International Conference on Natural language Processing and Chinese Computing
作者: Chai, Naiquan Zou, Dongsheng Liu, Jiyuan Wang, Hao Yang, Yuming Song, Xinyi Chongqing Univ Sch Comp Sci Chongqing Peoples R China
Knowledge-based visual question answering (VQA) requires external knowledge in addition to the image content to answer questions. Recent studies convert images to text descriptions and then generate answers or acquire... 详细信息
来源: 评论
visual language model for Keyword Spotting on Historical Mongolian Document Images  29
Visual Language Model for Keyword Spotting on Historical Mon...
收藏 引用
第29届中国控制与决策会议
作者: Hongxi Wei Guanglai Gao School of Computer Science Inner Mongolia University
The Bag-of-visual-Words(BoVW) approach has been attracted some attention in the field of keyword ***,the BoVW approach discards the spatial relations of the visual ***,a visual language model is integrated into the Bo... 详细信息
来源: 评论
Leveraging visual language model and Generative Diffusion model for Zero-Shot SAR Target Recognition
收藏 引用
REMOTE SENSING 2024年 第16期16卷 2927页
作者: Wang, Junyu Sun, Hao Tang, Tao Sun, Yuli He, Qishan Lei, Lin Ji, Kefeng Natl Univ Def Technol Coll Elect Sci & Technol Changsha 410073 Peoples R China
Simulated data play an important role in SAR target recognition, particularly under zero-shot learning (ZSL) conditions caused by the lack of training samples. The traditional SAR simulation method is based on manuall... 详细信息
来源: 评论
Automatic Findings Generation for Distress Images Using In-Context Few-Shot Learning of visual language model Based on Image Similarity and Text Diversity
收藏 引用
JOURNAL OF ROBOTICS AND MECHATRONICS 2024年 第2期36卷 353-364页
作者: Watanabe, Yuto Ogawa, Naoki Maeda, Keisuke Ogawa, Takahiro Haseyama, Miki Hokkaido Univ Grad Sch Informat Sci & Technol Kita 14 Nishi 9Kita-ku Sapporo 0600814 Japan Hokkaido Univ Fac Informat Sci & Technol Kita 14 Nishi 9Kita-ku Sapporo 0600814 Japan
This study proposes an automatic findings generation method that performs in-context few-shot learning of a visual language model. The automatic generation of findings can reduce the burden of creating inspection reco... 详细信息
来源: 评论
OpenECAD: An efficient visual language model for editable 3D-CAD design☆ ☆
收藏 引用
COMPUTERS & GRAPHICS-UK 2024年 124卷
作者: Yuan, Zhe Shi, Jianqi Huang, Yanhong East China Normal Univ Software Engn Inst Shanghai Peoples R China Natl Trusted Embedded Software Engn Technol Res Ct Shanghai Peoples R China
Computer-aided design (CAD) tools are utilized in the manufacturing industry for modeling everything from cups to spacecraft. These programs are complex to use and typically require years of training and experience to... 详细信息
来源: 评论
LATENT TOPIC visual language model FOR OBJECT CATEGORIZATION
LATENT TOPIC VISUAL LANGUAGE MODEL FOR OBJECT CATEGORIZATION
收藏 引用
International Conference on Signal Processing and Multimedia Applications
作者: Lei Wu Nenghai Yu Jing Liu Mingjing Li Department of EEIS University of Science and Technology of China Institute of Automation Chinese Academy of Sciences
This paper presents a latent topic visual language model to handle variation problem in object categorization. Variations including different views, styles, poses, etc., have greatly affected the spatial arrangement a... 详细信息
来源: 评论
KN-VLM: KNowledge-guided Vision-and-language model for visual abductive reasoning
收藏 引用
MULTIMEDIA SYSTEMS 2025年 第2期31卷 1-16页
作者: Tan, Kuo Qi, Zhaobo Zhong, Jianping Xu, Yuanrong Zhang, Weigang Harbin Inst Technol Sch Comp Sci & Technol Weihai 264209 Peoples R China
visual abductive reasoning strives to deduce the most suitable hypothesis that effectively explains the underlying visual context, garnering considerable attention in the academic community. However, recent efforts ar... 详细信息
来源: 评论
Spatiotemporal-Aware visual Captioning using Vision-language Pre-Training model
Spatiotemporal-Aware Visual Captioning using Vision-Language...
收藏 引用
2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025
作者: Wu, Shuai Yang, Weidong Wu, Shuyan School of Computer Science Fudan University Shanghai China Faculty of Electronic and Information Engineering Xi'an Jiaotong University Xi'an China
Current visual captioning technologies typically transform 3D/2D visual information into one-dimensional sequential data and employ language models to generate corresponding descriptions. This approach, however, compr... 详细信息
来源: 评论
Enabling High-Level Worker-Centric Semantic Understanding of Onsite Images Using visual language models with Attention Mechanism and Beam Search Strategy
收藏 引用
BUILDINGS 2025年 第6期15卷 959-959页
作者: Deng, Hui Fu, Kejie Yu, Binglin Li, Huimin Duan, Rui Deng, Yichuan Lin, Jia-rui South China Univ Technol Sch Civil Engn & Transportat Guangzhou 510641 Peoples R China State Key Lab Subtrop Bldg & Urban Sci Guangzhou 510641 Peoples R China Tsinghua Univ Dept Civil Engn Beijing 100084 Peoples R China
visual information is becoming increasingly essential in construction management. However, a significant portion of this information remains underutilized by construction managers due to the limitations of existing im... 详细信息
来源: 评论