咨询与建议

限定检索结果

文献类型

  • 80 篇 期刊文献
  • 74 篇 会议

馆藏范围

  • 154 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 141 篇 工学
    • 116 篇 计算机科学与技术...
    • 48 篇 电气工程
    • 14 篇 软件工程
    • 12 篇 信息与通信工程
    • 8 篇 控制科学与工程
    • 6 篇 生物医学工程(可授...
    • 4 篇 仪器科学与技术
    • 3 篇 交通运输工程
    • 2 篇 机械工程
    • 2 篇 电子科学与技术(可...
    • 2 篇 土木工程
    • 2 篇 测绘科学与技术
    • 1 篇 材料科学与工程(可...
    • 1 篇 水利工程
    • 1 篇 农业工程
    • 1 篇 环境科学与工程(可...
  • 20 篇 医学
    • 10 篇 临床医学
    • 8 篇 特种医学
    • 5 篇 基础医学(可授医学...
  • 16 篇 理学
    • 7 篇 物理学
    • 4 篇 化学
    • 3 篇 数学
    • 3 篇 生物学
    • 2 篇 地理学
    • 1 篇 天文学
    • 1 篇 大气科学
    • 1 篇 地球物理学
    • 1 篇 地质学
  • 10 篇 管理学
    • 8 篇 管理科学与工程(可...
    • 2 篇 公共管理
  • 1 篇 农学

主题

  • 154 篇 vision-language ...
  • 13 篇 visualization
  • 11 篇 large language m...
  • 11 篇 prompt learning
  • 10 篇 clip
  • 10 篇 training
  • 10 篇 prompt tuning
  • 9 篇 object detection
  • 8 篇 adaptation model...
  • 7 篇 deep learning
  • 7 篇 semantics
  • 7 篇 few-shot learnin...
  • 6 篇 knowledge distil...
  • 5 篇 task analysis
  • 5 篇 zero-shot learni...
  • 5 篇 feature extracti...
  • 4 篇 multimodal learn...
  • 4 篇 tuning
  • 4 篇 continual learni...
  • 4 篇 contrastive lear...

机构

  • 4 篇 shanghai ai lab ...
  • 4 篇 peng cheng lab p...
  • 3 篇 univ sci & techn...
  • 3 篇 chinese univ hon...
  • 3 篇 sensetime res pe...
  • 2 篇 univ michigan an...
  • 2 篇 hong kong polyte...
  • 2 篇 sun yat sen univ...
  • 2 篇 shanghai univ pe...
  • 2 篇 beijing univ tec...
  • 2 篇 univ chinese aca...
  • 2 篇 wuhan univ sch c...
  • 2 篇 harbin inst tech...
  • 2 篇 tongji univ coll...
  • 2 篇 northeastern uni...
  • 2 篇 chinese acad sci...
  • 2 篇 xidian univ sch ...
  • 2 篇 tianjin univ col...
  • 2 篇 chongqing univ p...
  • 2 篇 tsinghua univ sh...

作者

  • 5 篇 qiao yu
  • 3 篇 gao peng
  • 3 篇 obinata yoshiki
  • 3 篇 inaba masayuki
  • 3 篇 kawaharazuka ken...
  • 3 篇 wang ruixuan
  • 3 篇 dai jifeng
  • 3 篇 okada kei
  • 3 篇 kanazawa naoaki
  • 2 篇 zhou jie
  • 2 篇 wang lei
  • 2 篇 li xin
  • 2 篇 chen zhe
  • 2 篇 guo tao
  • 2 篇 luo ping
  • 2 篇 zhang tong
  • 2 篇 yang xi
  • 2 篇 liu liangchen
  • 2 篇 fang zhen
  • 2 篇 guo song

语言

  • 153 篇 英文
  • 1 篇 德文
  • 1 篇 法文
  • 1 篇 其他
检索条件"主题词=Vision-Language Model"
154 条 记 录,以下是1-10 订阅
排序:
A vision-language model with multi-granular knowledge fusion in medical imaging
收藏 引用
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS 2025年 第1期28卷 1-21页
作者: Chen, Kai Li, Yunxin Zhu, Xiwen Zhang, Wentai Hu, Baotian Huaqiao Univ Sch Comp Sci & Technol 668 Jimei Ave Xiamen 361021 Fujian Peoples R China Harbin Inst Technol Shenzhen Sch Comp Sci & Technol 6 Pingshan 1st Rd Nanshan Dist Shenzhen 518055 Guangdong Peoples R China Peking Univ First Hosp Dept Thorac Surg 8 Xishiku St Beijing 100034 Peoples R China
The rapid expansion of radiological imaging data has placed a significant burden on radiologists, increasing the risk of diagnostic errors. vision-language models offer a promising solution to alleviate this workload ... 详细信息
来源: 评论
Joint feature extraction and alignment in object tracking with vision-language model
收藏 引用
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE 2025年 152卷
作者: Zhu, Hong Lu, Qingyang Xue, Lei Yuan, Guanglin Zhang, Kaihua Natl Univ Def Technol Coll Elect Engn Hefei 230037 Peoples R China Army Artillery & Air Def Acad PLA Hefei 230031 Peoples R China Anhui Key Lab Polarizat Imaging Detect Technol Hefei 230031 Peoples R China Nanjing Univ Informat Sci & Technol Coll Comp Sci Nanjing 210044 Peoples R China
vision-language tracking is a new rising research topic that focuses on locating the target object in a video sequence using its language description. The main challenge is to model the correspondence between the visi... 详细信息
来源: 评论
Situation classification of living environment by daily life support robot using pre-trained large-scale vision-language model
收藏 引用
ADVANCED ROBOTICS 2025年 第7期39卷 323-337页
作者: Obinata, Yoshiki Kawaharazuka, Kento Kanazawa, Naoaki Yamaguchi, Naoya Tsukamoto, Naoto Yanokura, Iori Kitagawa, Shingo Okada, Kei Inaba, Masayuki Univ Tokyo Grad Sch Informat Sci & Technol Dept Mechanoinformat Bunkyo Ku Tokyo Japan
Various conditions exist in individual daily life environments. It is important for a daily life support robot to observe states in the daily life environment and perform tasks depending on the living environment. Tod... 详细信息
来源: 评论
A Dual-State-Based Surface Anomaly Detection model for Rail Transit Trains Using vision-language model
收藏 引用
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT 2025年 74卷
作者: Lei, Kaiyan Qi, Zhiquan Univ Chinese Acad Sci Sch Comp Sci & Technol Beijing 101408 Peoples R China Chinese Acad Sci Res Ctr Fictitious Econ & Data Sci Beijing 100190 Peoples R China
For the anomaly detection on the surface of rail transit train body (RTTB-AD), due to the scarcity of anomalies, the complexity and variability of the detection environment, and the exceptionally high identification r... 详细信息
来源: 评论
MammoVLM: A generative large vision-language model for mammography-related diagnostic assistance
收藏 引用
INFORMATION FUSION 2025年 118卷
作者: Cao, Zhenjie Deng, Zhuo Ma, Jie Hu, Jintao Ma, Lan Tsinghua Univ Shenzhen Int Grad Sch Shenzhen 518055 Peoples R China AI Lab Pingan Tech Shenzhen Peoples R China Chinese Univ Hong Kong Dept Anat & Cellular Pathol Hong Kong Peoples R China Shenzhen Peoples Hosp Radiol Dept Shenzhen 518020 Peoples R China
Inspired by the recent success of large language models (LLMs) in the general domain, many large multimodal models, such as vision-language models, have been developed to tackle problems across modalities. In the real... 详细信息
来源: 评论
HiE-VL: A Large vision-language model with Hierarchical Adapter for Handwritten Mathematical Expression Recognition
HiE-VL: A Large Vision-Language Model with Hierarchical Adap...
收藏 引用
2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025
作者: Guo, Hong-Yu Yin, Fei Xu, Jian Liu, Cheng-Lin School of Artificial Intelligence University of Chinese Academy of Sciences Beijing China State Key Laboratory of Multimodal Artificial Intelligence Systems Institute of Automation of Chinese Academy of Sciences Beijing China
Large vision-language models (LVLMs) have shown impressive capabilities across various domains, but existing LVLMs have limited performance in dense perception and structured learning problems, such as Handwritten Mat... 详细信息
来源: 评论
Context-aware prompt learning for test-time vision recognition with frozen vision-language model
收藏 引用
PATTERN RECOGNITION 2025年 162卷
作者: Yin, Junhui Zhang, Xinyu Wu, Lin Wang, Xiaojie Beijing Univ Posts & Telecommun Beijing Peoples R China Univ Adelaide Adelaide Australia Swansea Univ Swansea Wales
Current pre-trained vision-language models, such as CLIP, have demonstrated remarkable zero-shot generalization capabilities across various downstream tasks. However, their performance significantly degrades when test... 详细信息
来源: 评论
Controlling vision-language model for enhancing image restoration
收藏 引用
IMAGE AND vision COMPUTING 2025年 158卷
作者: Shao, Mingwen Liu, Weihan Meng, Lingzhuang Wan, Yecong China Univ Petr East China Qingdao Inst Software Coll Comp Sci & Technol State Key Lab Chem Safety Qingdao 266580 Peoples R China
Restoring low-quality images to their original high-quality state remains a significant challenge due to inherent uncertainties, particularly in blind image restoration scenarios where the nature of degradation is unk... 详细信息
来源: 评论
Multimodal multitask similarity learning for vision language model on radiological images and reports
收藏 引用
NEUROCOMPUTING 2025年 636卷
作者: Yu, Yang Wang, Jiahao Liu, Weide Mien, Ivan Ho Krishnaswamy, Pavitra Yang, Xulei Cheng, Jun ASTAR Machine Intellect Dept Inst Infocomm Res I R 2 1 Fusionopolis Way21-01 Connexis Singapore 138632 Singapore Natl Univ Singapore NUS Mechanobiol Inst MBI 5A Engn Dr 1 Singapore 117411 Singapore ASTAR Inst Infocomm Res I 2 R Healthcare & Medtech Div 1 Fusionopolis Way21-01 Connexis Singapore 138632 Singapore Natl Neurosci Inst NNI Dept Neuroradiol 11 Jln Tan Tock Seng Singapore 308433 Singapore
In recent years, large-scale vision-language models (VLM) have shown promise in learning general representations for various medical image analysis tasks. However, current medical VLM methods typically employ contrast... 详细信息
来源: 评论
Continual Learning of Image Classes With language Guidance From a vision-language model
收藏 引用
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 2024年 第12期34卷 13152-13163页
作者: Zhang, Wentao Huang, Yujun Zhang, Weizhuo Zhang, Tong Lao, Qicheng Yu, Yue Zheng, Wei-Shi Wang, Ruixuan Sun Yat Sen Univ Sch Comp Sci & Engn Guangzhou 510275 Peoples R China Minist Educ Key Lab Machine Intelligence & Adv Comp Guangzhou 510275 Peoples R China Peng Cheng Lab Shenzhen 518066 Peoples R China Beijing Univ Posts & Telecommun Sch Artificial Intelligence Beijing 100876 Peoples R China
Current deep learning models often catastrophically forget the knowledge of old classes when continually learning new ones. State-of-the-art approaches to continual learning of image classes often require retaining a ... 详细信息
来源: 评论