咨询与建议

限定检索结果

文献类型

  • 20 篇 期刊文献
  • 13 篇 会议

馆藏范围

  • 33 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 31 篇 工学
    • 24 篇 计算机科学与技术...
    • 8 篇 电气工程
    • 6 篇 信息与通信工程
    • 4 篇 软件工程
    • 3 篇 控制科学与工程
    • 2 篇 仪器科学与技术
    • 2 篇 电子科学与技术(可...
    • 2 篇 测绘科学与技术
    • 2 篇 环境科学与工程(可...
    • 1 篇 土木工程
    • 1 篇 生物医学工程(可授...
  • 5 篇 医学
    • 2 篇 临床医学
    • 1 篇 基础医学(可授医学...
    • 1 篇 公共卫生与预防医...
    • 1 篇 特种医学
  • 4 篇 管理学
    • 4 篇 管理科学与工程(可...
  • 3 篇 理学
    • 2 篇 地球物理学
    • 1 篇 化学
    • 1 篇 生物学
  • 1 篇 文学
    • 1 篇 外国语言文学
  • 1 篇 艺术学
    • 1 篇 设计学(可授艺术学...

主题

  • 33 篇 visual language ...
  • 4 篇 large language m...
  • 3 篇 visualization
  • 2 篇 autonomous drivi...
  • 2 篇 image captioning
  • 2 篇 keyword spotting
  • 2 篇 computational mo...
  • 2 篇 feature extracti...
  • 2 篇 computer vision
  • 2 篇 query likelihood...
  • 2 篇 generalizable pe...
  • 2 篇 training
  • 1 篇 internet of thin...
  • 1 篇 multimodal learn...
  • 1 篇 internet of thin...
  • 1 篇 representation l...
  • 1 篇 clip
  • 1 篇 semantic-related
  • 1 篇 semantic constra...
  • 1 篇 semantic segment...

机构

  • 1 篇 shandong univ sm...
  • 1 篇 chinese acad sci...
  • 1 篇 shanghai normal ...
  • 1 篇 natl univ def te...
  • 1 篇 inner mongolia u...
  • 1 篇 china acad chine...
  • 1 篇 johns hopkins un...
  • 1 篇 shandong prov de...
  • 1 篇 univ macau dept ...
  • 1 篇 southeast univ m...
  • 1 篇 institute of inf...
  • 1 篇 univ sci & techn...
  • 1 篇 hong kong univ s...
  • 1 篇 zhengzhou univ s...
  • 1 篇 minist educ key ...
  • 1 篇 zhengzhou univ s...
  • 1 篇 southeast univ k...
  • 1 篇 univ polytech ha...
  • 1 篇 univ genoa dibri...
  • 1 篇 natl trusted emb...

作者

  • 2 篇 geng xin
  • 2 篇 chen hao
  • 2 篇 zhao huazhong
  • 2 篇 qi lei
  • 1 篇 guan yanchen
  • 1 篇 lei lin
  • 1 篇 yang kunyu
  • 1 篇 lal jay
  • 1 篇 rivera corban g.
  • 1 篇 wang junyu
  • 1 篇 yin jianing
  • 1 篇 hamidouche wassi...
  • 1 篇 lin jia-rui
  • 1 篇 yonekura haruki
  • 1 篇 zhao rui
  • 1 篇 ramos ana paula ...
  • 1 篇 sun jie
  • 1 篇 deguchi daisuke
  • 1 篇 hua xian-sheng
  • 1 篇 liu jiyuan

语言

  • 32 篇 英文
  • 1 篇 中文
检索条件"主题词=Visual Language Model"
33 条 记 录,以下是11-20 订阅
VL-Trojan: Multimodal Instruction Backdoor Attacks against Autoregressive visual language models
收藏 引用
INTERNATIONAL JOURNAL OF COMPUTER VISION 2025年 1-20页
作者: Liang, Jiawei Liang, Siyuan Liu, Aishan Cao, Xiaochun Sun Yat Sen Univ Shenzhen Campus Shenzhen Peoples R China Natl Univ Singapore Singapore Singapore Beihang Univ Beijing Peoples R China Minist Educ Key Lab Cyberspace Secur Zhengzhou Peoples R China
Autoregressive visual language models (VLMs) demonstrate remarkable few-shot learning capabilities within a multimodal context. Recently, multimodal instruction tuning has emerged as a technique to further refine inst... 详细信息
来源: 评论
CILP-FGDI: Exploiting Vision-language model for Generalizable Person Re-Identification
收藏 引用
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY 2025年 20卷 2132-2142页
作者: Zhao, Huazhong Qi, Lei Geng, Xin Southeast Univ Minist Educ Sch Comp Sci & Engn Nanjing 211189 Peoples R China Southeast Univ Minist Educ Key Lab New Generat Artificial Intelligence Techno Nanjing 211189 Peoples R China
The visual language model, known for its robust cross-modal capabilities, has been extensively applied in various computer vision tasks. In this paper, we explore the use of CLIP (Contrastive language-Image Pretrainin... 详细信息
来源: 评论
On Large visual language models for Medical Imaging Analysis: An Empirical Study
On Large Visual Language Models for Medical Imaging Analysis...
收藏 引用
9th IEEE/ACM International Conference on Connected Health - Applications, Systems and Engineering Technologies (CHASE)
作者: Minh-Hao Van Verma, Prateek Wu, Xintao Univ Arkansas Fayetteville AR 72701 USA
Recently, large language models (LLMs) have taken the spotlight in natural language processing. Further, integrating LLMs with vision enables the users to explore emergent abilities with multimodal data. visual langua... 详细信息
来源: 评论
Exploring Vision language Pretraining with Knowledge Enhancement via Large language model  1
收藏 引用
2nd International Workshop on Trustworthy Artificial Intelligence for Healthcare (TAI4H)
作者: Tung, Chuenyuet Lin, Yi Yin, Jianing Ye, Qiaoyuchen Chen, Hao Hong Kong Univ Sci & Technol Hong Kong Peoples R China
The integration of Vision-language Pretraining (VLP) models in the medical field represents a significant advancement in the development of AI-driven diagnostic tools. These models, which learn to understand and gener... 详细信息
来源: 评论
Semantic matters: A constrained approach for zero-shot video action recognition
收藏 引用
PATTERN RECOGNITION 2025年 162卷
作者: Quan, Zhenzhen Chen, Jialei Deguchi, Daisuke Sun, Jie Zhang, Chenkai Li, Yujun Murase, Hiroshi Shandong Univ Sch Informat Sci & Engn 72 Binhai Rd Qingdao 266237 Shandong Peoples R China Nagoya Univ Furo ChoChikusa Ku Nagoya Aichi Japan Shandong Univ Smart State Governance Lab 72 Binhai Rd Qingdao Shandong Peoples R China Shandong Prov Dept Justice 15743 Jingshi Rd Jinan Shandong Peoples R China
Zero-shot video action recognition has advanced significantly due to the adaptation of visual-language models, such as CLIP, to video domains. However, existing methods attempt to adapt CLIP to video tasks by leveragi... 详细信息
来源: 评论
GCD-Net: Global consciousness-driven open-vocabulary semantic segmentation network
收藏 引用
NEUROCOMPUTING 2025年 636卷
作者: Wu, Xing Xu, Zhenyao Qian, Quan Huang, Bin Shanghai Univ Sch Comp Engn & Sci Shanghai 200444 Peoples R China Shanghai Univ Shanghai Inst Adv Commun & Data Sci Shanghai 200444 Peoples R China Shanghai Univ Key Lab Silicate Cultural Rel Conservat Minist Educ Shanghai Peoples R China Harbin Engn Univ Harbin Peoples R China
Open-vocabulary semantic segmentation aims to achieve accurate classification of different categories of pixels, even if these categories are not explicitly labeled during training. The current research trend in this ... 详细信息
来源: 评论
LATTE: A Real-time Lightweight Attention-based Traffic Accident Anticipation Engine
收藏 引用
INFORMATION FUSION 2025年 122卷
作者: Zhang, Jiaxun Guan, Yanchen Wang, Chengyue Liao, Haicheng Zhang, Guohui Li, Zhenning Univ Macau State Key Lab Internet Things Smart City Macau Peoples R China Univ Macau Dept Civil & Environm Engn Macau Peoples R China Univ Macau Dept Comp & Informat Sci Macau Peoples R China Univ Hawaii Manoa Dept Civil Environm & Construct Engn Honolulu HI USA
Accurately predicting traffic accidents in real-time is a critical challenge in autonomous driving, particularly in resource-constrained environments. Existing solutions often suffer from high computational overhead o... 详细信息
来源: 评论
MoSCE-ReID: Mixture of semantic clustering experts for person re-identification
收藏 引用
NEUROCOMPUTING 2025年 626卷
作者: Ren, Kai Hu, Chuanping Xi, Hao Li, Yongqiang Fan, Jinhao Liu, Lihua Zhengzhou Univ Sch Elect & Informat Engn Zhengzhou 450000 Henan Peoples R China Zhengzhou Univ Sch Cyber Sci & Engn 100 Sci Ave Zhengzhou 450000 Henan Peoples R China Henan Remote Sensing Inst Zhengzhou 450003 Henan Peoples R China
This study advances the utilization of semantic information in person re-identification (ReID) by leveraging pre- trained vision-language models, addressing the current limitations in semantic information processing w... 详细信息
来源: 评论
CLIP-DFGS: A Hard Sample Mining Method for CLIP in Generalizable Person Re-Identification
收藏 引用
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS 2025年 第1期21卷 1-20页
作者: Zhao, Huazhong Qi, Lei Geng, Xin Southeast Univ Sch Comp Sci & Engn Nanjing Peoples R China Southeast Univ Key Lab New Generat Artificial Intelligence Techno Minist Educ Nanjing Peoples R China
Recent advancements in pre-trained vision-language models like CLIP have shown promise in person re-identification (ReID) applications. However, their performance in generalizable person ReID tasks remains suboptimal.... 详细信息
来源: 评论
Bi-LORA: A Vision-language Approach for Synthetic Image Detection
收藏 引用
EXPERT SYSTEMS 2025年 第2期42卷
作者: Keita, Mamadou Hamidouche, Wassim Eutamene, Hessen Bougueffa Taleb-Ahmed, Abdelmalik Camacho, David Hadid, Abdenour Univ Polytech Hauts Defrance Lab IEMN Valenciennes France Khalifa Univ Abu Dhabi U Arab Emirates Tech Univ Madrid Madrid Spain Sorbonne Univ Sorbonne Ctr Artificial Intelligence Abu Dhabi U Arab Emirates
Advancements in deep image synthesis techniques, such as generative adversarial networks (GANs) and diffusion models (DMs), have ushered in an era of generating highly realistic images. While this technological progre... 详细信息
来源: 评论