咨询与建议

限定检索结果

文献类型

  • 92 篇 会议
  • 73 篇 期刊文献
  • 1 篇 学位论文

馆藏范围

  • 166 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 157 篇 工学
    • 128 篇 计算机科学与技术...
    • 36 篇 电气工程
    • 15 篇 软件工程
    • 12 篇 信息与通信工程
    • 12 篇 控制科学与工程
    • 9 篇 测绘科学与技术
    • 8 篇 电子科学与技术(可...
    • 8 篇 生物医学工程(可授...
    • 4 篇 机械工程
    • 4 篇 仪器科学与技术
    • 4 篇 材料科学与工程(可...
    • 2 篇 交通运输工程
    • 1 篇 航空宇航科学与技...
    • 1 篇 环境科学与工程(可...
    • 1 篇 生物工程
  • 32 篇 医学
    • 22 篇 临床医学
    • 9 篇 特种医学
    • 4 篇 基础医学(可授医学...
    • 1 篇 中西医结合
    • 1 篇 医学技术(可授医学...
  • 23 篇 理学
    • 10 篇 地球物理学
    • 8 篇 物理学
    • 6 篇 化学
    • 5 篇 生物学
    • 3 篇 地理学
    • 1 篇 天文学
    • 1 篇 地质学
  • 7 篇 管理学
    • 6 篇 管理科学与工程(可...
    • 1 篇 图书情报与档案管...
  • 1 篇 哲学
    • 1 篇 哲学
  • 1 篇 农学

主题

  • 166 篇 vision-language ...
  • 17 篇 large language m...
  • 14 篇 prompt learning
  • 11 篇 clip
  • 11 篇 few-shot learnin...
  • 10 篇 visualization
  • 7 篇 contrastive lear...
  • 6 篇 foundation model...
  • 6 篇 remote sensing
  • 6 篇 training
  • 6 篇 adaptation model...
  • 5 篇 object detection
  • 5 篇 deep learning
  • 5 篇 feature extracti...
  • 5 篇 image classifica...
  • 4 篇 long-tailed reco...
  • 4 篇 computational mo...
  • 4 篇 artificial intel...
  • 4 篇 computer vision
  • 4 篇 domain generaliz...

机构

  • 4 篇 chinese acad sci...
  • 4 篇 carnegie mellon ...
  • 4 篇 univ chinese aca...
  • 3 篇 inesc tec porto
  • 3 篇 sichuan univ col...
  • 3 篇 univ chinese aca...
  • 3 篇 zhejiang univ pe...
  • 3 篇 chinese univ hon...
  • 2 篇 shanghai ai lab ...
  • 2 篇 ecole polytech f...
  • 2 篇 tsinghua univ de...
  • 2 篇 harbin inst tech...
  • 2 篇 univ porto fac e...
  • 2 篇 cent south univ ...
  • 2 篇 beijing univ pos...
  • 2 篇 city univ hong k...
  • 2 篇 china univ geosc...
  • 2 篇 sichuan univ col...
  • 2 篇 tech univ munich...
  • 2 篇 westlake univ sc...

作者

  • 4 篇 banerjee biplab
  • 4 篇 zhang yi
  • 4 篇 jha ankit
  • 3 篇 wang donglin
  • 3 篇 singha mainak
  • 3 篇 ding kun
  • 3 篇 zhang ce
  • 3 篇 tuia devis
  • 2 篇 men aidong
  • 2 篇 li haifeng
  • 2 篇 zhang min
  • 2 篇 liu xuyang
  • 2 篇 chen honggang
  • 2 篇 ma chao
  • 2 篇 guo miaotian
  • 2 篇 yang yang
  • 2 篇 ricci elisa
  • 2 篇 ye mao
  • 2 篇 tian liang
  • 2 篇 patricio cristia...

语言

  • 163 篇 英文
  • 2 篇 其他
检索条件"主题词=Vision-language Models"
166 条 记 录,以下是91-100 订阅
排序:
Synth-CLIP: Synthetic data make CLIP generalize better in data-limited scenarios
收藏 引用
NEURAL NETWORKS 2025年 184卷 107083页
作者: Liu, Mushui He, Weijie Lu, Ziqian Dan, Jun Yu, Yunlong Li, Yingming Li, Xi Han, Jungong Zhejiang Univ Coll Informat Sci & Elect Engn Hangzhou Peoples R China Zhejiang Univ Sch Aeronaut & Astronaut Hangzhou Peoples R China Zhejiang Univ Coll Comp Sci & Technol Hangzhou Peoples R China Univ Sheffield Dept Comp Sci Sheffield England
Prompt learning is a powerful technique that enables the transfer of vision-language models (VLMs) like CLIP to downstream tasks. However, when the prompt-based methods are fine-tuned solely on base classes, they ofte... 详细信息
来源: 评论
Open-Vocabulary Action Localization With Iterative Visual Prompting
收藏 引用
IEEE ACCESS 2025年 13卷 56908-56917页
作者: Wake, Naoki Kanehira, Atsushi Sasabuchi, Kazuhiro Takamatsu, Jun Ikeuchi, Katsushi Microsoft Appl Robot Res Redmond WA 98052 USA
Video action localization aims to find the timings of specific actions from a long video. Although existing learning-based approaches have been successful, they require annotating videos, which comes with a considerab... 详细信息
来源: 评论
Continual Test-Time Adaptation for Single Image Defocus Deblurring via Causal Siamese Networks
收藏 引用
INTERNATIONAL JOURNAL OF COMPUTER vision 2025年 第7期133卷 4134-4157页
作者: Cui, Shuang Li, Yi Li, Jiangmeng Tang, Xiongxin Su, Bing Xu, Fanjiang Xiong, Hui Chinese Acad Sci Natl Key Lab Space Integrated Informat Syst Inst Software Beijing Peoples R China Univ Chinese Acad Sci Beijing Peoples R China Renmin Univ China Gaoling Sch Artificial Intelligence Beijing Key Lab Big Data Management & Anal Methods Beijing Peoples R China Hong Kong Univ Sci & Technol Guangzhou Thrust Artificial Intelligence Guangzhou Peoples R China Hong Kong Univ Sci & Technol Dept Comp Sci & Engn Guangzhou Peoples R China
Single image defocus deblurring (SIDD) aims to restore an all-in-focus image from a defocused one. Distribution shifts in defocused images generally lead to performance degradation of existing methods during out-of-di... 详细信息
来源: 评论
RS3Lip: Consistency for remote sensing image classification on part embeddings using self-supervised learning and CLIP
收藏 引用
COMPUTER vision AND IMAGE UNDERSTANDING 2025年 251卷
作者: Jhaa, Ankit Singhab, Mainak Bhattacharyab, Avigyan Banerjeeb, Biplab LNM Inst Informat Technol Jaipur 302031 India Indian Inst Technol Mumbai 400076 India
Tackling domain and class generalization challenges remains a significant hurdle in the realm of remote sensing (RS). Recently, large-scale pre-trained vision-language models (VLMs), exemplified by CLIP, have showcase... 详细信息
来源: 评论
LAMARS: Large language Model-Based Anticipation Mechanism Acceleration in Real-Time Robotic Systems
收藏 引用
IEEE ACCESS 2025年 13卷 3864-3880页
作者: Gao, Yifang Luo, Wei Wang, Xuye Zhang, Shunshun Goh, Patrick Univ Sains Malaysia Sch Elect & Elect Engn Nibong Tebal 14300 Penang Malaysia Beijing Jiaotong Univ Sch Elect Engn Beijing 100044 Peoples R China Univ Sains Malaysia Sch Pharmaceut Sci Gelugor 11800 Penang Malaysia Guangxi Univ Sci & Technol Sch Automat Liuzhou 545006 Peoples R China
Large language models (LLMs) have assumed an increasingly crucial role in robotic systems because of their ability to leverage the extensive knowledge they possess in robotic inference and task handling. Although LLMs... 详细信息
来源: 评论
Global-local prompts guided image-text embedding, alignment and aggregation for multi-label zero-shot learning
收藏 引用
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION 2025年 106卷
作者: Song, Tiecheng Huang, Yu Yang, Feng Qin, Anyong Zhao, Yue Gao, Chenqiang Chongqing Univ Posts & Telecommun Sch Commun & Informat Engn Chongqing 400065 Peoples R China Sun Yat Sen Univ Sch Intelligent Syst Engn Shenzhen Campus Shenzhen 518107 Guangdong Peoples R China
Multi-label zero-shot learning (MLZSL) aims to classify images into multiple unseen label classes, which is a practical yet challenging task. Recent methods have used vision-language models (VLM) for MLZSL, but they d... 详细信息
来源: 评论
MMTF-DES: A fusion of multimodal transformer models for desire, emotion, and sentiment analysis of social media data
收藏 引用
NEUROCOMPUTING 2025年 623卷
作者: Aziz, Abdul Chowdhury, Nihad Karim Kabir, Muhammad Ashad Chy, Abu Nowshed Siddique, Md. Jawad Univ Chittagong Dept Comp Sci & Engn Chattogram 4331 Bangladesh Charles Sturt Univ Sch Comp Math & Engn Bathurst NSW 2795 Australia Southern Illinois Univ Dept Comp Sci Carbondale IL 62901 USA
Desires, emotions, and sentiments are pivotal in understanding and predicting human behavior, influencing various aspects of decision-making, communication, and social interactions. Their analysis, particularly in the... 详细信息
来源: 评论
A two-step concept-based approach for enhanced interpretability and trust in skin lesion diagnosis
收藏 引用
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL 2025年 28卷 71-79页
作者: Patricio, Cristiano Teixeira, Luis F. Neves, Joao C. Univ Beira Interior Covilha Portugal NOVA LINCS Lisbon Portugal Univ Porto Fac Engn Porto Portugal INESC TEC Porto Portugal
The main challenges hindering the adoption of deep learning-based systems in clinical settings are the scarcity of annotated data and the lack of interpretability and trust in these systems. Concept Bottleneck models ... 详细信息
来源: 评论
CLIP-guided black-box domain adaptation of image classification
收藏 引用
SIGNAL IMAGE AND VIDEO PROCESSING 2024年 第5期18卷 4637-4646页
作者: Tian, Liang Ye, Mao Zhou, Lihua He, Qichen Univ Elect Sci & Technol China Sch Comp Sci & Engn Chengdu 611731 Peoples R China
Recently, the significant success of the large pre-trained models have attracted great attentions. How to sufficiently use these models is a big issue. Black-box domain adaptation is a way which tries to train a targe... 详细信息
来源: 评论
Source bias reduction for source-free domain adaptation
收藏 引用
SIGNAL IMAGE AND VIDEO PROCESSING 2024年 第SUPPL 1期18卷 883-893页
作者: Tian, Liang Ye, Mao Zhou, Lihua Wang, Zhenbin Univ Elect Sci & Technol China Sch Comp Sci & Engn Chengdu 611731 Peoples R China Sichuan Univ Coll Comp Sci Chengdu 610044 Peoples R China
Source-free domain adaptation (SFDA) mainly aims to the problem of not being able to access the source domain data during the model migration process. Although significant breakthroughs have been achieved, the current... 详细信息
来源: 评论