咨询与建议

限定检索结果

文献类型

  • 90 篇 会议
  • 70 篇 期刊文献
  • 1 篇 学位论文

馆藏范围

  • 161 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 153 篇 工学
    • 124 篇 计算机科学与技术...
    • 35 篇 电气工程
    • 15 篇 软件工程
    • 12 篇 信息与通信工程
    • 12 篇 控制科学与工程
    • 9 篇 测绘科学与技术
    • 8 篇 电子科学与技术(可...
    • 7 篇 生物医学工程(可授...
    • 4 篇 机械工程
    • 4 篇 仪器科学与技术
    • 4 篇 材料科学与工程(可...
    • 2 篇 交通运输工程
    • 1 篇 航空宇航科学与技...
    • 1 篇 环境科学与工程(可...
    • 1 篇 生物工程
  • 31 篇 医学
    • 22 篇 临床医学
    • 8 篇 特种医学
    • 4 篇 基础医学(可授医学...
    • 1 篇 中西医结合
    • 1 篇 医学技术(可授医学...
  • 23 篇 理学
    • 10 篇 地球物理学
    • 8 篇 物理学
    • 6 篇 化学
    • 5 篇 生物学
    • 3 篇 地理学
    • 1 篇 天文学
    • 1 篇 地质学
  • 5 篇 管理学
    • 4 篇 管理科学与工程(可...
    • 1 篇 图书情报与档案管...
  • 1 篇 哲学
    • 1 篇 哲学
  • 1 篇 农学

主题

  • 161 篇 vision-language ...
  • 16 篇 large language m...
  • 13 篇 prompt learning
  • 11 篇 clip
  • 11 篇 few-shot learnin...
  • 10 篇 visualization
  • 7 篇 contrastive lear...
  • 6 篇 foundation model...
  • 6 篇 remote sensing
  • 6 篇 training
  • 6 篇 adaptation model...
  • 5 篇 object detection
  • 5 篇 deep learning
  • 5 篇 feature extracti...
  • 5 篇 image classifica...
  • 4 篇 long-tailed reco...
  • 4 篇 computational mo...
  • 4 篇 artificial intel...
  • 4 篇 computer vision
  • 4 篇 domain generaliz...

机构

  • 4 篇 chinese acad sci...
  • 4 篇 carnegie mellon ...
  • 4 篇 univ chinese aca...
  • 3 篇 inesc tec porto
  • 3 篇 sichuan univ col...
  • 3 篇 univ chinese aca...
  • 3 篇 zhejiang univ pe...
  • 3 篇 chinese univ hon...
  • 2 篇 shanghai ai lab ...
  • 2 篇 ecole polytech f...
  • 2 篇 tsinghua univ de...
  • 2 篇 harbin inst tech...
  • 2 篇 univ porto fac e...
  • 2 篇 cent south univ ...
  • 2 篇 beijing univ pos...
  • 2 篇 city univ hong k...
  • 2 篇 china univ geosc...
  • 2 篇 sichuan univ col...
  • 2 篇 tech univ munich...
  • 2 篇 westlake univ sc...

作者

  • 4 篇 banerjee biplab
  • 4 篇 zhang yi
  • 4 篇 jha ankit
  • 3 篇 wang donglin
  • 3 篇 singha mainak
  • 3 篇 ding kun
  • 3 篇 zhang ce
  • 3 篇 tuia devis
  • 2 篇 men aidong
  • 2 篇 li haifeng
  • 2 篇 zhang min
  • 2 篇 liu xuyang
  • 2 篇 chen honggang
  • 2 篇 ma chao
  • 2 篇 guo miaotian
  • 2 篇 yang yang
  • 2 篇 ricci elisa
  • 2 篇 ye mao
  • 2 篇 tian liang
  • 2 篇 patricio cristia...

语言

  • 159 篇 英文
  • 1 篇 其他
检索条件"主题词=Vision-language Models"
161 条 记 录,以下是81-90 订阅
排序:
Recent Advances of Foundation language models-based Continual Learning: A Survey
收藏 引用
ACM COMPUTING SURVEYS 2025年 第5期57卷 1-38页
作者: Yang, Yutao Zhou, Jie Ding, Xuan wen Huai, Tianyu Liu, Shunyu Chen, Qin Xie, Yuan He, Liang East China Normal Univ Sch Comp Sci & Technol Shanghai Peoples R China
Recently, foundation language models (LMs) have marked significant achievements in the domains of natural language processing and computer vision. Unlike traditional neural network models, foundation LMs obtain a grea... 详细信息
来源: 评论
Progressive Visual Prompt Learning with Contrastive Feature Re-formation
收藏 引用
INTERNATIONAL JOURNAL OF COMPUTER vision 2025年 第2期133卷 511-526页
作者: Xu, Chen Zhu, Yuhan Shen, Haocheng Chen, Boheng Liao, Yixuan Chen, Xiaoxin Wang, Limin Nanjing Univ State Key Lab Novel Software Technol Nanjing Peoples R China VIVO AI Lab Shenzhen Peoples R China Shanghai AI Lab Shanghai Peoples R China
Prompt learning has recently emerged as a compelling alternative to the traditional fine-tuning paradigm for adapting the pre-trained vision-language (V-L) models to downstream tasks. Drawing inspiration from the succ... 详细信息
来源: 评论
Image-text aggregation for open-vocabulary semantic segmentation
收藏 引用
NEUROCOMPUTING 2025年 630卷
作者: Cheng, Shengyang Huang, Jianyong Wang, Xiaodong Huang, Lei Wei, Zhiqiang Ocean Univ China Fac Informat Sci & Engn Qingdao 266100 Peoples R China Qingdao Educ Equipment & Informat Technol Ctr Qingdao 266022 Peoples R China
Existing works on open-vocabulary semantic segmentation explore utilizing large-scale vision-language models. Recent methods have relied mostly on visual features while treating text features as supporting components.... 详细信息
来源: 评论
Image-text feature learning for unsupervised visible-infrared person re-identification
收藏 引用
IMAGE AND vision COMPUTING 2025年 158卷
作者: Guo, Jifeng Pang, Zhiqi Guilin Univ Aerosp Technol Coll Comp Sci & Engn Guilin 541000 Guangxi Peoples R China Harbin Inst Technol Fac Comp Harbin 150001 Heilongjiang Peoples R China
Visible-infrared person re-identification (VI-ReID) focuses on matching infrared and visible images of the same person. To reduce labeling costs, unsupervised VI-ReID (UVI-ReID) methods typically use clustering algori... 详细信息
来源: 评论
VideoQA in the Era of LLMs: An Empirical Study
收藏 引用
INTERNATIONAL JOURNAL OF COMPUTER vision 2025年 1-24页
作者: Xiao, Junbin Huang, Nanxin Qin, Hangyu Li, Dongyang Li, Yicong Zhu, Fengbin Tao, Zhulin Yu, Jianxing Lin, Liang Chua, Tat-Seng Yao, Angela Natl Univ Singapore Singapore Singapore Commun Univ China Beijing Peoples R China Sun Yat Sen Univ Guangzhou Peoples R China
Video Large language models (Video-LLMs) are flourishing and has advanced many video-language tasks. As a golden testbed, Video Question Answering (VideoQA) plays pivotal role in Video-LLM developing. This work conduc... 详细信息
来源: 评论
Synth-CLIP: Synthetic data make CLIP generalize better in data-limited scenarios
收藏 引用
NEURAL NETWORKS 2025年 184卷 107083页
作者: Liu, Mushui He, Weijie Lu, Ziqian Dan, Jun Yu, Yunlong Li, Yingming Li, Xi Han, Jungong Zhejiang Univ Coll Informat Sci & Elect Engn Hangzhou Peoples R China Zhejiang Univ Sch Aeronaut & Astronaut Hangzhou Peoples R China Zhejiang Univ Coll Comp Sci & Technol Hangzhou Peoples R China Univ Sheffield Dept Comp Sci Sheffield England
Prompt learning is a powerful technique that enables the transfer of vision-language models (VLMs) like CLIP to downstream tasks. However, when the prompt-based methods are fine-tuned solely on base classes, they ofte... 详细信息
来源: 评论
Open-Vocabulary Action Localization With Iterative Visual Prompting
收藏 引用
IEEE ACCESS 2025年 13卷 56908-56917页
作者: Wake, Naoki Kanehira, Atsushi Sasabuchi, Kazuhiro Takamatsu, Jun Ikeuchi, Katsushi Microsoft Appl Robot Res Redmond WA 98052 USA
Video action localization aims to find the timings of specific actions from a long video. Although existing learning-based approaches have been successful, they require annotating videos, which comes with a considerab... 详细信息
来源: 评论
RS3Lip: Consistency for remote sensing image classification on part embeddings using self-supervised learning and CLIP
收藏 引用
COMPUTER vision AND IMAGE UNDERSTANDING 2025年 251卷
作者: Jhaa, Ankit Singhab, Mainak Bhattacharyab, Avigyan Banerjeeb, Biplab LNM Inst Informat Technol Jaipur 302031 India Indian Inst Technol Mumbai 400076 India
Tackling domain and class generalization challenges remains a significant hurdle in the realm of remote sensing (RS). Recently, large-scale pre-trained vision-language models (VLMs), exemplified by CLIP, have showcase... 详细信息
来源: 评论
Continual Test-Time Adaptation for Single Image Defocus Deblurring via Causal Siamese Networks
收藏 引用
INTERNATIONAL JOURNAL OF COMPUTER vision 2025年 1-24页
作者: Cui, Shuang Li, Yi Li, Jiangmeng Tang, Xiongxin Su, Bing Xu, Fanjiang Xiong, Hui Chinese Acad Sci Natl Key Lab Space Integrated Informat Syst Inst Software Beijing Peoples R China Univ Chinese Acad Sci Beijing Peoples R China Renmin Univ China Gaoling Sch Artificial Intelligence Beijing Key Lab Big Data Management & Anal Methods Beijing Peoples R China Hong Kong Univ Sci & Technol Guangzhou Thrust Artificial Intelligence Guangzhou Peoples R China Hong Kong Univ Sci & Technol Dept Comp Sci & Engn Guangzhou Peoples R China
Single image defocus deblurring (SIDD) aims to restore an all-in-focus image from a defocused one. Distribution shifts in defocused images generally lead to performance degradation of existing methods during out-of-di... 详细信息
来源: 评论
LAMARS: Large language Model-Based Anticipation Mechanism Acceleration in Real-Time Robotic Systems
收藏 引用
IEEE ACCESS 2025年 13卷 3864-3880页
作者: Gao, Yifang Luo, Wei Wang, Xuye Zhang, Shunshun Goh, Patrick Univ Sains Malaysia Sch Elect & Elect Engn Nibong Tebal 14300 Penang Malaysia Beijing Jiaotong Univ Sch Elect Engn Beijing 100044 Peoples R China Univ Sains Malaysia Sch Pharmaceut Sci Gelugor 11800 Penang Malaysia Guangxi Univ Sci & Technol Sch Automat Liuzhou 545006 Peoples R China
Large language models (LLMs) have assumed an increasingly crucial role in robotic systems because of their ability to leverage the extensive knowledge they possess in robotic inference and task handling. Although LLMs... 详细信息
来源: 评论