咨询与建议

限定检索结果

文献类型

  • 90 篇 会议
  • 58 篇 期刊文献
  • 1 篇 学位论文

馆藏范围

  • 149 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 137 篇 工学
    • 114 篇 计算机科学与技术...
    • 25 篇 电气工程
    • 10 篇 软件工程
    • 7 篇 生物医学工程(可授...
    • 6 篇 信息与通信工程
    • 6 篇 控制科学与工程
    • 6 篇 测绘科学与技术
    • 4 篇 仪器科学与技术
    • 4 篇 电子科学与技术(可...
    • 3 篇 机械工程
    • 2 篇 材料科学与工程(可...
    • 2 篇 交通运输工程
    • 1 篇 航空宇航科学与技...
    • 1 篇 环境科学与工程(可...
    • 1 篇 生物工程
  • 28 篇 医学
    • 19 篇 临床医学
    • 8 篇 特种医学
    • 4 篇 基础医学(可授医学...
    • 1 篇 中西医结合
    • 1 篇 医学技术(可授医学...
  • 18 篇 理学
    • 7 篇 地球物理学
    • 6 篇 物理学
    • 5 篇 生物学
    • 4 篇 化学
    • 2 篇 地理学
    • 1 篇 天文学
    • 1 篇 地质学
  • 4 篇 管理学
    • 3 篇 管理科学与工程(可...
    • 1 篇 图书情报与档案管...
  • 1 篇 哲学
    • 1 篇 哲学
  • 1 篇 农学

主题

  • 149 篇 vision-language ...
  • 15 篇 large language m...
  • 12 篇 prompt learning
  • 10 篇 clip
  • 9 篇 few-shot learnin...
  • 6 篇 contrastive lear...
  • 6 篇 foundation model...
  • 6 篇 visualization
  • 5 篇 deep learning
  • 4 篇 object detection
  • 4 篇 long-tailed reco...
  • 4 篇 remote sensing
  • 4 篇 image classifica...
  • 4 篇 artificial intel...
  • 4 篇 computer vision
  • 4 篇 domain generaliz...
  • 4 篇 prompt tuning
  • 3 篇 multimodal learn...
  • 3 篇 representation l...
  • 3 篇 image captioning

机构

  • 4 篇 carnegie mellon ...
  • 4 篇 univ chinese aca...
  • 3 篇 inesc tec porto
  • 3 篇 sichuan univ col...
  • 3 篇 univ chinese aca...
  • 3 篇 chinese univ hon...
  • 3 篇 chinese acad sci...
  • 2 篇 shanghai ai lab ...
  • 2 篇 ecole polytech f...
  • 2 篇 tsinghua univ de...
  • 2 篇 harbin inst tech...
  • 2 篇 zhejiang univ pe...
  • 2 篇 univ porto fac e...
  • 2 篇 beijing univ pos...
  • 2 篇 city univ hong k...
  • 2 篇 sichuan univ col...
  • 2 篇 tech univ munich...
  • 2 篇 westlake univ sc...
  • 2 篇 univ elect sci &...
  • 2 篇 johns hopkins un...

作者

  • 4 篇 banerjee biplab
  • 4 篇 zhang yi
  • 4 篇 jha ankit
  • 3 篇 wang donglin
  • 3 篇 singha mainak
  • 3 篇 zhang ce
  • 3 篇 tuia devis
  • 2 篇 men aidong
  • 2 篇 zhang min
  • 2 篇 liu xuyang
  • 2 篇 chen honggang
  • 2 篇 guo miaotian
  • 2 篇 yang yang
  • 2 篇 ricci elisa
  • 2 篇 ye mao
  • 2 篇 tian liang
  • 2 篇 patricio cristia...
  • 2 篇 wang haiying
  • 2 篇 teixeira luis f.
  • 2 篇 mukhopadhyay sou...

语言

  • 148 篇 英文
  • 1 篇 其他
检索条件"主题词=Vision-Language Models"
149 条 记 录,以下是41-50 订阅
排序:
Label Propagation for Zero-shot Classification with vision-language models
Label Propagation for Zero-shot Classification with Vision-L...
收藏 引用
IEEE/CVF Conference on Computer vision and Pattern Recognition (CVPR)
作者: Stojnic, Vladan Kalantidis, Yannis Tolias, Giorgos Czech Tech Univ FEE VRG Prague Czech Republic NAVER LABS Europe Meylan France
vision-language models (VLMs) have demonstrated impressive performance on zero-shot classification, i.e. classification when provided merely with a list of class names. In this paper, we tackle the case of zero-shot c... 详细信息
来源: 评论
Multiple Prompt Fusion for Zero-Shot Lesion Detection Using vision-language models  26th
Multiple Prompt Fusion for Zero-Shot Lesion Detection Using ...
收藏 引用
26th International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI)
作者: Guo, Miaotian Yi, Huahui Qin, Ziyuan Wang, Haiying Men, Aidong Lao, Qicheng Beijing Univ Posts & Telecommun Sch Artificial Intelligence Beijing Peoples R China Sichuan Univ West China Biomed Big Data Ctr West China Hosp Sichuan University Sichuan Peoples R China Shanghai Artificial Intelligence Lab Shanghai Peoples R China
The success of large-scale pre-trained vision-language models (VLM) has provided a promising direction of transferring natural image representations to the medical domain by providing a well-designed prompt with medic... 详细信息
来源: 评论
Towards Better vision-Inspired vision-language models
Towards Better Vision-Inspired Vision-Language Models
收藏 引用
IEEE/CVF Conference on Computer vision and Pattern Recognition (CVPR)
作者: Cao, Yun-Hao Ji, Kaixiang Huang, Ziyuan Zheng, Chuanyang Liu, Jiajia Wang, Jian Chen, Jingdong Yang, Ming Nanjing Univ Natl Key Lab Novel Software Technol Nanjing Jiangsu Peoples R China Ant Grp Hangzhou Zhejiang Peoples R China
vision-language (VL) models have achieved unprecedented success recently, in which the connection module is the key to bridge the modality gap. Nevertheless, the abundant visual clues are not sufficiently exploited in... 详细信息
来源: 评论
The Arrival of Artificial Intelligence Large language models and vision-language models: A Potential to Possible Change in the Paradigm of Healthcare Delivery in Dermatology
收藏 引用
JOURNAL OF INVESTIGATIVE DERMATOLOGY 2024年 第6期144卷 1186-1188页
作者: Gupta, Aditya K. Talukder, Mesbah Wang, Tong Daneshjou, Roxana Piguet, Vincent Mediprobe Res Inc London ON Canada Univ Toronto Dept Med Div Dermatol Toronto ON Canada BRAC Univ Sch Pharm Dhaka Bangladesh Stanford Sch Med Dept Dermatol Redwood City CA USA Stanford Sch Med Dept Biomed Data Sci Redwood City CA USA Womens Coll Hosp Div Dermatol Toronto ON Canada
来源: 评论
Concept-Based Analysis of Neural Networks via vision-language models  1st
Concept-Based Analysis of Neural Networks via Vision-Languag...
收藏 引用
1st International Symposium on AI Verification (SAIV)
作者: Mangal, Ravi Narodytska, Nina Gopinath, Divya Hu, Boyue Caroline Roy, Anirban Jha, Susmit Pasareanu, Corina S. Carnegie Mellon Univ Pittsburgh PA 15213 USA VMware Res Palo Alto CA USA NASA Ames Moffett Field CA USA Univ Toronto Toronto ON Canada SRI Int Menlo Pk CA USA
The analysis of vision-based deep neural networks (DNNs) is highly desirable but it is very challenging due to the difficulty of expressing formal specifications for vision tasks and the lack of efficient verification... 详细信息
来源: 评论
Fast Certification of vision-language models Using Incremental Randomized Smoothing
Fast Certification of Vision-Language Models Using Increment...
收藏 引用
Conference on Safe and Trustworthy Machine Learning (SaTML)
作者: Nirala, Ashutosh Joshi, Ameya Sarkar, Soumik Hegde, Chinmay Iowa State Univ Ames IA 50011 USA New York Univ New York NY USA
A key benefit of deep vision-language models such as CLIP is that they enable zero-shot open vocabulary classification;the user has the ability to define novel class labels via natural language prompts at inference ti... 详细信息
来源: 评论
Unsupervised Prototype Adapter for vision-language models  6th
Unsupervised Prototype Adapter for Vision-Language Models
收藏 引用
6th Chinese Conference on Pattern Recognition and Computer vision (PRCV)
作者: Zhang, Yi Zhang, Ce Hu, Xueting He, Zhihai Harbin Inst Technol Harbin Peoples R China Southern Univ Sci & Technol Shenzhen Peoples R China Pengcheng Lab Shenzhen Peoples R China
Recently, large-scale pre-trained vision-language models (e.g. CLIP and ALIGN) have demonstrated remarkable effectiveness in acquiring transferable visual representations. To leverage the valuable knowledge encoded wi... 详细信息
来源: 评论
The Potential of vision-language models for Content Moderation of Children's Videos  22
The Potential of Vision-Language Models for Content Moderati...
收藏 引用
22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023
作者: Ahmed, Syed Hammad Hu, Shengnan Sukthankar, Gita University of Central Florida Department of Computer Science Orlando United States
Natural language supervision has been shown to be effective for zero-shot learning in many computer vision tasks, such as object detection and activity recognition. However, generating informative prompts can be chall... 详细信息
来源: 评论
GraphVL: Graph-Enhanced Semantic Modeling via vision-language models for Generalized Class Discovery  24
GraphVL: Graph-Enhanced Semantic Modeling via Vision-Languag...
收藏 引用
15th Indian Conference on Computer vision Graphics and Image Processing
作者: Solanki, Bhupendra Nair, Ashwin R. Singha, Mainak Mukhopadhyay, Souradeep Jha, Ankit Banerjee, Biplab Indian Inst Technol Mumbai Maharashtra India Indian Inst Sci Educ & Res Thiruvananthapuram Thiruvananthapuram Kerala India Indian Inst Sci Bangalore Karnataka India LNM Inst Informat Technol Jaipur Rajasthan India
Generalized Category Discovery (GCD) aims to cluster unlabeled images into known and novel categories using labeled images from known classes. To address the challenge of transferring features from known to unknown cl... 详细信息
来源: 评论
Cross-Modal Concept Learning and Inference for vision-language models
收藏 引用
NEUROCOMPUTING 2024年 583卷
作者: Zhang, Yi Zhang, Ce Tang, Yushun He, Zhihai Harbin Inst Technol Harbin 150001 Peoples R China Southern Univ Sci & Technol Shenzhen 518055 Peoples R China Pengcheng Lab Shenzhen 518000 Peoples R China
Large-scale pre -trained vision -language models (VLMs), such as CLIP, establish the correlation between texts and images, achieving remarkable success on various downstream tasks with fine-tuning. In existing fine-tu... 详细信息
来源: 评论