咨询与建议

限定检索结果

文献类型

  • 90 篇 会议
  • 70 篇 期刊文献
  • 1 篇 学位论文

馆藏范围

  • 161 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 153 篇 工学
    • 124 篇 计算机科学与技术...
    • 35 篇 电气工程
    • 15 篇 软件工程
    • 12 篇 信息与通信工程
    • 12 篇 控制科学与工程
    • 9 篇 测绘科学与技术
    • 8 篇 电子科学与技术(可...
    • 7 篇 生物医学工程(可授...
    • 4 篇 机械工程
    • 4 篇 仪器科学与技术
    • 4 篇 材料科学与工程(可...
    • 2 篇 交通运输工程
    • 1 篇 航空宇航科学与技...
    • 1 篇 环境科学与工程(可...
    • 1 篇 生物工程
  • 31 篇 医学
    • 22 篇 临床医学
    • 8 篇 特种医学
    • 4 篇 基础医学(可授医学...
    • 1 篇 中西医结合
    • 1 篇 医学技术(可授医学...
  • 23 篇 理学
    • 10 篇 地球物理学
    • 8 篇 物理学
    • 6 篇 化学
    • 5 篇 生物学
    • 3 篇 地理学
    • 1 篇 天文学
    • 1 篇 地质学
  • 5 篇 管理学
    • 4 篇 管理科学与工程(可...
    • 1 篇 图书情报与档案管...
  • 1 篇 哲学
    • 1 篇 哲学
  • 1 篇 农学

主题

  • 161 篇 vision-language ...
  • 16 篇 large language m...
  • 13 篇 prompt learning
  • 11 篇 clip
  • 11 篇 few-shot learnin...
  • 10 篇 visualization
  • 7 篇 contrastive lear...
  • 6 篇 foundation model...
  • 6 篇 remote sensing
  • 6 篇 training
  • 6 篇 adaptation model...
  • 5 篇 object detection
  • 5 篇 deep learning
  • 5 篇 feature extracti...
  • 5 篇 image classifica...
  • 4 篇 long-tailed reco...
  • 4 篇 computational mo...
  • 4 篇 artificial intel...
  • 4 篇 computer vision
  • 4 篇 domain generaliz...

机构

  • 4 篇 chinese acad sci...
  • 4 篇 carnegie mellon ...
  • 4 篇 univ chinese aca...
  • 3 篇 inesc tec porto
  • 3 篇 sichuan univ col...
  • 3 篇 univ chinese aca...
  • 3 篇 zhejiang univ pe...
  • 3 篇 chinese univ hon...
  • 2 篇 shanghai ai lab ...
  • 2 篇 ecole polytech f...
  • 2 篇 tsinghua univ de...
  • 2 篇 harbin inst tech...
  • 2 篇 univ porto fac e...
  • 2 篇 cent south univ ...
  • 2 篇 beijing univ pos...
  • 2 篇 city univ hong k...
  • 2 篇 china univ geosc...
  • 2 篇 sichuan univ col...
  • 2 篇 tech univ munich...
  • 2 篇 westlake univ sc...

作者

  • 4 篇 banerjee biplab
  • 4 篇 zhang yi
  • 4 篇 jha ankit
  • 3 篇 wang donglin
  • 3 篇 singha mainak
  • 3 篇 ding kun
  • 3 篇 zhang ce
  • 3 篇 tuia devis
  • 2 篇 men aidong
  • 2 篇 li haifeng
  • 2 篇 zhang min
  • 2 篇 liu xuyang
  • 2 篇 chen honggang
  • 2 篇 ma chao
  • 2 篇 guo miaotian
  • 2 篇 yang yang
  • 2 篇 ricci elisa
  • 2 篇 ye mao
  • 2 篇 tian liang
  • 2 篇 patricio cristia...

语言

  • 159 篇 英文
  • 1 篇 其他
检索条件"主题词=Vision-language Models"
161 条 记 录,以下是81-90 订阅
排序:
LAMARS: Large language Model-Based Anticipation Mechanism Acceleration in Real-Time Robotic Systems
收藏 引用
IEEE ACCESS 2025年 13卷 3864-3880页
作者: Gao, Yifang Luo, Wei Wang, Xuye Zhang, Shunshun Goh, Patrick Univ Sains Malaysia Sch Elect & Elect Engn Nibong Tebal 14300 Penang Malaysia Beijing Jiaotong Univ Sch Elect Engn Beijing 100044 Peoples R China Univ Sains Malaysia Sch Pharmaceut Sci Gelugor 11800 Penang Malaysia Guangxi Univ Sci & Technol Sch Automat Liuzhou 545006 Peoples R China
Large language models (LLMs) have assumed an increasingly crucial role in robotic systems because of their ability to leverage the extensive knowledge they possess in robotic inference and task handling. Although LLMs... 详细信息
来源: 评论
Global-local prompts guided image-text embedding, alignment and aggregation for multi-label zero-shot learning
收藏 引用
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION 2025年 106卷
作者: Song, Tiecheng Huang, Yu Yang, Feng Qin, Anyong Zhao, Yue Gao, Chenqiang Chongqing Univ Posts & Telecommun Sch Commun & Informat Engn Chongqing 400065 Peoples R China Sun Yat Sen Univ Sch Intelligent Syst Engn Shenzhen Campus Shenzhen 518107 Guangdong Peoples R China
Multi-label zero-shot learning (MLZSL) aims to classify images into multiple unseen label classes, which is a practical yet challenging task. Recent methods have used vision-language models (VLM) for MLZSL, but they d... 详细信息
来源: 评论
MMTF-DES: A fusion of multimodal transformer models for desire, emotion, and sentiment analysis of social media data
收藏 引用
NEUROCOMPUTING 2025年 623卷
作者: Aziz, Abdul Chowdhury, Nihad Karim Kabir, Muhammad Ashad Chy, Abu Nowshed Siddique, Md. Jawad Univ Chittagong Dept Comp Sci & Engn Chattogram 4331 Bangladesh Charles Sturt Univ Sch Comp Math & Engn Bathurst NSW 2795 Australia Southern Illinois Univ Dept Comp Sci Carbondale IL 62901 USA
Desires, emotions, and sentiments are pivotal in understanding and predicting human behavior, influencing various aspects of decision-making, communication, and social interactions. Their analysis, particularly in the... 详细信息
来源: 评论
A two-step concept-based approach for enhanced interpretability and trust in skin lesion diagnosis
收藏 引用
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL 2025年 28卷 71-79页
作者: Patricio, Cristiano Teixeira, Luis F. Neves, Joao C. Univ Beira Interior Covilha Portugal NOVA LINCS Lisbon Portugal Univ Porto Fac Engn Porto Portugal INESC TEC Porto Portugal
The main challenges hindering the adoption of deep learning-based systems in clinical settings are the scarcity of annotated data and the lack of interpretability and trust in these systems. Concept Bottleneck models ... 详细信息
来源: 评论
language-Guided Object Localization via Refined Spotting Enhancement in Remote Sensing Imagery
收藏 引用
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING 2025年 63卷
作者: Zhang, Peirong Zhang, Yidan Wu, Hui Liu, Xiaoxuan Hou, Yingyan Wang, Lei Chinese Acad Sci Aerosp Informat Res Inst Key Lab Network Informat Syst Technol NIST Beijing 100190 Peoples R China Chinese Acad Sci Key Lab Target Cognit & Applicat Technol TCAT Beijing 100190 Peoples R China Univ Chinese Acad Sci Sch Elect Elect & Commun Engn Beijing 100190 Peoples R China
language-guided remote sensing image object localization uses intuitive natural language interactions to locate objects of interest within satellite or drone imagery, and has a wide range of practical applications. Ea... 详细信息
来源: 评论
LM-CLIP: Adapting Positive Asymmetric Loss for Long-Tailed Multi-Label Classification
收藏 引用
IEEE ACCESS 2025年 13卷 71053-71065页
作者: Timmermann, Christoph Jung, Seunghyeon Kim, Miso Lee, Woojin Dongguk Univ Grad Sch Comp Sci & Artificial Intelligence Seoul 04620 South Korea
Accurate multi-label image classification is essential for real-world applications, especially in scenarios with long-tailed class distributions, where some classes appear frequently while others are rare. This imbala... 详细信息
来源: 评论
Enhanced Cleft Lip and Palate Classification Using SigLIP 2: A Comparative Study with vision Transformers and Siamese Networks
收藏 引用
APPLIED SCIENCES-BASEL 2025年 第9期15卷 4766-4766页
作者: Nantha, Oraphan Sathanarugsawait, Benjaporn Praneetpolgrang, Prasong Sripatum Univ Sch Informat Technol Bangkok 10900 Thailand
This paper extends our previous work on cleft lip and/or palate (CL/P) classification, which employed vision transformers (ViTs) and Siamese neural networks. We now integrate SigLIP 2, a state-of-the-art multilingual ... 详细信息
来源: 评论
Towards molecular structure discovery from cryo-ET density volumes via modelling auxiliary semantic prototypes
收藏 引用
BRIEFINGS IN BIOINFORMATICS 2025年 第1期26卷 bbae570页
作者: Nair, Ashwin Li, Xingjian Solanki, Bhupendra Mukhopadhyay, Souradeep Jha, Ankit Uddin, Mostofa Rafid Singha, Mainak Banerjee, Biplab Xu, Min Indian Inst Sci Educ & Res Dept Data Sci Vithura 695551 Kerela India Carnegie Mellon Univ Computat Biol Dept Pittsburgh PA 15213 USA Indian Inst Technol Machine Learning & Visual Comp Lab Powai 400076 Maharashtra India Indian Inst Sci Comp Sci & Automat CV Raman Rd Bengaluru 560012 Karnataka India LNM Inst Informat Technol Comp Sci & Engn Jaipur 302031 Rajasthan India
Cryo-electron tomography (cryo-ET) is confronted with the intricate task of unveiling novel structures. General class discovery (GCD) seeks to identify new classes by learning a model that can pseudo-label unannotated... 详细信息
来源: 评论
Exploring the Limits of Large language models' Ability to Distinguish Between Objects
收藏 引用
APPLIED SCIENCES-BASEL 2025年 第9期15卷 4620-4620页
作者: Ju, Hyeongjin Park, Incheol Nalcakan, Yagiz Jin, Youngwan Yeo, Sanghyeop Kim, Shiho Yonsei Univ Sch Integrated Technol Incheon 21983 South Korea Yonsei Univ BK21 Grad Program Intelligent Semicond Technol Incheon 21983 South Korea
This paper explores the capability of large language models (LLMs) to accurately classify objects in challenging visual scenarios, focusing on two main tasks: differentiating real objects from artificial replicas and ... 详细信息
来源: 评论
Enhancing open-vocabulary object detection through region-word and region-vision matching
收藏 引用
MULTIMEDIA SYSTEMS 2025年 第3期31卷 1-15页
作者: Chen, Yi Wang, Chong Li, Zhehao Lin, Sunqi Xiang, Jinhui Li, Yuqi Qian, Jiangbo Ningbo Univ Fac Elect Engn & Comp Sci Ningbo 315000 Zhejiang Peoples R China
Open-vocabulary object detection (OVOD) aims to detect novel object categories beyond the training set. Existing OVOD methods have made encouraging progress by leveraging large-scale image-caption pairs and pre-traine... 详细信息
来源: 评论