咨询与建议

限定检索结果

文献类型

  • 13 篇 期刊文献
  • 4 篇 会议

馆藏范围

  • 17 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 16 篇 工学
    • 11 篇 计算机科学与技术...
    • 7 篇 电气工程
    • 2 篇 电子科学与技术(可...
    • 2 篇 信息与通信工程
    • 2 篇 控制科学与工程
    • 2 篇 测绘科学与技术
    • 2 篇 交通运输工程
    • 2 篇 软件工程
    • 2 篇 安全科学与工程
    • 2 篇 网络空间安全
    • 1 篇 动力工程及工程热...
    • 1 篇 土木工程
    • 1 篇 石油与天然气工程
  • 3 篇 医学
    • 2 篇 临床医学
    • 1 篇 基础医学(可授医学...
  • 3 篇 管理学
    • 3 篇 管理科学与工程(可...
    • 1 篇 图书情报与档案管...
  • 2 篇 理学
    • 1 篇 物理学
    • 1 篇 地球物理学

主题

  • 17 篇 visual-language ...
  • 3 篇 representation l...
  • 2 篇 deep learning
  • 2 篇 knowledge distil...
  • 2 篇 visualization
  • 2 篇 semantics
  • 2 篇 feature extracti...
  • 2 篇 training
  • 1 篇 energy auditing
  • 1 篇 pseudo multi-dom...
  • 1 篇 object detection
  • 1 篇 relationship bin...
  • 1 篇 military image c...
  • 1 篇 semantic segment...
  • 1 篇 video action rec...
  • 1 篇 task analysis
  • 1 篇 few-shot segment...
  • 1 篇 unmanned ground ...
  • 1 篇 image understand...
  • 1 篇 collaborative pu...

机构

  • 1 篇 south china univ...
  • 1 篇 univ zhengzhou s...
  • 1 篇 beijing forestry...
  • 1 篇 school of cyber ...
  • 1 篇 sci & technol el...
  • 1 篇 huazhong univers...
  • 1 篇 univ surrey surr...
  • 1 篇 tianjin cancer i...
  • 1 篇 university of br...
  • 1 篇 harbin engn univ...
  • 1 篇 peking univ shen...
  • 1 篇 china univ petr ...
  • 1 篇 henan remote sen...
  • 1 篇 hangzhou dianzi ...
  • 1 篇 zhengzhou univ s...
  • 1 篇 nanyang technol ...
  • 1 篇 huazhong univ sc...
  • 1 篇 beijing forestry...
  • 1 篇 aerosp informat ...
  • 1 篇 univ shanghai sc...

作者

  • 1 篇 chen zhong
  • 1 篇 zhu yuesheng
  • 1 篇 hongru shen
  • 1 篇 liu ruijin
  • 1 篇 tang jingfeng
  • 1 篇 shi zhuang
  • 1 篇 luo guibo
  • 1 篇 lu runuo
  • 1 篇 lewis martha
  • 1 篇 xie yue
  • 1 篇 xu keyu
  • 1 篇 cheng keyang
  • 1 篇 weng zhenyu
  • 1 篇 yang zihe
  • 1 篇 cheng jian
  • 1 篇 cai weijia
  • 1 篇 zhengming ding
  • 1 篇 ye mao
  • 1 篇 gan xiaozheng
  • 1 篇 li zilong

语言

  • 17 篇 英文
检索条件"主题词=visual-language model"
17 条 记 录,以下是1-10 订阅
排序:
Ground4Act: Leveraging visual-language model for collaborative pushing and grasping in clutter
收藏 引用
IMAGE AND VISION COMPUTING 2024年 151卷
作者: Yang, Yuxiang Guo, Jiangtao Li, Zilong He, Zhiwei Zhang, Jing Hangzhou Dianzi Univ Sch Elect & Informat Hangzhou Peoples R China Univ Sydney Sch Comp Sci Sydney NSW Australia
The challenge in robotics is to enable robots to transition from visual perception and language understanding to performing tasks such as grasp and assembling objects, bridging the gap between "seeing" and &... 详细信息
来源: 评论
Constraint embedding for prompt tuning in vision-language pre-trained model
收藏 引用
MULTIMEDIA SYSTEMS 2025年 第1期31卷 1-16页
作者: Cheng, Keyang Wei, Liutao Tang, Jingfeng Zhan, Yongzhao Jiangsu Univ Sch Comp Sci & Commun Engn Zhenjiang 212013 Jiangsu Peoples R China
Prompt tuning, which fine-tunes the feature distributions in pre-trained vision-language (VL) models by adding learnable tokens or contexts into image and text branches, has emerged as a popular method for enhancing t... 详细信息
来源: 评论
Enhancing generalization in camera trap image recognition: Fine-tuning visual language models
收藏 引用
NEUROCOMPUTING 2025年 634卷
作者: Yang, Zihe Tian, Ye Wang, Lifeng Zhang, Junguo Beijing Forestry Univ Sch Technol Beijing 100083 Peoples R China Beijing Forestry Univ Key Lab State Forestry & Grassland Adm Forestry Eq Beijing 100083 Peoples R China Beijing Forestry Univ Res Ctr Biodivers Intelligent Monitoring Beijing 100083 Peoples R China
This study introduces a novel fine-tuning approach for enhancing the generalization capabilities of visual language models in the context of wildlife monitoring, particularly for camera trap image recognition. In this... 详细信息
来源: 评论
Pixel-level semantic parsing in complex industrial scenarios using large vision-language models
收藏 引用
INFORMATION FUSION 2025年 116卷
作者: Ji, Xiaofeng Gong, Faming Wang, Nuanlai Zhao, Yanpu Ma, Yuhui Shi, Zhuang China Univ Petr East China Coll Comp Sci & Technol Qingdao 266580 Peoples R China Aerosp Informat Res Inst QiLu Lab 32 Jinan 250100 Peoples R China
The emergence of vision-language models, particularly Contrastive language-Image Pre-Training (CLIP), has significantly improved the performance of numerous visual tasks, demonstrating notable zero-shot transfer abili... 详细信息
来源: 评论
EDIR: an expert method for describing image regions based on knowledge distillation and triple fusion
收藏 引用
APPLIED INTELLIGENCE 2025年 第1期55卷 1-16页
作者: Ren, Kai Hu, Chuanping Xi, Hao Li, Yongqiang Fan, Jinhao Liu, Lihua Univ Zhengzhou Sch Elect & Informat Engn Zhengzhou Henan Peoples R China Zhengzhou Univ Sch Cyber Sci & Engn Zhengzhou Henan Peoples R China Henan Remote Sensing Inst Zhengzhou Henan Peoples R China
Fine-grained visual features generally require higher image input resolutions, which in turn necessitate a larger parameter count for general visual models to effectively analyze these features. However, the substanti... 详细信息
来源: 评论
Adaptive Face Recognition for Multi-Type Occlusions
收藏 引用
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 2024年 第11期34卷 11400-11412页
作者: Liu, Yuxi Luo, Guibo Weng, Zhenyu Zhu, Yuesheng Peking Univ Shenzhen Grad Sch Sch Elect & Comp Engn Shenzhen 518055 Peoples R China Nanyang Technol Univ Sch Elect & Elect Engn Singapore 639798 Singapore
Due to the prevalence of influenza outbreaks and outdoor scenarios with various obstructing decorations, recognizing faces with occlusions has become a pressing challenge to address. However, current research mainly f... 详细信息
来源: 评论
Military Image Captioning for Low-Altitude UAV or UGV Perspectives
收藏 引用
DRONES 2024年 第9期8卷 421-421页
作者: Pan, Lizhi Song, Chengtian Gan, Xiaozheng Xu, Keyu Xie, Yue Beijing Inst Technol Sch Mechatron Engn Beijing 100081 Peoples R China Sci & Technol Electromech Dynam Control Lab Xian 710065 Peoples R China
Low-altitude unmanned aerial vehicles (UAVs) and unmanned ground vehicles (UGVs), which boast high-resolution imaging and agile maneuvering capabilities, are widely utilized in military scenarios and generate a vast a... 详细信息
来源: 评论
Real-time object detection method with single-domain generalization based on YOLOv8
收藏 引用
JOURNAL OF REAL-TIME IMAGE PROCESSING 2024年 第6期21卷 1-12页
作者: Zhou, Yipeng Qian, Huaming Harbin Engn Univ Coll Intelligent Syst Sci & Engn Harbin 150001 Peoples R China
The prevailing models for object detection are often beset by a dearth of generalizability across domains. Specifically, while these models may perform exceptionally well on a given dataset, their efficacy can plummet... 详细信息
来源: 评论
Advance One-Shot Multispectral Instance Detection With Text's Supervision
收藏 引用
IEEE SIGNAL PROCESSING LETTERS 2024年 31卷 1605-1609页
作者: Feng, Chen Cheng, Jian Xiao, Yang Cao, Zhiguo Huazhong Univ Sci & Technol Sch Artificial Intelligence & Automat Wuhan 430074 Peoples R China
One key issue within one-shot multispectral instance detection (OMID) is to extract features of strong instance discriminative power, domain adaptation capability, and instance-wise generality. Existing methods genera... 详细信息
来源: 评论
RoboAuditor: Goal-Oriented Robotic System for Assessing Energy-intensive Indoor Appliance via visual language models  23
RoboAuditor: Goal-Oriented Robotic System for Assessing Ener...
收藏 引用
10th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation (BuildSys)
作者: Cai, Weijia Huang, Lei Zou, Zhengbo Univ British Columbia Vancouver BC Canada
Energy auditing is a crucial step in building retrofitting to enhance building energy efficiency. However, auditing tasks, such as profiling energy-consuming appliances in buildings, rely heavily on human inspectors, ... 详细信息
来源: 评论