咨询与建议

限定检索结果

文献类型

  • 8 篇 期刊文献
  • 7 篇 会议

馆藏范围

  • 15 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 14 篇 工学
    • 8 篇 计算机科学与技术...
    • 4 篇 电气工程
    • 3 篇 控制科学与工程
    • 2 篇 电子科学与技术(可...
    • 1 篇 仪器科学与技术
    • 1 篇 材料科学与工程(可...
    • 1 篇 信息与通信工程
    • 1 篇 测绘科学与技术
    • 1 篇 生物医学工程(可授...
  • 2 篇 理学
    • 2 篇 化学
    • 1 篇 物理学
    • 1 篇 生物学
  • 2 篇 医学
    • 2 篇 临床医学
  • 1 篇 教育学
    • 1 篇 教育学
  • 1 篇 文学
    • 1 篇 外国语言文学

主题

  • 15 篇 visual language ...
  • 4 篇 large language m...
  • 2 篇 visual perceptio...
  • 2 篇 task planning
  • 1 篇 group activity r...
  • 1 篇 decision-making
  • 1 篇 gpt-4
  • 1 篇 generative artif...
  • 1 篇 clip
  • 1 篇 score matching
  • 1 篇 genai
  • 1 篇 error correction...
  • 1 篇 adversarial robu...
  • 1 篇 vlms
  • 1 篇 error correction
  • 1 篇 image captioning
  • 1 篇 transfer learnin...
  • 1 篇 scene understand...
  • 1 篇 gpt-3
  • 1 篇 yolov7

机构

  • 2 篇 fudan univ acad ...
  • 1 篇 korea elect tech...
  • 1 篇 department of el...
  • 1 篇 hubei engn univ ...
  • 1 篇 univ oxford oxfo...
  • 1 篇 korea adv inst s...
  • 1 篇 yildiz tech univ...
  • 1 篇 hisar hlth res c...
  • 1 篇 kocaeli univ inf...
  • 1 篇 hubei univ techn...
  • 1 篇 univ bergen dept...
  • 1 篇 xi an jiao tong ...
  • 1 篇 univ politecn va...
  • 1 篇 agcy sci technol...
  • 1 篇 univ oberta cata...
  • 1 篇 imperial coll lo...
  • 1 篇 chinese acad sci...
  • 1 篇 meituan inc peop...
  • 1 篇 univ bergen berg...
  • 1 篇 auckland univ te...

作者

  • 2 篇 mei aoran
  • 2 篇 zhu guo-niu
  • 2 篇 gan zhongxue
  • 1 篇 luxton-reilly an...
  • 1 篇 sun jiahao
  • 1 篇 wunsche burkhard...
  • 1 篇 feng tony haoran
  • 1 篇 ding caichang
  • 1 篇 gumuskaynak enes
  • 1 篇 liu yang
  • 1 篇 jia xiaojun
  • 1 篇 bayraktar ertugr...
  • 1 篇 pang shanmin
  • 1 篇 dai wei
  • 1 篇 zhu yan
  • 1 篇 geng jiajia
  • 1 篇 denny paul
  • 1 篇 guo qing
  • 1 篇 de zarza i.
  • 1 篇 cheng zehua

语言

  • 15 篇 英文
检索条件"主题词=Visual Language Models"
15 条 记 录,以下是11-20 订阅
排序:
An Eye for an AI: Evaluating GPT-4o's visual Perception Skills and Geometric Reasoning Skills Using Computer Graphics Questions  24
An Eye for an AI: Evaluating GPT-4o's Visual Perception Skil...
收藏 引用
2024 SIGGRAPH Asia Conference-SIGGRAPH Asia
作者: Feng, Tony Haoran Denny, Paul Wunsche, Burkhard C. Luxton-Reilly, Andrew Whalley, Jacqueline Univ Auckland Auckland New Zealand Auckland Univ Technol Auckland New Zealand
CG (Computer Graphics) is a popular field of CS (Computer Science), but many students find this topic difficult due to it requiring a large number of skills, such as mathematics, programming, geometric reasoning, and ... 详细信息
来源: 评论
Anchoring Vision and language Knowledge for Weakly Supervised Group Activity Recognition
Anchoring Vision and Language Knowledge for Weakly Supervise...
收藏 引用
2024 Conference on visual Communications and Image Processing
作者: Nugroho, Muhammad Adi Park, Jinyoung Kim, Donguk Kim, Changick Korea Adv Inst Sci & Technol KAIST Daejeon South Korea
The emergence of Foundation Vision-language models (VLMs) has ignited a surge of research in the computer vision field due to their robust baseline performance. Inspired by this, we propose the Anchoring Vision-Langua... 详细信息
来源: 评论
Orthogonal Temporal Interpolation for Zero-Shot Video Recognition  23
Orthogonal Temporal Interpolation for Zero-Shot Video Recogn...
收藏 引用
31st ACM International Conference on Multimedia (MM)
作者: Zhu, Yan Zhuo, Junbao Ma, Bin Geng, Jiajia Wei, Xiaoming Wei, Xiaolin Wang, Shuhui Univ Chinese Acad Sci Sch Artificial Intelligence Beijing Peoples R China Chinese Acad Sci Inst Comp Technol Beijing Peoples R China Meituan Inc Beijing Peoples R China
Zero-shot video recognition (ZSVR) is a task that aims to recognize video categories that have not been seen during the model training process. Recently, vision-language models (VLMs) pre-trained on large-scale image-... 详细信息
来源: 评论
visual language Model for Preclinical Toxicologic Liver Histopathology Assessment  1
Visual Language Model for Preclinical Toxicologic Liver Hist...
收藏 引用
1st International Workshop on Vision-language models for Biomedical Applications (VLM4Bio)
作者: Cheng, Zehua Dai, Wei Sun, Jiahao Univ Oxford Oxford England Robo Space Beijing Peoples R China FLock Io London England Imperial Coll London London England
Preclinical drug safety assessment is a critical step in drug development that relies on time-consuming manual histopathological examination, which is prone to high inter-observer variability. Artificial intelligence ... 详细信息
来源: 评论
Video Fire Recognition Using Zero-shot Vision-language models Guided by a Task-aware Object Detector
收藏 引用
ACM Transactions on Multimedia Computing, Communications, and Applications 1000年
作者: Diego Gragnaniello Antonio Greco Carlo Sansone Bruno Vento Department of Information and Electrical Engineering and Applied Mathematics (DIEM) University of Salerno Italy Department of Electrical Engineering and Information Technology (DIETI) University of Napoli Federico II Italy
Fire detection from images or videos has gained a growing interest in recent years due to the criticality of the application. Both reliable real-time detectors and efficient retrieval techniques, able to process large... 详细信息
来源: 评论