咨询与建议

限定检索结果

文献类型

  • 20,860 篇 会议
  • 104 篇 期刊文献
  • 43 册 图书

馆藏范围

  • 21,006 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 13,619 篇 工学
    • 11,055 篇 计算机科学与技术...
    • 2,652 篇 机械工程
    • 2,252 篇 软件工程
    • 914 篇 光学工程
    • 884 篇 电气工程
    • 529 篇 控制科学与工程
    • 477 篇 信息与通信工程
    • 216 篇 测绘科学与技术
    • 135 篇 生物工程
    • 127 篇 生物医学工程(可授...
    • 98 篇 电子科学与技术(可...
    • 92 篇 仪器科学与技术
    • 46 篇 安全科学与工程
    • 40 篇 建筑学
    • 40 篇 化学工程与技术
    • 39 篇 土木工程
    • 37 篇 交通运输工程
    • 35 篇 力学(可授工学、理...
    • 33 篇 航空宇航科学与技...
  • 3,494 篇 医学
    • 3,489 篇 临床医学
    • 32 篇 基础医学(可授医学...
  • 2,247 篇 理学
    • 1,145 篇 物理学
    • 1,081 篇 数学
    • 401 篇 生物学
    • 384 篇 统计学(可授理学、...
    • 245 篇 系统科学
    • 46 篇 化学
  • 343 篇 管理学
    • 176 篇 管理科学与工程(可...
    • 168 篇 图书情报与档案管...
    • 34 篇 工商管理
  • 31 篇 法学
  • 19 篇 农学
  • 15 篇 教育学
  • 8 篇 经济学
  • 5 篇 艺术学
  • 2 篇 军事学
  • 1 篇 文学

主题

  • 8,140 篇 computer vision
  • 2,886 篇 training
  • 2,840 篇 pattern recognit...
  • 1,809 篇 computational mo...
  • 1,715 篇 visualization
  • 1,492 篇 cameras
  • 1,433 篇 three-dimensiona...
  • 1,433 篇 feature extracti...
  • 1,366 篇 shape
  • 1,360 篇 face recognition
  • 1,243 篇 image segmentati...
  • 1,135 篇 robustness
  • 1,124 篇 semantics
  • 992 篇 computer archite...
  • 984 篇 object detection
  • 982 篇 layout
  • 959 篇 benchmark testin...
  • 935 篇 codes
  • 899 篇 computer science
  • 898 篇 object recogniti...

机构

  • 174 篇 univ sci & techn...
  • 158 篇 univ chinese aca...
  • 153 篇 carnegie mellon ...
  • 145 篇 chinese univ hon...
  • 109 篇 microsoft resear...
  • 103 篇 zhejiang univ pe...
  • 99 篇 swiss fed inst t...
  • 95 篇 tsinghua univers...
  • 90 篇 microsoft res as...
  • 90 篇 tsinghua univ pe...
  • 88 篇 shanghai ai lab ...
  • 81 篇 zhejiang univers...
  • 77 篇 alibaba grp peop...
  • 74 篇 hong kong univ s...
  • 73 篇 university of sc...
  • 72 篇 peking univ peop...
  • 72 篇 university of ch...
  • 68 篇 shanghai jiao to...
  • 66 篇 univ oxford oxfo...
  • 65 篇 google res mount...

作者

  • 80 篇 van gool luc
  • 70 篇 zhang lei
  • 58 篇 timofte radu
  • 48 篇 yang yi
  • 47 篇 luc van gool
  • 46 篇 xiaoou tang
  • 44 篇 tian qi
  • 43 篇 darrell trevor
  • 42 篇 loy chen change
  • 42 篇 sun jian
  • 41 篇 qi tian
  • 40 篇 li stan z.
  • 38 篇 li fei-fei
  • 37 篇 chen xilin
  • 36 篇 shan shiguang
  • 35 篇 zhou jie
  • 35 篇 vasconcelos nuno
  • 35 篇 liu yang
  • 35 篇 torralba antonio
  • 34 篇 liu xiaoming

语言

  • 20,981 篇 英文
  • 10 篇 中文
  • 7 篇 其他
  • 5 篇 土耳其文
  • 2 篇 日文
  • 2 篇 葡萄牙文
检索条件"任意字段=2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016"
21007 条 记 录,以下是1251-1260 订阅
排序:
VINDLU : A Recipe for Effective Video-and-Language Pretraining
VINDLU : A Recipe for Effective Video-and-Language Pretraini...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Cheng, Feng Wang, Xizi Lei, Jie Crandall, David Bansal, Mohit Bertasius, Gedas Univ N Carolina Chapel Hill NC 27599 USA Indiana Univ Bloomington IN 47405 USA
The last several years have witnessed remarkable progress in video-and-language (VidL) understanding. However, most modern VidL approaches use complex and specialized model architectures and sophisticated pretraining ... 详细信息
来源: 评论
Learning on Gradients: Generalized Artifacts Representation for GAN-Generated Images Detection
Learning on Gradients: Generalized Artifacts Representation ...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Tan, Chuangchuang Zhao, Yao Wei, Shikui Gu, Guanghua Wei, Yunchao Beijing Jiaotong Univ Inst Informat Sci Beijing Peoples R China Beijing Key Lab Adv Informat Sci & Network Techno Beijing Peoples R China Yanshan Univ Sch Informat Sci & Engn Qinhuangdao Peoples R China Hebei Key Lab Informat Transmiss & Signal Proc Qinhuangdao Peoples R China
Recently, there has been a significant advancement in image generation technology, known as GAN. It can easily generate realistic fake images, leading to an increased risk of abuse. However, most image detectors suffe... 详细信息
来源: 评论
From Images to Textual Prompts: Zero-shot Visual Question Answering with Frozen Large Language Models
From Images to Textual Prompts: Zero-shot Visual Question An...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Guo, Jiaxian Li, Junnan Li, Dongxu Tiong, Anthony Meng Huat Li, Boyang Tao, Dacheng Hoi, Steven Univ Sydney Sydney NSW Australia Salesforce Res San Francisco CA USA Nanyang Technol Univ Singapore Singapore
Large language models (LLMs) have demonstrated excellent zero-shot generalization to new language tasks. However, effective utilization of LLMs for zero-shot visual question-answering (VQA) remains challenging, primar... 详细信息
来源: 评论
You Can Ground Earlier than See: An Effective and Efficient Pipeline for Temporal Sentence Grounding in Compressed Videos
You Can Ground Earlier than See: An Effective and Efficient ...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Fang, Xiang Liu, Daizong Zhou, Pan Nan, Guoshun Huazhong Univ Sci & Technol Hubei Key Lab Distributed Syst Secur Hubei Engn Res Ctr Big Data Secur Sch Cyber Sci & Engn Wuhan Peoples R China Peking Univ Beijing Peoples R China Beijing Univ Posts & Telecommun Beijing Peoples R China
Given an untrimmed video, temporal sentence grounding (TSG) aims to locate a target moment semantically according to a sentence query. Although previous respectable works have made decent success, they only focus on h... 详细信息
来源: 评论
RepMode: Learning to Re-parameterize Diverse Experts for Subcellular Structure Prediction
RepMode: Learning to Re-parameterize Diverse Experts for Sub...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zhou, Donghao Gu, Chunbin Xu, Junde Liu, Furui Wang, Qiong Chen, Guangyong Heng, Pheng-Ann Chinese Acad Sci Shenzhen Inst Adv Technol Guangdong Prov Key Lab Comp Vis & Virtual Real Shenzhen Peoples R China Univ Chinese Acad Sci Beijing Peoples R China Chinese Univ Hong Kong Hong Kong Peoples R China Zhejiang Lab Hangzhou Peoples R China
In biological research, fluorescence staining is a key technique to reveal the locations and morphology of subcellular structures. However, it is slow, expensive, and harmful to cells. In this paper, we model it as a ... 详细信息
来源: 评论
MIST : Multi-modal Iterative Spatial-Temporal Transformer for Long-form Video Question Answering
MIST : Multi-modal Iterative Spatial-Temporal Transformer fo...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Gao, Difei Zhou, Luowei Ji, Lei Zhu, Linchao Yang, Yi Shou, Mike Zheng Natl Univ Singapore Show Lab Singapore Singapore Microsoft Albuquerque NM USA Microsoft Res Asia Beijing Peoples R China Zhejiang Univ Hangzhou Peoples R China Google Brain Mountain View CA USA
To build Video Question Answering (VideoQA) systems capable of assisting humans in daily activities, seeking answers from long-form videos with diverse and complex events is a must. Existing multi-modal VQA models ach... 详细信息
来源: 评论
All-in-one Image Restoration for Unknown Degradations Using Adaptive Discriminative Filters for Specific Degradations
All-in-one Image Restoration for Unknown Degradations Using ...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Park, Dongwon Lee, Byung Hyun Chun, Se Young Seoul Natl Univ INMC Seoul South Korea Seoul Natl Univ Dept ECE Seoul South Korea Seoul Natl Univ IPAI Seoul South Korea UNIST Dept EE Ulsan South Korea
Image restorations for single degradations have been widely studied, demonstrating excellent performance for each degradation, but can not reflect unpredictable realistic environments with unknown multiple degradation... 详细信息
来源: 评论
SECAD-Net: Self-Supervised CAD Reconstruction by Learning Sketch-Extrude Operations
SECAD-Net: Self-Supervised CAD Reconstruction by Learning Sk...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Li, Pu Guo, Jianwei Zhang, Xiaopeng Yan, Dong-Ming Chinese Acad Sci Inst Automat MAIS Beijing Peoples R China Univ Chinese Acad Sci Sch Artificial Intelligence Beijing Peoples R China
Reverse engineering CAD models from raw geometry is a classic but strenuous research problem. Previous learning-based methods rely heavily on labels due to the supervised design patterns or reconstruct CAD shapes that... 详细信息
来源: 评论
Fusing Pre-trained Language Models with Multimodal Prompts through Reinforcement Learning
Fusing Pre-trained Language Models with Multimodal Prompts t...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Yu, Youngjae Chung, Jiwan Yun, Heeseung Hessel, Jack Park, Jae Sung Lu, Ximing Zellers, Rowan Ammanabrolu, Prithviraj Le Bras, Ronan Kim, Gunhee Choi, Yejin Allen Inst Artificial Intelligence Seattle WA USA OpenAI Seattle WA USA Yonsei Univ Dept Artificial Intelligence Seoul South Korea Seoul Natl Univ Dept Comp Sci & Engn Seoul South Korea Univ Washington Paul G Allen Sch Comp Sci Seattle WA 98195 USA
Language models are capable of commonsense reasoning: while domain-specific models can learn from explicit knowledge (e.g. commonsense graphs [6], ethical norms [25]), and larger models like GPT-3 [7] manifest broad c... 详细信息
来源: 评论
Leverage Interactive Affinity for Affordance Learning
Leverage Interactive Affinity for Affordance Learning
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Luo, Hongchen Zhai, Wei Zhang, Jing Cao, Yang Tao, Dacheng Univ Sci & Technol China Hefei Peoples R China Univ Sydney Camperdown Australia JD Explore Acad Beijing Peoples R China Hefei Comprehens Natl Sci Ctr Inst Artificial Intelligence Hefei Peoples R China
Perceiving potential "action possibilities" (i.e., affordance) regions of images and learning interactive functionalities of objects from human demonstration is a challenging task due to the diversity of hum... 详细信息
来源: 评论