咨询与建议

限定检索结果

文献类型

  • 20,860 篇 会议
  • 105 篇 期刊文献
  • 43 册 图书

馆藏范围

  • 21,007 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 13,620 篇 工学
    • 11,056 篇 计算机科学与技术...
    • 2,652 篇 机械工程
    • 2,252 篇 软件工程
    • 914 篇 光学工程
    • 885 篇 电气工程
    • 529 篇 控制科学与工程
    • 477 篇 信息与通信工程
    • 216 篇 测绘科学与技术
    • 135 篇 生物工程
    • 127 篇 生物医学工程(可授...
    • 98 篇 电子科学与技术(可...
    • 92 篇 仪器科学与技术
    • 46 篇 安全科学与工程
    • 40 篇 建筑学
    • 40 篇 化学工程与技术
    • 39 篇 土木工程
    • 37 篇 交通运输工程
    • 35 篇 力学(可授工学、理...
    • 33 篇 航空宇航科学与技...
  • 3,494 篇 医学
    • 3,489 篇 临床医学
    • 32 篇 基础医学(可授医学...
  • 2,247 篇 理学
    • 1,145 篇 物理学
    • 1,081 篇 数学
    • 401 篇 生物学
    • 384 篇 统计学(可授理学、...
    • 245 篇 系统科学
    • 46 篇 化学
  • 343 篇 管理学
    • 176 篇 管理科学与工程(可...
    • 168 篇 图书情报与档案管...
    • 34 篇 工商管理
  • 31 篇 法学
  • 19 篇 农学
  • 15 篇 教育学
  • 8 篇 经济学
  • 5 篇 艺术学
  • 2 篇 军事学
  • 1 篇 文学

主题

  • 8,141 篇 computer vision
  • 2,886 篇 training
  • 2,841 篇 pattern recognit...
  • 1,809 篇 computational mo...
  • 1,715 篇 visualization
  • 1,493 篇 cameras
  • 1,433 篇 three-dimensiona...
  • 1,433 篇 feature extracti...
  • 1,366 篇 shape
  • 1,360 篇 face recognition
  • 1,243 篇 image segmentati...
  • 1,135 篇 robustness
  • 1,124 篇 semantics
  • 992 篇 computer archite...
  • 985 篇 object detection
  • 982 篇 layout
  • 959 篇 benchmark testin...
  • 935 篇 codes
  • 900 篇 computer science
  • 898 篇 object recogniti...

机构

  • 174 篇 univ sci & techn...
  • 158 篇 univ chinese aca...
  • 153 篇 carnegie mellon ...
  • 145 篇 chinese univ hon...
  • 109 篇 microsoft resear...
  • 103 篇 zhejiang univ pe...
  • 99 篇 swiss fed inst t...
  • 95 篇 tsinghua univers...
  • 90 篇 microsoft res as...
  • 90 篇 tsinghua univ pe...
  • 88 篇 shanghai ai lab ...
  • 81 篇 zhejiang univers...
  • 77 篇 alibaba grp peop...
  • 74 篇 hong kong univ s...
  • 73 篇 university of sc...
  • 72 篇 peking univ peop...
  • 72 篇 university of ch...
  • 68 篇 shanghai jiao to...
  • 66 篇 univ oxford oxfo...
  • 65 篇 google res mount...

作者

  • 80 篇 van gool luc
  • 70 篇 zhang lei
  • 58 篇 timofte radu
  • 48 篇 yang yi
  • 47 篇 luc van gool
  • 46 篇 xiaoou tang
  • 44 篇 tian qi
  • 43 篇 darrell trevor
  • 42 篇 loy chen change
  • 42 篇 sun jian
  • 41 篇 qi tian
  • 40 篇 li stan z.
  • 38 篇 li fei-fei
  • 37 篇 chen xilin
  • 36 篇 shan shiguang
  • 35 篇 zhou jie
  • 35 篇 vasconcelos nuno
  • 35 篇 liu yang
  • 35 篇 torralba antonio
  • 34 篇 liu xiaoming

语言

  • 20,982 篇 英文
  • 10 篇 中文
  • 7 篇 其他
  • 5 篇 土耳其文
  • 2 篇 日文
  • 2 篇 葡萄牙文
检索条件"任意字段=2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016"
21008 条 记 录,以下是81-90 订阅
Exploring vision Transformers for 3D Human Motion-Language Models with Motion Patches
Exploring Vision Transformers for 3D Human Motion-Language M...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Yu, Qing Tanaka, Mikihiro Fujiwara, Kent LY Corp Tokyo Japan
To build a cross-modal latent space between 3D human motion and language, acquiring large-scale and high-quality human motion data is crucial. However, unlike the abundance of image data, the scarcity of motion data h... 详细信息
来源: 评论
AIDE: An Automatic Data Engine for Object Detection in Autonomous Driving
AIDE: An Automatic Data Engine for Object Detection in Auton...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Liang, Mingfu Sue, Jong-Chyi Schulter, Samuel Garg, Sparsh Zhao, Shiyu Wu, Ying Chandraker, Manmohan Northwestern Univ Evanston IL 60208 USA NEC Labs Amer Princeton NJ USA Rutgers State Univ New Brunswick NJ USA Univ Calif San Diego San Diego CA USA
Autonomous vehicle (AV) systems rely on robust perception models as a cornerstone of safety assurance. However, objects encountered on the road exhibit a long-tailed distribution, with rare or unseen categories posing... 详细信息
来源: 评论
Solving Masked Jigsaw Puzzles with Diffusion vision Transformers
Solving Masked Jigsaw Puzzles with Diffusion Vision Transfor...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Liu, Jinyang Teshome, Wondmgezahu Ghimire, Sandesh Sznaier, Mario Camps, Octavia Northeastern Univ Boston MA 02115 USA Qualcomm San Diego CA USA
Solving image and video jigsaw puzzles poses the challenging task of rearranging image fragments or video frames from unordered sequences to restore meaningful images and video sequences. Existing approaches often hin... 详细信息
来源: 评论
Do vision and Language Encoders Represent the World Similarly?
Do Vision and Language Encoders Represent the World Similarl...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Maniparambil, Mayug Akshulakov, Raiymbek Djilali, Yasser Abdelaziz Dahou Seddik, Mohamed El Amine Narayan, Sanath Mangalam, Karttikeya O'Connor, Noel E. Dublin City Univ ML Labs Dublin Ireland Univ Calif Berkeley Berkeley CA 94720 USA Technol Innovat Inst Dublin Ireland
Aligned text-image encoders such as CLIP have become the de-facto model for vision-language tasks. Furthermore, modality-specific encoders achieve impressive performances in their respective domains. This raises a cen... 详细信息
来源: 评论
Incorporating Geo-Diverse Knowledge into Prompting for Increased Geographical Robustness in Object recognition
Incorporating Geo-Diverse Knowledge into Prompting for Incre...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Buettner, Kyle Malakouti, Sina Li, Xiang Lorraine Kovashka, Adriana Univ Pittsburgh Intelligent Syst Program Pittsburgh PA 15260 USA Univ Pittsburgh Dept Comp Sci Pittsburgh PA 15260 USA
Existing object recognition models have been shown to lack robustness in diverse geographical scenarios due to domain shifts in design and context. Class representations need to be adapted to more accurately reflect a... 详细信息
来源: 评论
SlowFormer: Adversarial Attack on Compute and Energy Consumption of Efficient vision Transformers
SlowFormer: Adversarial Attack on Compute and Energy Consump...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Navaneet, K. L. Koohpayegani, Soroush Abbasi Sleiman, Essam Pirsiavash, Hamed Univ Calif Davis Davis CA 95616 USA Harvard Univ Cambridge MA 02138 USA
Recently, there has been a lot of progress in reducing the computation of deep models at inference time. These methods can reduce both the computational needs and power usage of deep models. Some of these approaches a... 详细信息
来源: 评论
GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation
GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Khanna, Mukul Ramrakhya, Ram Chhablani, Gunjan Yenamandra, Sriram Gervet, Theophile Chang, Matthew Kiraly, Zsolt Chaplot, Devendra Singh Batra, Dhruv Mottaghi, Roozbeh Georgia Inst Technol Atlanta GA 30332 USA Carnegie Mellon Univ Pittsburgh PA 15213 USA Univ Illinois Urbana IL USA Mistral AI Paris France Univ Washington Seattle WA USA
The Embodied AI community has made significant strides in visual navigation tasks, exploring targets from 3D coordinates, objects, language descriptions, and images. However, these navigation models often handle only ... 详细信息
来源: 评论
Attention-Propagation Network for Egocentric Heatmap to 3D Pose Lifting
Attention-Propagation Network for Egocentric Heatmap to 3D P...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Kang, Taeho Lee, Youngki Seoul Natl Univ Seoul South Korea
We present EgoTAP, a heatmap-to-3D pose lifting method for highly accurate stereo egocentric 3D pose estimation. Severe self-occlusion and out-of-view limbs in egocentric camera views make accurate pose estimation a c... 详细信息
来源: 评论
MAFA: Managing False Negatives for vision-Language Pre-training
MAFA: Managing False Negatives for Vision-Language Pre-train...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Byun, Jaeseok Kim, Dohoon Moon, Taesup Seoul Natl Univ Dept ECE Seoul South Korea Seoul Natl Univ Dept ASRI INMC IPAI AIIS Seoul South Korea
We consider a critical issue of false negatives in vision-Language Pre-training (VLP), a challenge that arises from the inherent many-to-many correspondence of image-text pairs in large-scale web-crawled datasets. The...
来源: 评论
Mitigating Object Hallucinations in Large vision-Language Models through Visual Contrastive Decoding
Mitigating Object Hallucinations in Large Vision-Language Mo...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Leng, Sicong Zhang, Hang Chen, Guanzheng Li, Xin Lug, Shijian Miao, Chunyan Bing, Lidong Alibaba Grp DAMO Acad Hangzhou Peoples R China Nanyang Technol Univ Singapore Singapore Hupan Lab Hangzhou 310023 Peoples R China
Large vision-Language Models (LVLMs) have advanced considerably, intertwining visual recognition and language understanding to generate content that is not only coherent but also contextually attuned. Despite their su... 详细信息
来源: 评论