咨询与建议

限定检索结果

文献类型

  • 20,860 篇 会议
  • 105 篇 期刊文献
  • 43 册 图书

馆藏范围

  • 21,007 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 13,620 篇 工学
    • 11,056 篇 计算机科学与技术...
    • 2,652 篇 机械工程
    • 2,252 篇 软件工程
    • 914 篇 光学工程
    • 885 篇 电气工程
    • 529 篇 控制科学与工程
    • 477 篇 信息与通信工程
    • 216 篇 测绘科学与技术
    • 135 篇 生物工程
    • 127 篇 生物医学工程(可授...
    • 98 篇 电子科学与技术(可...
    • 92 篇 仪器科学与技术
    • 46 篇 安全科学与工程
    • 40 篇 建筑学
    • 40 篇 化学工程与技术
    • 39 篇 土木工程
    • 37 篇 交通运输工程
    • 35 篇 力学(可授工学、理...
    • 33 篇 航空宇航科学与技...
  • 3,494 篇 医学
    • 3,489 篇 临床医学
    • 32 篇 基础医学(可授医学...
  • 2,247 篇 理学
    • 1,145 篇 物理学
    • 1,081 篇 数学
    • 401 篇 生物学
    • 384 篇 统计学(可授理学、...
    • 245 篇 系统科学
    • 46 篇 化学
  • 343 篇 管理学
    • 176 篇 管理科学与工程(可...
    • 168 篇 图书情报与档案管...
    • 34 篇 工商管理
  • 31 篇 法学
  • 19 篇 农学
  • 15 篇 教育学
  • 8 篇 经济学
  • 5 篇 艺术学
  • 2 篇 军事学
  • 1 篇 文学

主题

  • 8,141 篇 computer vision
  • 2,886 篇 training
  • 2,841 篇 pattern recognit...
  • 1,809 篇 computational mo...
  • 1,715 篇 visualization
  • 1,493 篇 cameras
  • 1,433 篇 three-dimensiona...
  • 1,433 篇 feature extracti...
  • 1,366 篇 shape
  • 1,360 篇 face recognition
  • 1,243 篇 image segmentati...
  • 1,135 篇 robustness
  • 1,124 篇 semantics
  • 992 篇 computer archite...
  • 985 篇 object detection
  • 982 篇 layout
  • 959 篇 benchmark testin...
  • 935 篇 codes
  • 900 篇 computer science
  • 898 篇 object recogniti...

机构

  • 174 篇 univ sci & techn...
  • 158 篇 univ chinese aca...
  • 153 篇 carnegie mellon ...
  • 145 篇 chinese univ hon...
  • 109 篇 microsoft resear...
  • 103 篇 zhejiang univ pe...
  • 99 篇 swiss fed inst t...
  • 95 篇 tsinghua univers...
  • 90 篇 microsoft res as...
  • 90 篇 tsinghua univ pe...
  • 88 篇 shanghai ai lab ...
  • 81 篇 zhejiang univers...
  • 77 篇 alibaba grp peop...
  • 74 篇 hong kong univ s...
  • 73 篇 university of sc...
  • 72 篇 peking univ peop...
  • 72 篇 university of ch...
  • 68 篇 shanghai jiao to...
  • 66 篇 univ oxford oxfo...
  • 65 篇 google res mount...

作者

  • 80 篇 van gool luc
  • 70 篇 zhang lei
  • 58 篇 timofte radu
  • 48 篇 yang yi
  • 47 篇 luc van gool
  • 46 篇 xiaoou tang
  • 44 篇 tian qi
  • 43 篇 darrell trevor
  • 42 篇 loy chen change
  • 42 篇 sun jian
  • 41 篇 qi tian
  • 40 篇 li stan z.
  • 38 篇 li fei-fei
  • 37 篇 chen xilin
  • 36 篇 shan shiguang
  • 35 篇 zhou jie
  • 35 篇 vasconcelos nuno
  • 35 篇 liu yang
  • 35 篇 torralba antonio
  • 34 篇 liu xiaoming

语言

  • 20,982 篇 英文
  • 10 篇 中文
  • 7 篇 其他
  • 5 篇 土耳其文
  • 2 篇 日文
  • 2 篇 葡萄牙文
检索条件"任意字段=2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016"
21008 条 记 录,以下是381-390 订阅
排序:
HALLUSIONBENCH: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large vision-Language Models
HALLUSIONBENCH: An Advanced Diagnostic Suite for Entangled L...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Guan, Tianrui Liu, Fuxiao Wu, Xiyang Xian, Ruiqi Li, Zongxia Liu, Xiaoyu Wang, Xijun Chen, Lichang Huang, Furong Yacoob, Yaser Manocha, Dinesh Zhou, Tianyi Univ Maryland College Pk MD 20742 USA
We introduce "HALLUSIONBENCH(1)," a comprehensive benchmark designed for the evaluation of image-context reasoning. This benchmark presents significant challenges to advanced large visual-language models (LV... 详细信息
来源: 评论
SHViT: Single-Head vision Transformer with Memory Efficient Macro Design
SHViT: Single-Head Vision Transformer with Memory Efficient ...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Yun, Seokju Ro, Youngmin Univ Seoul Machine Intelligence Lab Seoul South Korea
Recently, efficient vision Transformers have shown great performance with low latency on resource-constrained devices. Conventionally, they use 4x4 patch embeddings and a 4-stage structure at the macro level, while ut... 详细信息
来源: 评论
Lookahead Exploration with Neural Radiance Representation for Continuous vision-Language Navigation
Lookahead Exploration with Neural Radiance Representation fo...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Wang, Zihan Li, Xiangyang Yang, Jiahao Liu, Yeqi Hu, Junjie Jiang, Ming Jiang, Shuqiang Chinese Acad Sci Inst Comp Technol Beijing 100190 Peoples R China Univ Chinese Acad Sci Beijing 100049 Peoples R China Univ Wisconsin Dept Comp Sci 1210 W Dayton St Madison WI 53706 USA Univ Wisconsin Dept Biostat & Med Informat Madison WI USA Indiana Univ Dept Human Ctr Comp Indianapolis IN 46204 USA
vision-and-language navigation (VLN) enables the agent to navigate to a remote location following the natural language instruction in 3D environments. At each navigation step, the agent selects from possible candidate... 详细信息
来源: 评论
Multi-scale Dynamic and Hierarchical Relationship Modeling for Facial Action Units recognition
Multi-scale Dynamic and Hierarchical Relationship Modeling f...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Wang, Zihan Song, Siyang Luo, Cheng Deng, Songhe Xie, Weicheng Shen, Linlin Shenzhen Univ Sch Comp Sci & Software Engn Comp Vis Inst Shenzhen Peoples R China Shenzhen Univ Shenzhen Inst Artificial Intelligence & Robot Soc Shenzhen Peoples R China Shenzhen Univ Natl Engn Lab Big Data Syst Comp Technol Shenzhen Peoples R China Univ Leicester Leicester Leics England Monash Univ Clayton Vic Australia
Human facial action units (AUs) are mutually related in a hierarchical manner, as not only they are associated with each other in both spatial and temporal domains but also AUs located in the same/close facial regions...
来源: 评论
Model Inversion Robustness: Can Transfer Learning Help?
Model Inversion Robustness: Can Transfer Learning Help?
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Ho, Sy-Tuyen Hao, Koh Jun Chandrasegaran, Keshigeyan Ngoc-Bao Nguyen Cheung, Ngai-Man Singapore Univ Technol & Design SUTD Singapore Singapore Stanford Univ Stanford CA 94305 USA SUTD Singapore Singapore
Model Inversion (MI) attacks aim to reconstruct private training data by abusing access to machine learning models. Contemporary MI attacks have achieved impressive attack performance, posing serious threats to privac... 详细信息
来源: 评论
SparseOcc: Rethinking Sparse Latent Representation for vision-Based Semantic Occupancy Prediction
SparseOcc: Rethinking Sparse Latent Representation for Visio...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Tang, Pin Wang, Zhongdao Wang, Gum-ling Zheng, Jilai Ren, Xiangxuan Feng, Bailan Ma, Chao Shanghai Jiao Tong Univ AI Inst MoE Key Lab Artificial Intelligence Shanghai Peoples R China Huawei Noahs Ark Lab Montreal PQ Canada
vision-based perception for autonomous driving requires an explicit modeling of a 3D space, where 2D latent representations are mapped and subsequent 3D operators are applied. However, operating on dense latent spaces... 详细信息
来源: 评论
Explaining CLIP's performance disparities on data from blind/low vision users
Explaining CLIP's performance disparities on data from blind...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Massiceti, Daniela Longden, Camilla Slowik, Agnieszka Wills, Samuel Grayson, Martin Morrison, Cecily Microsoft Res Redmond WA 98052 USA World Bank 1818 H St NW Washington DC 20433 USA
Large multi-modal models (LMMs) hold the potential to usher in a new era of automated visual assistance for people who are blind or low vision (BLV). Yet, these models have not been systematically evaluated on data ca... 详细信息
来源: 评论
Enhanced Motion-Text Alignment for Image-to-Video Transfer Learning
Enhanced Motion-Text Alignment for Image-to-Video Transfer L...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zhang, Wei Wan, Chaoqun Liu, Tongliang Tian, Xinmei Shen, Xu Ye, Jieping Univ Sci & Technol China Hefei Peoples R China Alibaba Cloud Hangzhou Peoples R China Univ Sydney Sydney Australia Hefei Comprehens Natl Sci Ctr Inst Artificial Intelligence Hefei Peoples R China
Extending large image-text pre-trained models (e.g., CLIP) for video understanding has made significant advancements. To enable the capability of CLIP to perceive dynamic information in videos, existing works are dedi... 详细信息
来源: 评论
Knowledge Combination to Learn Rotated Detection Without Rotated Annotation
Knowledge Combination to Learn Rotated Detection Without Rot...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zhu, Tianyu Ferenczi, Bryce Purkait, Pulak Drummond, Tom Rezatofighi, Hamid van den Hengel, Anton Amazon IML Seattle WA 98109 USA Monash Univ Clayton Vic Australia
Rotated bounding boxes drastically reduce output ambiguity of elongated objects, making it superior to axis-aligned bounding boxes. Despite the effectiveness, rotated detectors are not widely employed. Annotating rota... 详细信息
来源: 评论
OpenBias: Open-set Bias Detection in Text-to-Image Generative Models
OpenBias: Open-set Bias Detection in Text-to-Image Generativ...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: D'Inca, Moreno Peruzzo, Elia Mancini, Massimiliano Xu, Dejia Goel, Vidit Xu, Xinggian Wang, Zhangyang Shi, Humphrey Sebe, Nicu Univ Trento Trento Italy UT Austin Austin TX USA SHI Labs Georgia Tech & UIUC Atlanta GA USA Picsart AI Res PAIR Miami FL USA
Text-to-image generative models are becoming increasingly popular and accessible to the general public. As these models see large-scale deployments, it is necessary to deeply investigate their safety and fairness to n... 详细信息
来源: 评论