咨询与建议

限定检索结果

文献类型

  • 12,844 篇 会议
  • 13 篇 期刊文献

馆藏范围

  • 12,857 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 7,573 篇 工学
    • 6,863 篇 计算机科学与技术...
    • 880 篇 机械工程
    • 814 篇 软件工程
    • 435 篇 控制科学与工程
    • 360 篇 光学工程
    • 306 篇 电气工程
    • 209 篇 仪器科学与技术
    • 124 篇 信息与通信工程
    • 91 篇 生物工程
    • 62 篇 生物医学工程(可授...
    • 39 篇 电子科学与技术(可...
    • 34 篇 安全科学与工程
    • 26 篇 化学工程与技术
    • 21 篇 交通运输工程
    • 20 篇 建筑学
    • 18 篇 土木工程
  • 2,957 篇 医学
    • 2,956 篇 临床医学
    • 15 篇 基础医学(可授医学...
    • 12 篇 药学(可授医学、理...
  • 700 篇 理学
    • 359 篇 物理学
    • 225 篇 数学
    • 175 篇 系统科学
    • 95 篇 统计学(可授理学、...
    • 93 篇 生物学
    • 22 篇 化学
  • 201 篇 艺术学
    • 201 篇 设计学(可授艺术学...
  • 84 篇 管理学
    • 59 篇 图书情报与档案管...
    • 25 篇 管理科学与工程(可...
    • 14 篇 工商管理
  • 23 篇 法学
    • 21 篇 社会学
  • 5 篇 农学
  • 4 篇 教育学
  • 2 篇 经济学
  • 1 篇 军事学

主题

  • 6,464 篇 computer vision
  • 2,688 篇 training
  • 2,435 篇 pattern recognit...
  • 1,780 篇 computational mo...
  • 1,522 篇 visualization
  • 1,348 篇 three-dimensiona...
  • 1,091 篇 computer archite...
  • 1,063 篇 semantics
  • 997 篇 benchmark testin...
  • 976 篇 codes
  • 970 篇 conferences
  • 854 篇 feature extracti...
  • 830 篇 cameras
  • 771 篇 task analysis
  • 707 篇 deep learning
  • 646 篇 image segmentati...
  • 611 篇 object detection
  • 595 篇 shape
  • 554 篇 transformers
  • 538 篇 neural networks

机构

  • 132 篇 univ sci & techn...
  • 122 篇 carnegie mellon ...
  • 120 篇 tsinghua univ pe...
  • 114 篇 univ chinese aca...
  • 113 篇 chinese univ hon...
  • 94 篇 tsinghua univers...
  • 91 篇 zhejiang univ pe...
  • 91 篇 swiss fed inst t...
  • 85 篇 peng cheng lab p...
  • 81 篇 university of ch...
  • 80 篇 zhejiang univers...
  • 77 篇 shanghai ai lab ...
  • 77 篇 peng cheng labor...
  • 75 篇 university of sc...
  • 69 篇 shanghai jiao to...
  • 68 篇 shanghai jiao to...
  • 67 篇 alibaba grp peop...
  • 67 篇 stanford univ st...
  • 66 篇 univ hong kong p...
  • 64 篇 sensetime res pe...

作者

  • 77 篇 timofte radu
  • 63 篇 van gool luc
  • 45 篇 zhang lei
  • 36 篇 yang yi
  • 36 篇 luc van gool
  • 34 篇 tao dacheng
  • 31 篇 loy chen change
  • 29 篇 chen chen
  • 28 篇 sun jian
  • 28 篇 qi tian
  • 25 篇 li xin
  • 24 篇 liu yang
  • 24 篇 tian qi
  • 24 篇 ying shan
  • 23 篇 wang xinchao
  • 23 篇 zha zheng-jun
  • 23 篇 boxin shi
  • 21 篇 zhou jie
  • 21 篇 vasconcelos nuno
  • 20 篇 luo ping

语言

  • 12,849 篇 英文
  • 7 篇 其他
  • 1 篇 中文
检索条件"任意字段=IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops"
12857 条 记 录,以下是301-310 订阅
排序:
Hyper-MD: Mesh Denoising with Customized Parameters Aware of Noise Intensity and Geometric Characteristics
Hyper-MD: Mesh Denoising with Customized Parameters Aware of...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Wang, Xingtao Wei, Hongliang Fan, Xiaopeng Zhao, Debin Harbin Inst Technol Harbin Peoples R China
Mesh denoising (MD) is a critical task in geometry processing, as meshes from scanning or AIGC techniques are susceptible to noise contamination. The challenge of MD lies in the diverse nature of mesh facets in terms ... 详细信息
来源: 评论
Distilling vision-Language Models on Millions of Videos
Distilling Vision-Language Models on Millions of Videos
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Zhao, Yue Zhao, Long Zhou, Xingyi Wu, Jialin Chu, Chun-Te Mia, Hui Schroff, Florian Adam, Hartwig Liu, Ting Gong, Boqing Krahenbuhl, Philipp Yuan, Liangzhe Google Res Mountain View CA 94043 USA Univ Texas Austin Austin TX 78712 USA
The recent advance in vision-language models is largely attributed to the abundance of image-text data. We aim to replicate this success for video-language models, but there simply is not enough human- curated video-t... 详细信息
来源: 评论
GROUNDHOG : Grounding Large Language Models to Holistic Segmentation
GROUNDHOG : Grounding Large Language Models to Holistic Segm...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Zhang, Yichi Qiao, Zhiqiao Gao, Xiaofeng Shakiah, Suhaila Gao, Qiaozi Chai, Joyce Univ Michigan Ann Arbor MI 48109 USA Amazon AGI Seattle WA USA
Most multimodal large language models (MLLMs) learn language-to-object grounding through causal language modeling where grounded objects are captured by bounding boxes as sequences of location tokens. This paradigm la... 详细信息
来源: 评论
Proceedings - 2021 ieee/cvf conference on computer vision and pattern recognition, CVPR 2021
Proceedings - 2021 IEEE/CVF Conference on Computer Vision an...
收藏 引用
2021 ieee/cvf conference on computer vision and pattern recognition, CVPR 2021
The proceedings contain 1658 papers. The topics discussed include: single-stage instance shadow detection with bidirectional relation learning;learning Delaunay surface elements for mesh reconstruction;fusing the old ...
来源: 评论
SpatialVLM: Endowing vision-Language Models with Spatial Reasoning Capabilities
SpatialVLM: Endowing Vision-Language Models with Spatial Rea...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Chen, Boyuan Xu, Zhuo Kirman, Sean Ichter, Brian Sadigh, Dorsa Guibas, Leonidas Xia, Fei Google DeepMind London England Google Res Mountain View CA USA MIT 77 Massachusetts Ave Cambridge MA 02139 USA
Understanding and reasoning about spatial relationships is a fundamental capability for Visual Question Answering (VQA) and robotics. While vision Language Models (VLM) have demonstrated remarkable performance in cert... 详细信息
来源: 评论
Synthesize, Diagnose, and Optimize: Towards Fine-Grained vision-Language Understanding
Synthesize, Diagnose, and Optimize: Towards Fine-Grained Vis...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Peng, Wujian Xi, Sicheng You, Zuyao Lan, Shiyi Wu, Zuxuan Fudan Univ Sch CS Shanghai Key Lab Intell Info Proc Shanghai Peoples R China Shanghai Collaborat Innovat Ctr Intelligent Visua Shanghai Peoples R China NVIDIA Shenzhen Guangdong Peoples R China
vision language models (VLM) have demonstrated remarkable performance across various downstream tasks. However, understanding fine-grained visual-linguistic concepts, such as attributes and inter-object relationships,... 详细信息
来源: 评论
BigGait: Learning Gait Representation You Want by Large vision Models
BigGait: Learning Gait Representation You Want by Large Visi...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Ye, Dingqiang Fan, Chao Ma, Jingzhe Liu, Xiaoming Yu, Shiqi Southern Univ Sci & Technol Res Inst Trustworthy Autonomous Syst Shenzhen Peoples R China Southern Univ Sci & Technol Dept Comp Sci & Engn Shenzhen Peoples R China Michigan State Univ E Lansing MI USA
Gait recognition stands as one of the most pivotal remote identification technologies and progressively expands across research and industry communities. However, existing gait recognition methods heavily rely on task... 详细信息
来源: 评论
ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
ViP-LLaVA: Making Large Multimodal Models Understand Arbitra...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Cai, Mu Liu, Haotian Mustikovela, Siva Karthik Meyer, Gregory P. Chai, Yuning Park, Dennis Lee, Yong Jae Univ Wisconsin Madison WI 53706 USA Cruise LLC San Francisco CA USA
While existing large vision-language multimodal models focus on whole image understanding, there is a prominent gap in achieving region-specific comprehension. Current approaches that use textual coordinates or spatia... 详细信息
来源: 评论
Language-aware Visual Semantic Distillation for Video Question Answering
Language-aware Visual Semantic Distillation for Video Questi...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Zou, Bo Yang, Chao Qiao, Yu Quan, Chengbin Zhao, Youjian Tsinghua Univ Beijing Peoples R China Shanghai AI Lab Shanghai Peoples R China Zhongguancun Lab Beijing Peoples R China
Significant progress in video question answering (VideoQA) have been made thanks to thriving large image-language pretraining frameworks. Although image-language models can efficiently represent both video and languag... 详细信息
来源: 评论
SyncMask: Synchronized Attentional Masking for Fashion-centric vision-Language Pretraining
SyncMask: Synchronized Attentional Masking for Fashion-centr...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Song, Chull Hwan Hwang, Taebaek Yoon, Jooyoung Choi, Shunghyun Gu, Yeong Hyeon Dealicious Inc Seoul South Korea Sejong Univ Seoul South Korea
vision-language models (VLMs) have made significant strides in cross-modal understanding through large-scale paired datasets. However, in fashion domain, datasets of-en exhibit a disparity between the information conv... 详细信息
来源: 评论