咨询与建议

限定检索结果

文献类型

  • 17,638 篇 会议
  • 255 册 图书
  • 189 篇 期刊文献
  • 1 篇 学位论文

馆藏范围

  • 18,082 篇 电子文献
  • 2 种 纸本馆藏

日期分布

学科分类号

  • 10,443 篇 工学
    • 6,148 篇 计算机科学与技术...
    • 3,929 篇 电气工程
    • 3,741 篇 控制科学与工程
    • 2,823 篇 软件工程
    • 1,836 篇 信息与通信工程
    • 1,551 篇 光学工程
    • 1,405 篇 机械工程
    • 997 篇 仪器科学与技术
    • 549 篇 生物医学工程(可授...
    • 498 篇 电子科学与技术(可...
    • 433 篇 生物工程
    • 232 篇 材料科学与工程(可...
    • 195 篇 交通运输工程
    • 163 篇 安全科学与工程
    • 153 篇 化学工程与技术
    • 137 篇 力学(可授工学、理...
    • 114 篇 建筑学
    • 109 篇 土木工程
  • 3,398 篇 理学
    • 2,546 篇 物理学
    • 805 篇 数学
    • 486 篇 生物学
    • 295 篇 系统科学
    • 209 篇 统计学(可授理学、...
    • 134 篇 化学
  • 1,654 篇 医学
    • 1,577 篇 临床医学
    • 185 篇 基础医学(可授医学...
  • 759 篇 管理学
    • 580 篇 管理科学与工程(可...
    • 190 篇 图书情报与档案管...
    • 120 篇 工商管理
  • 107 篇 农学
    • 104 篇 作物学
  • 78 篇 法学
  • 43 篇 经济学
  • 42 篇 教育学
  • 39 篇 艺术学
  • 37 篇 军事学
  • 18 篇 文学

主题

  • 2,731 篇 computer vision
  • 1,685 篇 cameras
  • 1,485 篇 signal processin...
  • 1,441 篇 robot vision sys...
  • 1,352 篇 image processing
  • 1,169 篇 robot sensing sy...
  • 907 篇 signal processin...
  • 875 篇 mobile robots
  • 835 篇 feature extracti...
  • 767 篇 machine vision
  • 549 篇 image segmentati...
  • 504 篇 object detection
  • 439 篇 visualization
  • 423 篇 deep learning
  • 408 篇 robustness
  • 391 篇 estimation
  • 367 篇 stereo vision
  • 356 篇 navigation
  • 343 篇 training
  • 318 篇 robot kinematics

机构

  • 83 篇 centre for visio...
  • 63 篇 xi an jiao tong ...
  • 54 篇 centre for visio...
  • 37 篇 school of electr...
  • 37 篇 centre for visio...
  • 29 篇 carnegie mellon ...
  • 28 篇 chinese acad sci...
  • 27 篇 shanghai jiao to...
  • 27 篇 center for machi...
  • 27 篇 university of ch...
  • 23 篇 centre for visio...
  • 23 篇 harbin inst tech...
  • 21 篇 univ chinese aca...
  • 21 篇 nanyang technol ...
  • 17 篇 centre for visio...
  • 16 篇 university of sc...
  • 16 篇 tsinghua univers...
  • 13 篇 chinese acad sci...
  • 13 篇 univ sci & techn...
  • 13 篇 chinese univ hon...

作者

  • 52 篇 j. kittler
  • 40 篇 josef kittler
  • 28 篇 nakadai kazuhiro
  • 19 篇 anil fernando
  • 18 篇 wang wei
  • 15 篇 chen chen
  • 14 篇 yang yang
  • 14 篇 nascimento jacin...
  • 13 篇 jing zhang
  • 13 篇 liu yang
  • 13 篇 sun fuchun
  • 12 篇 sun lining
  • 12 篇 hansung kim
  • 11 篇 zhang lei
  • 11 篇 bartolozzi chiar...
  • 11 篇 hong liu
  • 10 篇 wang lei
  • 10 篇 li yang
  • 10 篇 aguiar pedro m. ...
  • 10 篇 qiuqiang kong

语言

  • 17,904 篇 英文
  • 87 篇 中文
  • 78 篇 其他
  • 12 篇 土耳其文
  • 3 篇 俄文
  • 2 篇 西班牙文
检索条件"任意字段=International Conference on Robot Vision and Signal Processing"
18083 条 记 录,以下是51-60 订阅
排序:
18th international Workshop on Design and Architecture for signal and Image processing, DASIP 2025
18th International Workshop on Design and Architecture for S...
收藏 引用
18th international Workshop on Design and Architecture for signal and Image processing, DASIP 2025
The proceedings contain 10 papers. The special focus in this conference is on Design and Architectures for signal and Image processing. The topics include: LiFT: Lightweight, FPGA-Tailored 3D Object Detection Based on...
来源: 评论
Rethinking Mamba in Speech processing by Self-Supervised Models
Rethinking Mamba in Speech Processing by Self-Supervised Mod...
收藏 引用
2025 IEEE international conference on Acoustics, Speech, and signal processing, ICASSP 2025
作者: Zhang, Xiangyu Ma, Jianbo Shahin, Mostafa Ahmed, Beena Epps, Julien The University of New South Wales Australia Dolby Laboratories United States
The Mamba-based model has demonstrated outstanding performance across tasks in computer vision, natural language processing, and speech processing. However, in the realm of speech processing, the Mamba-based model'... 详细信息
来源: 评论
Spatiotemporal-Aware Visual Captioning using vision-Language Pre-Training Model
Spatiotemporal-Aware Visual Captioning using Vision-Language...
收藏 引用
2025 IEEE international conference on Acoustics, Speech, and signal processing, ICASSP 2025
作者: Wu, Shuai Yang, Weidong Wu, Shuyan School of Computer Science Fudan University Shanghai China Faculty of Electronic and Information Engineering Xi'an Jiaotong University Xi'an China
Current visual captioning technologies typically transform 3D/2D visual information into one-dimensional sequential data and employ language models to generate corresponding descriptions. This approach, however, compr... 详细信息
来源: 评论
MixSense: Mixture of vision Sense
MixSense: Mixture of Vision Sense
收藏 引用
2025 IEEE international conference on Acoustics, Speech, and signal processing, ICASSP 2025
作者: Lin, Jian Wang, Zhuoran Qiu, Qibo Chen, Jianzhong Ge, Zixian Jin, Weizhong Yan, Yuchao Yu, Li Hangzhou China School of Computing and Information University of Pittsburgh Pittsburgh United States
It is a new trend to fine-tune Large Multimodal Models (LMMs) to adapt to specific visual tasks through task-related conversation data. This approach provides a new paradigm for solving various vision-language tasks, ... 详细信息
来源: 评论
PeT-KeyStAtion: Parameter-efficient Transformer with Keypoint-guided Spatial-temporal Aggregation for Video-based Person Re-identification
PeT-KeyStAtion: Parameter-efficient Transformer with Keypoin...
收藏 引用
2025 IEEE international conference on Acoustics, Speech, and signal processing, ICASSP 2025
作者: Ma, Xingan Yi, Jinhui Gall, Juergen University of Bonn Bonn Germany
Video-based Person Re-identification (ReID) is crucial in visual surveillance, focusing on matching video snippets of individuals across multiple non-overlapping cameras. Existing methods either conduct ReID at the im... 详细信息
来源: 评论
VisTa: Visual-contextual and Text-augmented Zero-shot Object-level OOD Detection
VisTa: Visual-contextual and Text-augmented Zero-shot Object...
收藏 引用
2025 IEEE international conference on Acoustics, Speech, and signal processing, ICASSP 2025
作者: Zhang, Bin Qu, Xiaoyang Li, Guokuan Wan, Jiguang Wang, Jianzong Wuhan National Laboratory for Optoelectronics Huazhong University of Science and Technology Wuhan China Co. Ltd. Shenzhen China
As object detectors are increasingly deployed as black-box cloud services or pre-trained models with restricted access to the original training data, the challenge of zero-shot object-level out-of-distribution (OOD) d... 详细信息
来源: 评论
Text-Guided Few-Shot Semantic Segmentation with Training-Free Multimodal Feature Matching
Text-Guided Few-Shot Semantic Segmentation with Training-Fre...
收藏 引用
2025 IEEE international conference on Acoustics, Speech, and signal processing, ICASSP 2025
作者: Buthmann, Guillaume Sakai, Tomoya Qiu, Haoxiang Katsuki, Takayuki Kimura, Daiki IBM Research Tokyo Japan Mines Paris - PSL University Paris France
This paper addresses few-shot semantic segmentation (FSS) guided by text, where we classify unseen novel classes using image and text references as in-context examples, without the need for training. We enhance the qu... 详细信息
来源: 评论
Efficient Localized Perception for Resource-Constrained vision Systems
Efficient Localized Perception for Resource-Constrained Visi...
收藏 引用
2025 IEEE international conference on Acoustics, Speech, and signal processing, ICASSP 2025
作者: Subramanyam, A.V. Singal, Niyati Verma, Vinay K. Department of ECE IIIT Delhi India Department of CSE IIIT Delhi India
Despite the rapid advancement in the field of image recognition, the processing of high-resolution imagery remains a computational challenge. However, this processing is pivotal for extracting detailed object insights... 详细信息
来源: 评论
Dynamic SpikFormer: Low-Latency & Energy-Efficient Spiking Neural Networks with Dynamic Time Steps for vision Transformers
Dynamic SpikFormer: Low-Latency & Energy-Efficient Spiking N...
收藏 引用
2025 IEEE international conference on Acoustics, Speech, and signal processing, ICASSP 2025
作者: Datta, Gourav Liu, Zeyu Li, Anni Beerel, Peter A. Dept. of Electrical Computer & Systems Engineering Case Western Reserve University Cleveland United States Ming Hsieh Dept. of Electrical and Computer Engineering University of Southern California Los Angeles United States
Spiking Neural Networks (SNNs) have emerged as a popular spatio-temporal computing paradigm for complex vision tasks. Recently proposed SNN training algorithms have significantly reduced the number of time steps (down...
来源: 评论
Sample Efficient Reinforcement Learning via Large vision Language Model Distillation
Sample Efficient Reinforcement Learning via Large Vision Lan...
收藏 引用
2025 IEEE international conference on Acoustics, Speech, and signal processing, ICASSP 2025
作者: Lee, Donghoon Luu, Tung M. Lee, Younghwan Yoo, Chang D. Robotics Program KAIST Daejeon Korea Republic of Electrical Engineering KAIST Daejeon Korea Republic of
Recent research highlights the potential of multimodal foundation models in tackling complex decision-making challenges. However, their large parameters make real-world deployment resource-intensive and often impracti... 详细信息
来源: 评论