咨询与建议

限定检索结果

文献类型

  • 25,252 篇 会议
  • 277 篇 期刊文献
  • 21 册 图书
  • 3 篇 学位论文

馆藏范围

  • 25,553 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 15,800 篇 工学
    • 9,866 篇 计算机科学与技术...
    • 6,079 篇 电气工程
    • 5,771 篇 信息与通信工程
    • 5,615 篇 软件工程
    • 2,016 篇 光学工程
    • 1,453 篇 控制科学与工程
    • 1,280 篇 机械工程
    • 1,155 篇 电子科学与技术(可...
    • 873 篇 生物医学工程(可授...
    • 833 篇 生物工程
    • 793 篇 仪器科学与技术
    • 265 篇 网络空间安全
    • 253 篇 化学工程与技术
    • 245 篇 安全科学与工程
    • 239 篇 交通运输工程
    • 183 篇 材料科学与工程(可...
    • 162 篇 土木工程
    • 159 篇 建筑学
  • 5,716 篇 理学
    • 3,480 篇 物理学
    • 2,207 篇 数学
    • 886 篇 生物学
    • 564 篇 统计学(可授理学、...
    • 420 篇 系统科学
    • 310 篇 化学
  • 3,023 篇 医学
    • 2,897 篇 临床医学
    • 312 篇 基础医学(可授医学...
    • 229 篇 药学(可授医学、理...
  • 1,390 篇 管理学
    • 850 篇 管理科学与工程(可...
    • 612 篇 图书情报与档案管...
    • 169 篇 工商管理
  • 181 篇 法学
  • 133 篇 农学
  • 55 篇 教育学
  • 52 篇 文学
  • 51 篇 经济学
  • 51 篇 军事学
  • 22 篇 艺术学

主题

  • 3,122 篇 image processing
  • 2,084 篇 image coding
  • 2,020 篇 visualization
  • 1,752 篇 image segmentati...
  • 1,486 篇 feature extracti...
  • 1,081 篇 image reconstruc...
  • 907 篇 cameras
  • 885 篇 signal processin...
  • 833 篇 image color anal...
  • 756 篇 humans
  • 712 篇 image edge detec...
  • 688 篇 image enhancemen...
  • 667 篇 computer vision
  • 649 篇 training
  • 582 篇 image analysis
  • 567 篇 deep learning
  • 536 篇 image quality
  • 481 篇 conferences
  • 472 篇 object detection
  • 472 篇 robustness

机构

  • 51 篇 school of electr...
  • 50 篇 shanghai jiao to...
  • 39 篇 ieee
  • 38 篇 university of sc...
  • 36 篇 shanghai jiao to...
  • 36 篇 school of comput...
  • 34 篇 shanghai jiao to...
  • 33 篇 university of ch...
  • 32 篇 microsoft resear...
  • 26 篇 national institu...
  • 25 篇 department of el...
  • 24 篇 hendisli&#x011f
  • 23 篇 institute for in...
  • 23 篇 institute of ima...
  • 23 篇 istanbul teknik ...
  • 23 篇 institute of dig...
  • 22 篇 peking univ inst...
  • 21 篇 institute of inf...
  • 21 篇 univ chinese aca...
  • 21 篇 univ sci & techn...

作者

  • 62 篇 guangtao zhai
  • 46 篇 song li
  • 45 篇 zhai guangtao
  • 32 篇 jie yang
  • 27 篇 li li
  • 25 篇 m. vetterli
  • 25 篇 bovik alan c.
  • 25 篇 li sumei
  • 25 篇 li song
  • 25 篇 sarp ertürk
  • 24 篇 jing zhang
  • 24 篇 b. macq
  • 23 篇 zhang lei
  • 23 篇 li zhuo
  • 23 篇 d.r. bull
  • 22 篇 jürgen seiler
  • 21 篇 shi guangming
  • 20 篇 liu yang
  • 20 篇 zhang wenjun
  • 18 篇 mohamed-chaker l...

语言

  • 24,740 篇 英文
  • 489 篇 土耳其文
  • 209 篇 其他
  • 132 篇 中文
  • 2 篇 西班牙文
  • 2 篇 葡萄牙文
检索条件"任意字段=IEEE Visual Communications and Image Processing Conference"
25553 条 记 录,以下是221-230 订阅
排序:
Non-Autoregressive Multimodal Machine Translation
Non-Autoregressive Multimodal Machine Translation
收藏 引用
International conference on Acoustics, Speech, and Signal processing (ICASSP)
作者: Guojing Liu Xiangqian Ding Huili Gong Xiangyu Qu Zhenyu Yang Kai Yan Faculty of Information Science and Engineering Ocean University of China Qingdao China School of Cyber Science and Technology Shandong University Qingdao China Shandong Academy of Sciences Qilu University of Technology Jinan China Laiwu Vocational and Technical College Jinan China
Performing better text translation by integrating auxiliary inputs from visual information has gained widespread attention in recent years. While existing methods outperform the text-only translation models, the step-... 详细信息
来源: 评论
Explore the Hallucination on Low-level Perception for MLLMs
Explore the Hallucination on Low-level Perception for MLLMs
收藏 引用
International conference on Acoustics, Speech, and Signal processing (ICASSP)
作者: Yinan Sun Zicheng Zhang Haoning Wu Xiaohong Liu Weisi Lin Guangtao Zhai Xiongkuo Min Shanghai Jiao Tong University S-Lab Nanyang Technological University Nanyang Technological University Shanghai Jiao Tong University Shanghai China
The rapid development of Multi-modality Large Language Models (MLLMs) has significantly influenced various aspects of industry and daily life, showcasing impressive capabilities in visual perception and understanding.... 详细信息
来源: 评论
SentiFormer: Metadata Enhanced Transformer for image Sentiment Analysis
SentiFormer: Metadata Enhanced Transformer for Image Sentime...
收藏 引用
International conference on Acoustics, Speech, and Signal processing (ICASSP)
作者: Bin Feng Shulan Ruan Mingzheng Yang Dongxuan Han Huijie Liu Kai Zhang Qi Liu University of Science and Technology of China State Key Laboratory of Cognitive Intelligence Shenzhen International Graduate School Tsinghua University
As more and more internet users post images online to express their daily emotions, image sentiment analysis has attracted increasing attention. Recently, researchers generally tend to design different neural networks... 详细信息
来源: 评论
Structural-Aware Disentangled Learning with CLIP for Hyperbolic Zero-Shot Sketch-Based image Retrieval*
Structural-Aware Disentangled Learning with CLIP for Hyperbo...
收藏 引用
International conference on Acoustics, Speech, and Signal processing (ICASSP)
作者: Qing Zhang Jing Zhang Feilong Bao Xiangdong Su Guanglai Gao School of Computer Science Inner Mongolia University Hohhot China Inner Mongolia Key Laboratory of Multilingual Artificial Intelligence Technology Hohhot China National and Local Joint Engineering Research Center of Mongolian Information Processing Technology Hohhot China
The zero-shot sketch-based image retrieval task faces two key challenges: domain gap and knowledge transfer. Our innovation is recognizing that directly aligning cross-domain features weakens the discriminative abilit... 详细信息
来源: 评论
OSLO-IC: On-the-Sphere Learned Omnidirectional image Compression with Attention Modules and Spatial Context
OSLO-IC: On-the-Sphere Learned Omnidirectional Image Compres...
收藏 引用
International conference on Acoustics, Speech, and Signal processing (ICASSP)
作者: Paul Wawerek-López Navid Mahmoudian Bidgoli Pascal Frossard André Kaup Thomas Maugey Multimedia Communications and Signal Processing Friedrich-Alexander-Universität Erlangen-Nürnberg Germany Trimble Inc. École Polytechnique Fédérale de Lausanne (EPFL) Lausanne Switzerland Institut National de Recherche en Informatique et en Automatique (INRIA) Rennes France
Developing effective 360-degree (spherical) image compression techniques is crucial for technologies like virtual reality and automated driving. This paper advances the state-of-the-art in on-the-sphere learning (OSLO... 详细信息
来源: 评论
U-SAM: Upgrade Segment Anything Model With Semantic-Aware and Memory-Efficient
U-SAM: Upgrade Segment Anything Model With Semantic-Aware an...
收藏 引用
International conference on Acoustics, Speech, and Signal processing (ICASSP)
作者: Xiaofeng Jin Jie Hu Jianghang Lin Shengchuan Zhang Liujuan Cao Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University P.R. China Learning and Vision Lab National University of Singapore Singapore
Segment Anything Model (SAM) has achieved remarkable success in the field of class-agnostic image segmentation by utilizing points or boxes as prompts. However, we identify two significant limitations when compared to... 详细信息
来源: 评论
PointActionCLIP: Preventing Transfer Degradation in Point Cloud Action Recognition with a Triple-Path CLIP
PointActionCLIP: Preventing Transfer Degradation in Point Cl...
收藏 引用
International conference on Acoustics, Speech, and Signal processing (ICASSP)
作者: Wei Tao Shenglin He Xiaoyang Qu Jiguang Wan Jianzong Wang Huazhong University of Science and Technology Wuhan China Ping An Technology (Shenzhen) Co. Ltd Shenzhen China
Directly applying CLIP to point cloud action recognition can cause severe accuracy collapse. In this paper, we propose PointActionCLIP, which successfully prevents this transfer degradation with a triplepath CLIP, inc... 详细信息
来源: 评论
A Computer Vision and Vibrohaptic Glove-Based Piano Learning System for the visually Impaired
A Computer Vision and Vibrohaptic Glove-Based Piano Learning...
收藏 引用
International conference on Advanced Communication Technology (ICACT)
作者: Ian Juha Cho Jin Park Hosung Bae Hankuk Academy of Foreign Studies Yongin South Korea
The visually impaired are unable to enjoy leisure activities as much as ordinary people due to various limitations. To expand the scope of leisure activities for the visually impaired, we have developed a vibration gl... 详细信息
来源: 评论
Enhancing Vision: Harmonizing Frequency for Imaging Quality and Perception Accuracy
Enhancing Vision: Harmonizing Frequency for Imaging Quality ...
收藏 引用
International conference on Acoustics, Speech, and Signal processing (ICASSP)
作者: Hongyang Chen Kaisheng Ma Xi’an Jiaotong University Tsinghua University
In low-level vision tasks, achieving harmony between visual quality and recognition accuracy is often challenging, as the two do not always align. Many existing approaches focus on optimizing downstream tasks by linki... 详细信息
来源: 评论
From Pixels to Voice: A Simple and Efficient End-to-End Spoken image Description Approach via Vision Codec Language Models
From Pixels to Voice: A Simple and Efficient End-to-End Spok...
收藏 引用
International conference on Acoustics, Speech, and Signal processing (ICASSP)
作者: Chung Tran Sakriani Sakti Graduate School of Science and Technology Nara Institute of Science and Technology Ikoma Japan
Neural audio codecs provide a powerful tool for compressing audio signals into discrete codec representations. This compact discrete representation has made it possible to successfully apply a natural language process... 详细信息
来源: 评论