咨询与建议

限定检索结果

文献类型

  • 25,252 篇 会议
  • 277 篇 期刊文献
  • 21 册 图书
  • 3 篇 学位论文

馆藏范围

  • 25,553 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 15,800 篇 工学
    • 9,866 篇 计算机科学与技术...
    • 6,079 篇 电气工程
    • 5,771 篇 信息与通信工程
    • 5,615 篇 软件工程
    • 2,016 篇 光学工程
    • 1,453 篇 控制科学与工程
    • 1,280 篇 机械工程
    • 1,155 篇 电子科学与技术(可...
    • 873 篇 生物医学工程(可授...
    • 833 篇 生物工程
    • 793 篇 仪器科学与技术
    • 265 篇 网络空间安全
    • 253 篇 化学工程与技术
    • 245 篇 安全科学与工程
    • 239 篇 交通运输工程
    • 183 篇 材料科学与工程(可...
    • 162 篇 土木工程
    • 159 篇 建筑学
  • 5,716 篇 理学
    • 3,480 篇 物理学
    • 2,207 篇 数学
    • 886 篇 生物学
    • 564 篇 统计学(可授理学、...
    • 420 篇 系统科学
    • 310 篇 化学
  • 3,023 篇 医学
    • 2,897 篇 临床医学
    • 312 篇 基础医学(可授医学...
    • 229 篇 药学(可授医学、理...
  • 1,390 篇 管理学
    • 850 篇 管理科学与工程(可...
    • 612 篇 图书情报与档案管...
    • 169 篇 工商管理
  • 181 篇 法学
  • 133 篇 农学
  • 55 篇 教育学
  • 52 篇 文学
  • 51 篇 经济学
  • 51 篇 军事学
  • 22 篇 艺术学

主题

  • 3,122 篇 image processing
  • 2,084 篇 image coding
  • 2,020 篇 visualization
  • 1,752 篇 image segmentati...
  • 1,486 篇 feature extracti...
  • 1,081 篇 image reconstruc...
  • 907 篇 cameras
  • 885 篇 signal processin...
  • 833 篇 image color anal...
  • 756 篇 humans
  • 712 篇 image edge detec...
  • 688 篇 image enhancemen...
  • 667 篇 computer vision
  • 649 篇 training
  • 582 篇 image analysis
  • 567 篇 deep learning
  • 536 篇 image quality
  • 481 篇 conferences
  • 472 篇 object detection
  • 472 篇 robustness

机构

  • 51 篇 school of electr...
  • 50 篇 shanghai jiao to...
  • 39 篇 ieee
  • 38 篇 university of sc...
  • 36 篇 shanghai jiao to...
  • 36 篇 school of comput...
  • 34 篇 shanghai jiao to...
  • 33 篇 university of ch...
  • 32 篇 microsoft resear...
  • 26 篇 national institu...
  • 25 篇 department of el...
  • 24 篇 hendisli&#x011f
  • 23 篇 institute for in...
  • 23 篇 institute of ima...
  • 23 篇 istanbul teknik ...
  • 23 篇 institute of dig...
  • 22 篇 peking univ inst...
  • 21 篇 institute of inf...
  • 21 篇 univ chinese aca...
  • 21 篇 univ sci & techn...

作者

  • 62 篇 guangtao zhai
  • 46 篇 song li
  • 45 篇 zhai guangtao
  • 32 篇 jie yang
  • 27 篇 li li
  • 25 篇 m. vetterli
  • 25 篇 bovik alan c.
  • 25 篇 li sumei
  • 25 篇 li song
  • 25 篇 sarp ertürk
  • 24 篇 jing zhang
  • 24 篇 b. macq
  • 23 篇 zhang lei
  • 23 篇 li zhuo
  • 23 篇 d.r. bull
  • 22 篇 jürgen seiler
  • 21 篇 shi guangming
  • 20 篇 liu yang
  • 20 篇 zhang wenjun
  • 18 篇 mohamed-chaker l...

语言

  • 24,740 篇 英文
  • 489 篇 土耳其文
  • 209 篇 其他
  • 132 篇 中文
  • 2 篇 西班牙文
  • 2 篇 葡萄牙文
检索条件"任意字段=IEEE Visual Communications and Image Processing Conference"
25553 条 记 录,以下是171-180 订阅
排序:
Object-Centric Discriminative Learning for Text-Based Person Retrieval
Object-Centric Discriminative Learning for Text-Based Person...
收藏 引用
International conference on Acoustics, Speech, and Signal processing (ICASSP)
作者: Haiwen Li Delong Liu Fei Su Zhicheng Zhao Beijing University of Posts and Telecommunications Beijing Key Laboratory of Network System and Network Culture China
Text-based person retrieval (TBPR) is a vision-language task that aims to find specific pedestrians in a large image gallery using the textual description. However, due to the heterogeneity between modalities and the ... 详细信息
来源: 评论
VisTa: visual-contextual and Text-augmented Zero-shot Object-level OOD Detection
VisTa: Visual-contextual and Text-augmented Zero-shot Object...
收藏 引用
International conference on Acoustics, Speech, and Signal processing (ICASSP)
作者: Bin Zhang Xiaoyang Qu Guokuan Li Jiguang Wan Jianzong Wang Wuhan National Laboratory for Optoelectronics Huazhong University of Science and Technology Wuhan China Ping An Technology (Shenzhen) Co. Ltd Shenzhen China
As object detectors are increasingly deployed as black-box cloud services or pre-trained models with restricted access to the original training data, the challenge of zero-shot object-level out-of-distribution (OOD) d... 详细信息
来源: 评论
LV-ReID: Large Language-Vision Alignment Model for Text-based Person Re-identification
LV-ReID: Large Language-Vision Alignment Model for Text-base...
收藏 引用
International conference on Acoustics, Speech, and Signal processing (ICASSP)
作者: Yinghui Xia Chao Wang Jinsong Yang HKUST(GZ) Wuhan University AutoAgents.ai
Person Re-Identification (ReID) is a critical task in computer vision that involves identifying individuals across different cameras or video frames. It’s challenging due to variations in appearance, lighting, viewpo... 详细信息
来源: 评论
Adapting Without Seeing: Text-Aided Domain Adaptation for Adapting CLIP-like Models to Novel Domains
Adapting Without Seeing: Text-Aided Domain Adaptation for Ad...
收藏 引用
International conference on Acoustics, Speech, and Signal processing (ICASSP)
作者: Louis Hémadou Héléna Vorobieva Ewa Kijak Frédéric Jurie Digital Sciences & Technologies Department Safran Tech Université de Rennes IRISA INRIA CNRS Université de Caen Normandie ENSICAEN CNRS
This paper addresses the challenge of adapting large vision models, such as CLIP, to domain shifts in image classification tasks. While these models, pre-trained on vast datasets like LAION 2B, offer powerful visual r... 详细信息
来源: 评论
Minimizing Disparities between Real and Pseudo Queries for Unsupervised visual Grounding
Minimizing Disparities between Real and Pseudo Queries for U...
收藏 引用
International conference on Acoustics, Speech, and Signal processing (ICASSP)
作者: Hui Jiang Changkai Ji Jilan Xu Yanhao Zhu Yuejie Zhang Rui Feng Tao Zhang Shang Gao School of Computer Science Shanghai Key Laboratory of Intelligent Information Processing Fudan University Shanghai China School of Information Management and Engineering Shanghai Key Laboratory of Financial Information Technology Shanghai University of Finance and Economics Shanghai China School of Information Technology Deakin University Victoria Australia
visual grounding involves the identification and localization of image regions given textual descriptions. To reduce the manual labeling effort on region-text pairs, unsupervised visual grounding aims to generate pseu... 详细信息
来源: 评论
Bridging the Machine-Human Gap in Blurred-image Classification via Entropy Maximisation
Bridging the Machine-Human Gap in Blurred-Image Classificati...
收藏 引用
International image processing, Applications and Systems conference (IPAS)
作者: Emilio Sansano-Sansano Marina Martínez-García Javier Portilla INIT Universitat Jaume I Castellón de la Plana Spain IMAC Universitat Jaume I Castellón de la Plana Spain Instituto de Óptica CSIC Madrid Spain
Recent studies point to an accuracy gap between humans and Artificial Neural Network (ANN) models when classifying blurred images, with humans outperforming ANNs. To bridge this gap, we introduce a spectral channel-ba... 详细信息
来源: 评论
A Critical Assessment of visual Sound Source Localization Models Including Negative Audio
A Critical Assessment of Visual Sound Source Localization Mo...
收藏 引用
International conference on Acoustics, Speech, and Signal processing (ICASSP)
作者: Xavier Juanola Gloria Haro Magdalena Fuentes Universitat Pompeu Fabra Barcelona Spain MARL-IDM New York University New York USA
The task of visual Sound Source Localization (VSSL) involves identifying the location of sound sources in visual scenes, integrating audio-visual data for enhanced scene understanding. Despite advancements in state-of... 详细信息
来源: 评论
CoF: Coarse to Fine-Grained image Understanding for Multi-modal Large Language Models
CoF: Coarse to Fine-Grained Image Understanding for Multi-mo...
收藏 引用
International conference on Acoustics, Speech, and Signal processing (ICASSP)
作者: Yeyuan Wang Dehong Gao Bin Li Rujiao Long Lei Yi Xiaoyan Cai Libin Yang Jinxia Zhang Shanqing Yu Qi Xuan School of Automation Northwestern Polytechnical University Xi’an China School of Cybersecurity Northwestern Polytechnical University Xi’an China Alibaba Group Hangzhou China The Key Laboratory of Measurement and Control of CSE Ministry of Education School of Automation Southeast University Nanjing China Advanced Ocean Institute of Southeast University Nantong China Zhejiang University of Technology Hangzhou China Binjiang Institute of Artificial Intelligence Hangzhou China
The impressive performance of Large Language Model (LLM) has prompted researchers to develop Multi-modal LLM (MLLM), which has shown great potential for various multi-modal tasks. However, current MLLM often struggles... 详细信息
来源: 评论
Enhanced Satellite image Fusion Using Deep Learning and Feature Extraction Techniques: A Survey  1st
Enhanced Satellite Image Fusion Using Deep Learning and Feat...
收藏 引用
1st International conference on Intelligent Systems in Computing and communications, ISCComm 2023
作者: Nallagachu, Swathi Sandanalakshmi, R. Department of Electronics and Communication Engineering Puducherry Technological University Puducherry India
This paper presents an overview and analysis of numerous research projects on image fusion methods, with a particular emphasis on deep learning-based methods. The research analyses the inadequacies of current fusion m... 详细信息
来源: 评论
VitaCap: A Vision Transformer-Based Framework for image Captioning  29
VitaCap: A Vision Transformer-Based Framework for Image Capt...
收藏 引用
29th International Computer conference, Computer Society of Iran, CSICC 2025
作者: Nia, Amirhossein Hossein Feizi, Fatemehzahra Ahmadi, Ali School of Computer Engineering K. N. Toosi University of Technology Tehran Iran School of Computer Engineering Iran University of Science and Technology Tehran Iran
Automatic image captioning, which involves generating textual descriptions from visual content, is a challenging and multidisciplinary task combining computer vision and natural language processing. This paper introdu... 详细信息
来源: 评论