咨询与建议

限定检索结果

文献类型

  • 916 篇 期刊文献
  • 866 篇 会议
  • 2 册 图书

馆藏范围

  • 1,784 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 1,223 篇 工学
    • 911 篇 计算机科学与技术...
    • 788 篇 软件工程
    • 291 篇 信息与通信工程
    • 199 篇 生物工程
    • 137 篇 光学工程
    • 127 篇 控制科学与工程
    • 101 篇 生物医学工程(可授...
    • 72 篇 电气工程
    • 72 篇 化学工程与技术
    • 67 篇 电子科学与技术(可...
    • 66 篇 机械工程
    • 32 篇 仪器科学与技术
    • 29 篇 建筑学
    • 28 篇 土木工程
    • 25 篇 交通运输工程
    • 25 篇 安全科学与工程
  • 701 篇 理学
    • 356 篇 数学
    • 218 篇 生物学
    • 192 篇 物理学
    • 96 篇 统计学(可授理学、...
    • 71 篇 化学
    • 33 篇 系统科学
  • 370 篇 管理学
    • 238 篇 图书情报与档案管...
    • 146 篇 管理科学与工程(可...
    • 54 篇 工商管理
  • 79 篇 医学
    • 64 篇 临床医学
    • 52 篇 基础医学(可授医学...
    • 30 篇 公共卫生与预防医...
    • 29 篇 药学(可授医学、理...
  • 48 篇 法学
    • 47 篇 社会学
  • 17 篇 经济学
  • 13 篇 农学
  • 10 篇 教育学
  • 6 篇 文学
  • 3 篇 军事学
  • 3 篇 艺术学

主题

  • 97 篇 semantics
  • 65 篇 feature extracti...
  • 52 篇 training
  • 36 篇 convolution
  • 34 篇 deep learning
  • 34 篇 machine learning
  • 32 篇 object detection
  • 29 篇 deep neural netw...
  • 28 篇 task analysis
  • 28 篇 visualization
  • 27 篇 computational li...
  • 27 篇 computer vision
  • 26 篇 image segmentati...
  • 24 篇 signal processin...
  • 22 篇 computational mo...
  • 20 篇 contrastive lear...
  • 20 篇 predictive model...
  • 20 篇 benchmarking
  • 19 篇 data mining
  • 19 篇 accuracy

机构

  • 236 篇 school of comput...
  • 210 篇 shanghai key lab...
  • 154 篇 shanghai key lab...
  • 148 篇 shanghai key lab...
  • 101 篇 school of comput...
  • 72 篇 shanghai key lab...
  • 46 篇 academy for engi...
  • 44 篇 school of comput...
  • 44 篇 institute of sci...
  • 37 篇 shanghai key lab...
  • 33 篇 institute of mod...
  • 31 篇 shanghai key lab...
  • 31 篇 school of data s...
  • 29 篇 school of inform...
  • 27 篇 shanghai center ...
  • 24 篇 fudan university
  • 23 篇 school of comput...
  • 22 篇 university of ch...
  • 21 篇 shanghai enginee...
  • 21 篇 school of comput...

作者

  • 124 篇 huang xuanjing
  • 114 篇 qiu xipeng
  • 78 篇 xue xiangyang
  • 62 篇 jiang yu-gang
  • 60 篇 zhang qi
  • 55 篇 zhang yuejie
  • 54 篇 zhang junping
  • 53 篇 zhang wenqiang
  • 52 篇 feng rui
  • 48 篇 fu yanwei
  • 43 篇 zhang zhongzhi
  • 41 篇 gui tao
  • 41 篇 li wei
  • 40 篇 zhou shuigeng
  • 38 篇 shuigeng zhou
  • 35 篇 shan hongming
  • 34 篇 li bin
  • 32 篇 tao zhang
  • 32 篇 junping zhang
  • 32 篇 rui feng

语言

  • 1,679 篇 英文
  • 75 篇 其他
  • 32 篇 中文
检索条件"机构=Shanghai Key Lab of Intelligent Information Processing and School of Computer Science"
1784 条 记 录,以下是131-140 订阅
排序:
Effectively Enhancing Vision Language Large Models by Prompt Augmentation and Caption Utilization
arXiv
收藏 引用
arXiv 2024年
作者: Zhao, Minyi Wang, Jie Li, Zhaoyang Zhang, Jiyuan Sun, Zhenbang Zhou, Shuigeng Shanghai Key Lab of Intelligent Information Processing School of Computer Science Fudan University Shanghai200438 China ByteDance China
Recent studies have shown that Vision Language Large Models (VLLMs) may output content not relevant to the input images. This problem, called the hallucination phenomenon, undoubtedly degrades VLLM performance. Theref... 详细信息
来源: 评论
Denoising Diffusion Path: Attribution Noise Reduction with An Auxiliary Diffusion Model  38
Denoising Diffusion Path: Attribution Noise Reduction with A...
收藏 引用
38th Conference on Neural information processing Systems, NeurIPS 2024
作者: Lei, Yiming Li, Zilong Zhang, Junping Shan, Hongming Shanghai Key Laboratory of Intelligent Information Processing School of Computer Science Fudan University China Institute of Science and Technology for Brain-Inspired Intelligence MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence MOE Frontiers Center for Brain Science Fudan University China
The explainability of deep neural networks (DNNs) is critical for trust and reliability in AI systems. Path-based attribution methods, such as integrated gradients (IG), aim to explain predictions by accumulating grad...
来源: 评论
TOWARDS GENERATIVE ABSTRACT REASONING: COMPLETING RAVEN’S PROGRESSIVE MATRIX VIA RULE ABSTRACTION AND SELECTION
arXiv
收藏 引用
arXiv 2024年
作者: Shi, Fan Li, Bin Xue, Xiangyang Shanghai Key Laboratory of Intelligent Information Processing School of Computer Science Fudan University China
Endowing machines with abstract reasoning ability has been a long-term research topic in artificial intelligence. Raven’s Progressive Matrix (RPM) is widely used to probe abstract visual reasoning in machine intellig... 详细信息
来源: 评论
TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning
arXiv
收藏 引用
arXiv 2024年
作者: Li, Jinglun Zhou, Xinyu Jiang, Kaixun Hong, Lingyi Guo, Pinxue Chen, Zhaoyu Ge, Weifeng Zhang, Wenqiang Shanghai Engineering Research Center of AI & Robotics Academy for Engineering & Technology Fudan University Shanghai China Shanghai Key Lab of Intelligent Information Processing School of Computer Science Fudan University Shanghai China Key Lab of Intelligent Information Processing School of Computer Science Fudan University Shanghai China
Multimodal fusion, leveraging data like vision and language, is rapidly gaining traction. This enriched data representation improves performance across various tasks. Existing methods for out-of-distribution (OOD) det... 详细信息
来源: 评论
Optimizing V-information for Self-Supervised Pre-training Data-Effective Medical Foundation Models
arXiv
收藏 引用
arXiv 2024年
作者: Yang, Wenxuan Zhang, Hanyu Tan, Weimin Sun, Yuqi Yan, Bo Shanghai Key Laboratory of Intelligent Information Processing School of Computer Science Fudan University China
Self-supervised pre-training medical foundation models on large-scale datasets demonstrate exceptional performance. However, recent research questions this traditional notion, exploring whether an increase in pre-trai... 详细信息
来源: 评论
X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation  24
X-Prompt: Multi-modal Visual Prompt for Video Object Segment...
收藏 引用
32nd ACM International Conference on Multimedia, MM 2024
作者: Guo, Pinxue Li, Wanyun Huang, Hao Hong, Lingyi Zhou, Xinyu Chen, Zhaoyu Li, Jinglun Jiang, Kaixun Zhang, Wei Zhang, Wenqiang Shanghai Engineering Research Center of Ai & Robotics Academy for Engineering & Technology Fudan University China Shanghai Key Lab of Intelligent Information Processing School of Computer Science Fudan University China Engineering Research Center of Ai & Robotics Ministry of Education Academy for Engineering & Technology China
Multi-modal Video Object Segmentation (VOS), including RGB-Thermal, RGB-Depth, and RGB-Event, has garnered attention due to its capability to address challenging scenarios where traditional VOS methods struggle, such ... 详细信息
来源: 评论
EAFormer: Scene Text Segmentation with Edge-Aware Transformers
arXiv
收藏 引用
arXiv 2024年
作者: Yu, Haiyang Fu, Teng Li, Bin Xue, Xiangyang Shanghai Key Laboratory of Intelligent Information Processing School of Computer Science Fudan University China
Scene text segmentation aims at cropping texts from scene images, which is usually used to help generative models edit or remove texts. The existing text segmentation methods tend to involve various text-related super... 详细信息
来源: 评论
Unsupervised Learning of Global Object-Centric Representations for Compositional Scene Understanding
收藏 引用
IEEE Transactions on Visualization and computer Graphics 2025年 PP卷 PP页
作者: Chen, Tonglin Huang, Yinxuan Huang, Jinghao Li, Bin Xue, Xiangyang Fudan University Shanghai Key Lab of Intelligent Information Processing School of Computer Science Shanghai200433 China
The ability to extract invariant visual features of objects from complex scenes and identify the same objects in different scenes is inborn for humans. To endow AI systems with such capability, we introduce a novel co... 详细信息
来源: 评论
Improving Viewpoint-Independent Object-Centric Representations through Active Viewpoint Selection
arXiv
收藏 引用
arXiv 2024年
作者: Huang, Yinxuan Gao, Chengmin Li, Bin Xue, Xiangyang Shanghai Key Laboratory of Intelligent Information Processing School of Computer Science Fudan University China
Given the complexities inherent in visual scenes, such as object occlusion, a comprehensive understanding often requires observation from multiple viewpoints. Existing multi-viewpoint object-centric learning methods t... 详细信息
来源: 评论
Deep-OCTA: Ensemble Deep Learning Approaches for Diabetic Retinopathy Analysis on OCTA Images  25th
Deep-OCTA: Ensemble Deep Learning Approaches for Diabetic R...
收藏 引用
25th International Conference on Medical Image Computing and computer-Assisted Intervention , MICCAI 2022
作者: Hou, Junlin Xiao, Fan Xu, Jilan Zhang, Yuejie Zou, Haidong Feng, Rui School of Computer Science Shanghai Key Laboratory of Intelligent Information Processing Fudan University Shanghai China Academy for Engineering and Technology Fudan University Shanghai China Department of Ophthalmology Shanghai General Hospital School of Medicine Shanghai Jiao Tong University Shanghai China
The ultra-wide optical coherence tomography angiography (OCTA) has become an important imaging modality in diabetic retinopathy (DR) diagnosis. However, there are few researches focusing on automatic DR analysis using... 详细信息
来源: 评论