咨询与建议

限定检索结果

文献类型

  • 288 篇 期刊文献
  • 221 篇 会议

馆藏范围

  • 509 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 318 篇 工学
    • 263 篇 计算机科学与技术...
    • 224 篇 软件工程
    • 67 篇 信息与通信工程
    • 47 篇 生物工程
    • 31 篇 控制科学与工程
    • 24 篇 电子科学与技术(可...
    • 21 篇 电气工程
    • 21 篇 化学工程与技术
    • 17 篇 光学工程
    • 16 篇 生物医学工程(可授...
    • 9 篇 机械工程
    • 6 篇 力学(可授工学、理...
    • 6 篇 土木工程
    • 5 篇 仪器科学与技术
    • 5 篇 材料科学与工程(可...
    • 5 篇 动力工程及工程热...
  • 211 篇 理学
    • 115 篇 物理学
    • 67 篇 数学
    • 57 篇 生物学
    • 20 篇 化学
    • 18 篇 统计学(可授理学、...
    • 6 篇 系统科学
    • 4 篇 地质学
  • 65 篇 管理学
    • 45 篇 图书情报与档案管...
    • 21 篇 管理科学与工程(可...
    • 8 篇 工商管理
  • 13 篇 医学
    • 13 篇 基础医学(可授医学...
    • 12 篇 临床医学
    • 10 篇 药学(可授医学、理...
  • 12 篇 法学
    • 12 篇 社会学
  • 2 篇 经济学
  • 1 篇 教育学
  • 1 篇 文学

主题

  • 28 篇 speech recogniti...
  • 26 篇 semantics
  • 23 篇 training
  • 18 篇 signal processin...
  • 14 篇 speech enhanceme...
  • 12 篇 acoustics
  • 12 篇 machine learning
  • 12 篇 embeddings
  • 11 篇 computational li...
  • 11 篇 adaptation model...
  • 10 篇 computational mo...
  • 10 篇 syntactics
  • 10 篇 neural machine t...
  • 9 篇 speech processin...
  • 9 篇 feature extracti...
  • 9 篇 degradation
  • 9 篇 robustness
  • 8 篇 self-supervised ...
  • 8 篇 decoding
  • 7 篇 object detection

机构

  • 153 篇 moe key lab of a...
  • 131 篇 department of co...
  • 60 篇 key laboratory o...
  • 53 篇 moe key lab of a...
  • 32 篇 department of co...
  • 28 篇 department of co...
  • 28 篇 x-lance lab depa...
  • 23 篇 suzhou laborator...
  • 22 篇 x-lance lab depa...
  • 16 篇 key lab. of shan...
  • 16 篇 research center ...
  • 15 篇 aispeech co. ltd...
  • 15 篇 ji hua laborator...
  • 15 篇 shanghai jiao to...
  • 10 篇 shanghai jiao to...
  • 10 篇 auditory cogniti...
  • 9 篇 kyoto
  • 8 篇 department of co...
  • 8 篇 aispeech ltd
  • 8 篇 microsoft resear...

作者

  • 106 篇 yu kai
  • 93 篇 zhao hai
  • 61 篇 chen lu
  • 56 篇 qian yanmin
  • 40 篇 zhang zhuosheng
  • 39 篇 yan junchi
  • 38 篇 yanmin qian
  • 36 篇 chen xie
  • 32 篇 li zuchao
  • 28 篇 wu mengyue
  • 23 篇 zhu su
  • 22 篇 guo yiwei
  • 20 篇 kai yu
  • 19 篇 yang xiaokang
  • 18 篇 chen zhengyang
  • 17 篇 xu hongshen
  • 17 篇 du chenpeng
  • 17 篇 junchi yan
  • 16 篇 cao ruisheng
  • 16 篇 ma ziyang

语言

  • 464 篇 英文
  • 45 篇 其他
  • 1 篇 中文
检索条件"机构=Dep. of Computer Science and Engineering & MoE Key Lab of AI"
509 条 记 录,以下是231-240 订阅
排序:
ScanDTM: A Novel Dual-Temporal Modulation Scanpath Prediction Model for Omnidirectional Images
收藏 引用
IEEE Transactions on Circuits and Systems for Video Technology 2025年
作者: Zhu, Dandan Zhang, Kaiwei Min, Xiongkuo Zhai, Guangtao Yang, Xiaokang East China Normal University School of Computer Science and Technology Shanghai200333 China Shanghai Jiao Tong University Institute of Image Communication and Network Engineering Shanghai200240 China Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence AI Institute Shanghai200240 China
Scanpath prediction for omnidirectional images aims to effectively simulate the human visual perception mechanism to generate dynamic realistic fixation trajectories. However, the majority of scanpath prediction metho... 详细信息
来源: 评论
Beyond the Status Quo: A Contemporary Survey of Advances and Challenges in Audio Captioning
arXiv
收藏 引用
arXiv 2022年
作者: Xu, Xuenan Xie, Zeyu Wu, Mengyue Yu, Kai X-LANCE Lab Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence AI Institute Shanghai Jiao Tong University China
Automated audio captioning (AAC), a task that mimics human perception as well as innovatively links audio processing and natural language processing, has overseen much progress over the last few years. AAC requires re... 详细信息
来源: 评论
A Cognitive Evaluation Benchmark of Image Reasoning and Description for Large Vision-Language Models
arXiv
收藏 引用
arXiv 2024年
作者: Song, Xiujie Wu, Mengyue Zhu, Kenny Q. Zhang, Chunhao Chen, Yanyi X-LANCE Lab Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence AI Institute Shanghai Jiao Tong University Shanghai China University of Texas at Arlington ArlingtonTX United States University of Chicago ChicagoIL United States
Large Vision-Language Models (LVLMs), despite their recent success, are hardly comprehensively tested for their cognitive abilities. Inspired by the prevalent use of the Cookie Theft task in human cognitive tests, we ... 详细信息
来源: 评论
Robust Cross-Domain Speaker Verification with Multi-Level Domain Adapters
Robust Cross-Domain Speaker Verification with Multi-Level Do...
收藏 引用
International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
作者: Wen Huang Bing Han Shuai Wang Zhengyang Chen Yanmin Qian AI Institute Department of Computer Science and Engineering Auditory Cognition and Computational Acoustics Lab MoE Key Lab of Artificial Intelligence Shanghai Jiao Tong University Shanghai China Shenzhen Research Institute of Big Data The Chinese University of Hong Kong Shenzhen China
Speaker verification encounters significant challenges when confronted with diverse domain data, often resulting in performance degradation due to domain mismatch. To enhance performance in cross-domain scenarios, we ...
来源: 评论
ShapeBoost: Boosting Human Shape Estimation with Part-Based Parameterization and Clothing-Preserving Augmentation
arXiv
收藏 引用
arXiv 2024年
作者: Bian, Siyuan Li, Jiefeng Tang, Jiasheng Lu, Cewu Department of Computer Science and Engineering Shanghai Jiao Tong University Shanghai China MoE Key Lab of Artificial Intelligence AI Institute Shanghai Jiao Tong University Shanghai China DAMO Academy Alibaba Group Hangzhou China Hupan Lab Hangzhou China
Accurate human shape recovery from a monocular RGB image is a challenging task because humans come in different shapes and sizes and wear different clothes. In this paper, we propose ShapeBoost, a new human shape reco... 详细信息
来源: 评论
Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification
arXiv
收藏 引用
arXiv 2024年
作者: Huang, Wen Han, Bing Chen, Zhengyang Wang, Shuai Qian, Yanmin Auditory Cognition and Computational Acoustics Lab MoE Key Lab of Artificial Intelligence AI Institute Department of Computer Science and Engineering Shanghai Jiao Tong University Shanghai China Shenzhen Research Institute of Big Data The Chinese University of Hong Kong Shenzhen China
Speaker verification system trained on one domain usually suffers performance degradation when applied to another domain. To address this challenge, researchers commonly use feature distribution matching-based methods... 详细信息
来源: 评论
Is Your Image a Good Storyteller?  39
Is Your Image a Good Storyteller?
收藏 引用
39th Annual AAai Conference on Artificial Intelligence, AAai 2025
作者: Song, Xiujie Pang, Xiaoyi Tang, Haifeng Wu, Mengyue Zhu, Kenny Q. X-LANCE Lab Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence AI Institute Shanghai Jiao Tong University Shanghai China China Merchants Bank Credit Card Center Shanghai China University of Texas at Arlington Arlington TX United States
Quantifying image complexity at the entity level is straightforward, but the assessment of semantic complexity has been largely overlooked. In fact, there are differences in semantic complexity across images. Images w...
来源: 评论
Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition
Exploring Effective Distillation of Self-Supervised Speech M...
收藏 引用
IEEE Workshop on Automatic Speech Recognition and Understanding
作者: Yujin Wang Changli Tang Ziyang Ma Zhisheng Zheng Xie Chen Wei-Qiang Zhang Department of Electronic Engineering Tsinghua University Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence AI Institute X-LANCE Lab Shanghai Jiao Tong University Shanghai China Peng Cheng Laboratory Shenzhen China
Self-supervised learning (SSL) has achieved great success in speech processing, but always with a large model size to increase the modeling capacity. This may limit its potential applications due to the expensive comp...
来源: 评论
EndoGSLAM: Real-Time Dense Reconstruction and Tracking in Endoscopic Surgeries using Gaussian Splatting
arXiv
收藏 引用
arXiv 2024年
作者: Wang, Kailing Yang, Chen Wang, Yuehao Li, Sikuang Wang, Yan Dou, Qi Yang, Xiaokang Shen, Wei MoE Key Lab of Artificial Intelligence AI Institute Shanghai Jiao Tong University China Dept. of Computer Science and Engineering The Chinese University of Hong Kong Hong Kong Shanghai Key Laboratory of Multidimensional Information Processing East China Normal University China
Precise camera tracking, high-fidelity 3D tissue reconstruction, and real-time online visualization are critical for intrabody medical imaging devices such as endoscopes and capsule robots. However, existing SLAM (Sim... 详细信息
来源: 评论
Relation-Aware Multi-hop Reasoning forVisual Dialog  10th
Relation-Aware Multi-hop Reasoning forVisual Dialog
收藏 引用
10th CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2021
作者: Zhao, Yao Chen, Lu Yu, Kai X-LANCE Lab Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence AI Institute Shanghai Jiao Tong University Shanghai China State Key Lab of Media Convergence Production Technology and Systems Beijing China
Visual dialog is a multi-modal task that requires a dialog agent to answer a series of progressive questions grounded in an image. In this paper, we propose Relation-aware Multi-hop Reasoning Network (i.e. R2N for sho... 详细信息
来源: 评论