咨询与建议

限定检索结果

文献类型

  • 288 篇 期刊文献
  • 221 篇 会议

馆藏范围

  • 509 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 318 篇 工学
    • 263 篇 计算机科学与技术...
    • 224 篇 软件工程
    • 67 篇 信息与通信工程
    • 47 篇 生物工程
    • 31 篇 控制科学与工程
    • 24 篇 电子科学与技术(可...
    • 21 篇 电气工程
    • 21 篇 化学工程与技术
    • 17 篇 光学工程
    • 16 篇 生物医学工程(可授...
    • 9 篇 机械工程
    • 6 篇 力学(可授工学、理...
    • 6 篇 土木工程
    • 5 篇 仪器科学与技术
    • 5 篇 材料科学与工程(可...
    • 5 篇 动力工程及工程热...
  • 211 篇 理学
    • 115 篇 物理学
    • 67 篇 数学
    • 57 篇 生物学
    • 20 篇 化学
    • 18 篇 统计学(可授理学、...
    • 6 篇 系统科学
    • 4 篇 地质学
  • 65 篇 管理学
    • 45 篇 图书情报与档案管...
    • 21 篇 管理科学与工程(可...
    • 8 篇 工商管理
  • 13 篇 医学
    • 13 篇 基础医学(可授医学...
    • 12 篇 临床医学
    • 10 篇 药学(可授医学、理...
  • 12 篇 法学
    • 12 篇 社会学
  • 2 篇 经济学
  • 1 篇 教育学
  • 1 篇 文学

主题

  • 28 篇 speech recogniti...
  • 26 篇 semantics
  • 23 篇 training
  • 18 篇 signal processin...
  • 14 篇 speech enhanceme...
  • 12 篇 acoustics
  • 12 篇 machine learning
  • 12 篇 embeddings
  • 11 篇 computational li...
  • 11 篇 adaptation model...
  • 10 篇 computational mo...
  • 10 篇 syntactics
  • 10 篇 neural machine t...
  • 9 篇 speech processin...
  • 9 篇 feature extracti...
  • 9 篇 degradation
  • 9 篇 robustness
  • 8 篇 self-supervised ...
  • 8 篇 decoding
  • 7 篇 object detection

机构

  • 153 篇 moe key lab of a...
  • 131 篇 department of co...
  • 60 篇 key laboratory o...
  • 53 篇 moe key lab of a...
  • 32 篇 department of co...
  • 28 篇 department of co...
  • 28 篇 x-lance lab depa...
  • 23 篇 suzhou laborator...
  • 22 篇 x-lance lab depa...
  • 16 篇 key lab. of shan...
  • 16 篇 research center ...
  • 15 篇 aispeech co. ltd...
  • 15 篇 ji hua laborator...
  • 15 篇 shanghai jiao to...
  • 10 篇 shanghai jiao to...
  • 10 篇 auditory cogniti...
  • 9 篇 kyoto
  • 8 篇 department of co...
  • 8 篇 aispeech ltd
  • 8 篇 microsoft resear...

作者

  • 106 篇 yu kai
  • 93 篇 zhao hai
  • 61 篇 chen lu
  • 56 篇 qian yanmin
  • 40 篇 zhang zhuosheng
  • 39 篇 yan junchi
  • 38 篇 yanmin qian
  • 36 篇 chen xie
  • 32 篇 li zuchao
  • 28 篇 wu mengyue
  • 23 篇 zhu su
  • 22 篇 guo yiwei
  • 20 篇 kai yu
  • 19 篇 yang xiaokang
  • 18 篇 chen zhengyang
  • 17 篇 xu hongshen
  • 17 篇 du chenpeng
  • 17 篇 junchi yan
  • 16 篇 cao ruisheng
  • 16 篇 ma ziyang

语言

  • 464 篇 英文
  • 45 篇 其他
  • 1 篇 中文
检索条件"机构=Dep. of Computer Science and Engineering & MoE Key Lab of AI"
509 条 记 录,以下是131-140 订阅
排序:
Robust Audio-Visual ASR with Unified Cross-Modal Attention
Robust Audio-Visual ASR with Unified Cross-Modal Attention
收藏 引用
International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
作者: Jiahong Li Chenda Li Yifei Wu Yanmin Qian Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence AI Institute X-LANCE Lab Shanghai Jiao Tong University Shanghai China
Audio-visual speech recognition (AVSR) takes advantage of noise-invariant visual information to improve the robustness of automatic speech recognition (ASR) systems. While previous works mainly focused on the clean co... 详细信息
来源: 评论
HuBERT-AGG: Aggregated Representation Distillation of Hidden-Unit Bert for Robust Speech Recognition
HuBERT-AGG: Aggregated Representation Distillation of Hidden...
收藏 引用
International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
作者: Wei Wang Yanmin Qian Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence AI Institute X-LANCE Lab Shanghai Jiao Tong University Shanghai China
Self-supervised learning (SSL) has attracted widespread research interest since many successful SSL approaches such as wav2vec 2.0 and Hidden-unit BERT (HuBERT) have achieved promising results on speech-related tasks ... 详细信息
来源: 评论
EXPRESSIVE TTS DRIVEN BY NATURAL LANGUAGE PROMPTS USING FEW HUMAN ANNOTATIONS
arXiv
收藏 引用
arXiv 2023年
作者: Zhang, Hanglei Guo, Yiwei Liu, Sen Chen, Xie Yu, Kai MoE Key Lab of Artificial Intelligence AI Institute X-LANCE Lab Department of Computer Science and Engineering Shanghai Jiao Tong University Shanghai China
Expressive text-to-speech (TTS) aims to synthesize speeches with human-like tones, moods, or even artistic attributes. Recent advancements in expressive TTS empower users with the ability to directly control synthesis... 详细信息
来源: 评论
Diverse and Vivid Sound Generation from Text Descriptions
Diverse and Vivid Sound Generation from Text Descriptions
收藏 引用
International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
作者: Guangwei Li Xuenan Xu Lingfeng Dai Mengyue Wu Kai Yu Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence AI Institute X-Lance Lab Shanghai Jiao Tong University Shanghai China
Previous audio generation mainly focuses on specified sound classes such as speech or music, whose form and content are greatly restricted. In this paper, we go beyond specific audio generation by using natural langua... 详细信息
来源: 评论
Adaptive Large Margin Fine-Tuning For Robust Speaker Verification
Adaptive Large Margin Fine-Tuning For Robust Speaker Verific...
收藏 引用
International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
作者: Leying Zhang Zhengyang Chen Yanmin Qian Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence AI Institute X-LANCE Lab Shanghai Jiao Tong University Shanghai China
Large margin fine-tuning (LMFT) is an effective strategy to improve the speaker verification system’s performance and is widely used in speaker verification challenge systems. Because the large margin in the loss fun... 详细信息
来源: 评论
MULTI-SPEAKER MULTI-LINGUAL VQTTS SYSTEM FOR LIMMITS 2023 CHALLENGE
arXiv
收藏 引用
arXiv 2023年
作者: Du, Chenpeng Guo, Yiwei Shen, Feiyu Yu, Kai MoE Key Lab of Artificial Intelligence AI Institute X-LANCE Lab Department of Computer Science and Engineering Shanghai Jiao Tong University Shanghai China
In this paper, we describe the systems developed by the SJTU X-LANCE team for LIMMITS 2023 Challenge, and we mainly focus on the winning system on naturalness for track 1. The aim of this challenge is to build a multi... 详细信息
来源: 评论
Improving Dino-Based Self-Supervised Speaker Verification with Progressive Cluster-Aware Training
Improving Dino-Based Self-Supervised Speaker Verification wi...
收藏 引用
Acoustics, Speech, and Signal Processing Workshops (ICASSPW), IEEE International Conference on
作者: Bing Han Wen Huang Zhengyang Chen Yanmin Qian Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence AI Institute X-LANCE Lab Shanghai Jiao Tong University Shanghai China
Self-supervised contrastive learning has recently emerged as one of the promising approaches in speaker verification task, due to its indep.ndence from labeled data. Among them, the DINO-based self-supervised framewor...
来源: 评论
Multi-Speaker Multi-Lingual VQTTS System for LIMMITS 2023 Challenge
Multi-Speaker Multi-Lingual VQTTS System for LIMMITS 2023 Ch...
收藏 引用
International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
作者: Chenpeng Du Yiwei Guo Feiyu Shen Kai Yu Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence AI Institute X-LANCE Lab Shanghai Jiao Tong University Shanghai China
In this paper, we describe the systems developed by the SJTU X-LANCE team for LIMMITS 2023 Challenge, and we mainly focus on the winning system on naturalness for track 1. The aim of this challenge is to build a multi... 详细信息
来源: 评论
On the Structural Generalization in Text-to-SQL
arXiv
收藏 引用
arXiv 2023年
作者: Li, Jieyu Chen, Lu Cao, Ruisheng Zhu, Su Xu, Hongshen Chen, Zhi Zhang, Hanchong Yu, Kai X-LANCE Lab Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence Ai Institute Shanghai Jiao Tong University Shanghai China
Exploring the generalization of a text-to-SQL parser is essential for a system to automatically adapt the real-world databases. Previous works provided investigations focusing on lexical diversity, including the influ... 详细信息
来源: 评论
ASTormer: An AST Structure-aware Transformer Decoder for Text-to-SQL
arXiv
收藏 引用
arXiv 2023年
作者: Cao, Ruisheng Zhang, Hanchong Xu, Hongshen Li, Jieyu Ma, Da Chen, Lu Yu, Kai X-LANCE Lab Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence AI Institute Shanghai Jiao Tong University Shanghai China
Text-to-SQL aims to generate an executable SQL program given the user utterance and the corresponding database schema. To ensure the well-formedness of output SQLs, one prominent approach adopts a grammar-based recurr... 详细信息
来源: 评论