咨询与建议

限定检索结果

文献类型

  • 575 篇 期刊文献
  • 385 篇 会议
  • 4 册 图书

馆藏范围

  • 964 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 641 篇 工学
    • 511 篇 计算机科学与技术...
    • 444 篇 软件工程
    • 129 篇 信息与通信工程
    • 111 篇 生物工程
    • 68 篇 控制科学与工程
    • 67 篇 生物医学工程(可授...
    • 63 篇 光学工程
    • 47 篇 电气工程
    • 47 篇 电子科学与技术(可...
    • 38 篇 化学工程与技术
    • 27 篇 机械工程
    • 18 篇 仪器科学与技术
    • 17 篇 土木工程
    • 14 篇 材料科学与工程(可...
    • 14 篇 动力工程及工程热...
    • 13 篇 建筑学
  • 399 篇 理学
    • 177 篇 物理学
    • 163 篇 数学
    • 126 篇 生物学
    • 51 篇 统计学(可授理学、...
    • 40 篇 化学
    • 24 篇 系统科学
  • 158 篇 管理学
    • 93 篇 图书情报与档案管...
    • 71 篇 管理科学与工程(可...
    • 32 篇 工商管理
  • 53 篇 医学
    • 51 篇 临床医学
    • 40 篇 基础医学(可授医学...
    • 30 篇 药学(可授医学、理...
  • 22 篇 法学
    • 22 篇 社会学
  • 12 篇 经济学
    • 12 篇 应用经济学
  • 4 篇 教育学
  • 2 篇 农学
  • 1 篇 文学
  • 1 篇 军事学

主题

  • 35 篇 semantics
  • 30 篇 speech recogniti...
  • 29 篇 training
  • 27 篇 machine learning
  • 20 篇 signal processin...
  • 18 篇 embeddings
  • 16 篇 speech enhanceme...
  • 16 篇 computational mo...
  • 14 篇 object detection
  • 14 篇 reinforcement le...
  • 14 篇 artificial intel...
  • 13 篇 acoustics
  • 13 篇 decoding
  • 13 篇 adaptation model...
  • 12 篇 deep learning
  • 12 篇 speech processin...
  • 12 篇 computational li...
  • 11 篇 syntactics
  • 11 篇 predictive model...
  • 11 篇 feature extracti...

机构

  • 179 篇 moe key lab of a...
  • 147 篇 department of co...
  • 60 篇 key laboratory o...
  • 53 篇 moe key lab of a...
  • 44 篇 department of co...
  • 32 篇 department of co...
  • 30 篇 moe key lab of a...
  • 28 篇 department of co...
  • 28 篇 x-lance lab depa...
  • 23 篇 suzhou laborator...
  • 22 篇 x-lance lab depa...
  • 21 篇 shanghai jiao to...
  • 17 篇 key lab. of shan...
  • 15 篇 aispeech co. ltd...
  • 15 篇 shanghai jiao to...
  • 13 篇 moe key lab of a...
  • 12 篇 school of artifi...
  • 11 篇 department of el...
  • 11 篇 moe key lab of a...
  • 10 篇 school of artifi...

作者

  • 121 篇 yu kai
  • 96 篇 zhao hai
  • 68 篇 yan junchi
  • 63 篇 chen lu
  • 59 篇 qian yanmin
  • 41 篇 zhang zhuosheng
  • 38 篇 yanmin qian
  • 37 篇 chen xie
  • 34 篇 yang xiaokang
  • 33 篇 li zuchao
  • 32 篇 wu mengyue
  • 29 篇 zhu su
  • 28 篇 junchi yan
  • 25 篇 niu li
  • 22 篇 guo yiwei
  • 22 篇 gao xiaofeng
  • 21 篇 kai yu
  • 20 篇 zhang liqing
  • 18 篇 cao ruisheng
  • 18 篇 chen zhengyang

语言

  • 864 篇 英文
  • 89 篇 其他
  • 11 篇 中文
  • 1 篇 西班牙文
检索条件"机构=Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence"
964 条 记 录,以下是31-40 订阅
排序:
DiffVoice: Text-to-Speech with Latent Diffusion  48
DiffVoice: Text-to-Speech with Latent Diffusion
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Liu, Zhijun Guo, Yiwei Yu, Kai Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence Ai Institute X-Lance Lab Department of Computer Science and Engineering Shanghai China
In this work, we present DiffVoice, a novel text-to-speech model based on latent diffusion. We propose to first encode speech signals into a phoneme-rate latent representation with a variational autoencoder enhanced b... 详细信息
来源: 评论
Emodiff: Intensity Controllable Emotional Text-to-Speech with Soft-label Guidance  48
Emodiff: Intensity Controllable Emotional Text-to-Speech wit...
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Guo, Yiwei Du, Chenpeng Chen, Xie Yu, Kai Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence Ai Institute X-LANCE Lab Department of Computer Science and Engineering Shanghai China
Although current neural text-to-speech (TTS) models are able to generate high-quality speech, intensity controllable emotional TTS is still a challenging task. Most existing methods need external optimizations for int... 详细信息
来源: 评论
Exploring Binary Classification Loss for Speaker Verification  48
Exploring Binary Classification Loss for Speaker Verificatio...
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Han, Bing Chen, Zhengyang Qian, Yanmin Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence Ai Institute X-LANCE Lab Department of Computer Science and Engineering Shanghai China
The mismatch between close-set training and open-set testing usually leads to significant performance degradation for speaker verification task. For existing loss functions, metric learning-based objectives depend str... 详细信息
来源: 评论
Factorized AED: Factorized Attention-Based Encoder-Decoder for Text-Only Domain Adaptive ASR  48
Factorized AED: Factorized Attention-Based Encoder-Decoder f...
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Gong, Xun Wang, Wei Shao, Hang Chen, Xie Qian, Yanmin Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence Ai Institute X-LANCE Lab Department of Computer Science and Engineering Shanghai China
End-to-end automatic speech recognition (ASR) systems have gained popularity given their simplified architecture and promising results. However, text-only domain adaptation remains a big challenge for E2E systems. Tex... 详细信息
来源: 评论
HuBERT-AGG: Aggregated Representation Distillation of Hidden-Unit Bert for Robust Speech Recognition  48
HuBERT-AGG: Aggregated Representation Distillation of Hidden...
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Wang, Wei Qian, Yanmin Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence Ai Institute X-LANCE Lab Department of Computer Science and Engineering Shanghai China
Self-supervised learning (SSL) has attracted widespread research interest since many successful SSL approaches such as wav2vec 2.0 and Hidden-unit BERT (HuBERT) have achieved promising results on speech-related tasks ... 详细信息
来源: 评论
ECAPA++: Fine-grained Deep Embedding Learning for TDNN Based Speaker Verification  24
ECAPA++: Fine-grained Deep Embedding Learning for TDNN Based...
收藏 引用
24th International Speech Communication Association, Interspeech 2023
作者: Liu, Bei Qian, Yanmin MoE Key Lab of Artificial Intelligence AI Institute X-LANCE Lab Department of Computer Science and Engineering Shanghai Jiao Tong University Shanghai China
In this paper, we aim to bridge the performance gap between TDNN and 2D CNN based speaker verification systems. Specifically, three types of architectural enhancements to ECAPA-TDNN are proposed: 1) follow depth-first... 详细信息
来源: 评论
LongFNT: Long-Form Speech Recognition with Factorized Neural Transducer  48
LongFNT: Long-Form Speech Recognition with Factorized Neural...
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Gong, Xun Wu, Yu Li, Jinyu Liu, Shujie Zhao, Rui Chen, Xie Qian, Yanmin Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence Ai Institute X-LANCE Lab Department of Computer Science and Engineering China Microsoft
Traditional automatic speech recognition (ASR) systems usually focus on individual utterances, without considering long-form speech with useful historical information, which is more practical in real scenarios. Simply... 详细信息
来源: 评论
Multi-Speaker Multi-Lingual VQTTS System for LIMMITS 2023 Challenge  48
Multi-Speaker Multi-Lingual VQTTS System for LIMMITS 2023 Ch...
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Du, Chenpeng Guo, Yiwei Shen, Feiyu Yu, Kai Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence AI Institute X-LANCE Lab Department of Computer Science and Engineering Shanghai China
In this paper, we describe the systems developed by the SJTU X-LANCE team for LIMMITS 2023 Challenge, and we mainly focus on the winning system on naturalness for track 1. The aim of this challenge is to build a multi... 详细信息
来源: 评论
CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions
CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with...
收藏 引用
2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024
作者: Zhang, Hanchong Cao, Ruisheng Xu, Hongshen Chen, Lu Yu, Kai X-LANCE Lab Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence SJTU AI Institute Shanghai Jiao Tong University Shanghai China Suzhou Laboratory Suzhou China
Recently, Large Language Models (LLMs) have been demonstrated to possess impressive capabilities in a variety of domains and tasks. We investigate the issue of prompt design in the multi-turn text-to-SQL task and atte... 详细信息
来源: 评论
UP2ME: univariate pre-training to multivariate fine-tuning as a general-purpose framework for multivariate time series analysis  24
UP2ME: univariate pre-training to multivariate fine-tuning a...
收藏 引用
Proceedings of the 41st International Conference on Machine Learning
作者: Yunhao Zhang Minghao Liu Shengyang Zhou Junchi Yan School of Artificial Intelligence & Department of Computer Science and Engineering & MoE Key Lab of Artificial Intelligence Shanghai Jiao Tong University School of Artificial Intelligence & Department of Computer Science and Engineering & MoE Key Lab of Artificial Intelligence Shanghai Jiao Tong UniversitySchool of Artificial Intelligence & Department of Computer Science and Engineering & MoE Key Lab of Artificial Intelligence Shanghai Jiao Tong University
Despite the success of self-supervised pretraining in texts and images, applying it to multivariate time series (MTS) falls behind tailored methods for tasks like forecasting, imputation and anomaly detection. We prop...
来源: 评论