咨询与建议

限定检索结果

文献类型

  • 288 篇 期刊文献
  • 221 篇 会议

馆藏范围

  • 509 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 318 篇 工学
    • 263 篇 计算机科学与技术...
    • 224 篇 软件工程
    • 67 篇 信息与通信工程
    • 47 篇 生物工程
    • 31 篇 控制科学与工程
    • 24 篇 电子科学与技术(可...
    • 21 篇 电气工程
    • 21 篇 化学工程与技术
    • 17 篇 光学工程
    • 16 篇 生物医学工程(可授...
    • 9 篇 机械工程
    • 6 篇 力学(可授工学、理...
    • 6 篇 土木工程
    • 5 篇 仪器科学与技术
    • 5 篇 材料科学与工程(可...
    • 5 篇 动力工程及工程热...
  • 211 篇 理学
    • 115 篇 物理学
    • 67 篇 数学
    • 57 篇 生物学
    • 20 篇 化学
    • 18 篇 统计学(可授理学、...
    • 6 篇 系统科学
    • 4 篇 地质学
  • 65 篇 管理学
    • 45 篇 图书情报与档案管...
    • 21 篇 管理科学与工程(可...
    • 8 篇 工商管理
  • 13 篇 医学
    • 13 篇 基础医学(可授医学...
    • 12 篇 临床医学
    • 10 篇 药学(可授医学、理...
  • 12 篇 法学
    • 12 篇 社会学
  • 2 篇 经济学
  • 1 篇 教育学
  • 1 篇 文学

主题

  • 28 篇 speech recogniti...
  • 26 篇 semantics
  • 23 篇 training
  • 18 篇 signal processin...
  • 14 篇 speech enhanceme...
  • 12 篇 acoustics
  • 12 篇 machine learning
  • 12 篇 embeddings
  • 11 篇 computational li...
  • 11 篇 adaptation model...
  • 10 篇 computational mo...
  • 10 篇 syntactics
  • 10 篇 neural machine t...
  • 9 篇 speech processin...
  • 9 篇 feature extracti...
  • 9 篇 degradation
  • 9 篇 robustness
  • 8 篇 self-supervised ...
  • 8 篇 decoding
  • 7 篇 object detection

机构

  • 153 篇 moe key lab of a...
  • 131 篇 department of co...
  • 60 篇 key laboratory o...
  • 53 篇 moe key lab of a...
  • 32 篇 department of co...
  • 28 篇 department of co...
  • 28 篇 x-lance lab depa...
  • 23 篇 suzhou laborator...
  • 22 篇 x-lance lab depa...
  • 16 篇 key lab. of shan...
  • 16 篇 research center ...
  • 15 篇 aispeech co. ltd...
  • 15 篇 ji hua laborator...
  • 15 篇 shanghai jiao to...
  • 10 篇 shanghai jiao to...
  • 10 篇 auditory cogniti...
  • 9 篇 kyoto
  • 8 篇 department of co...
  • 8 篇 aispeech ltd
  • 8 篇 microsoft resear...

作者

  • 106 篇 yu kai
  • 93 篇 zhao hai
  • 61 篇 chen lu
  • 56 篇 qian yanmin
  • 40 篇 zhang zhuosheng
  • 39 篇 yan junchi
  • 38 篇 yanmin qian
  • 36 篇 chen xie
  • 32 篇 li zuchao
  • 28 篇 wu mengyue
  • 23 篇 zhu su
  • 22 篇 guo yiwei
  • 20 篇 kai yu
  • 19 篇 yang xiaokang
  • 18 篇 chen zhengyang
  • 17 篇 xu hongshen
  • 17 篇 du chenpeng
  • 17 篇 junchi yan
  • 16 篇 cao ruisheng
  • 16 篇 ma ziyang

语言

  • 464 篇 英文
  • 45 篇 其他
  • 1 篇 中文
检索条件"机构=Dep. of Computer Science and Engineering & MoE Key Lab of AI"
509 条 记 录,以下是261-270 订阅
排序:
Quantifying the Knowledge in a DNN to Explain Knowledge Distillation for Classification
arXiv
收藏 引用
arXiv 2022年
作者: Zhang, Quanshi Cheng, Xu Chen, Yilan Rao, Zhefan The Department of Computer Science and Engineering The John Hopcroft Center The MoE Key Lab of Artificial Intelligence AI Institute The Shanghai Jiao Tong University China
Compared to traditional learning from scratch, knowledge distillation sometimes makes the DNN achieve superior performance. In this paper, we provide a new perspective to explain the success of knowledge distillation ... 详细信息
来源: 评论
EVASION: Efficient KV CAche CompreSsion vIa PrOduct QuaNtization
EVASION: Efficient KV CAche CompreSsion vIa PrOduct QuaNtiza...
收藏 引用
Design, Automation and Test in Europe Conference and Exhibition
作者: Zongwu Wang Fangxin Liu Peng Xu Qingxiao Sun Junping Zhao Li Jiang Department of Computer Science and Engineering Shanghai Jiao Tong University Shanghai Qi Zhi Institute Dept. of CST SSSLab China University of Petroleum-Beijing China Ant Group MoE Key Lab of Artificial Intelligence AI Institute Shanghai Jiao Tong University
Large language models (LLMs) are increasingly utilized for complex tasks requiring longer context lengths, with some models supporting up to 128K or 1M tokens. This trend, however, presents significant challenges in i... 详细信息
来源: 评论
Is Your Image a Good Storyteller?
arXiv
收藏 引用
arXiv 2024年
作者: Song, Xiujie Pang, Xiaoyi Tang, Haifeng Wu, Mengyue Zhu, Kenny Q. X-LANCE Lab Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence AI Institute Shanghai Jiao Tong University Shanghai China China Merchants Bank Credit Card Center Shanghai China University of Texas at Arlington ArlingtonTX United States
Quantifying image complexity at the entity level is straightforward, but the assessment of semantic complexity has been largely overlooked. In fact, there are differences in semantic complexity across images. Images w...
来源: 评论
LONGFNT: LONG-FORM SPEECH RECOGNITION WITH FACTORIZED NEURAL TRANSDUCER
arXiv
收藏 引用
arXiv 2022年
作者: Gong, Xun Wu, Yu Li, Jinyu Liu, Shujie Zhao, Rui Chen, Xie Qian, Yanmin MoE Key Lab of Artificial Intelligence AI Institute X-LANCE Lab Department of Computer Science and Engineering Shanghai Jiao Tong University China Microsoft China
Traditional automatic speech recognition (ASR) systems usually focus on individual utterances, without considering long-form speech with useful historical information, which is more practical in real scenarios. Simply... 详细信息
来源: 评论
LEVERAGING SPEECH PTM, TEXT LLM, AND EMOTIONAL TTS FOR SPEECH EMOTION RECOGNITION
arXiv
收藏 引用
arXiv 2023年
作者: Ma, Ziyang Wu, Wen Zheng, Zhisheng Guo, Yiwei Chen, Qian Zhang, Shiliang Chen, Xie MoE Key Lab of Artificial Intelligence AI Institute X-LANCE Lab Department of Computer Science and Engineering Shanghai Jiao Tong University Shanghai China Department of Engineering University of Cambridge Cambridge United Kingdom Speech Lab of DAMO Academy Alibaba Group China
In this paper, we explored how to boost speech emotion recognition (SER) with the state-of-the-art speech pre-trained model (PTM), data2vec, text generation technique, GPT-4, and speech synthesis technique, Azure TTS.... 详细信息
来源: 评论
Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning
arXiv
收藏 引用
arXiv 2023年
作者: Wang, Qi Yang, Junming Wang, Yunbo Jin, Xin Zeng, Wenjun Yang, Xiaokang MoE Key Lab of Artificial Intelligence AI Institute Shanghai Jiao Tong University China Ningbo Institute of Digital Twin Eastern Institute of Technology China School of Computer Science and Engineering Southeast University China
Training offline RL models using visual inputs poses two significant challenges, i.e., the overfitting problem in representation learning and the overestimation bias for expected future rewards. Recent work has attemp... 详细信息
来源: 评论
SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech Separation
arXiv
收藏 引用
arXiv 2022年
作者: Li, Chenda Yang, Lei Wang, Weiqin Qian, Yanmin MoE Key Lab of Artificial Intelligence AI Institute X-LANCE Lab Department of Computer Science and Engineering Shanghai Jiao Tong University Shanghai China China
Continuous speech separation for meeting pre-processing has recently become a focused research topic. Compared to the data in utterance-level speech separation, the meeting-style audio stream lasts longer, has an unce... 详细信息
来源: 评论
SP3: Enhancing Structured Pruning via PCA Projection
arXiv
收藏 引用
arXiv 2023年
作者: Hu, Yuxuan Zhang, Jing Zhao, Zhe Zhao, Chen Chen, Xiaodong Li, Cuiping Chen, Hong School of Information Renmin University of China Beijing China Key Laboratory of Data Engineering and Knowledge Engineering MOE China Engineering Research Center of Database and Business Intelligence MOE China Tencent AI Lab Tencent Beijing China School of Computer Science and Technology Xi'an Jiaotong University Xi'An China
Structured pruning is a widely used technique for reducing the size of pre-trained language models (PLMs), but current methods often overlook the potential of compressing the hidden dimension (d) in PLMs, a dimension ...
来源: 评论
Flow-TSVAD: Target-Speaker Voice Activity Detection via Latent Flow Matching for Speaker Diarization
Flow-TSVAD: Target-Speaker Voice Activity Detection via Late...
收藏 引用
International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
作者: Zhengyang Chen Bing Han Shuai Wang Yidi Jiang Yanmin Qian Department of Computer Science and Engineering Auditory Cognition and Computational Acoustics Lab MoE Key Lab of Artificial Intelligence AI Institute Shanghai Jiao Tong University Shanghai China Shenzhen Research Institute of Big Data Shenzhen China School of Data Science The Chinese University of Hong Kong Shenzhen China National University of Singapore Singapore
Speaker diarization is typically considered as a discriminative task, using discriminative approaches to produce fixed diarization results. In this paper, we explore for the first time the use of neural network-based ... 详细信息
来源: 评论
End-to-End Streaming Customizable keyword Spotting Based on Text-Adaptive Neural Search  1
收藏 引用
18th National Conference on Man-Machine Speech Communication, NCMMSC 2023
作者: Yang, Baochen Guo, Jiaqi Li, Haoyu Xi, Yu Zhuo, Qing Yu, Kai X-LANCE Lab Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence AI Institute Shanghai Jiao Tong University Shanghai China State Key Laboratory of Media Convergence Production Technology and Systems Beijing China AISpeech Ltd. Suzhou China Department of Automation Tsinghua University Beijing China
Streaming keyword spotting (KWS) is an important technique for voice assistant wake-up. While KWS with a preset fixed keyword has been well studied, test-time customizable keyword spotting in streaming mode remains a ... 详细信息
来源: 评论