咨询与建议

限定检索结果

文献类型

  • 1,558 篇 期刊文献
  • 1,518 篇 会议
  • 13 册 图书

馆藏范围

  • 3,089 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 2,104 篇 工学
    • 1,605 篇 计算机科学与技术...
    • 1,374 篇 软件工程
    • 383 篇 信息与通信工程
    • 338 篇 生物工程
    • 330 篇 控制科学与工程
    • 226 篇 生物医学工程(可授...
    • 207 篇 光学工程
    • 158 篇 电气工程
    • 154 篇 机械工程
    • 136 篇 电子科学与技术(可...
    • 107 篇 化学工程与技术
    • 62 篇 仪器科学与技术
    • 57 篇 交通运输工程
    • 56 篇 安全科学与工程
    • 53 篇 土木工程
    • 47 篇 建筑学
  • 1,158 篇 理学
    • 589 篇 数学
    • 366 篇 生物学
    • 363 篇 物理学
    • 196 篇 统计学(可授理学、...
    • 112 篇 化学
    • 109 篇 系统科学
  • 538 篇 管理学
    • 293 篇 图书情报与档案管...
    • 262 篇 管理科学与工程(可...
    • 131 篇 工商管理
  • 162 篇 医学
    • 144 篇 临床医学
    • 127 篇 基础医学(可授医学...
    • 86 篇 药学(可授医学、理...
    • 54 篇 公共卫生与预防医...
  • 88 篇 法学
    • 80 篇 社会学
  • 39 篇 教育学
  • 34 篇 经济学
  • 20 篇 农学
  • 9 篇 文学
  • 2 篇 军事学
  • 1 篇 哲学
  • 1 篇 历史学

主题

  • 85 篇 semantics
  • 82 篇 machine learning
  • 81 篇 training
  • 71 篇 deep learning
  • 62 篇 computational mo...
  • 49 篇 artificial intel...
  • 48 篇 reinforcement le...
  • 46 篇 accuracy
  • 41 篇 feature extracti...
  • 38 篇 computer vision
  • 36 篇 speech recogniti...
  • 35 篇 predictive model...
  • 32 篇 visualization
  • 30 篇 data models
  • 29 篇 object detection
  • 29 篇 image segmentati...
  • 29 篇 embeddings
  • 29 篇 robustness
  • 28 篇 neural networks
  • 26 篇 graph neural net...

机构

  • 170 篇 moe key lab of a...
  • 142 篇 department of co...
  • 59 篇 key laboratory o...
  • 51 篇 moe key lab of a...
  • 47 篇 department of co...
  • 44 篇 department of co...
  • 38 篇 shanghai artific...
  • 36 篇 computer science...
  • 36 篇 tencent ai lab
  • 36 篇 computer science...
  • 33 篇 faculty of compu...
  • 32 篇 department of co...
  • 32 篇 computer science...
  • 30 篇 institute for ar...
  • 30 篇 gaoling school o...
  • 30 篇 moe key lab of a...
  • 29 篇 shanghai jiao to...
  • 27 篇 state key lab on...
  • 27 篇 department of co...
  • 26 篇 computer science...

作者

  • 121 篇 yu kai
  • 94 篇 zhao hai
  • 70 篇 yan junchi
  • 57 篇 chen lu
  • 56 篇 qian yanmin
  • 49 篇 sun maosong
  • 46 篇 ayman atia
  • 45 篇 daniela rus
  • 43 篇 liu zhiyuan
  • 41 篇 zhang zhuosheng
  • 41 篇 atia ayman
  • 37 篇 chen xie
  • 32 篇 rus daniela
  • 32 篇 li zuchao
  • 32 篇 yanmin qian
  • 29 篇 zhu su
  • 29 篇 huang minlie
  • 28 篇 wu mengyue
  • 28 篇 yang xiaokang
  • 27 篇 junchi yan

语言

  • 2,836 篇 英文
  • 239 篇 其他
  • 21 篇 中文
  • 1 篇 德文
检索条件"机构=LIACC- Artificial Intelligence and Computer Science Lab"
3089 条 记 录,以下是111-120 订阅
排序:
Exploring Binary Classification Loss for Speaker Verification  48
Exploring Binary Classification Loss for Speaker Verificatio...
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Han, Bing Chen, Zhengyang Qian, Yanmin Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence Ai Institute X-LANCE Lab Department of Computer Science and Engineering Shanghai China
The mismatch between close-set training and open-set testing usually leads to significant performance degradation for speaker verification task. For existing loss functions, metric learning-based objectives depend str... 详细信息
来源: 评论
Robust Audio-Visual ASR with Unified Cross-Modal Attention  48
Robust Audio-Visual ASR with Unified Cross-Modal Attention
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Li, Jiahong Li, Chenda Wu, Yifei Qian, Yanmin Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence AI Institute X-LANCE Lab Department of Computer Science and Engineering Shanghai China
Audio-visual speech recognition (AVSR) takes advantage of noise-invariant visual information to improve the robustness of automatic speech recognition (ASR) systems. While previous works mainly focused on the clean co... 详细信息
来源: 评论
DiffVoice: Text-to-Speech with Latent Diffusion  48
DiffVoice: Text-to-Speech with Latent Diffusion
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Liu, Zhijun Guo, Yiwei Yu, Kai Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence Ai Institute X-Lance Lab Department of Computer Science and Engineering Shanghai China
In this work, we present DiffVoice, a novel text-to-speech model based on latent diffusion. We propose to first encode speech signals into a phoneme-rate latent representation with a variational autoencoder enhanced b... 详细信息
来源: 评论
A Civil Aviation Customer Service Ontology and Its Applications
收藏 引用
Data intelligence 2023年 第4期5卷 1063-1081页
作者: Meixiang Lv Xudong Cao Tianxing Wu Yuehua Li School of Computer Science and Engineering Southeast UniversityNanjingChina Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications(Southeast University) Ministry of EducationChina Zhejiang Lab HangzhouChina
In the process of developing the C919 large aircraft customer service intelligence system,we find that heterogeneous and incomplete data cause the inefficient and inaccurate decision ***,to solve this problem,we propo... 详细信息
来源: 评论
ALPSolver: A Solver for Assumable Logic Programming  5
ALPSolver: A Solver for Assumable Logic Programming
收藏 引用
5th International Conference on Intelligent Computing and Human-computer Interaction, ICHCI 2024
作者: Zhang, Zhizheng Chen, Jiayi Tian, Huangdezhong School of Computer Science and Engineering Southeast University Nanjing China Key Lab. of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications Ministry of Education China
Assumable Logic Programming (ALP), an extension of Answer Set Programming (ASP), has been theoretically demonstrated to possess significant advantages in addressing problems involving incomplete information. Therefore... 详细信息
来源: 评论
MetaSTC: A Backbone Agnostic Spatio-Temporal Framework for Traffic Forecasting  24
MetaSTC: A Backbone Agnostic Spatio-Temporal Framework for T...
收藏 引用
24th IEEE International Conference on Data Mining, ICDM 2024
作者: Xu, Kexin Yu, Zhemeng Gao, Yucen Zhang, Songjian Fang, Jun Gao, Xiaofeng Chen, Guihai Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence Department of Computer Science and Engineering Shanghai China Didi Chuxing Technology Co. Beijing China
Traffic flow prediction is a critical issue in transportation engineering and presents distinct challenges when handling large-scale datasets in the real world. Existing complex spatio-temporal forecasting paradigms u... 详细信息
来源: 评论
Adaptive Large Margin Fine-Tuning For Robust Speaker Verification  48
Adaptive Large Margin Fine-Tuning For Robust Speaker Verific...
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Zhang, Leying Chen, Zhengyang Qian, Yanmin Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence Ai Institute X-LANCE Lab Department of Computer Science and Engineering Shanghai China
Large margin fine-tuning (LMFT) is an effective strategy to improve the speaker verification system's performance and is widely used in speaker verification challenge systems. Because the large margin in the loss ... 详细信息
来源: 评论
Factorized AED: Factorized Attention-Based Encoder-Decoder for Text-Only Domain Adaptive ASR  48
Factorized AED: Factorized Attention-Based Encoder-Decoder f...
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Gong, Xun Wang, Wei Shao, Hang Chen, Xie Qian, Yanmin Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence Ai Institute X-LANCE Lab Department of Computer Science and Engineering Shanghai China
End-to-end automatic speech recognition (ASR) systems have gained popularity given their simplified architecture and promising results. However, text-only domain adaptation remains a big challenge for E2E systems. Tex... 详细信息
来源: 评论
LongFNT: Long-Form Speech Recognition with Factorized Neural Transducer  48
LongFNT: Long-Form Speech Recognition with Factorized Neural...
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Gong, Xun Wu, Yu Li, Jinyu Liu, Shujie Zhao, Rui Chen, Xie Qian, Yanmin Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence Ai Institute X-LANCE Lab Department of Computer Science and Engineering China Microsoft
Traditional automatic speech recognition (ASR) systems usually focus on individual utterances, without considering long-form speech with useful historical information, which is more practical in real scenarios. Simply... 详细信息
来源: 评论
Emodiff: Intensity Controllable Emotional Text-to-Speech with Soft-label Guidance  48
Emodiff: Intensity Controllable Emotional Text-to-Speech wit...
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Guo, Yiwei Du, Chenpeng Chen, Xie Yu, Kai Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence Ai Institute X-LANCE Lab Department of Computer Science and Engineering Shanghai China
Although current neural text-to-speech (TTS) models are able to generate high-quality speech, intensity controllable emotional TTS is still a challenging task. Most existing methods need external optimizations for int... 详细信息
来源: 评论