咨询与建议

限定检索结果

文献类型

  • 17 篇 会议
  • 12 篇 期刊文献

馆藏范围

  • 29 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 19 篇 工学
    • 13 篇 计算机科学与技术...
    • 12 篇 软件工程
    • 9 篇 控制科学与工程
    • 5 篇 信息与通信工程
    • 3 篇 电气工程
    • 3 篇 电子科学与技术(可...
    • 2 篇 光学工程
    • 2 篇 航空宇航科学与技...
    • 1 篇 机械工程
    • 1 篇 仪器科学与技术
    • 1 篇 材料科学与工程(可...
    • 1 篇 化学工程与技术
    • 1 篇 轻工技术与工程
    • 1 篇 生物医学工程(可授...
    • 1 篇 生物工程
  • 11 篇 理学
    • 9 篇 物理学
    • 1 篇 数学
    • 1 篇 化学
    • 1 篇 生物学
    • 1 篇 系统科学
  • 1 篇 法学
    • 1 篇 社会学
  • 1 篇 管理学
    • 1 篇 图书情报与档案管...

主题

  • 5 篇 signal processin...
  • 4 篇 acoustics
  • 3 篇 speech recogniti...
  • 2 篇 libraries
  • 2 篇 task analysis
  • 2 篇 motion planning
  • 2 篇 speech processin...
  • 2 篇 production
  • 2 篇 diffusion
  • 2 篇 embeddings
  • 2 篇 computer vision
  • 2 篇 data models
  • 1 篇 extensions
  • 1 篇 frames
  • 1 篇 legged locomotio...
  • 1 篇 noise reduction
  • 1 篇 springs
  • 1 篇 grasping
  • 1 篇 fast-u2++
  • 1 篇 information

机构

  • 13 篇 horizon robotics
  • 13 篇 wenet open sourc...
  • 5 篇 risc-v internati...
  • 4 篇 shenzhen interna...
  • 4 篇 school of marine...
  • 3 篇 northwestern pol...
  • 3 篇 aispeech ltd
  • 3 篇 tsinghua-berkele...
  • 2 篇 tencent ethereal...
  • 2 篇 open source robo...
  • 2 篇 school of marine...
  • 2 篇 tencent robotics...
  • 2 篇 school of comput...
  • 2 篇 nvidia santa cla...
  • 2 篇 the chinese univ...
  • 2 篇 department of ma...
  • 1 篇 the school of me...
  • 1 篇 the chinese univ...
  • 1 篇 open source robo...
  • 1 篇 george washingto...

作者

  • 9 篇 zhang binbin
  • 7 篇 pan fuping
  • 5 篇 peng zhendong
  • 5 篇 song xingchen
  • 4 篇 zhang xiao-lei
  • 4 篇 binbin zhang
  • 4 篇 liang chengdong
  • 3 篇 fuping pan
  • 3 篇 wu di
  • 3 篇 xie lei
  • 3 篇 ding wenbo
  • 2 篇 deng yanlei
  • 2 篇 wang hongji
  • 2 篇 wang shuai
  • 2 篇 hou jingyong
  • 2 篇 wang zihan
  • 2 篇 wu changsheng
  • 2 篇 chengdong liang
  • 2 篇 mu shilong
  • 2 篇 li shengqiang

语言

  • 28 篇 英文
  • 1 篇 其他
检索条件"机构=Open Source Robotics"
29 条 记 录,以下是1-10 订阅
排序:
Fast-U2++: Fast and Accurate End-to-End Speech Recognition in Joint CTC/Attention Frames  48
Fast-U2++: Fast and Accurate End-to-End Speech Recognition i...
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Liang, Chengdong Zhang, Xiao-Lei Zhang, BinBin Wu, Di Li, Shengqiang Song, Xingchen Peng, Zhendong Pan, Fuping Northwestern Polytechnical University School of Marine Science and Technology Xi'an China Horizon Robotics Beijing China WeNet Open Source Community
Recently, the unified streaming and non-streaming two-pass (U2/U2++) end-to-end model for speech recognition has shown great performance in terms of streaming capability, accuracy and latency. In this paper, we presen... 详细信息
来源: 评论
Wekws: A Production First Small-Footprint End-to-End Keyword Spotting Toolkit  48
Wekws: A Production First Small-Footprint End-to-End Keyword...
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Wang, Jie Xu, Menglong Hou, Jingyong Zhang, Binbin Zhang, Xiao-Lei Xie, Lei Pan, Fuping Northwestern Polytechnical University School of Marine Science and Technology Xi'an China WeNet Open Source Community China Horizon Robotics Beijing China School of Computer Science Xi'an China
Keyword spotting (KWS) enables speech-based user interaction and gradually becomes an indispensable component of smart devices. Recently, end-to-end (E2E) methods have be-come the most popular approach for on-device K... 详细信息
来源: 评论
LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-Speech  48
LightGrad: Lightweight Diffusion Probabilistic Model for Tex...
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Chen, Jie Song, Xingchen Peng, Zhendong Zhang, Binbin Pan, Fuping Wu, Zhiyong Tsinghua University Shenzhen International Graduate School Shenzhen China Horizon Robotics Beijing China WeNet Open Source Community The Chinese University of Hong Kong Hong Kong Hong Kong
Recent advances in neural text-to-speech (TTS) models bring thousands of TTS applications into daily life, where models are deployed in cloud to provide services for customs. Among these models are diffusion probabili... 详细信息
来源: 评论
Unleashing Artificial Cognition: Integrating Multiple AI Systems
arXiv
收藏 引用
arXiv 2024年
作者: Adnan, Muntasir Gamage, Buddhi Xu, Zhiwei Herath, Damith Kuhn, Carlos C.N. Open Source Institute Faculty of Science and Technology University of Canberra Australia Collaborative Robotics Lab Faculty of Science and Technology University of Canberra Australia
In this study, we present an innovative fusion of language models and query analysis techniques to unlock cognition in artificial intelligence. The introduced open-source AI system seamlessly integrates a Chess engine... 详细信息
来源: 评论
Fast-U2++: Fast and Accurate End-to-End Speech Recognition in Joint CTC/Attention Frames
Fast-U2++: Fast and Accurate End-to-End Speech Recognition i...
收藏 引用
International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
作者: Chengdong Liang Xiao-Lei Zhang BinBin Zhang Di Wu Shengqiang Li Xingchen Song Zhendong Peng Fuping Pan School of Marine Science and Technology Northwestern Polytechnical University Xi’an China Horizon Robotics Beijing China WeNet Open Source Community
Recently, the unified streaming and non-streaming two-pass (U2/U2++) end-to-end model for speech recognition has shown great performance in terms of streaming capability, accuracy and latency. In this paper, we presen... 详细信息
来源: 评论
LIGHTGRAD: LIGHTWEIGHT DIFFUSION PROBABILISTIC MODEL FOR TEXT-TO-SPEECH
arXiv
收藏 引用
arXiv 2023年
作者: Chen, Jie Song, Xingchen Peng, Zhendong Zhang, Binbin Pan, Fuping Wu, Zhiyong Shenzhen International Graduate School Tsinghua University Shenzhen China Horizon Robotics Beijing China WeNet Open Source Community China The Chinese University of Hong Kong Hong Kong
Recent advances in neural text-to-speech (TTS) models bring thousands of TTS applications into daily life, where models are deployed in cloud to provide services for customs. Among these models are diffusion probabili... 详细信息
来源: 评论
LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-Speech
LightGrad: Lightweight Diffusion Probabilistic Model for Tex...
收藏 引用
International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
作者: Jie Chen Xingchen Song Zhendong Peng Binbin Zhang Fuping Pan Zhiyong Wu Shenzhen International Graduate School Tsinghua University Shenzhen China Horizon Robotics Beijing China WeNet Open Source Community The Chinese University of Hong Kong Hong Kong SAR China
Recent advances in neural text-to-speech (TTS) models bring thousands of TTS applications into daily life, where models are deployed in cloud to provide services for customs. Among these models are diffusion probabili... 详细信息
来源: 评论
WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
arXiv
收藏 引用
arXiv 2022年
作者: Zhang, Binbin Wu, Di Peng, Zhendong Song, Xingchen Yao, Zhuoyuan Lv, Hang Xie, Lei Yang, Chao Pan, Fuping Niu, Jianwei Horizon Robotics Beijing China School of Computer Science Northwestern Polytechnical University Xi'An China WeNet Open Source Community China
Recently, we made available WeNet [1], a production-oriented end-to-end speech recognition toolkit, which introduces a unified two-pass (U2) framework and a built-in runtime to address the streaming and non-streaming ... 详细信息
来源: 评论
FAST-U2++: FAST AND ACCURATE END-TO-END SPEECH RECOGNITION IN JOINT CTC/ATTENTION FRAMES
arXiv
收藏 引用
arXiv 2022年
作者: Liang, Chengdong Zhang, Xiao-Lei Zhang, BinBin Wu, Di Li, Shengqiang Song, Xingchen Peng, Zhendong Pan, Fuping School of Marine Science and Technology Northwestern Polytechnical University Xi’an China Horizon Robotics Beijing China WeNet Open Source Community China
Recently, the unified streaming and non-streaming two-pass (U2/U2++) end-to-end model for speech recognition has shown great performance in terms of streaming capability, accuracy and latency. In this paper, we presen... 详细信息
来源: 评论
Ultra-High-Frequency Harmony: mmWave Radar and Event Camera Orchestrate Accurate Drone Landing
arXiv
收藏 引用
arXiv 2025年
作者: Wang, Haoyang Xu, Jingao Luo, Xinyu Chen, Xuecheng Zhang, Ting Duan, Ruiyang Liu, Yunhao Chen, Xinlei Shenzhen International Graduate School Tsinghua University China Carnegie Mellon University United States Meituan Academy of Robotics Shenzhen China School of Software Tsinghua University China Pengcheng Laboratory Shenzhen China RISC-V International Open Source Laboratory Shenzhen China
For precise, efficient, and safe drone landings, ground platforms should real-time, accurately locate descending drones and guide them to designated spots. While mmWave sensing combined with cameras improves localizat... 详细信息
来源: 评论