咨询与建议

限定检索结果

文献类型

  • 8 篇 会议
  • 5 篇 期刊文献

馆藏范围

  • 13 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 13 篇 工学
    • 11 篇 计算机科学与技术...
    • 8 篇 电气工程
    • 4 篇 控制科学与工程
    • 2 篇 信息与通信工程
    • 1 篇 机械工程
    • 1 篇 电子科学与技术(可...
    • 1 篇 软件工程
    • 1 篇 网络空间安全
  • 1 篇 法学
    • 1 篇 社会学
  • 1 篇 文学
    • 1 篇 新闻传播学
  • 1 篇 医学
    • 1 篇 临床医学
  • 1 篇 管理学
    • 1 篇 图书情报与档案管...

主题

  • 13 篇 audio-visual rec...
  • 3 篇 deep learning
  • 2 篇 multi-modality f...
  • 2 篇 hidden markov mo...
  • 1 篇 wavelet transfor...
  • 1 篇 structure tensor
  • 1 篇 computational co...
  • 1 篇 hidden markov mo...
  • 1 篇 optical flow
  • 1 篇 conversational s...
  • 1 篇 speaker verifica...
  • 1 篇 system error ide...
  • 1 篇 k-nearest neighb...
  • 1 篇 hidden markov mo...
  • 1 篇 feature
  • 1 篇 lip-movements
  • 1 篇 convolutional ne...
  • 1 篇 fuzzy logical mo...
  • 1 篇 hmm
  • 1 篇 visual synthesis

机构

  • 1 篇 ocean univ china...
  • 1 篇 ocean university...
  • 1 篇 comilla universi...
  • 1 篇 horia hulubei na...
  • 1 篇 taylors univ lak...
  • 1 篇 univ nottingham ...
  • 1 篇 univ technol syd...
  • 1 篇 saitama universi...
  • 1 篇 kth royal inst t...
  • 1 篇 northwestern uni...
  • 1 篇 delin inst techn...
  • 1 篇 tatung univ dept...
  • 1 篇 fdn res & techno...
  • 1 篇 idemia courbevoi...
  • 1 篇 kth royal inst t...
  • 1 篇 sfax univ ecole ...
  • 1 篇 usmba fsdm dept ...
  • 1 篇 inst polytech pa...
  • 1 篇 nstu department ...
  • 1 篇 west virginia un...

作者

  • 2 篇 li xiaomei
  • 2 篇 guo zhongwen
  • 2 篇 pao tsang-long
  • 2 篇 yang chao
  • 2 篇 liao wen-yuan
  • 2 篇 wang jinxin
  • 2 篇 chen yu-te
  • 1 篇 perez javier
  • 1 篇 nasrabadi nasser
  • 1 篇 spanakis emmanou...
  • 1 篇 uddin md. kamal
  • 1 篇 darrell trevor
  • 1 篇 nicolin alexandr...
  • 1 篇 leijon arne
  • 1 篇 kjellstroem hedv...
  • 1 篇 markopoulos ioan...
  • 1 篇 hasan mahmudul
  • 1 篇 narr christian
  • 1 篇 dawson jeremy
  • 1 篇 jahan meskat

语言

  • 11 篇 英文
  • 2 篇 其他
检索条件"主题词=Audio-visual recognition"
13 条 记 录,以下是1-10 订阅
排序:
MULTI-SCALE HYBRID FUSION NETWORK FOR MANDARIN audio-visual SPEECH recognition
MULTI-SCALE HYBRID FUSION NETWORK FOR MANDARIN AUDIO-VISUAL ...
收藏 引用
IEEE International Conference on Multimedia and Expo (ICME)
作者: Wang, Jinxin Guo, Zhongwen Yang, Chao Li, Xiaomei Cui, Ziyuan Ocean Univ China Fac Informat Sci & Engn Qingdao Peoples R China Univ Technol Sydney Sch Comp Sci Sydney Australia
Compared to feature or decision fusion, hybrid fusion can beneficially improve audio-visual speech recognition accuracy. Existing works are mainly prone to design the multi-modality feature extraction process, interac... 详细信息
来源: 评论
An End-to-End Mandarin audio-visual Speech recognition Model with a Feature Enhancement Module
An End-to-End Mandarin Audio-Visual Speech Recognition Model...
收藏 引用
2023 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2023
作者: Wang, Jinxin Yang, Chao Guo, Zhongwen Li, Xiaomei Wang, Weigang Ocean University of China Faculty of Information Science and Engineering Qingdao China School of Computer Science University of Technology Sydney Sydney Australia
Compared to relying only on audio information, incorporating visual information improves speech recognition accuracy in noisy environments. Existing works are prone to design specific architecture for feature extracti... 详细信息
来源: 评论
3D Convolutional Neural Networks for Cross audio-visual Matching recognition
收藏 引用
IEEE ACCESS 2017年 5卷 22081-22091页
作者: Torfi, Amirsina Iranmanesh, Seyed Mehdi Nasrabadi, Nasser Dawson, Jeremy West Virginia Univ Coll Engn & Mineral Resources Lane Dept Comp Sci & Elect Engn Morgantown WV 26506 USA
audio-visual recognition (AVR) has been considered as a solution for speech recognition tasks when the audio is corrupted, as well as a visual recognition method used for speaker verification in multi-speaker scenario... 详细信息
来源: 评论
Multimodal Emotion recognition through Deep Fusion of audio-visual Data  26
Multimodal Emotion Recognition through Deep Fusion of Audio-...
收藏 引用
26th International Conference on Computer and Information Technology, ICCIT 2023
作者: Sultana, Tamanna Jahan, Meskat Uddin, Md. Kamal Kobayashi, Yoshinori Hasan, Mahmudul Comilla University Department of Computer Science and Engineering Cumilla Bangladesh NSTU Department of Computer Science & Telecommunication Engineering Noakhali Bangladesh Saitama University Interactive Systems Lab. Saitama Japan
The field of emotion recognition in artificial intelligence focuses on enabling machines to comprehend and react to the range of emotions experienced by humans. This paper presents a novel approach that integrates the... 详细信息
来源: 评论
Methodologies of audio-visual Biometric Performance Evaluation for the H2020 SpeechXRays Project  5
Methodologies of Audio-Visual Biometric Performance Evaluati...
收藏 引用
5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)
作者: Mtibaa, Aymen Hmani, Mohamed Amine Petrovska-Delacretaz, Dijana Boudy, Jerome Ben Hamida, Ahmed Bauzou, Claude Crucianu, Iacob Markopoulos, Ioannis Spanakis, Emmanouil Nicolin, Alexandru Narr, Christian Kockmann, Marcel Perez, Javier Inst Polytech Paris Telecom SudParis Paris France Sfax Univ Ecole Natl Ingenieurs Sfax ATMS Sfax Tunisia Fdn Res & Technol Hellas Inst Comp Sci Athens Greece Horia Hulubei Natl Inst Phys & Nucl Engn Magurele Romania IDEMIA Courbevoie France SIVECO Bucharest Romania FORTHNET Athens Greece LumenVox Berlin Germany
Biometric recognition is nowadays widely used in different services and applications, making the user authentication easier and more secure than the traditional authentication system. Starting from this idea, the EU S... 详细信息
来源: 评论
audio-visual recognition System in Compression Domain
收藏 引用
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 2011年 第5期21卷 637-646页
作者: Wong, Yee Wan Seng, Kah Phooi Ang, Li-Minn Taylors Univ Lakeside Campus Selangor 47500 Darul Ehsan Malaysia Univ Nottingham Malaysia Campus Selangor 43500 Darul Ehsan Malaysia
This paper presents a highly efficient audio-visual recognition system in compression domain. For face recognition systems, the multiband feature fusion method selects the wavelet subbands that are invariant to illumi... 详细信息
来源: 评论
Amazigh audiovisual Speech recognition System Design
Amazigh Audiovisual Speech Recognition System Design
收藏 引用
Intelligent Systems and Computer Vision (ISCV)
作者: Addarrazi, Ilham Satori, Hassan Satori, Khalid USMBA FSDM Dept Math & Comp Sci Fes Morocco UMP FPN Dept Math & Comp Sci Nador Morocco
It is well known that speech recognition is a multimodal process which uses information not only from audio but also from vision. This paper describes our experience to design an audio visual speech recognition system... 详细信息
来源: 评论
MANDARIN audio-visual SPEECH recognition WITH EFFECTS TO THE NOISE AND EMOTION
收藏 引用
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL 2010年 第2期6卷 711-723页
作者: Pao, Tsang-Long Liao, Wen-Yuan Chen, Yu-Te Wu, Tsan-Nung Tatung Univ Dept Comp Sci & Engn Taipei 104 Taiwan DeLin Inst Technol Dept Comp Sci & Informat Engn Tucheng City 236 Taipei County Taiwan
This paper presents;a Mandarin audio-visual recognition system dealing with noisy and emotional speech signal. In the proposed approach, we extract the visual features of the lips. These features are very important to... 详细信息
来源: 评论
Human audio-visual Consonant recognition Analyzed with Three Bimodal Integration Models
Human Audio-Visual Consonant Recognition Analyzed with Three...
收藏 引用
10th INTERSPEECH 2009 Conference
作者: Ma, Zhanyu Leijon, Arne KTH Royal Inst Technol Sound & Image Proc Lab Stockholm Sweden
With A-V recordings. ten normal hearing people took recognition tests at different signal-to-noise ratios (SNR). The AV recognition results are predicted by the fuzzy logical model of perception (FLMP) and the post-la... 详细信息
来源: 评论
An audio-visual speech recognition with a new mandarin audio-visual database
An audio-visual speech recognition with a new mandarin audio...
收藏 引用
4th International Conference on Cybernetics and Information Technologies, Systems and Applications/5th Int Conf on Computing, Communications and Control Technologies
作者: Liao, Wen-Yuan Pao, Tsang-Long Chen, Yu-Te Chang, Tsun-Wei De Lin Inst Technol Dept Comp Sci & Engn Taipei Taiwan Tatung Univ Dept Comp Sci & Engn Taipei Taiwan
Automatic speech recognition(ASR) by machine has been a goal and an attractive research area for past several decades. In recent years, there has been growing attractive research topic for overcoming certain audio-onl... 详细信息
来源: 评论