咨询与建议

限定检索结果

文献类型

  • 378 篇 会议
  • 322 篇 期刊文献
  • 9 册 图书

馆藏范围

  • 709 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 404 篇 工学
    • 246 篇 计算机科学与技术...
    • 210 篇 软件工程
    • 102 篇 光学工程
    • 91 篇 生物医学工程(可授...
    • 88 篇 生物工程
    • 86 篇 信息与通信工程
    • 50 篇 控制科学与工程
    • 47 篇 电气工程
    • 41 篇 电子科学与技术(可...
    • 28 篇 机械工程
    • 28 篇 仪器科学与技术
    • 23 篇 化学工程与技术
    • 16 篇 建筑学
    • 16 篇 土木工程
    • 9 篇 材料科学与工程(可...
    • 8 篇 安全科学与工程
  • 277 篇 理学
    • 123 篇 物理学
    • 97 篇 生物学
    • 91 篇 数学
    • 46 篇 统计学(可授理学、...
    • 20 篇 化学
    • 14 篇 系统科学
  • 83 篇 管理学
    • 45 篇 管理科学与工程(可...
    • 45 篇 图书情报与档案管...
    • 16 篇 工商管理
  • 69 篇 医学
    • 61 篇 临床医学
    • 52 篇 基础医学(可授医学...
    • 36 篇 药学(可授医学、理...
    • 14 篇 公共卫生与预防医...
  • 13 篇 法学
    • 12 篇 社会学
  • 9 篇 教育学
  • 6 篇 农学
  • 4 篇 文学
  • 3 篇 经济学
  • 3 篇 艺术学
  • 2 篇 军事学

主题

  • 52 篇 laboratories
  • 52 篇 computer vision
  • 48 篇 computer science
  • 33 篇 neural networks
  • 25 篇 speech recogniti...
  • 24 篇 image segmentati...
  • 24 篇 feature extracti...
  • 24 篇 training
  • 20 篇 speech processin...
  • 18 篇 robot vision sys...
  • 17 篇 hidden markov mo...
  • 17 篇 humans
  • 17 篇 artificial intel...
  • 17 篇 shape
  • 16 篇 deep learning
  • 16 篇 cameras
  • 15 篇 speech enhanceme...
  • 15 篇 machine vision
  • 15 篇 pattern recognit...
  • 15 篇 accuracy

机构

  • 23 篇 guangdong provin...
  • 17 篇 speech and visio...
  • 16 篇 department of co...
  • 11 篇 department of co...
  • 9 篇 speech and visio...
  • 9 篇 department of el...
  • 8 篇 department of ra...
  • 8 篇 computer vision ...
  • 8 篇 shenzhen key lab...
  • 8 篇 heidelberg
  • 7 篇 university of sc...
  • 6 篇 centre of excell...
  • 6 篇 department of el...
  • 6 篇 centre for medic...
  • 6 篇 school of artifi...
  • 6 篇 department of qu...
  • 6 篇 computer vision ...
  • 6 篇 imsight medical ...
  • 6 篇 department of co...
  • 6 篇 university of ch...

作者

  • 28 篇 heng pheng-ann
  • 20 篇 b. yegnanarayana
  • 16 篇 yegnanarayana b.
  • 15 篇 chen hao
  • 13 篇 timofte radu
  • 12 篇 dou qi
  • 9 篇 zhang hong
  • 9 篇 shen linlin
  • 8 篇 bakas spyridon
  • 8 篇 wu xiao-jun
  • 7 篇 qin jing
  • 7 篇 hu xiaowei
  • 7 篇 zhang zhao
  • 7 篇 josef kittler
  • 7 篇 egger jan
  • 7 篇 wang wenwu
  • 7 篇 islam md jahidul
  • 7 篇 kittler josef
  • 6 篇 kozubek michal
  • 6 篇 reinke annika

语言

  • 666 篇 英文
  • 37 篇 其他
  • 6 篇 中文
检索条件"机构=Speech and Vision Laboratory Department of Computer Science and Engineering"
709 条 记 录,以下是121-130 订阅
排序:
Unpaired Overwater Image Defogging Using Prior Map Guided Cycle-Consistent Generative Adversarial Network
SSRN
收藏 引用
SSRN 2024年
作者: Mo, Yaozong Li, Chaofeng Ren, Wenqi Wang, Wenwu Wu, Xiao-Jun Shanghai201306 China School of Cyber Science and Technology Sun Yat-sen University Shenzhen518000 China Center for Vision Speech and Signal Processing Department of Electrical and Electronic Engineering University of Surrey Surrey SurreyGU2 7XH United Kingdom School of Artificial Intelligence and Computer Science Jiangnan University Wuxi 214122 China
Existing image defogging approaches have made significant advancements. But their effectiveness in addressing overwater foggy images remains limited. Current methods are predominantly optimized for land scenes, which ... 详细信息
来源: 评论
ZCS-CDiff: A Zero-Shot Code-Switching TTS System with Conformer-Based Diffusion Model
ZCS-CDiff: A Zero-Shot Code-Switching TTS System with Confor...
收藏 引用
International Conference on Acoustics, speech, and Signal Processing (ICASSP)
作者: Ke Chen Zhihua Huang Liang He Yonghong Yan School of Computer Science and Technology Xinjiang University Urumqi China Xinjiang Key Laboratory of Signal Detection and Processing Urumqi China Department of Electronic Engineering Tsinghua University Beijing China University of Chinese Academy of Sciences Beijing China Key Laboratory of Speech Acoustics and Content Understanding Institute of Acoustics CAS Beijing China
Code-Switching (CS) Text-To-speech (TTS) models have gained attention due to the increasing prevalence of multilingual communication. However, existing models struggle to meet the growing demand for personalized CS TT... 详细信息
来源: 评论
Unified Domain Adaptive Semantic Segmentation
arXiv
收藏 引用
arXiv 2023年
作者: Zhang, Zhe Wu, Gaochang Zhang, Jing Zhu, Xiatian Tao, Dacheng Chai, Tianyou The State Key Laboratory of Synthetical Automation for Process Industries Northeastern University Shenyang China The Surrey Institute for People-Centred Artificial Intelligence Centre for Vision Speech and Signal Processing University of Surrey Guildford United Kingdom The School of Computer Science Faculty of Engineering University of Sydney Sydney Australia
Unsupervised Domain Adaptive Semantic Segmentation (UDA-SS) aims to transfer the supervision from a labeled source domain to an unlabeled and shifted target domain. The majority of existing UDA-SS works typically cons... 详细信息
来源: 评论
Enhanced multi-stage network for defocus deblurring using dual-pixel images  13
Enhanced multi-stage network for defocus deblurring using du...
收藏 引用
13th International Conference on Signal Processing Systems, ICSPS 2021
作者: Li, Ru Xie, Junwei Xue, Yuyang Zou, Wenbin Tong, Tong Luo, Ming Gao, Qinquan College of Physics and Information Engineering Fuzhou University China Fujian Key Lab of Medical Instrumentation & Pharmaceutical Technology Fuzhou University China Imperial Vision Technology Fujian China Department of Computer Science University of Tsukuba Tsukuba Japan Fujian Provincial Key Laboratory of Photonics Technology Fujian Normal University China
The defocus deblurring raised from the finite aperture size and exposure time is an essential problem in the shooting process, which seriously affects the quality of the images. However, studies based on defocus deblu... 详细信息
来源: 评论
Deep Neural Decision Forest for Acoustic Scene Classification
Deep Neural Decision Forest for Acoustic Scene Classificatio...
收藏 引用
European Signal Processing Conference (EUSIPCO)
作者: Jianyuan Sun Xubo Liu Xinhao Mei Jinzheng Zhao Mark D. Plumbley Volkan Kılıç Wenwu Wang Centre for Vision Speech and Signal Processing (CVSSP) University of Surrey UK College of Computer Science and Technology Qingdao University China Department of Electrical and Electronics Engineering Izmir Katip Celebi University Turkey
Acoustic scene classification (ASC) aims to classify an audio clip based on the characteristic of the recording environment. In this regard, deep learning based approaches have emerged as a useful tool for ASC problem... 详细信息
来源: 评论
Riemannian Self-Attention Mechanism for SPD Networks
arXiv
收藏 引用
arXiv 2023年
作者: Wang, Rui Wu, Xiao-Jun Li, Hui Kittler, Josef School of Artificial Intelligence and Computer Science Jiangnan University Wuxi214122 China Jiangsu Provincial Engineering Laboratory of Pattern Recognition and Computational Intelligence Jiangnan University China Centre for Vision Speech and Signal Processing University of Surrey GuildfordGU2 7XH United Kingdom
Symmetric positive definite (SPD) matrix has been demonstrated to be an effective feature descriptor in many scientific areas, as it can encode spatiotemporal statistics of the data adequately on a curved Riemannian m... 详细信息
来源: 评论
Subspace Gaussian Mixture Modeling for low-resource non-native Punjabi Language speech Recognition  6
Subspace Gaussian Mixture Modeling for low-resource non-nati...
收藏 引用
6th International Conference on Futuristic Trends in Networks and Computing Technologies, FTNCT 2024
作者: Bawa, Puneet Kadyan, Virender Chhabra, Gunjan Chopra, Ashish Centre of Excellence for Speech and Multimodal Laboratory Chitkara University Institute of Engineering and Technology Chitkara University Punjab India Machine Intelligence Research Centre School of Computer Science UPES Energy Acres Bidholi Uttarakhand Dehradun248007 India Department of CSE Graphic Era Hill University Graphic Era Deemed to Be University Uttarakhand Dehradun248007 India Department of Computer Science and Applications Seth Jai Parkash Mukand Lal Institute of Engineering and Technology Haryana Radaur India
The advancement of non-native recognition of speech is becoming more significant as individual research interest in communicating with various languages has developed. However, because of the limited ability to includ... 详细信息
来源: 评论
Joint Design of Radar Receive Filter and Unimodular ISAC Waveform with Sidelobe Level Control
arXiv
收藏 引用
arXiv 2025年
作者: Zhang, Kecheng Liu, Ya-Feng Wang, Zhongbin Yuan, Weijie Keskin, Musa Furkan Wymeersch, Henk Xia, Shuqiang School of System Design and Intelligent Manufacturing The Shenzhen Key Laboratory of Robotics and Computer Vision Southern University of Science and Technology Shenzhen518055 China State Key Laboratory of Scientific and Engineering Computing Institute of Computational Mathematics and Scientific/Engineering Computing Academy of Mathematics and Systems Science Chinese Academy of Sciences Beijing100190 China Department of Electrical Engineering Chalmers University of Technology Gothenburg41296 Sweden ZTE Corporation The State Key Laboratory of Mobile Network and Mobile Multimedia Technology Shenzhen518055 China
Integrated sensing and communication (ISAC) has been considered a key feature of next-generation wireless networks. This paper investigates the joint design of the radar receive filter and dual-functional transmit wav... 详细信息
来源: 评论
FIRST-SHOT UNSUPERVISED ANOMALOUS SOUND DETECTION WITH UNKNOWN ANOMALIES ESTIMATED BY METADATA-ASSISTED AUDIO GENERATION
arXiv
收藏 引用
arXiv 2023年
作者: Zhang, Hejing Zhu, Qiaoxi Guan, Jian Liu, Haohe Xiao, Feiyang Tian, Jiantong Mei, Xinhao Liu, Xubo Wang, Wenwu Group of Intelligent Signal Processing College of Computer Science and Technology Harbin Engineering University Harbin China National Engineering Laboratory for Modeling and Emulation in E-Government Harbin Engineering University Harbin China Centre for Audio Acoustics and Vibration University of Technology Sydney Ultimo Australia Centre for Vision Speech and Signal Processing University of Surrey Guildford United Kingdom
First-shot (FS) unsupervised anomalous sound detection (ASD) is a brand-new task introduced in DCASE 2023 Challenge Task 2, where the anomalous sounds for the target machine types are unseen in training. Existing meth... 详细信息
来源: 评论
GraspGPT: Leveraging Semantic Knowledge from a Large Language Model for Task-Oriented Grasping
arXiv
收藏 引用
arXiv 2023年
作者: Tang, Chao Huang, Dehao Ge, Wenqi Liu, Weiyu Zhang, Hong Shenzhen Key Laboratory of Robotics and Computer Vision Southern University of Science and Technology Shenzhen China Department of Electronic and Electrical Engineering Southern University of Science and Technology Shenzhen China Institute for Robotics and Intelligent Machines Georgia Institute of Technology Atlanta United States
Task-oriented grasping (TOG) refers to the problem of predicting grasps on an object that enable subsequent manipulation tasks. To model the complex relationships between objects, tasks, and grasps, existing methods i... 详细信息
来源: 评论