咨询与建议

限定检索结果

文献类型

  • 378 篇 会议
  • 322 篇 期刊文献
  • 9 册 图书

馆藏范围

  • 709 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 404 篇 工学
    • 246 篇 计算机科学与技术...
    • 210 篇 软件工程
    • 102 篇 光学工程
    • 91 篇 生物医学工程(可授...
    • 88 篇 生物工程
    • 86 篇 信息与通信工程
    • 50 篇 控制科学与工程
    • 47 篇 电气工程
    • 41 篇 电子科学与技术(可...
    • 28 篇 机械工程
    • 28 篇 仪器科学与技术
    • 23 篇 化学工程与技术
    • 16 篇 建筑学
    • 16 篇 土木工程
    • 9 篇 材料科学与工程(可...
    • 8 篇 安全科学与工程
  • 277 篇 理学
    • 123 篇 物理学
    • 97 篇 生物学
    • 91 篇 数学
    • 46 篇 统计学(可授理学、...
    • 20 篇 化学
    • 14 篇 系统科学
  • 83 篇 管理学
    • 45 篇 管理科学与工程(可...
    • 45 篇 图书情报与档案管...
    • 16 篇 工商管理
  • 69 篇 医学
    • 61 篇 临床医学
    • 52 篇 基础医学(可授医学...
    • 36 篇 药学(可授医学、理...
    • 14 篇 公共卫生与预防医...
  • 13 篇 法学
    • 12 篇 社会学
  • 9 篇 教育学
  • 6 篇 农学
  • 4 篇 文学
  • 3 篇 经济学
  • 3 篇 艺术学
  • 2 篇 军事学

主题

  • 52 篇 laboratories
  • 52 篇 computer vision
  • 48 篇 computer science
  • 33 篇 neural networks
  • 25 篇 speech recogniti...
  • 24 篇 image segmentati...
  • 24 篇 feature extracti...
  • 24 篇 training
  • 20 篇 speech processin...
  • 18 篇 robot vision sys...
  • 17 篇 hidden markov mo...
  • 17 篇 humans
  • 17 篇 artificial intel...
  • 17 篇 shape
  • 16 篇 deep learning
  • 16 篇 cameras
  • 15 篇 speech enhanceme...
  • 15 篇 machine vision
  • 15 篇 pattern recognit...
  • 15 篇 accuracy

机构

  • 23 篇 guangdong provin...
  • 17 篇 speech and visio...
  • 16 篇 department of co...
  • 11 篇 department of co...
  • 9 篇 speech and visio...
  • 9 篇 department of el...
  • 8 篇 department of ra...
  • 8 篇 computer vision ...
  • 8 篇 shenzhen key lab...
  • 8 篇 heidelberg
  • 7 篇 university of sc...
  • 6 篇 centre of excell...
  • 6 篇 department of el...
  • 6 篇 centre for medic...
  • 6 篇 school of artifi...
  • 6 篇 department of qu...
  • 6 篇 computer vision ...
  • 6 篇 imsight medical ...
  • 6 篇 department of co...
  • 6 篇 university of ch...

作者

  • 28 篇 heng pheng-ann
  • 20 篇 b. yegnanarayana
  • 16 篇 yegnanarayana b.
  • 15 篇 chen hao
  • 13 篇 timofte radu
  • 12 篇 dou qi
  • 9 篇 zhang hong
  • 9 篇 shen linlin
  • 8 篇 bakas spyridon
  • 8 篇 wu xiao-jun
  • 7 篇 qin jing
  • 7 篇 hu xiaowei
  • 7 篇 zhang zhao
  • 7 篇 josef kittler
  • 7 篇 egger jan
  • 7 篇 wang wenwu
  • 7 篇 islam md jahidul
  • 7 篇 kittler josef
  • 6 篇 kozubek michal
  • 6 篇 reinke annika

语言

  • 666 篇 英文
  • 37 篇 其他
  • 6 篇 中文
检索条件"机构=Speech and Vision Laboratory Department of Computer Science and Engineering"
709 条 记 录,以下是21-30 订阅
排序:
DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face Synthesis
DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-...
收藏 引用
International Conference on Acoustics, speech, and Signal Processing (ICASSP)
作者: Kaijun Deng Dezhi Zheng Jindong Xie Jinbao Wang Weicheng Xie Linlin Shen Siyang Song Computer Vision Institute School of Computer Science and Software Engineering Shenzhen University National Engineering Laboratory for Big Data System Computing Technology Shenzhen University Guangdong Provincial Key Laboratory of Intelligent Information Processing Department of Computer Science University of Exeter
Accurately synthesizing talking face videos and capturing fine facial features for individuals with long hair presents a significant challenge. To tackle these challenges in existing methods, we propose a decomposed p... 详细信息
来源: 评论
Data-Driven Analysis of Skin Cancer Classification with Convolutional Neural Networks for E-Health Applications
Data-Driven Analysis of Skin Cancer Classification with Conv...
收藏 引用
2024 IEEE Global Communications Conference, GLOBECOM 2024
作者: Ahmed, Imran Ahmad, Misbah Chehri, Abdellah Jeon, Gwanggil Anglia Ruskin University School of Computing and Information Science Cambridge United Kingdom Hartpury University Animal and Agriculture Department Gloucester United Kingdom University of the West of England Centre for Machine Vision Bristol Robotics Laboratory Bristol United Kingdom Department of Mathematics and Computer Science Canada Incheon National University Department of Embedded Systems Engineering Incheon Korea Republic of
This study explores the effectiveness of Convolutional Neural Networks (CNNs) in automatically classifying skin cancer for e-health applications. The trained model showcases impressive performance by leveraging the HA... 详细信息
来源: 评论
An Ensemble Approach to Multi-Class Classification of Vocal Disorders: Laryngocele and Vox Senilis
An Ensemble Approach to Multi-Class Classification of Vocal ...
收藏 引用
2024 IEEE International Conference on Intelligent Signal Processing and Effective Communication Technologies, INSPECT 2024
作者: Bawa, Puneet Kadyan, Virender Mantri, Archana Sethi, Monika Chitkara University Institute of Engineering & Technology Chitkara University Centre of Excellence for Speech and Multimodal Laboratory Punjab India Machine Intelligence Research Centre School of Computer Science UPES Uttarakhand Dehradun India Anurag University Department of Electronicsand Communication Engineering Hyderabad India Chitkara University Institute of Engineering & Technology Chitkara University Punjab India
The classification of audio signals has been a significant challenge in machine learning, especially with regard to the early identification of voice disorders. However, traditional techniques based on raw audio featu... 详细信息
来源: 评论
Human Orientation Estimation Under Partial Observation
Human Orientation Estimation Under Partial Observation
收藏 引用
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
作者: Jieting Zhao Hanjing Ye Yu Zhan Hao Luan Hong Zhang Department of Electronic and Electrical Engineering SUSTech Shenzhen Key Laboratory of Robotics and Computer Vision Southern University of Science and Technology (SUSTech) Department of Computer Science School of Computing National University of Singapore
Reliable Human Orientation Estimation (HOE) from a monocular image is critical for autonomous agents to understand human intention. Significant progress has been made in HOE under full observation. However, the existi... 详细信息
来源: 评论
Inverse Kinematics of Robotic Manipulators Using a New Learning-by-Example Method
Inverse Kinematics of Robotic Manipulators Using a New Learn...
收藏 引用
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
作者: Jacket Demby’s Ramy Farag Guilherme N. DeSouza Department of Electrical Engineering and Computer Science (EECS) Vision-Guided and Intelligent Robotics (ViGIR) Laboratory University of Missouri-Columbia Columbia Missouri
Inverse Kinematics (IK) is one of the most fundamental challenges in robotics. It refers to the process of determining the joint configurations required to achieve the desired position and orientation (pose) of a robo... 详细信息
来源: 评论
Generative Adversarial Network-Based Voice Synthesis from Spectrograms for Low-Resource speech Recognition in Mismatched Conditions  15
Generative Adversarial Network-Based Voice Synthesis from Sp...
收藏 引用
15th International Conference on Computing Communication and Networking Technologies, ICCCNT 2024
作者: Bawa, Puneet Kadyan, Virender Chhabra, Gunjan Chitkara University Institute of Engineering & Technology Chitkara University Centre of Excellence for Speech and Multimodal Laboratory Punjab India University of Petroleum & Energy Studies Machine Intelligence Research Centre School of Computer Science Energy Acres Bidholi Uttarakhand Dehradun248007 India Department of Computer Science and Engineering Graphic Era Hill University Uttarakhand Dehradun248007 India Graphic Era Deemed to be University Uttarakhand Dehradun248007 India
The use of Generative Adversarial Networks (GANs) has been increasing in speech recognition tasks but there has been significant hurdle due to limited availability. The use of GAN have shown promise in speech synthesi... 详细信息
来源: 评论
NORPPA: NOvel Ringed Seal Re-Identification by Pelage Pattern Aggregation
NORPPA: NOvel Ringed Seal Re-Identification by Pelage Patter...
收藏 引用
IEEE Winter Applications and computer vision Workshops (WACVW)
作者: Ekaterina Nepovinnykh Tuomas Eerola Heikki Kälviäinen Ilia Chelak Department of Computational Engineering School of Engineering Sciences Computer Vision and Pattern Recognition Laboratory (CVPRL) Lappeenranta-Lahti University of Technology LUT Lappeenranta Finland Department of Computer Science Faculty of Science University of Helsinki Helsinki Finland
We propose a method for Saimaa ringed seal (Pusa hispida saimensis) re-identification. Access to large image volumes through camera trapping and crowdsourcing provides novel possibilities for animal conservation and m...
来源: 评论
MULTIMODALITY HELPS FEW-SHOT 3D POINT CLOUD SEMANTIC SEGMENTATION
arXiv
收藏 引用
arXiv 2024年
作者: An, Zhaochong Sun, Guolei Liu, Yun Li, Runjia Wu, Min Cheng, Ming-Ming Konukoglu, Ender Belongie, Serge Department of Computer Science University of Copenhagen Denmark Computer Vision Laboratory ETH Zurich Switzerland College of Computer Science Nankai University China Department of Engineering Science University of Oxford United Kingdom Institute for Infocomm Research A*STAR Singapore
Few-shot 3D point cloud segmentation (FS-PCS) aims at generalizing models to segment novel categories with minimal annotated support samples. While existing FS-PCS methods have shown promise, they primarily focus on u... 详细信息
来源: 评论
HGDiffuser: Efficient Task-Oriented Grasp Generation via Human-Guided Grasp Diffusion Models
arXiv
收藏 引用
arXiv 2025年
作者: Huang, Dehao Dong, Wenlong Tang, Chao Zhang, Hong Shenzhen Key Laboratory of Robotics and Computer Vision Southern University of Science and Technology Shenzhen China Department of Electronic and Electrical Engineering Southern University of Science and Technology Shenzhen China
Task-oriented grasping (TOG) is essential for robots to perform manipulation tasks, requiring grasps that are both stable and compliant with task-specific constraints. Humans naturally grasp objects in a task-oriented... 详细信息
来源: 评论
Zero-Shot Audio Captioning Using Soft and Hard Prompts
IEEE Transactions on Audio, Speech and Language Processing
收藏 引用
IEEE Transactions on Audio, speech and Language Processing 2025年 33卷 2045-2058页
作者: Yiming Zhang Xuenan Xu Ruoyi Du Haohe Liu Yuan Dong Zheng-Hua Tan Wenwu Wang Zhanyu Ma Pattern Recognition and Intelligent System Laboratory School of Artificial Intelligence Beijing University of Posts and Telecommunications Beijing China Department of Computer Science and Engineering Shanghai Jiao Tong University Shanghai China Centre for Vision Speech and Signal Processing University of Surrey Guildford U.K. Department of Electronic Systems Aalborg University Aalborg Denmark
In traditional audio captioning methods, a model is usually trained in a fully supervised manner using a human-annotated dataset containing audio-text pairs and then evaluated on the test set from the same dataset. Su... 详细信息
来源: 评论