咨询与建议

限定检索结果

文献类型

  • 438 篇 期刊文献
  • 256 篇 会议

馆藏范围

  • 694 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 544 篇 工学
    • 432 篇 计算机科学与技术...
    • 154 篇 软件工程
    • 137 篇 电气工程
    • 51 篇 信息与通信工程
    • 46 篇 生物医学工程(可授...
    • 29 篇 生物工程
    • 23 篇 控制科学与工程
    • 14 篇 光学工程
    • 11 篇 电子科学与技术(可...
    • 10 篇 动力工程及工程热...
    • 10 篇 测绘科学与技术
    • 10 篇 化学工程与技术
    • 7 篇 建筑学
    • 7 篇 土木工程
  • 177 篇 理学
    • 61 篇 生物学
    • 54 篇 物理学
    • 53 篇 数学
    • 31 篇 统计学(可授理学、...
    • 15 篇 化学
    • 8 篇 地球物理学
    • 8 篇 系统科学
    • 5 篇 地理学
  • 110 篇 医学
    • 63 篇 临床医学
    • 34 篇 基础医学(可授医学...
    • 28 篇 特种医学
  • 47 篇 管理学
    • 28 篇 管理科学与工程(可...
    • 16 篇 图书情报与档案管...
  • 15 篇 教育学
    • 13 篇 教育学
  • 15 篇 文学
    • 15 篇 外国语言文学
  • 10 篇 法学
    • 9 篇 社会学
  • 9 篇 农学
  • 2 篇 经济学

主题

  • 43 篇 deep learning
  • 36 篇 training
  • 27 篇 task analysis
  • 27 篇 feature extracti...
  • 25 篇 semantics
  • 22 篇 computational mo...
  • 19 篇 machine learning
  • 16 篇 visualization
  • 14 篇 deep neural netw...
  • 13 篇 object detection
  • 13 篇 three-dimensiona...
  • 13 篇 transformers
  • 12 篇 image segmentati...
  • 12 篇 self-supervised ...
  • 12 篇 neural networks
  • 12 篇 computer vision
  • 10 篇 contrastive lear...
  • 10 篇 data models
  • 10 篇 robustness
  • 9 篇 reinforcement le...

机构

  • 55 篇 tencent ai lab
  • 53 篇 tencent ai lab p...
  • 50 篇 shenzhen researc...
  • 33 篇 shenzhen res ins...
  • 30 篇 beijing key lab ...
  • 26 篇 the chinese univ...
  • 25 篇 gaoling school o...
  • 25 篇 beijing key labo...
  • 25 篇 renmin univ chin...
  • 24 篇 shenzhen univ na...
  • 23 篇 shanghai ai lab ...
  • 19 篇 chinese univ hon...
  • 14 篇 south china univ...
  • 14 篇 shenzhen researc...
  • 14 篇 shenzhen univ co...
  • 14 篇 peng cheng lab p...
  • 14 篇 school of data s...
  • 13 篇 tsinghua univers...
  • 13 篇 key laboratory o...
  • 13 篇 pazhou laborator...

作者

  • 36 篇 wang shuai
  • 34 篇 wu baoyuan
  • 33 篇 li chongxuan
  • 25 篇 li haizhou
  • 25 篇 zhu jun
  • 25 篇 tan mingkui
  • 24 篇 zhao peilin
  • 23 篇 bao fan
  • 23 篇 fan yanbo
  • 23 篇 zhang yong
  • 20 篇 liu qi
  • 19 篇 shen linlin
  • 18 篇 chen enhong
  • 17 篇 huang wenbing
  • 16 篇 wang jue
  • 16 篇 qian yanmin
  • 15 篇 zhou jie
  • 15 篇 li zhen
  • 15 篇 huang junzhou
  • 14 篇 gan chuang

语言

  • 660 篇 英文
  • 30 篇 其他
检索条件"机构=Big Data and AI Lab"
694 条 记 录,以下是11-20 订阅
排序:
AMPHION: AN OPEN-SOURCE AUDIO, MUSIC, AND SPEECH GENERATION TOOLKIT
AMPHION: AN OPEN-SOURCE AUDIO, MUSIC, AND SPEECH GENERATION ...
收藏 引用
2024 Spoken Language Technology Workshop
作者: Zhang, Xueyao Xue, Liumeng Gu, Yicheng Wang, Yuancheng Li, Jiaqi He, Haorui Wang, Chaoren Liu, Songting Chen, Xi Zhang, Junan Fang, Zihao Chen, Haopeng Tang, Tze Ying Zou, Lexiao Wang, Mingxuan Han, Jun Chen, Kai Li, Haizhou Wu, Zhizheng Chinese Univ Hong Kong Shenzhen Peoples R China Shanghai AI Lab Shanghai Peoples R China Shenzhen Reseach Inst Big Data Shenzhen Peoples R China
Amphion is an open-source toolkit for Audio, Music, and Speech Generation, targeting to ease the way for junior researchers and engineers into these fields. It presents a unified framework that includes diverse genera... 详细信息
来源: 评论
ControlVideo: conditional control for one-shot text-driven video editing and beyond
收藏 引用
Science China(Information Sciences) 2025年 第3期68卷 150-162页
作者: Min ZHAO Rongzhen WANG Fan BAO Chongxuan LI Jun ZHU Department of Computer Science and Technology Institute for AI Tsinghua-Bosch Joint ML CenterTsinghua Laboratory of Brain and Intelligence Lab Tsinghua University ShengShu Technology Gaoling School of Artificial Intelligence Renmin University of China Beijing Key Laboratory of Big Data Management and Analysis Methods Pazhou Laboratory (Huangpu)
This paper presents ControlVideo for text-driven video editing — generating a video that aligns with a given text while preserving the structure of the source video. Building on a pre-trained text-to-image diffusion ... 详细信息
来源: 评论
Transformer-Based Visual Segmentation: A Survey
收藏 引用
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2024年 第12期46卷 10138-10163页
作者: Li, Xiangtai Ding, Henghui Yuan, Haobo Zhang, Wenwei Pang, Jiangmiao Cheng, Guangliang Chen, Kai Liu, Ziwei Loy, Chen Change Nanyang Technol Univ S Lab Singapore 639798 Singapore Fudan Univ Inst Big Data Shanghai 200437 Peoples R China Shanghai AI Lab Shanghai 200240 Peoples R China Univ Liverpool Liverpool L69 7ZX Merseyside England
Visual segmentation seeks to partition images, video frames, or point clouds into multiple segments or groups. This technique has numerous real-world applications, such as autonomous driving, image editing, robot sens... 详细信息
来源: 评论
Low-Resourced Speech Recognition for Iu Mien Language via Weakly-Supervised Phoneme-based Multilingual Pre-training  14
Low-Resourced Speech Recognition for Iu Mien Language via We...
收藏 引用
14th International Symposium on Chinese Spoken Language Processing
作者: Dong, Lukuan Qin, Donghong Bai, Fengbo Song, Fanhua Liu, Yan Xu, Chen Ou, Zhijian Guangxi Minzu Univ Sch Artificial Intelligence AI & Big Data Int Cooperat Joint Lab Nanning Peoples R China Tsinghua Univ Speech Proc & Machine Intelligence SPMI Lab Beijing Peoples R China
The mainstream automatic speech recognition (ASR) technology usually requires hundreds to thousands of hours of annotated speech data. Three approaches to low-resourced ASR are phoneme or subword based supervised pre-... 详细信息
来源: 评论
USED: Universal Speaker Extraction and Diarization
收藏 引用
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 2025年 33卷 96-110页
作者: Ao, Junyi Yldrm, Mehmet Sinan Tao, Ruijie Ge, Meng Wang, Shuai Qian, Yanmin Li, Haizhou Chinese Univ Hong Kong Shenzhen Res Inst Big Data Sch Data Sci Shenzhen 518172 Peoples R China Natl Univ Singapore Dept Elect & Comp Engn Singapore 119077 Singapore Natl Univ Singapore Saw Swee Hock Sch Publ Hlth Singapore 117549 Singapore Shenzhen Res Inst Big Data Shenzhen 518172 Peoples R China Shanghai Jiao Tong Univ Auditory Cognit & Computat Acoust Lab AI Inst Dept Comp Sci & Engn Shanghai 200240 Peoples R China Shanghai Jiao Tong Univ AI Inst MoE Key Lab Artificial Intelligence Shanghai 200240 Peoples R China
Speaker extraction and diarization are two enabling techniques for real-world speech applications. Speaker extraction aims to extract a target speaker's voice from a speech mixture, while speaker diarization demar... 详细信息
来源: 评论
A survey on cross-user federated recommendation
收藏 引用
Science China(Information Sciences) 2025年 第4期68卷 7-32页
作者: Enyue YANG Yudi XIONG Wei YUAN Weike PAN Qiang YANG Zhong MING College of Computer Science and Software Engineering Shenzhen University School of Electrical Engineering and Computer Science The University of Queensland WeBank AI Lab WeBank Department of Computer Science and Engineering Hong Kong University of Science and Technology College of Big Data and Internet Shenzhen Technology University Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ)
Recommender systems are effective in mitigating information overload, yet the centralized storage of user data raises significant privacy concerns. Cross-user federated recommendation(CUFR) provides a promising distri... 详细信息
来源: 评论
SaliencyMix plus : Noise-Minimized Image Mixing Method With Saliency Map in data Augmentation
收藏 引用
IEEE ACCESS 2025年 13卷 21734-21743页
作者: Lee, Hajeong Jin, Zhixiong Woo, Jiyoung Noh, Byeongjoon Soonchunhyang Univ Dept AI & Big Data Asan 31538 South Korea Univ Gustave Eiffel ENTPE LICIT ECO7 F-69500 Lyon France Ecole Polytech Fed Lausanne EPFL Urban Transport Syst Lab LUTS CH-1015 Ecublens Switzerland
data augmentation is vital in deep learning for enhancing model robustness by artificially expanding training datasets. However, advanced methods like CutMix blend images and assign labels based on pixel ratios, often... 详细信息
来源: 评论
Optimization of Cross-Lingual Voice Conversion With Linguistics Losses to Reduce Foreign Accents
收藏 引用
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 2023年 31卷 1916-1926页
作者: Zhou, Yi Wu, Zhizheng Tian, Xiaohai Li, Haizhou Natl Univ Singapore Dept Elect & Comp Engn Singapore 119077 Singapore Chinese Univ Hong Kong Shenzhen Res Inst Big Data Sch Data Sci Shenzhen 518172 Peoples R China Bytedance AI lab Speech & Audio Dept Singapore 569933 Singapore
Cross-lingual voice conversion (XVC) transforms the speaker identity of a source speaker to that of a target speaker who speaks a different language. Due to the intrinsic differences between languages, the converted s... 详细信息
来源: 评论
DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction
DocReal: Robust Document Dewarping of Real-Life Images via A...
收藏 引用
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)
作者: Yu, Fangchen Xie, Yina Wu, Lei Wen, Yafei Wang, Guozhi Ren, Shuai Chen, Xiaoxin Mao, Jianfeng Li, Wenye Chinese Univ Hong Kong Shenzhen Peoples R China Vivo AI Lab Shenzhen Peoples R China Shenzhen Res Inst Big Data Shenzhen Peoples R China
Document image dewarping is a crucial task in computer vision with numerous practical applications. The control point method, as a popular image dewarping approach, has attracted attention due to its simplicity and ef... 详细信息
来源: 评论
Understanding adversarial robustness against on-manifold adversarial examples
收藏 引用
PATTERN RECOGNITION 2025年 159卷
作者: Xiao, Jiancong Yang, Liusha Fan, Yanbo Wang, Jue Luo, Zhi-Quan Chinese Univ Hong Kong Shenzhen 518172 Peoples R China Shenzhen Res Inst Big Data Shenzhen 518172 Peoples R China Tencent AI Lab Shenzhen 518063 Peoples R China Univ Penn Philadelphia PA USA Shenzhen Technol Univ Shenzhen Peoples R China Ant Res Shenzhen Peoples R China Dzine AI Kortrijk Belgium
Deep neural networks (DNNs) are shown to be vulnerable to adversarial examples. A well-trained model can be easily attacked by adding small perturbations to the original data. One of the hypotheses of the existence of... 详细信息
来源: 评论