咨询与建议

限定检索结果

文献类型

  • 846 篇 会议
  • 296 篇 期刊文献
  • 3 册 图书

馆藏范围

  • 1,145 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 726 篇 工学
    • 571 篇 计算机科学与技术...
    • 417 篇 软件工程
    • 127 篇 信息与通信工程
    • 78 篇 电子科学与技术(可...
    • 73 篇 控制科学与工程
    • 55 篇 生物工程
    • 54 篇 机械工程
    • 38 篇 电气工程
    • 25 篇 动力工程及工程热...
    • 24 篇 仪器科学与技术
    • 21 篇 化学工程与技术
    • 18 篇 材料科学与工程(可...
    • 14 篇 土木工程
    • 14 篇 网络空间安全
    • 12 篇 力学(可授工学、理...
    • 12 篇 建筑学
    • 12 篇 农业工程
    • 11 篇 交通运输工程
    • 11 篇 环境科学与工程(可...
  • 279 篇 理学
    • 176 篇 数学
    • 58 篇 生物学
    • 47 篇 物理学
    • 39 篇 统计学(可授理学、...
    • 38 篇 系统科学
    • 22 篇 化学
  • 185 篇 管理学
    • 124 篇 管理科学与工程(可...
    • 64 篇 图书情报与档案管...
    • 37 篇 工商管理
  • 18 篇 法学
    • 15 篇 社会学
  • 17 篇 经济学
    • 17 篇 应用经济学
  • 13 篇 农学
  • 9 篇 教育学
  • 7 篇 医学
  • 3 篇 文学
  • 3 篇 军事学
  • 2 篇 艺术学

主题

  • 46 篇 distributed proc...
  • 44 篇 laboratories
  • 42 篇 computational mo...
  • 31 篇 kernel
  • 29 篇 algorithm design...
  • 28 篇 concurrent compu...
  • 28 篇 computer archite...
  • 27 篇 graphics process...
  • 27 篇 benchmark testin...
  • 25 篇 fault tolerance
  • 25 篇 hardware
  • 24 篇 feature extracti...
  • 23 篇 throughput
  • 23 篇 semantics
  • 22 篇 servers
  • 22 篇 cloud computing
  • 21 篇 parallel process...
  • 21 篇 protocols
  • 21 篇 training
  • 20 篇 deep learning

机构

  • 169 篇 national laborat...
  • 134 篇 science and tech...
  • 103 篇 college of compu...
  • 89 篇 national laborat...
  • 81 篇 national laborat...
  • 38 篇 school of comput...
  • 38 篇 national laborat...
  • 29 篇 national key lab...
  • 22 篇 science and tech...
  • 22 篇 national key lab...
  • 21 篇 national key lab...
  • 18 篇 national laborat...
  • 16 篇 national laborat...
  • 16 篇 laboratory of di...
  • 15 篇 national univers...
  • 14 篇 national laborat...
  • 14 篇 national key lab...
  • 13 篇 national key lab...
  • 13 篇 school of comput...
  • 12 篇 national laborat...

作者

  • 44 篇 wang huaimin
  • 42 篇 yong dou
  • 40 篇 li dongsheng
  • 38 篇 dou yong
  • 38 篇 liu jie
  • 37 篇 wang ji
  • 36 篇 dongsheng li
  • 36 篇 huaimin wang
  • 35 篇 ji wang
  • 31 篇 jie liu
  • 30 篇 yijie wang
  • 30 篇 wang yijie
  • 29 篇 xiaodong wang
  • 27 篇 yin gang
  • 26 篇 peng yuxing
  • 25 篇 yuxing peng
  • 25 篇 gang yin
  • 22 篇 tao wang
  • 21 篇 zhigang luo
  • 21 篇 xicheng lu

语言

  • 1,072 篇 英文
  • 63 篇 中文
  • 10 篇 其他
检索条件"机构=National Key Laboratory of Parallel and Distributed Processing"
1145 条 记 录,以下是1-10 订阅
排序:
Automatic parallelism strategy generation with minimalmemory redundancy
收藏 引用
Frontiers of Information Technology & Electronic Engineering 2025年 第1期26卷 109-118页
作者: Yanqi SHI Peng LIANG Hao ZHENG Linbo QIAO Dongsheng LI National Key Laboratory of Parallel and Distributed Computing National University of Defense TechnologyChangsha 410000China
Large-scale deep learning models are trained distributedly due to memory and computing resource *** existing strategy generation approaches take optimal memory minimization as the *** fill in this gap,we propose a nov... 详细信息
来源: 评论
Training large-scale language models with limited GPU memory:a survey
收藏 引用
Frontiers of Information Technology & Electronic Engineering 2025年 第3期26卷 309-331页
作者: Yu TANG Linbo QIAO Lujia YIN Peng LIANG Ao SHEN Zhilin YANG Lizhi ZHANG Dongsheng LI National Key Laboratory of Parallel and Distributed Computing College of ComputerNational University of Defense TechnologyChangsha 410073China
Large-scale models have gained significant attention in a wide range of fields,such as computer vision and natural language processing,due to their effectiveness across various ***,a notable hurdle in training these l... 详细信息
来源: 评论
An intelligent mesh-smoothing method with graph neural networks
收藏 引用
Frontiers of Information Technology & Electronic Engineering 2025年 第3期26卷 367-384页
作者: Zhichao WANG Xinhai CHEN Junjun YAN Jie LIU Science and Technology on Parallel and Distributed Processing Laboratory National University of Defense TechnologyChangsha 410073China Laboratory of Digitizing Software for Frontier Equipment National University of Defense TechnologyChangsha 410073China
In computational fluid dynamics(CFD),mesh-smoothing methods are widely used to refine the mesh quality for achieving high-precision numerical ***,optimization-based smoothing is used for high-quality mesh smoothing,bu... 详细信息
来源: 评论
Optimizing Fine-Tuning in Quantized Language Models:An In-Depth Analysis of key Variables
收藏 引用
Computers, Materials & Continua 2025年 第1期82卷 307-325页
作者: Ao Shen Zhiquan Lai Dongsheng Li Xiaoyu Hu National Key Laboratory of Parallel and Distributed Computing National University of Defense TechnologyChangsha410073China Strategic Assessments and Consultation Institute Academy of Military ScienceBeijing100091China
Large-scale Language Models(LLMs)have achieved significant breakthroughs in Natural Language processing(NLP),driven by the pre-training and fine-tuning *** this approach allows models to specialize in specific tasks w... 详细信息
来源: 评论
FMCC-RT: a scalable and fine-grained all-reduce algorithm for large-scale SMP clusters
收藏 引用
Science China(Information Sciences) 2025年 第5期68卷 362-379页
作者: Jintao PENG Jie LIU Jianbin FANG Min XIE Yi DAI Zhiquan LAI Bo YANG Chunye GONG Xinjun MAO Guo MAO Jie REN School of Computer Science and Technology National University of Defense Technology Science and Technology on Parallel and Distributed Processing Laboratory National University of Defense Technology Laboratory of Digitizing Software for Frontier Equipment National University of Defense Technology National Supercomputer Center in Tianjin School of Computer Science Shaanxi Normal University
All-reduce is a widely used communication technique for distributed and parallel applications typically implemented using either a tree-based or ring-based scheme. Each of these approaches has its own limitations: tre... 详细信息
来源: 评论
LSSM-SpMM: A Long-Row Splitting and Short-Row Merging Approach for parallel SpMM on PEZY-SC3s  24th
LSSM-SpMM: A Long-Row Splitting and Short-Row Merging Appro...
收藏 引用
24th International Conference on Algorithms and Architectures for parallel processing, ICA3PP 2024
作者: Cao, Ligang Wang, Qinglin Yang, Shun Xia, Rui Guo, Weihao Liu, Jie Laboratory of Digitizing Software for Frontier Equipment National University of Defense Technology Changsha410073 China National Key Laboratory of Parallel and Distributed Computing National University of Defense Technology Changsha410073 China
Sparse Matrix-Dense Matrix Multiplication (SpMM) is a crucial kernel used in a wide range of fields including machine learning and linear algebra solvers. Thus, enhancing the performance of SpMM is essential. The unev... 详细信息
来源: 评论
AFMA-Track: Adaptive Fusion of Motion and Appearance for Robust Multi-object Tracking  27th
AFMA-Track: Adaptive Fusion of Motion and Appearance for ...
收藏 引用
27th International Conference on Pattern Recognition, ICPR 2024
作者: Liao, Wei Luo, Lei Zhang, Chunyuan College of Computer Science and Technology National University of Defence Technology Changsha China Science and Technology on Parallel and Distributed Processing Laboratory College of Computer Science and Technology National University of Defense Technology Changsha China
Motion and appearance cues play a crucial role in Multi-object Tracking (MOT) algorithms for associating objects across consecutive frames. While most MOT methods prioritize accurate motion modeling and distincti... 详细信息
来源: 评论
Comprehensive Deadlock Prevention for GPU Collective Communication  25
Comprehensive Deadlock Prevention for GPU Collective Communi...
收藏 引用
20th European Conference on Computer Systems, EuroSys 2025, co-located 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS 2025
作者: Pan, Lichen Liu, Juncheng Fu, Yongquan Yuan, Jinhui Zhang, Rongkai Li, Pengze Xiao, Zhen School of Computer Science Peking University China OneFlow Research China National Key Laboratory of Parallel and Distributed Computing College of Computer Science and Technology National University of Defense Technology China
distributed deep neural network training necessitates efficient GPU collective communications, which are inherently susceptible to deadlocks. GPU collective deadlocks arise easily in distributed deep learning applicat... 详细信息
来源: 评论
YFLM: An Improved Levenberg-Marquardt Algorithm for Global Bundle Adjustment  41st
YFLM: An Improved Levenberg-Marquardt Algorithm for Global ...
收藏 引用
41st Computer Graphics International Conference, CGI 2024
作者: Peng, Jiaxin Li, Tao Jiang, Qin Liu, Jie Wang, Ruibo Laboratory of Software Engineering for Complex Systems School of Computer Science National University of Defense Technology Hunan Changsha410073 China Parallel and Distributed Processing Laboratory School of Computer Science National University of Defense Technology Hunan Changsha410073 China
The conventional Levenberg-Marquardt (LM) algorithm is a state-of-the-art trust-region optimization method for solving bundle adjustment problems in the Structure-from-Motion community, which not only takes advantage ... 详细信息
来源: 评论
MARO: Enabling Full MPI Automatic Refactoring in DSL-Based Programming Framework  24th
MARO: Enabling Full MPI Automatic Refactoring in DSL-Based ...
收藏 引用
24th International Conference on Algorithms and Architectures for parallel processing, ICA3PP 2024
作者: Lei, Tong Chen, Zongjing Che, Yonggang Xu, Chuanfu Laboratory of Digitizing Software for Frontier Equipment National University of Defense Technology Changsha410073 China National Key Laboratory of Parallel and Distributed Computing College of Computer Science and Technology National University of Defense Technology Changsha410073 China
Currently, the landscape of computer hardware architecture presents the characteristics of heterogeneity and diversity, prompting widespread attention to cross-platform portable parallel programming techniques. Most e... 详细信息
来源: 评论