咨询与建议

限定检索结果

文献类型

  • 237 篇 会议
  • 71 篇 期刊文献

馆藏范围

  • 308 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 170 篇 工学
    • 153 篇 计算机科学与技术...
    • 122 篇 软件工程
    • 20 篇 信息与通信工程
    • 20 篇 生物工程
    • 18 篇 控制科学与工程
    • 13 篇 机械工程
    • 12 篇 电子科学与技术(可...
    • 10 篇 电气工程
    • 7 篇 动力工程及工程热...
    • 7 篇 化学工程与技术
    • 7 篇 生物医学工程(可授...
    • 4 篇 力学(可授工学、理...
    • 3 篇 材料科学与工程(可...
    • 3 篇 交通运输工程
    • 3 篇 网络空间安全
  • 76 篇 理学
    • 43 篇 数学
    • 21 篇 生物学
    • 10 篇 化学
    • 9 篇 物理学
    • 7 篇 统计学(可授理学、...
    • 4 篇 系统科学
  • 53 篇 管理学
    • 33 篇 管理科学与工程(可...
    • 21 篇 图书情报与档案管...
    • 14 篇 工商管理
  • 5 篇 医学
    • 4 篇 基础医学(可授医学...
    • 4 篇 临床医学
    • 3 篇 药学(可授医学、理...
  • 4 篇 经济学
    • 4 篇 应用经济学
  • 4 篇 法学
    • 4 篇 社会学
  • 3 篇 教育学
    • 3 篇 教育学
  • 2 篇 农学

主题

  • 30 篇 distributed comp...
  • 25 篇 concurrent compu...
  • 21 篇 laboratories
  • 13 篇 parallel process...
  • 10 篇 application soft...
  • 10 篇 data mining
  • 10 篇 computational mo...
  • 9 篇 computer science
  • 9 篇 grid computing
  • 9 篇 accuracy
  • 8 篇 routing
  • 8 篇 kernel
  • 8 篇 data models
  • 7 篇 java
  • 6 篇 runtime
  • 6 篇 scheduling algor...
  • 6 篇 computer archite...
  • 6 篇 neural networks
  • 6 篇 contracts
  • 6 篇 algorithm design...

机构

  • 21 篇 national key lab...
  • 15 篇 college of compu...
  • 13 篇 national laborat...
  • 12 篇 national laborat...
  • 11 篇 shanghai key lab...
  • 11 篇 national key lab...
  • 10 篇 national key lab...
  • 8 篇 john von neumann...
  • 7 篇 science and tech...
  • 7 篇 laboratory of di...
  • 6 篇 parallel and dis...
  • 6 篇 laboratory of pa...
  • 6 篇 mta sztaki labor...
  • 5 篇 key laboratory o...
  • 5 篇 óbuda university...
  • 5 篇 parallel and dis...
  • 5 篇 department of co...
  • 5 篇 national univers...
  • 5 篇 institute of par...
  • 5 篇 mta sztaki/labor...

作者

  • 20 篇 li kuan-ching
  • 20 篇 yang chao-tung
  • 15 篇 li dongsheng
  • 11 篇 huaimin wang
  • 10 篇 dongsheng li
  • 10 篇 chen haibo
  • 10 篇 v. chaudhary
  • 9 篇 gang yin
  • 9 篇 dou yong
  • 9 篇 ji wang
  • 9 篇 tao wang
  • 8 篇 wang yijie
  • 8 篇 zang binyu
  • 7 篇 guan haibing
  • 7 篇 lai zhiquan
  • 7 篇 qiao peng
  • 7 篇 huang zhen
  • 7 篇 yue yu
  • 6 篇 yijie wang
  • 6 篇 s. roy

语言

  • 298 篇 英文
  • 5 篇 其他
  • 5 篇 中文
检索条件"机构=Parallel Distributed Computing Laboratory"
308 条 记 录,以下是1-10 订阅
排序:
Automatic parallelism strategy generation with minimalmemory redundancy
收藏 引用
Frontiers of Information Technology & Electronic Engineering 2025年 第1期26卷 109-118页
作者: Yanqi SHI Peng LIANG Hao ZHENG Linbo QIAO Dongsheng LI National Key Laboratory of Parallel and Distributed Computing National University of Defense TechnologyChangsha 410000China
Large-scale deep learning models are trained distributedly due to memory and computing resource *** existing strategy generation approaches take optimal memory minimization as the *** fill in this gap,we propose a nov... 详细信息
来源: 评论
Training large-scale language models with limited GPU memory:a survey
收藏 引用
Frontiers of Information Technology & Electronic Engineering 2025年 第3期26卷 309-331页
作者: Yu TANG Linbo QIAO Lujia YIN Peng LIANG Ao SHEN Zhilin YANG Lizhi ZHANG Dongsheng LI National Key Laboratory of Parallel and Distributed Computing College of ComputerNational University of Defense TechnologyChangsha 410073China
Large-scale models have gained significant attention in a wide range of fields,such as computer vision and natural language processing,due to their effectiveness across various ***,a notable hurdle in training these l... 详细信息
来源: 评论
Exploring Quantization Techniques for Large-Scale Language Models: Methods, Challenges and Future Directions  24
Exploring Quantization Techniques for Large-Scale Language M...
收藏 引用
9th International Conference on Cyber Security and Information Engineering, ICCSIE 2024
作者: Shen, Ao Lai, Zhiquan Li, Dongsheng National Key Laboratory of Parallel and Distributed Computing National University of Defense Technology China
Breakthroughs in natural language processing (NLP) by large-scale language models (LLMs) have led to superior performance in multilingual tasks such as translation, summarization, and Q&A. However, the size and co... 详细信息
来源: 评论
Optimizing Fine-Tuning in Quantized Language Models:An In-Depth Analysis of Key Variables
收藏 引用
Computers, Materials & Continua 2025年 第1期82卷 307-325页
作者: Ao Shen Zhiquan Lai Dongsheng Li Xiaoyu Hu National Key Laboratory of Parallel and Distributed Computing National University of Defense TechnologyChangsha410073China Strategic Assessments and Consultation Institute Academy of Military ScienceBeijing100091China
Large-scale Language Models(LLMs)have achieved significant breakthroughs in Natural Language Processing(NLP),driven by the pre-training and fine-tuning *** this approach allows models to specialize in specific tasks w... 详细信息
来源: 评论
U-shaped Dual Attention Transformer: An Efficient Transformer Based on Channel and Spatial Attention  4
U-shaped Dual Attention Transformer: An Efficient Transforme...
收藏 引用
4th International Conference on Artificial Intelligence, Robotics, and Communication, ICAIRC 2024
作者: Zhai, Zhaoyuan Qiao, Peng Li, Rongchun Zhou, Zhen National University of Defense Technology National Key Laboratory of Parallel and Distributed Computing Changsha China
Transformer-based methods have demonstrated remarkable performance on image super-resolution tasks. Due to high computational complexity, researchers have been working to achieve a balance between computation costs an... 详细信息
来源: 评论
Funnel: An Efficient Sparse Attention Accelerator with Multi-Dataflow Fusion  22
Funnel: An Efficient Sparse Attention Accelerator with Multi...
收藏 引用
22nd IEEE International Symposium on parallel and distributed Processing with Applications, ISPA 2024
作者: Ma, Shenghong Xu, Jinwei Jiang, Jingfei Wang, Yaohua Li, Dongsheng National University of Defense Technology National Key Laboratory of Parallel and Distributed Computing College of Computer Changsha China
The self-attention mechanism is the core component of Transformer, which provides a powerful ability to understand the sequence context. However, the self-attention mechanism also suffers from a large amount of redund... 详细信息
来源: 评论
Mbapp: Efficient Memory-Balanced Pipeline parallelism for Large Model Fine-Tuning on Commodity GPU Servers  24
Mbapp: Efficient Memory-Balanced Pipeline Parallelism for La...
收藏 引用
5th International Conference on Computer Information and Big Data Applications, CIBDA 2024
作者: Liu, Yujie Lai, Zhiquan Li, Dongsheng National Key Laboratory of Parallel and Distributed Computing College of Computer National University of Defense Technology Changsha410000 China
Large-scale models have demonstrated outstanding performance across various downstream tasks. Pipeline parallelism is essential for fine-tuning large models on commodity GPU servers, as it plays a crucial role in maki... 详细信息
来源: 评论
Communication Analysis for Multidimensional parallel Training of Large-scale DNN Models  25
Communication Analysis for Multidimensional Parallel Trainin...
收藏 引用
25th IEEE International Conferences on High Performance computing and Communications, 9th International Conference on Data Science and Systems, 21st IEEE International Conference on Smart City and 9th IEEE International Conference on Dependability in Sensor, Cloud and Big Data Systems and Applications, HPCC/DSS/SmartCity/DependSys 2023
作者: Lai, Zhiquan Hao, Yanqi Li, Shengwei Li, Dongsheng College of Computer National University of Defense Technology National Key Laboratory of Parallel and Distributed Computing Changsha China
Multidimensional parallel training has been widely applied to train large-scale deep learning models like GPT-3. The efficiency of parameter communication among training devices/processes is often the performance bott... 详细信息
来源: 评论
Efficient Large Models Fine-tuning on Commodity Servers via Memory-balanced Pipeline parallelism  25
Efficient Large Models Fine-tuning on Commodity Servers via ...
收藏 引用
25th IEEE International Conferences on High Performance computing and Communications, 9th International Conference on Data Science and Systems, 21st IEEE International Conference on Smart City and 9th IEEE International Conference on Dependability in Sensor, Cloud and Big Data Systems and Applications, HPCC/DSS/SmartCity/DependSys 2023
作者: Liu, Yujie Lai, Zhiquan Liu, Weijie Wang, Wei Li, Dongsheng College of Computer National University of Defense Technology National Key Laboratory of Parallel and Distributed Computing Changsha China
Large models have achieved impressive performance in many downstream tasks. Using pipeline parallelism to fine-tune large models on commodity GPU servers is an important way to make the excellent performance of large ... 详细信息
来源: 评论
Rethinking the distributed DNN Training Cluster Design from the Cost-effectiveness View  25
Rethinking the Distributed DNN Training Cluster Design from ...
收藏 引用
25th IEEE International Conferences on High Performance computing and Communications, 9th International Conference on Data Science and Systems, 21st IEEE International Conference on Smart City and 9th IEEE International Conference on Dependability in Sensor, Cloud and Big Data Systems and Applications, HPCC/DSS/SmartCity/DependSys 2023
作者: Lai, Zhiquan Liu, Yujie Wang, Wei Hao, Yanqi Li, Dongsheng College of Computer National University of Defense Technology National Key Laboratory of Parallel and Distributed Computing Changsha China
As deep learning grows rapidly, model training heavily relies on parallel methods and there exist numerous cluster configurations. However, current preferences for parallel training focus on data centers, overlooking ... 详细信息
来源: 评论