咨询与建议

限定检索结果

文献类型

  • 75 篇 会议
  • 31 篇 期刊文献

馆藏范围

  • 106 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 54 篇 工学
    • 48 篇 计算机科学与技术...
    • 34 篇 软件工程
    • 9 篇 控制科学与工程
    • 8 篇 信息与通信工程
    • 6 篇 生物工程
    • 5 篇 动力工程及工程热...
    • 4 篇 电气工程
    • 3 篇 电子科学与技术(可...
    • 3 篇 建筑学
    • 3 篇 土木工程
    • 2 篇 交通运输工程
    • 2 篇 安全科学与工程
    • 1 篇 机械工程
    • 1 篇 光学工程
    • 1 篇 仪器科学与技术
  • 34 篇 理学
    • 22 篇 数学
    • 8 篇 系统科学
    • 6 篇 生物学
    • 3 篇 化学
    • 1 篇 物理学
    • 1 篇 地球物理学
    • 1 篇 统计学(可授理学、...
  • 22 篇 管理学
    • 12 篇 管理科学与工程(可...
    • 12 篇 图书情报与档案管...
    • 6 篇 工商管理
  • 5 篇 经济学
    • 5 篇 应用经济学
  • 1 篇 教育学
    • 1 篇 教育学
    • 1 篇 心理学(可授教育学...
  • 1 篇 文学
    • 1 篇 中国语言文学
    • 1 篇 外国语言文学
  • 1 篇 农学

主题

  • 6 篇 hardware
  • 5 篇 deep learning
  • 5 篇 neural networks
  • 5 篇 bandwidth
  • 4 篇 runtime
  • 4 篇 libraries
  • 4 篇 computer archite...
  • 4 篇 computational mo...
  • 4 篇 codes
  • 4 篇 machine learning
  • 4 篇 high performance...
  • 4 篇 pipelines
  • 4 篇 heuristic algori...
  • 3 篇 parallel process...
  • 3 篇 throughput
  • 3 篇 big data
  • 3 篇 computer bugs
  • 3 篇 random access me...
  • 3 篇 optimization
  • 3 篇 tensors

机构

  • 49 篇 university of ch...
  • 27 篇 skl of computer ...
  • 17 篇 skl of computer ...
  • 10 篇 skl computer arc...
  • 9 篇 skl computer arc...
  • 8 篇 cambricon techno...
  • 6 篇 university of sc...
  • 6 篇 skl computer arc...
  • 4 篇 school of comput...
  • 4 篇 huawei technolog...
  • 4 篇 college of compu...
  • 4 篇 skl of computer ...
  • 3 篇 tsinghua univers...
  • 3 篇 department of co...
  • 3 篇 spklstn lab depa...
  • 3 篇 department of co...
  • 3 篇 skl of computer ...
  • 3 篇 college of infor...
  • 3 篇 department of co...
  • 3 篇 skl computer arc...

作者

  • 9 篇 yunquan zhang
  • 8 篇 chen yunji
  • 8 篇 zhang yunquan
  • 7 篇 xiong jin
  • 7 篇 zhang rui
  • 7 篇 du zidong
  • 6 篇 zhang xishan
  • 6 篇 feng xiaobing
  • 6 篇 guo qi
  • 6 篇 jiang dejun
  • 5 篇 yunji chen
  • 5 篇 lian li
  • 5 篇 jingling xue
  • 5 篇 hu xing
  • 4 篇 li shigang
  • 4 篇 ninghui sun
  • 4 篇 xiaochun ye
  • 4 篇 cui huimin
  • 4 篇 dongrui fan
  • 4 篇 liu shaoli

语言

  • 104 篇 英文
  • 1 篇 其他
  • 1 篇 中文
检索条件"机构=SKL Computer Architecture"
106 条 记 录,以下是1-10 订阅
排序:
iNUMAlloc: Towards Intelligent Memory Allocation for AI Accelerators with NUMA  21
iNUMAlloc: Towards Intelligent Memory Allocation for AI Acce...
收藏 引用
21st IEEE International Symposium on Parallel and Distributed Processing with Applications, 13th IEEE International Conference on Big Data and Cloud Computing, 16th IEEE International Conference on Social Computing and Networking and 13th International Conference on Sustainable Computing and Communications, ISPA/BDCloud/SocialCom/SustainCom 2023
作者: Xu, Yuanchao Qian, Ruyi Wang, Yida Huo, Qirun Capital Normal University College of Information Engineering Beijing China Skl of Computer Architecture Institute of Computing Technology Cas Beijing China
The amazing success of deep neural network benefits from the rise of big data. As deep learning models are becoming more scale than ever before, their requirements for memory bandwidth are growing at a tremendous pace... 详细信息
来源: 评论
EagerReuse: An Efficient Memory Reuse Approach for Complex Computational Graph  29
EagerReuse: An Efficient Memory Reuse Approach for Complex C...
收藏 引用
29th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2023
作者: Qian, Ruyi Cao, Bojun Gao, Mengjuan Shi, Qinwen Wang, Yida Xu, Yuanchao Huo, Qirun Qiu, Keni Capital Normal University College of Information Engineering Beijing China Institute of Computing Technology Cas Skl of Computer Architecture Beijing China
Memory reuse is a promising approach for deep neural network (DNN) to reduce memory consumption because it does not introduce any additional runtime overhead. We observe that existing memory reuse algorithms consider ... 详细信息
来源: 评论
HiStore: Rethinking Hybrid Index in RDMA-based Key-Value Store
arXiv
收藏 引用
arXiv 2022年
作者: Han, Shukai Zhang, Mi Jiang, Dejun Xiong, Jin SKL Computer Architecture ICT CAS China
RDMA (Remote Direct Memory Access) is widely exploited in building key-value stores to achieve ultra low latency. In RDMA-based key-value stores, the indexing time takes a large fraction (up to 74%) of the overall ope...
来源: 评论
CDFGNN: a Systematic Design of Cache-based Distributed Full-Batch Graph Neural Network Training with Communication Reduction
arXiv
收藏 引用
arXiv 2024年
作者: Zhang, Shuai Jiang, Zite You, Haihang Meituan Beijing China SKL Computer Architecture Institute of Computing Technology Chinese Academy of Sciences Beijing China
Graph neural network training is mainly categorized into mini-batch and full-batch training methods. The mini-batch training method samples subgraphs from the original graph in each iteration. This sampling operation ... 详细信息
来源: 评论
NEURAL PROGRAM SYNTHESIS WITH QUERY  10
NEURAL PROGRAM SYNTHESIS WITH QUERY
收藏 引用
10th International Conference on Learning Representations, ICLR 2022
作者: Huang, Di Zhang, Rui Hu, Xing Zhang, Xishan Jin, Pengwei Li, Nan Du, Zidong Guo, Qi Chen, Yunji SKL of Computer Architecture Institute of Computing Technology CAS China University of Chinese Academy of Sciences China University of Science and Technology of China China Cambricon Technologies
Aiming to find a program satisfying the user intent given input-output examples, program synthesis has attracted increasing interest in the area of machine learning. Despite the promising performance of existing metho...
来源: 评论
PR-Sketch: Monitoring per-key aggregation of streaming data with nearly full accuracy  47th
PR-Sketch: Monitoring per-key aggregation of streaming data ...
收藏 引用
47th International Conference on Very Large Data Bases, VLDB 2021
作者: Sheng, Siyuan Huang, Qun Wang, Sa Bao, Yungang University of Chinese Academy of Sciences SKL of Computer Architecture ICT CAS China Peking University China
Computing per-key aggregation is indispensable in streaming data analysis formulated as two phases, an update phase and a recovery phase. As the size and speed of data streams rise, accurate per-key information is use... 详细信息
来源: 评论
A Transpose-free Three-dimensional FFT Algorithm on ARM CPUs  23
A Transpose-free Three-dimensional FFT Algorithm on ARM CPUs
收藏 引用
23rd IEEE International Conference on High Performance Computing and Communications, 7th IEEE International Conference on Data Science and Systems, 19th IEEE International Conference on Smart City and 7th IEEE International Conference on Dependability in Sensor, Cloud and Big Data Systems and Applications, HPCC-DSS-SmartCity-DependSys 2021
作者: Chen, Tun Jia, Haipeng Li, Zhihao Li, Chendi Zhang, Yunquan Skl of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences Beijing China University of Chinese Academy of Sciences Beijing China Huawei Technologies Co. Ltd Shenzhen China
According to the traditional multi-dimensional FFT, memory layouts of high-dimensional data are discontinuous. Transposition is introduced to keep high-dimensional data continuous in memory. However, transposition inc... 详细信息
来源: 评论
Progressive Join Algorithms Considering User Preference  11
Progressive Join Algorithms Considering User Preference
收藏 引用
11th Annual Conference on Innovative Data Systems Research, CIDR 2021
作者: Ding, Mengsu Chen, Shimin Makrynioti, Nantia Manegold, Stefan SKL of Computer Architecture ICT CAS University of Chinese Academy of Sciences China CWI Amsterdam Netherlands
Progressive query processing is a new attractive paradigm for exploratory data analysis. This paper considers the case where users want to receive results ordered according to their preference, and specifically focuse...
来源: 评论
Density-optimized Intersection-free Mapping and Matrix Multiplication for Join-Project Operations (extended version)
arXiv
收藏 引用
arXiv 2022年
作者: Huang, Zichun Chen, Shimin SKL of Computer Architecture ICT CAS University of Chinese Academy of Sciences China
A Join-Project operation is a join operation followed by a duplicate eliminating projection operation. It is used in a large variety of applications, including entity matching, set analytics, and graph analytics. Prev... 详细信息
来源: 评论
EagerReuse: An Efficient Memory Reuse Approach for Complex Computational Graph
EagerReuse: An Efficient Memory Reuse Approach for Complex C...
收藏 引用
International Conference on Parallel and Distributed Systems (ICPADS)
作者: Ruyi Qian Bojun Cao Mengjuan Gao Qinwen Shi Yida Wang Yuanchao Xu Qirun Huo Keni Qiu College of Information Engineering Capital Normal University Beijing China SKL of Computer Architecture Institute of Computing Technology CAS Beijing China
Memory reuse is a promising approach for deep neural network (DNN) to reduce memory consumption because it does not introduce any additional runtime overhead. We observe that existing memory reuse algorithms consider ...
来源: 评论