咨询与建议

限定检索结果

文献类型

  • 1,740 篇 会议
  • 23 篇 期刊文献
  • 7 册 图书

馆藏范围

  • 1,770 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 1,209 篇 工学
    • 1,099 篇 计算机科学与技术...
    • 696 篇 软件工程
    • 221 篇 信息与通信工程
    • 163 篇 电气工程
    • 122 篇 电子科学与技术(可...
    • 97 篇 控制科学与工程
    • 82 篇 生物工程
    • 61 篇 动力工程及工程热...
    • 49 篇 机械工程
    • 46 篇 生物医学工程(可授...
    • 31 篇 交通运输工程
    • 30 篇 光学工程
    • 29 篇 仪器科学与技术
    • 26 篇 环境科学与工程(可...
    • 24 篇 化学工程与技术
    • 20 篇 网络空间安全
    • 17 篇 力学(可授工学、理...
    • 17 篇 材料科学与工程(可...
  • 497 篇 理学
    • 320 篇 数学
    • 96 篇 生物学
    • 83 篇 系统科学
    • 78 篇 物理学
    • 72 篇 统计学(可授理学、...
    • 34 篇 化学
  • 271 篇 管理学
    • 184 篇 管理科学与工程(可...
    • 121 篇 图书情报与档案管...
    • 117 篇 工商管理
  • 62 篇 医学
    • 55 篇 临床医学
  • 36 篇 法学
    • 35 篇 社会学
  • 33 篇 经济学
    • 33 篇 应用经济学
  • 8 篇 教育学
  • 6 篇 农学
  • 3 篇 文学
  • 2 篇 军事学

主题

  • 198 篇 computer archite...
  • 80 篇 hardware
  • 79 篇 computational mo...
  • 66 篇 high performance...
  • 57 篇 cloud computing
  • 50 篇 grid computing
  • 41 篇 distributed comp...
  • 41 篇 field programmab...
  • 40 篇 bandwidth
  • 37 篇 kernel
  • 37 篇 graphics process...
  • 34 篇 resource managem...
  • 34 篇 computer network...
  • 33 篇 performance eval...
  • 32 篇 throughput
  • 31 篇 computer science
  • 31 篇 application soft...
  • 31 篇 analytical model...
  • 30 篇 program processo...
  • 29 篇 supercomputers

机构

  • 26 篇 college of compu...
  • 19 篇 university of ch...
  • 14 篇 school of comput...
  • 10 篇 college of compu...
  • 9 篇 school of comput...
  • 9 篇 college of compu...
  • 8 篇 school of comput...
  • 7 篇 school of data a...
  • 7 篇 school of comput...
  • 7 篇 institute of com...
  • 6 篇 computer network...
  • 6 篇 tsinghua univers...
  • 6 篇 school of cyber ...
  • 6 篇 changsha univers...
  • 5 篇 university of sc...
  • 5 篇 skl of computer ...
  • 5 篇 computer science...
  • 5 篇 school of comput...
  • 5 篇 hubei province k...
  • 5 篇 zhongguancun lab...

作者

  • 11 篇 duan yucong
  • 7 篇 zhang tao
  • 7 篇 li kenli
  • 7 篇 xu xiaolong
  • 6 篇 wang wei
  • 6 篇 gao guang r.
  • 5 篇 li peng
  • 5 篇 yunquan zhang
  • 5 篇 liu qin
  • 5 篇 wang dong
  • 5 篇 bader david a.
  • 5 篇 liu ruicheng
  • 5 篇 zhang rui
  • 5 篇 wan shouhong
  • 5 篇 wu jigang
  • 5 篇 panda dhabaleswa...
  • 5 篇 zhang jie
  • 5 篇 wang xiaoliang
  • 5 篇 chen long
  • 5 篇 li wei

语言

  • 1,717 篇 英文
  • 49 篇 其他
  • 3 篇 中文
  • 1 篇 波兰文
检索条件"任意字段=21st International Symposium on Computer Architecture and High Performance Computing"
1770 条 记 录,以下是41-50 订阅
排序:
BitMoD: Bit-serial Mixture-of-Datatype LLM Acceleration  31
BitMoD: Bit-serial Mixture-of-Datatype LLM Acceleration
收藏 引用
31st IEEE international symposium on high performance computer architecture, HPCA 2025
作者: Chen, Yuzong Abouelhamayed, Ahmed F. Dai, Xilai Wang, Yang Andronic, Marta Constantinides, George A. Abdelfattah, Mohamed S. Cornell University Computer Systems Lab United States Systems and Networking Research Group Microsoft Research United States Imperial College London Department of Electrical and Electronic Engineering United Kingdom
Large language models (LLMs) have demonstrated remarkable performance across various machine learning tasks. Yet the substantial memory footprint of LLMs significantly hinders their deployment. In this paper, we impro... 详细信息
来源: 评论
Warped-Compaction: Maximizing GPU Register File Bandwidth Utilization via Operand Compaction  31
Warped-Compaction: Maximizing GPU Register File Bandwidth Ut...
收藏 引用
31st IEEE international symposium on high performance computer architecture, HPCA 2025
作者: Jeong, Eunbi Jeong, Ipoom Yoon, Myung Kuk Kim, Nam Sung Ewha Womans University Department of Computer Science and Engineering Seoul Korea Republic of Yonsei University Department of System Semiconductor Engineering Seoul Korea Republic of Yonsei University Department of Electrical and Electronic Engineering Seoul Korea Republic of University of Illinois Urbana-Champaign Urbana United States
The GPU has been successfully used for diverse emerging compute-intensive applications, including imaging, computer vision, and more recently, deep learning, to name a few. To offer high performance for such applicati... 详细信息
来源: 评论
Make LLM Inference Affordable to Everyone: Augmenting GPU Memory with NDP-DIMM  31
Make LLM Inference Affordable to Everyone: Augmenting GPU Me...
收藏 引用
31st IEEE international symposium on high performance computer architecture, HPCA 2025
作者: Liu, Lian Zhao, Shixin Li, Bing Ren, Haimeng Xu, Zhaohui Wang, Mengdi Li, Xiaowei Han, Yinhe Wang, Ying Institute of Computing Technology Chinese Academic of Sciences China University of Chinese Academy of Sciences China Zhongguancun Laboratory China Institute of Microelectronics Chinese Academy of Sciences China ShanghaiTech University School of Information Science and Technology China
The billion-scale Large Language Models (LLMs) necessitate deployment on expensive server-grade GPUs with large-storage HBMs and abundant computation capability. As LLM-assisted services become popular, achieving cost... 详细信息
来源: 评论
Criticality-Aware Instruction-Centric Bandwidth Partitioning for Data Center Applications  31
Criticality-Aware Instruction-Centric Bandwidth Partitioning...
收藏 引用
31st IEEE international symposium on high performance computer architecture, HPCA 2025
作者: Zhu, Liren Li, Liujia Wu, Jianyu Yao, Yiming Shi, Zhan Zhang, Jie Wang, Zhenlin Wang, Xiaolin Luo, Yingwei Zhou, Diyu Huawei Hisilicon China Peking University National Key Laboratory for Multimedia Information Processing School of Computer Science China Zhongguancun Laboratory China Michigan Technological University United States
To reduce operational costs, modern data centers co-locate high-priority latency-critical (LC) tasks and low-priority best-effort (BE) tasks on the same physical node to increase resource utilization. However, such co... 详细信息
来源: 评论
Real-Time Multi-object Tracking Using YOLOv8 and SORT on a SoC FPGA  21st
Real-Time Multi-object Tracking Using YOLOv8 and SORT on a...
收藏 引用
21st international symposium on Applied Reconfigurable computing, ARC 2025
作者: Danilowicz, Michal Kryjak, Tomasz Embedded Vision Systems Group Computer Vision Laboratory Department of Automatic Control and Robotics AGH University of Science and Technology Krakow Poland
Multi-object tracking (MOT) is one of the most important problems in computer vision and a key component of any vision-based perception system used in advanced autonomous mobile robotics. Therefore, its implementation... 详细信息
来源: 评论
GSArch: Breaking Memory Barriers in 3D Gaussian Splatting Training via Architectural Support  31
GSArch: Breaking Memory Barriers in 3D Gaussian Splatting Tr...
收藏 引用
31st IEEE international symposium on high performance computer architecture, HPCA 2025
作者: He, Houshu Li, Gang Liu, Fangxin Jiang, Li Liang, Xiaoyao Song, Zhuoran Shanghai Jiao Tong University Department of Computer Science and Engineering Shanghai China Institute of Automation Chinese Academy of Sciences Beijing China
3D Gaussian Splatting (3DGS) introduces a novel methodology for representing scenes with anisotropic 3D Gaussian primitives, achieving exceptional quality and rendering speed in neural scene representation (NSR). Howe... 详细信息
来源: 评论
Dynamic Function Exchange in FPGA to Redefine RISC-V Multicore architectures at Runtime  21st
Dynamic Function Exchange in FPGA to Redefine RISC-V Multi...
收藏 引用
21st international symposium on Applied Reconfigurable computing, ARC 2025
作者: Alves, Téo Sobrino Bonato, Vanderlei Institute of Mathematical and Computing Sciences The University of São Paulo São Paulo São Carlos Brazil
Dynamic Partial Reconfiguration is a powerful feature available in some FPGAs that enables the reconfiguration of specific regions within the FPGA fabric without halting the whole system. This capability opens new opp... 详细信息
来源: 评论
VQ-LLM: high-performance Code Generation for Vector Quantization Augmented LLM Inference  31
VQ-LLM: High-performance Code Generation for Vector Quantiza...
收藏 引用
31st IEEE international symposium on high performance computer architecture, HPCA 2025
作者: Liu, Zihan Luo, Xinhao Guo, Junxian Ni, Wentao Zhou, Yangjie Guan, Yue Guo, Cong Cui, Weihao Feng, Yu Guo, Minyi Zhu, Yuhao Zhang, Minjia Jin, Chen Leng, Jingwen Shanghai Jiao Tong University China Shanghai Qi Zhi Institute China Duke University United States National University of Singapore Singapore University of Rochester United States University of Illinois Urbana-Champaign United States Magik Compute
Vector quantization (VQ), which treats a vector as a compression unit, gains increasing research interests for its potential to accelerate large language models (LLMs). Compared to conventional element-wise quantizati... 详细信息
来源: 评论
Ultra-Low Latency and Extreme-Throughput Echo state Neural Networks on FPGA  21st
Ultra-Low Latency and Extreme-Throughput Echo State Neural ...
收藏 引用
21st international symposium on Applied Reconfigurable computing, ARC 2025
作者: Jafari, Atousa Platzner, Marco Computer Science Department Paderborn University Paderborn Germany
Echo state networks, as a popular form of reservoir computing models, are recurrent neural networks that consist of three layers, and only the output layer needs to be trained. Compared to other recurrent neural netwo... 详细信息
来源: 评论
Hardware-Accelerated Event-Graph Neural Networks for Low-Latency Time-Series Classification on SoC FPGA  21st
Hardware-Accelerated Event-Graph Neural Networks for Low-La...
收藏 引用
21st international symposium on Applied Reconfigurable computing, ARC 2025
作者: Nakano, Hiroshi Blachut, Krzysztof Jeziorek, Kamil Wzorek, Piotr Dampfhoffer, Manon Mesquida, Thomas Nishi, Hiroaki Kryjak, Tomasz Dalgaty, Thomas Graduate School of Science and Technology Keio University Tokyo Japan Embedded Vision Systems Group Computer Vision Laboratory AGH University of Krakow Krakow Poland CEA-List Université Grenoble Alpes Grenoble France
As the quantities of data recorded by embedded edge sensors grow, so too does the need for intelligent local processing. Such data often comes in the form of time-series signals, based on which real-time predictions c... 详细信息
来源: 评论