咨询与建议

限定检索结果

文献类型

  • 5,157 篇 会议
  • 50 篇 期刊文献
  • 19 册 图书

馆藏范围

  • 5,226 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 2,474 篇 工学
    • 2,331 篇 计算机科学与技术...
    • 1,202 篇 软件工程
    • 559 篇 电气工程
    • 345 篇 信息与通信工程
    • 232 篇 电子科学与技术(可...
    • 202 篇 控制科学与工程
    • 137 篇 网络空间安全
    • 63 篇 动力工程及工程热...
    • 43 篇 机械工程
    • 40 篇 生物工程
    • 29 篇 建筑学
    • 29 篇 生物医学工程(可授...
    • 28 篇 光学工程
    • 28 篇 土木工程
    • 27 篇 仪器科学与技术
    • 22 篇 环境科学与工程(可...
    • 19 篇 材料科学与工程(可...
    • 18 篇 安全科学与工程
  • 525 篇 理学
    • 373 篇 数学
    • 72 篇 物理学
    • 65 篇 系统科学
    • 48 篇 生物学
    • 37 篇 统计学(可授理学、...
  • 443 篇 管理学
    • 262 篇 管理科学与工程(可...
    • 197 篇 图书情报与档案管...
    • 130 篇 工商管理
  • 33 篇 经济学
    • 33 篇 应用经济学
  • 28 篇 医学
    • 21 篇 临床医学
    • 17 篇 基础医学(可授医学...
  • 20 篇 法学
    • 15 篇 社会学
  • 13 篇 农学
  • 9 篇 教育学
  • 1 篇 文学

主题

  • 1,759 篇 computer archite...
  • 677 篇 high performance...
  • 615 篇 hardware
  • 463 篇 computational mo...
  • 366 篇 parallel process...
  • 352 篇 concurrent compu...
  • 304 篇 application soft...
  • 252 篇 bandwidth
  • 247 篇 computer science
  • 233 篇 distributed comp...
  • 211 篇 graphics process...
  • 205 篇 kernel
  • 196 篇 costs
  • 195 篇 scalability
  • 195 篇 grid computing
  • 193 篇 throughput
  • 190 篇 cloud computing
  • 184 篇 resource managem...
  • 174 篇 benchmark testin...
  • 172 篇 processor schedu...

机构

  • 32 篇 university of ch...
  • 15 篇 college of compu...
  • 14 篇 ibm thomas j. wa...
  • 14 篇 barcelona superc...
  • 14 篇 mathematics and ...
  • 13 篇 georgia inst tec...
  • 13 篇 school of comput...
  • 12 篇 oak ridge nation...
  • 12 篇 mathematics and ...
  • 12 篇 department of co...
  • 11 篇 intel corporatio...
  • 11 篇 univ fed rio gra...
  • 10 篇 department of co...
  • 10 篇 intel corp santa...
  • 10 篇 oak ridge nation...
  • 9 篇 univ chicago dep...
  • 9 篇 computer science...
  • 9 篇 oak ridge nation...
  • 9 篇 institute of com...
  • 8 篇 university of sc...

作者

  • 16 篇 navaux philippe ...
  • 13 篇 hai jin
  • 11 篇 dhabaleswar k. p...
  • 11 篇 borin edson
  • 11 篇 xiaofei liao
  • 11 篇 prasanna viktor ...
  • 11 篇 wen-mei w. hwu
  • 10 篇 jack dongarra
  • 10 篇 panda dhabaleswa...
  • 10 篇 i. foster
  • 10 篇 d.k. panda
  • 9 篇 dongarra jack
  • 9 篇 renato ferreira
  • 9 篇 vetter jeffrey s...
  • 9 篇 mutlu onur
  • 9 篇 jie zhang
  • 8 篇 wang lei
  • 8 篇 mateo valero
  • 8 篇 hari subramoni
  • 8 篇 guedes dorgival

语言

  • 5,126 篇 英文
  • 94 篇 其他
  • 7 篇 中文
  • 1 篇 葡萄牙文
检索条件"任意字段=2024 International Symposium on Computer Architecture and High Performance Computing Workshops"
5226 条 记 录,以下是451-460 订阅
排序:
PMBS 2024: 15th IEEE international Workshop on performance Modeling, Benchmarking, and Simulation of high performance computer Systems
PMBS 2024: 15th IEEE International Workshop on Performance M...
收藏 引用
high performance computing, Networking, Storage and Analysis, SC-W: workshops of the international Conference for
来源: 评论
ETTE: Efficient Tensor-Train-based computing Engine for Deep Neural Networks  23
ETTE: Efficient Tensor-Train-based Computing Engine for Deep...
收藏 引用
50th Annual international symposium on computer architecture (ISCA)
作者: Gong, Yu Yin, Miao Huang, Lingyi Xiao, Jinqi Sui, Yang Deng, Chunhua Yuan, Bo Rutgers State Univ New Brunswick NJ 08901 USA ScaleFlux Inc Milpitas CA USA
Tensor-train (TT) decomposition enables ultra-high compression ratio, making the deep neural network (DNN) accelerators based on this method very attractive. TIE, the state-of-the-art TT based DNN accelerator, achieve... 详细信息
来源: 评论
Combining Lossy Compression with Multi-level Caching for Data Staging over Network
Combining Lossy Compression with Multi-level Caching for Dat...
收藏 引用
1st international Conference on Smart Energy Systems and Artificial Intelligence (SESAI)
作者: Aoyagi, Rei Takahashi, Keichi Shimomura, Yoichi Takizawa, Hiroyuki Tohoku Univ Grad Sch Informat Sci Sendai Miyagi Japan Tohoku Univ Cybersci Ctr Sendai Miyagi Japan
Researchers conduct post-processing on the simulation results by running an interactive data analysis tool on a high-performance computing (HPC) system installed at an HPC center and retrieving the post-processed resu... 详细信息
来源: 评论
Sparse Ternary Matrix Multiplication with Tensor Core for Transformer
Sparse Ternary Matrix Multiplication with Tensor Core for Tr...
收藏 引用
international symposium on computing and Networking workshops (CANDARW)
作者: Yushi Ogiwara Hideyuki Kawashima Keio University Fujisawa
The Transformer architecture, despite its scaling law, faces expensive computational cost challenges as the number of parameters increases. Quantization methods like Ternary-BERT and BitNet address this issue using te... 详细信息
来源: 评论
ADTopk: All-Dimension Top-k Compression for high-performance Data-Parallel DNN Training  24
ADTopk: All-Dimension Top-k Compression for High-Performance...
收藏 引用
33rd international symposium on high-performance Parallel and Distributed computing (HPDC)
作者: Ming, Zhangqiang Hu, Yuchong Zhou, Wenxiang Zheng, Xinjue Yao, Chenxuan Feng, Dan Huazhong Univ Sci & Technol Wuhan Hubei Peoples R China Huazhong Univ Sci & Technol Shenzhen Res Inst Shenzhen Guangdong Peoples R China
Data-parallel deep neural networks (DNN) training systems deployed across nodes have been widely used in various domains, while the system performance is often bottlenecked by the communication overhead among workers ... 详细信息
来源: 评论
Enabling FPGA and AI Engine Tasks in the HPX Programming Framework for Heterogeneous high-performance computing  20th
Enabling FPGA and AI Engine Tasks in the HPX Programming Fra...
收藏 引用
20th international symposium on Applied Reconfigurable computing (ARC)
作者: Kalkhof, Torben Heinz, Carsten Koch, Andreas Tech Univ Darmstadt Embedded Syst & Applicat Grp Darmstadt Germany
The increasing complexity of modern exascale computers, with a growing number of cores per node, poses a challenge to traditional programming models. To address this challenge, Asynchronous Many-Task (AMT) runtimes su... 详细信息
来源: 评论
CK-index: A Distribution-Aware Learned Index for Composite Keys  22
CK-index: A Distribution-Aware Learned Index for Composite K...
收藏 引用
22nd IEEE international symposium on Parallel and Distributed Processing with Applications, ISPA 2024
作者: Wei, Zhengyang Ye, Baoliu Cai, Miao Hohai University College of Computer and Software China Nanjing University Department of Computer Science and Technology China Nanjing University of Aeronautics and Astronautics College of Computer Science and Technology China
The learned index is a high-performance index structure that uses machine learning methods to predict key positions in a large key space efficiently. Existing learned indexes suffer from underfitting of key-to-positio... 详细信息
来源: 评论
Modeling and Analyzing the Shared Receive Queue of RDMA  22
Modeling and Analyzing the Shared Receive Queue of RDMA
收藏 引用
22nd IEEE international symposium on Parallel and Distributed Processing with Applications, ISPA 2024
作者: Tian, Zhuang Wang, Kai Jiang, Wanchun Central South University School of Computer Science and Engineering Changsha China
Nowadays, the RDMA (Remote Direct Memory Access) technology has been broadly employed in data centers. The Shared Receive Queue (SRQ) is an embedded mechanism in RDMA protocol, which reduces the memory cost of queue p... 详细信息
来源: 评论
An Innovative Optimization Framework for Aerodynamic Shape Optimization Using Deep Neural Network  3
An Innovative Optimization Framework for Aerodynamic Shape O...
收藏 引用
3rd international symposium on Aerospace Engineering and Systems, ISAES 2024
作者: Wu, Pin Zhou, Zhu Liu, Zhitao Song, Chao School of Computer Engineering and Science Shanghai University Shanghai200444 China China Aerodynamics Research And Development Center State Key Laboratory of Aerodynamics Sichuan Mianyang621000 China
It is necessary to optimize the design method of the airfoil aerodynamic shape for better performance while meeting the design requirements. However, current mainstream design methods for aerodynamic shape are based o... 详细信息
来源: 评论
A Task Dependency-based Deduplicated Task Offloading Mechanism in Vehicular Edge computing  22
A Task Dependency-based Deduplicated Task Offloading Mechani...
收藏 引用
22nd IEEE international symposium on Parallel and Distributed Processing with Applications, ISPA 2024
作者: Shao, Zhenyi Liao, Zhuofan Tang, Xiaoyong Changsha University of Science and Technology School of Computer and Communication Engineering Changsha410114 China
The increasing demand for in-vehicle applications has raised the complexity and computational load, while the in-vehicle tasks exhibit a sensitivity to latency. Previous research has proposed utilizing the idle comput... 详细信息
来源: 评论