检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

文献类型

5,157 篇 会议
50 篇 期刊文献
19 册 图书

馆藏范围

5,226 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

2,474 篇 工学
- 2,331 篇 计算机科学与技术...
- 1,202 篇 软件工程
- 559 篇 电气工程
- 345 篇 信息与通信工程
- 232 篇 电子科学与技术（可...
- 202 篇 控制科学与工程
- 137 篇 网络空间安全
- 63 篇 动力工程及工程热...
- 43 篇 机械工程
- 40 篇 生物工程
- 29 篇 建筑学
- 29 篇 生物医学工程（可授...
- 28 篇 光学工程
- 28 篇 土木工程
- 27 篇 仪器科学与技术
- 22 篇 环境科学与工程（可...
- 19 篇 材料科学与工程（可...
- 18 篇 安全科学与工程
525 篇 理学
- 373 篇 数学
- 72 篇 物理学
- 65 篇 系统科学
- 48 篇 生物学
- 37 篇 统计学（可授理学、...
443 篇 管理学
- 262 篇 管理科学与工程(可...
- 197 篇 图书情报与档案管...
- 130 篇 工商管理
33 篇 经济学
- 33 篇 应用经济学
28 篇 医学
- 21 篇 临床医学
- 17 篇 基础医学(可授医学...
20 篇 法学
- 15 篇 社会学
13 篇 农学
9 篇 教育学
1 篇 文学

主题

1,759 篇 computer archite...
677 篇 high performance...
615 篇 hardware
463 篇 computational mo...
366 篇 parallel process...
352 篇 concurrent compu...
304 篇 application soft...
252 篇 bandwidth
247 篇 computer science
233 篇 distributed comp...
211 篇 graphics process...
205 篇 kernel
196 篇 costs
195 篇 scalability
195 篇 grid computing
193 篇 throughput
190 篇 cloud computing
184 篇 resource managem...
174 篇 benchmark testin...
172 篇 processor schedu...

机构

32 篇 university of ch...
15 篇 college of compu...
14 篇 ibm thomas j. wa...
14 篇 barcelona superc...
14 篇 mathematics and ...
13 篇 georgia inst tec...
13 篇 school of comput...
12 篇 oak ridge nation...
12 篇 mathematics and ...
12 篇 department of co...
11 篇 intel corporatio...
11 篇 univ fed rio gra...
10 篇 department of co...
10 篇 intel corp santa...
10 篇 oak ridge nation...
9 篇 univ chicago dep...
9 篇 computer science...
9 篇 oak ridge nation...
9 篇 institute of com...
8 篇 university of sc...

作者

16 篇 navaux philippe ...
13 篇 hai jin
11 篇 dhabaleswar k. p...
11 篇 borin edson
11 篇 xiaofei liao
11 篇 prasanna viktor ...
11 篇 wen-mei w. hwu
10 篇 jack dongarra
10 篇 panda dhabaleswa...
10 篇 i. foster
10 篇 d.k. panda
9 篇 dongarra jack
9 篇 renato ferreira
9 篇 vetter jeffrey s...
9 篇 mutlu onur
9 篇 jie zhang
8 篇 wang lei
8 篇 mateo valero
8 篇 hari subramoni
8 篇 guedes dorgival

语言

5,126 篇 英文
94 篇 其他
7 篇 中文
1 篇 葡萄牙文

检索条件"任意字段=2024 International Symposium on Computer Architecture and High Performance Computing Workshops"

共 5226 条记录，以下是81-90 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

A Comparative Survey: Reusing Small Pre-Trained Models for Efficient Large Model Training

A Comparative Survey: Reusing Small Pre-Trained Models for E...

引用

2024 workshops of the international Conference for high performance computing, Networking, Storage and Analysis, SC workshops 2024

作者： Pandey, Dhroov Ghebremichael, Jonah Qi, Zongqing Shu, Tong University of North Texas Department of Computer Science and Engineering DentonTX United States North Carolina State University Department of Computer Science RaleighNC United States Institute of Technology Department of Computer Science Stevens HobokenNJ United States

ISBN: (纸本)9798350355543

Training large language models is becoming increasingly complex due to the rapid expansion in their size, resulting in significant computational costs. To address this challenge, various model growth methodologies have been proposed to leverage smaller pre-trained models to incrementally build larger models and reduce computational requirements. These methods typically involve mapping parameters from small models to large ones using either static functions or learned mappings. Although these approaches have demonstrated effectiveness, there is a lack of comprehensive comparative evaluations in the literature. Additionally, combining different methodologies could potentially yield superior performance. This study provides a uniform evaluation of multiple state-of-the-art model growth techniques and their combinations, revealing that efficient combination techniques can reduce the training cost (in TFLOPs) of individual methods by up to 80%. © 2024 IEEE.

关键词： comparative survey efficient training large language models model reuse

来源：评论

学校读者我要写书评

暂无评论

A Hierarchical Deep Learning Approach for Predicting Job Queue Times in HPC Systems

A Hierarchical Deep Learning Approach for Predicting Job Que...

引用

2024 workshops of the international Conference for high performance computing, Networking, Storage and Analysis, SC workshops 2024

作者： Lovell, Austin Wisniewski, Philip Rodenbeck, Sarah Ashish Purdue University Department of Computer Science IN United States Purdue University Rosen Center for Advanced Computing IN United States

ISBN: (纸本)9798350355543

Accurate wait-time prediction for HPC jobs contributes to a positive user experience but has historically been a challenging task. Previous models lack the accuracy needed for confident predictions, and many were developed before the rise of deep *** this work, we investigate and develop TROUT, a neural network-based model to accurately predict wait times for jobs submitted to the Anvil HPC cluster. Data was taken from the Slurm Workload Manager on the cluster and transformed before performing additional feature engineering from jobs' priorities, partitions, and states. We developed a hierarchical model that classifies job queue times into bins before applying regression, outperforming traditional methods. The model was then integrated into a CLI tool for queue time prediction. This study explores which queue time prediction methods are most applicable for modern HPC systems and shows that deep learning-based prediction models are viable solutions. © 2024 IEEE.

关键词： computational efficiency high-performance computing machine learning neural networks operations research performance optimization queue management resource allocation

来源：评论

学校读者我要写书评

暂无评论

Shared Memory-Aware Latency-Sensitive Message Aggregation for Fine-Grained Communication

Shared Memory-Aware Latency-Sensitive Message Aggregation fo...

引用

2024 workshops of the international Conference for high performance computing, Networking, Storage and Analysis, SC workshops 2024

作者： Chandrasekar, Kavitha Kale, Laxmikant University of Illinois at Urbana-Champaign Department of Computer Science United States

ISBN: (纸本)9798350355543

Message aggregation is widely used with a goal to reduce communication cost in HPC applications. The difference in the order of overhead of sending a message and cost of per byte transferred motivates the need for message aggregation, for several irregular fine-grained messaging applications like graph algorithms and parallel discrete event simulation (PDES). While the benefit of message aggregation is often analyzed in terms of reducing the overhead, specifically the per message cost, we also analyze different schemes that can aid in reducing the message latency, i.e. the time from when a message is sent to the time when it is received. Message latency can affect several applications like PDES with speculative execution where reducing message latency could result in fewer rollbacks. Specifically in our work, we demonstrate the effectiveness of process-aware message aggregation schemes for a range of proxy applications with respect to messaging overhead and latency. © 2024 IEEE.

关键词： Charm++ Message aggregation runtime SMP

来源：评论

学校读者我要写书评

暂无评论

performance Analysis of the NICAM Benchmark on MN-Core Processor

Performance Analysis of the NICAM Benchmark on MN-Core Proce...

引用

2024 workshops of the international Conference for high performance computing, Networking, Storage and Analysis, SC workshops 2024

作者： Takayashiki, Hikaru Saito, Natsuko Imachi, Hiroto Sakamoto, Ryo Makino, Junichiro Fixstars Corporation Tokyo Japan Preferred Networks Inc. Tokyo Japan Kobe University Kobe Japan

ISBN: (纸本)9798350355543

Large-scale Computational Fluid Dynamics (CFD) simulations are typical HPC applications that require both high memory bandwidth and large memory capacity. However, it is difficult to achieve high performance for such applications on modern high-performance processors due to their low memory bandwidth compared to their high computational power. Near-memory computing can overcome this problem by placing on-chip memory near arithmetic units and reducing off-chip accesses. MN-Core is a distributed memory SIMD processor with each core having its own addressable memory, realizing a near-memory computing processor. MN-Core can be an attractive platform for executing bandwidth-demanding HPC applications. This paper reports the performance of MN-Core for three kernels from the NICAM benchmark, taken from NICAM global climate model. The evaluation results show that MN-Core realizes 986 GFLOPS at the maximum, which is 13.4% of its peak performance. This efficiency is comparable to those obtained on CPUs with high memory bandwidth, such as Fujitsu A64FX. © 2024 IEEE.

关键词： Accelerator Distributed memory HPC Near-memory computing SIMD

来源：评论

学校读者我要写书评

暂无评论

Experiences in Managing high-performance computing Management and Support Tools while Upgrading a Campus Cluster

Experiences in Managing High-performance Computing Managemen...

引用

2024 workshops of the international Conference for high performance computing, Networking, Storage and Analysis, SC workshops 2024

作者： Chen, Yuwu Cooper, Trevor Irving, Christopher Tatineni, Mahidhar Wolter, Nicole Mishin, Dmitry Sivagnanam, Subhashini University of California San Diego San Diego Supercomputer Center San Diego United States

ISBN: (纸本)9798350355543

The Triton Shared computing Cluster (TSCC) [1] is the San Diego Supercomputer Center ("Center"in the remaining text)'s primary campus research computing system. This paper describes the transition from TSCC 1.0 to TSCC 2.0, focusing on the implementation of new high-performance computing (HPC) infrastructure components and management strategies. We detail our approach to overcoming challenges posed by node heterogeneity, enhancing job scheduling efficiency, and improving resource allocation and billing *** legacy TSCC 1.0 is described first, focusing on some critical issues we want to solve under TSCC 2.0. The HPC tools under TSCC 2.0 are then described. Lastly, the best practices and experiences learned are discussed. © 2024 IEEE.

关键词： campus cluster HPC upgrade User support

来源：评论

学校读者我要写书评

暂无评论

Testing the Unknown: A Framework for OpenMP Testing via Random Program Generation

Testing the Unknown: A Framework for OpenMP Testing via Rand...

引用

2024 workshops of the international Conference for high performance computing, Networking, Storage and Analysis, SC workshops 2024

作者： Laguna, Ignacio Chapman, Patrick Parasyris, Konstantinos Georgakoudis, Giorgis Rubio-Gonzalez, Cindy Lawrence Livermore National Laboratory Center for Applied Scientific Computing United States University of California Department of Computer Science Davis United States

ISBN: (纸本)9798350355543

We present a randomized differential testing approach to test OpenMP implementations. In contrast to previous work that manually creates dozens of verification and validation tests, our approach is able to randomly generate thousands of tests, exposing OpenMP implementations to a wide range of program behaviors. We represent the space of possible random OpenMP tests using a grammar and implement our method as an extension of the Varity program generator. By generating 1,800 OpenMP tests, we find various performance anomalies and correctness issues when we apply them to three OpenMP implementations: GCC, Clang, and Intel. We also present several case studies that analyze the anomalies and give more details about the classes of tests that our approach creates. © 2024 IEEE.

关键词： differential testing OpenMP random program generation software testing

来源：评论

学校读者我要写书评

暂无评论

Scalable and Efficient architecture for Random Forest on FPGA-Based Edge computing

Scalable and Efficient Architecture for Random Forest on FPG...

引用

29th international Conference on Parallel and Distributed computing (Euro-Par)

作者： Cuong Pham-Quoc Ho Chi Minh City Univ Technol HCMUT Ho Chi Minh City Vietnam Vietnam Natl Univ Ho Chi Minh City VNU HCM Ho Chi Minh City Vietnam

ISBN: (纸本)9783031506833;9783031506840

This paper proposes a scalable and efficient architecture to accelerate random forest computation on FPGA devices targeting edge computing platforms. The proposed architecture with efficient decision tree units (DTUs) executes samples in a pipeline model for improving performance. Moreover, a size-effective memory organization is also introduced with the architecture to save the on-chip block ram used for reducing the latency and improving working frequency of the implementation system on FPGA devices. We target edge computing platforms that suffer from the limitations of resources and power consumption. Therefore, the proposed architecture can reconfigure the number of DTUs according to the target platform's available resources. We build a system with a PYNQ Z2 FPGA board for testing, validating, and estimating the proposed architecture. In this system, we exploit different numbers of DTUs, from 1 to 15, to test our scalability. Experimental results with certified datasets show that we achieve speed-ups by up to 170.39x and 90.27x compared to Intel core i7 desktop version and core i9 high-performance computing version processors, respectively.

关键词： FPGA Hardware accelerator Decision tree Random forest Edge computing Scalability

来源：评论

学校读者我要写书评

暂无评论

Evaluation Model and performance Analysis of NIC Aggregations in Containerized Private Clouds 35

Evaluation Model and Performance Analysis of NIC Aggregation...

引用

35th IEEE international symposium on computer architecture and high performance computing (SBAC-PAD)

作者： Maliszewski, Anderson M. Griebler, Dalvan Roloff, Eduardo Righi, Rodrigo da Rosa Navaux, Philippe O. A. Fed Univ Rio Grande Sul UFRGS Informat Inst Porto Alegre RS Brazil Tres de Maio Fac SETREM Lab Adv Res Cloud Comp LARCC Tres De Maio Brazil Pontifical Catholica Univ Rio Grande Sul PUCRS Sch Technol Porto Alegre RS Brazil Univ Vale Rio Sinos UNISINOS Software Innovat Lab SoftwareLab Sao Leopoldo Brazil

ISBN: (纸本)9798350381603

The availability of computational resources changed significantly due to cloud computing. In addition, we have witnessed efforts to execute high-performance computing (HPC) applications in the cloud attracted by the advantages of cost savings and scalable/elastic resource allocation. Allocating more powerful hardware and exclusivity allocating resources such as memory, storage, and CPU can improve performance in the cloud. For network interconnection, significant noise, and other inferences are generated by several simultaneous instances (multitenants) communicating using the same network. As increasing the network bandwidth may be an alternative, we designed an evaluation model, and performance analysis of NIC aggregation approaches in containerized private clouds. The experiments using NAS Parallel Benchmarks revealed that NIC aggregation approach outperforms the baseline up to similar to 98% of the executions with applications characterized by intensive network use. Also, the Balance Round-Robin aggregation mode performed better than the 802.3ad aggregation mode in most assessments.

关键词： Cloud computing Private Cloud high performance computing Network performance NIC Aggregation performance Analysis

来源：评论

学校读者我要写书评

暂无评论

Proceedings of SC 2024-W: workshops of the international Conference for high performance computing, Networking, Storage and Analysis

Proceedings of SC 2024-W: Workshops of the International Con...

引用

2024 workshops of the international Conference for high performance computing, Networking, Storage and Analysis, SC workshops 2024

ISBN: (纸本)9798350355543

The proceedings contain 226 papers. The topics discussed include: analyzing HPC utilization with PIKA and Vampir;portable cross-facility workflows for X-ray ptychography;towards sustainable post-exascale leadership computing;SANReN’s 100 Gbps data transfer service: transferring data fast!;framework for integrating machine learning methods for path-aware source routing;an Ising-based decision method for intra prediction mode in video coding;LLM-inference-bench: inference benchmarking of large language models on AI accelerators;ActorProf: a framework for profiling and visualizing fine-grained asynchronous bulk synchronous parallel execution;LIDC: a location independent multi-cluster computing framework for data intensive science;and Parsl+CWL: towards combining the Python and CWL ecosystems.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Poster: Integration of Wearable and Affective computing via Abstraction and Decision Fusion architecture 25

Poster: Integration of Wearable and Affective Computing via ...

引用

25th IEEE international symposium on a World of Wireless, Mobile and Multimedia Networks (IEEE WoWMoM)

作者： Najafi, Mohammadreza Fallah, Mohammad K. Gorgin, Saeid Jaberipur, Ghassem Lee, Jeong-A Chosun Univ Dept Comp Engn Gwangju South Korea

ISBN: (纸本)9798350394665;9798350394672

This paper introduces an efficient emotion detection method to integrate wearable and affective computing paradigms. Our research contributes to advancing emotion detection technologies, offering potential applications in diverse domains such as healthcare, human-computer interaction, and personalized computing experiences. Our approach addresses the increasing need for real-time emotion recognition while minimizing computational demands. By leveraging low-computation techniques, we propose a novel framework that achieves high accuracy in emotion detection. Besides, advanced data abstraction methods are developed to reduce data workload keeping detection performance. Experimental results demonstrate a notable accuracy rate of 89.77%, affirming the efficacy of our proposed method.

关键词： Emotion Detection Machine Learning Computation Reduction Data Abstraction

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 5 6 7 8 9 10 11 12 13 14 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：