检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

1,995 篇 会议
771 篇 期刊文献

馆藏范围

2,766 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

2,594 篇 工学
- 2,348 篇 计算机科学与技术...
- 1,934 篇 软件工程
- 335 篇 电气工程
- 158 篇 电子科学与技术（可...
- 139 篇 信息与通信工程
- 130 篇 控制科学与工程
- 40 篇 生物工程
- 39 篇 机械工程
- 26 篇 建筑学
- 24 篇 光学工程
- 23 篇 土木工程
- 21 篇 材料科学与工程（可...
- 19 篇 化学工程与技术
- 15 篇 动力工程及工程热...
- 14 篇 仪器科学与技术
- 14 篇 安全科学与工程
- 10 篇 力学（可授工学、理...
691 篇 理学
- 558 篇 数学
- 78 篇 物理学
- 43 篇 生物学
- 39 篇 系统科学
- 36 篇 统计学（可授理学、...
- 19 篇 化学
306 篇 管理学
- 202 篇 管理科学与工程(可...
- 114 篇 图书情报与档案管...
- 86 篇 工商管理
21 篇 经济学
- 20 篇 应用经济学
20 篇 教育学
- 20 篇 教育学
14 篇 法学
- 12 篇 社会学
8 篇 医学
3 篇 农学
2 篇 文学
1 篇 艺术学

主题

2,766 篇 program compiler...
56 篇 compilers
42 篇 embedded systems
29 篇 java
27 篇 computer archite...
26 篇 parallel process...
25 篇 c language
23 篇 software enginee...
23 篇 program diagnost...
23 篇 formal specifica...
22 篇 parallel program...
21 篇 parallel archite...
21 篇 formal verificat...
19 篇 software tools
19 篇 multiprocessing ...
18 篇 programming
18 篇 code generation
17 篇 program processo...
16 篇 unified modeling...
15 篇 optimisation

机构

12 篇 carnegie mellon ...
11 篇 eth zurich
10 篇 inria
10 篇 university of ed...
9 篇 university of ed...
8 篇 purdue universit...
7 篇 stanford univers...
7 篇 microsoft resear...
7 篇 univ nova lisboa...
7 篇 microsoft resear...
7 篇 peking universit...
6 篇 university of sc...
6 篇 shanghai jiao to...
6 篇 tsinghua univers...
6 篇 microsoft resear...
6 篇 university of ca...
6 篇 google
5 篇 meta ai united s...
5 篇 department of co...
5 篇 syracuse univ sy...

作者

14 篇 amarasinghe sama...
13 篇 eigenmann rudolf
10 篇 patrignani marco
9 篇 hwu wen-mei w.
9 篇 cohen albert
9 篇 gomes luis
8 篇 kandemir m
8 篇 tseng chau-wen
8 篇 hendren laurie j...
8 篇 cummins chris
8 篇 padua david
8 篇 kjolstad fredrik
8 篇 chambers craig
8 篇 grosser tobias
8 篇 kennedy ken
7 篇 leather hugh
7 篇 kasahara hironor...
7 篇 eggers susan j.
7 篇 serrano manuel
7 篇 banerjee prithvi...

语言

2,351 篇 英文
389 篇 其他
12 篇 中文
6 篇 日文
3 篇 葡萄牙文
2 篇 德文
2 篇 法文
1 篇 西班牙文
1 篇 塞尔维亚文

检索条件"主题词=Program compilers"

共 2766 条记录，以下是151-160 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Defining and Preserving More C Behaviors: Verified Compilation Using a Concrete Memory Model 15

Defining and Preserving More C Behaviors: Verified Compilati...

引用

15th International Conference on Interactive Theorem Proving, ITP 2024

作者： Tolmach, Andrew Chhak, Chris Anderson, Sean Portland State University OR United States

ISBN: (纸本)9783959773379

We propose a concrete ("pointer as integer") memory semantics for C that supports verified compilation to a target environment having simple "public vs. private"data protection based on tagging or sandboxing (such as the WebAssembly virtual machine). Our semantics gives definition to a range of legacy programming idioms that cause undefined behavior in standard C, and are not covered by existing verified compilers, but that often work in practice. Compiler correctness in this context implies that target programs are secure against all control-flow attacks (although not against data-only attacks). To avoid tying our semantics too closely to particular compiler implementation choices, it is parameterized by a novel form of oracle that non-deterministically chooses the addresses of stack and heap allocations. As a proof-of-concept, we formalize a small RTL-like language and verify two-way refinement for a compiler from this language to a low-level machine and runtime system with hardware tagging. Our Coq formalization and proofs are provided as supplementary material. © 2024 Andrew Tolmach, Chris Chhak, and Sean Anderson.

关键词： program compilers

来源：评论

学校读者我要写书评

暂无评论

REDLC: Learning-driven Reverse Engineering for Deep Learning compilers 35

REDLC: Learning-driven Reverse Engineering for Deep Learning...

引用

35th IEEE International Symposium on Software Reliability Engineering, ISSRE 2024

作者： Li, Minghui Li, Yang Han, Hao Ke, Xiaopeng Wang, Tongyu Xu, Fengyuan Fang, Liming Nanjing University of Aeronautics and Astronautics Nanjing China Nanjing University Nanjing China

ISBN: (纸本)9798350353884

Deep Learning (DL) compilers such as TVM enable the efficient deployment of diverse DL models on heterogeneous and resource-constrained devices to meet the needs for low latency, privacy protection, and enhanced reliability. However, the booming of on-device DL technology will inevitably attract new types of cybercriminals and industrial spies aiming to steal commercial models. Emerging research focused on model-stealing attacks from the perspective of DL compilers mainly uses heuristic approaches, which do not work well with compiler-optimized models. This work proposes an advanced model-stealing attack pipeline that combines code representation learning and binary analysis to efficiently reverse retrainable DL framework models from TVM-compiled executables. To further improve the accuracy of reversed models, we exploit the computational relationships to correct the prediction of operators in the models using Graph Convolutional Networks. Extensive experiments demonstrate that our approach can recover 18 common DL models with different scales downloaded from Keras repositories with 99% accuracy. © 2024 IEEE.

关键词： program compilers

来源：评论

学校读者我要写书评

暂无评论

An Improved Method for Control Dependency in LLVM 5

An Improved Method for Control Dependency in LLVM

引用

5th International Conference on Intelligent Computing and Human-Computer Interaction, ICHCI 2024

作者： Li, Jianan Gao, Wei Li, Yingying Han, Lin School of Computer and Artificial Intelligence of ZZU Zhengzhou University Zhengzhou China Zhengzhou University National Supercomputing Center in Zhengzhou Zhengzhou China

ISBN: (纸本)9798350368284

The existence of control dependencies within programs necessitates intricate data reorganization, significantly hindering the vectorization capabilities in automated SIMD compilation processes. The latest iteration of the LLVM compiler, employs a control flow vectorization approach that is contingent upon platform-specific mask intrinsic functions. This reliance results in many loops with IF constructs forfeiting their vectorization potential. To address this, a novel mask instruction transformation technique is introduced, converting IF structures to select instructions. This conversion facilitates control flow vectorization independent of intrinsic functions, effectively transitioning from control to data dependencies. Furthermore, an enhancement to the existing Phi node generation algorithm is proposed to augment control flow vectorization opportunities. Utilizing the TSVC benchmark suite, the experimental results demonstrate a 48% increase in vectorization recognition rate following the implementation of the mask instruction transformation. The maximum speedup ratio achieved is 2.4, with an average speedup ratio of 1.6, outperforming LLVM's current automatic vectorization capabilities. © 2024 IEEE.

关键词： program compilers

来源：评论

学校读者我要写书评

暂无评论

Analyzing SpecFEM-3D's Performance on ARM A64FX Architecture with Compiler Variations 4

Analyzing SpecFEM-3D's Performance on ARM A64FX Architecture...

引用

4th International Conference on Emerging Trends in Networks and Computer Communications, ETNCC 2024

作者： Jadhav, Om Krishna, Sabbi Vamshi Dinde, Prashant Wandhekar, Sanjay Jat, Dharm Singh HPC-Technologies Group C-DAC Pune India NUST Windhoek Namibia

ISBN: (纸本)9798350353266

This research paper offers a comprehensive performance study of SpecFEM-3D, a well known software package devised for simulating seismic wave propagation in complex 3D geological structures, on the ARM A64FX compute architecture using multiple compilers, including GCC, Arm, and Cray. Then the performance is weighed with the Intel Cascadelake architecture with the native compiler i.e Intel compiler. The purpose of this study is to evaluate the performance and scalability of SpecFEM-3D with diverse compiler setups along with various optimization flags. The methodology incorporates running SpecFEM-3D with different compiler settings and observing the performance with respect to execution time. The results illustrates that, SpecFEM-3D shows scalability on ARM A64FX compute architecture with a single and multinode environment, moreover, demonstrating better performance with the cray compiler. The investigated results gives insights into the performance attributes of SpecFEM-3D on ARM A64FX architectures, aiding informed decision-making for optimizing application performance. © 2024 IEEE.

关键词： program compilers

来源：评论

学校读者我要写书评

暂无评论

Inlined Code Generation for Smalltalk

Inlined Code Generation for Smalltalk

引用

2024 International Workshop on Smalltalk Technologies, IWST 2024

作者： Franklin, Daniel Mason, Dave Toronto Metropolitan University Toronto Canada

In this paper we present our early work at improving Smalltalk performance by inlining message sends during compilation. Smalltalk developers typically write small method bodies with one or two statements, this limits a compiler's ability to perform many optimizations, e.g. common sub-expression elimination. Inlining messages into a method body produces methods with fewer message sends and more statements allowing the compiler to optimize and generate efficient executable code. There are several challenges to inlining messages in Smalltalk that need to be resolved like detecting cycles in the call graph, resolving methods arguments to their equivalent in the calling method, and handling non-local returns in block arguments. In this paper, we describe the inlining approach taken for the Zag Smalltalk compiler that solves for these issues and improves performance. © 2024 Copyright for this paper by its authors.

关键词： program compilers

来源：评论

学校读者我要写书评

暂无评论

GCV-Turbo: End-to-end Acceleration of GNN-based Computer Vision Tasks on FPGA 32

GCV-Turbo: End-to-end Acceleration of GNN-based Computer Vis...

引用

32nd IEEE Annual International Symposium on Field-programmable Custom Computing Machines, FCCM 2024

作者： Zhang, Bingyi Kannan, Rajgopal Busart, Carl Prasanna, Viktor University of Southern California United States DEVCOM Army Research Office DEVCOM Army Research Lab

ISBN: (纸本)9798350372434

Graph neural networks (GNNs) have recently em-powered various novel computer vision (CV) tasks. In GNN-based CV tasks, a combination of CNN layers and GNN layers or only GNN layers are employed. This paper introduces GCV-Turbo, a domain-specific accelerator on FPGA for end-to-end acceleration of GNN-based CV tasks. GCV-Turbo consists of two key components: (1) a novel hardware architecture optimized for the computation kernels in both CNNs and GNNs using the same set of computation resources. (2) a compiler that takes a user-defined model as input, performs end-to-end optimization for the computation graph of a given GNN-based CV task, and produces optimized code for hardware execution. The hardware architecture and the compiler work synergistically to support a variety of GNN-based CV tasks. We implement GCV-Turbo on a state-of-the-art FPGA and evaluate its performance across six representative GNN-based CV tasks with diverse input data modalities (e.g., image, human skeleton, point cloud). Compared with state-of-the-art CPU (GPU) implementations, GCV-Turbo achieves an average latency reduction of 68.4× (4.1x) on these six GNN-based CV tasks. Moreover, GCV-Turbo supports the execution of the standalone CNNs or GNNs, achieving performance comparable to that of state-of-the-art CNN (GNN) accelerators for widely used CNN-only (GNN-only) models. © 2024 IEEE.

关键词： program compilers

来源：评论

学校读者我要写书评

暂无评论

Smalltalk JIT Compilation: LLVM Experimentation

Smalltalk JIT Compilation: LLVM Experimentation

引用

2024 International Workshop on Smalltalk Technologies, IWST 2024

作者： Baig, Janat Mason, Dave Toronto Metropolitan University Toronto Canada

This paper discusses the ongoing development of the Zag Smalltalk LLVM JIT Compiler project. The project is aimed at enhancing the performance of dynamic languages through JIT compilation using LLVM. We highlight the project's rationale, emphasizing LLVM's optimization capabilities and architectural support. Our methodology focuses on generating LLVM IR code and exploring optimization strategies to improve execution efficiency. © 2024 Copyright for this paper by its authors.

关键词： program compilers

来源：评论

学校读者我要写书评

暂无评论

RTLRewriter: Methodologies for Large Models aided RTL Code Optimization 24

RTLRewriter: Methodologies for Large Models aided RTL Code O...

引用

43rd International Conference on Computer Aided Design-ICCAD

作者： Yao, Xufeng Wang, Yiwen Li, Xing Lian, Yingzhao Chen, Ran Chen, Lei Yuan, Mingxuan Xu, Hong Yu, Bei Chinese Univ Hong Kong Hong Kong Peoples R China Huawei Hong Kong Peoples R China

ISBN: (纸本)9798400710773

Register Transfer Level (RTL) code optimization is crucial for enhancing the efficiency and performance of digital circuits during early synthesis stages. Currently, optimization relies heavily on manual efforts by skilled engineers, often requiring multiple iterations based on synthesis feedback. In contrast, existing compiler-based methods fall short in addressing complex designs. This paper introduces RTLRewriter, an innovative framework that leverages large models to optimize RTL code. A circuit partition pipeline is utilized for fast synthesis and efficient rewriting. A multi-modal program analysis is proposed to incorporate vital visual diagram information as optimization cues. A specialized search engine is designed to identify useful optimization guides, algorithms, and code snippets that enhance the model's ability to generate optimized RTL. Additionally, we introduce a Cost-aware Monte Carlo Tree Search (C-MCTS) algorithm for efficient rewriting, managing diverse retrieved contents and steering the rewriting results. Furthermore, a fast verification pipeline is proposed to reduce verification cost. To cater to the needs of both industry and academia, we propose two benchmarking suites: the long Rewriter benchmark, targeting complex scenarios with extensive circuit partitioning, optimization trade-offs, and verification challenges, and the short Rewriter benchmark, designed for a wider range of scenarios and patterns. Our comparative analysis with established compilers such as Yosys and E-graph demonstrates significant improvements, highlighting the benefits of integrating large models into the early stages of circuit design. We provide our benchmarks at https://***/yaoxufeng/RTLRewriter-Bench.

关键词： program compilers

来源：评论

学校读者我要写书评

暂无评论

Detecting Hot Code from Partially Context-Sensitive Profiles 32

Detecting Hot Code from Partially Context-Sensitive Profiles

引用

32nd Telecommunications Forum, TELFOR 2024

作者： Vukasovic, Maja Prokopec, Aleksandar University of Belgrade School of Electrical Engineering Belgrade Serbia Oracle Labs Zurich Switzerland

ISBN: (纸本)9798350391053

In order to achieve the peek program performance, compilers employ numerous optimizations. Some of these optimizations, although highly effective, come with the high price in terms of compilation time, and the compiled code size. This is why it is beneficial to apply optimizations on only selected portions of the most frequently executed code - hot code. In JIT-compiled programs, information about the frequency of code execution is available during the compilation, however, AOT compilers must compensate through the profiling data collected from the previous program runs. In this paper, we use partially context-sensitive profiles, for efficient profile collection and managing, to identify and reconstruct significant hot-code fragments. We show that, with the proper identification of the hot code, significantly better program performance can be achieved with reasonable cost. © 2024 IEEE.

关键词： program compilers

来源：评论

学校读者我要写书评

暂无评论

A Compiler-Like Framework for Optimizing Cryptographic Big Integer Multiplication on GPUs 57

A Compiler-Like Framework for Optimizing Cryptographic Big I...

引用

57th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2024

作者： Ji, Zhuoran Zhao, Jianyu Zhang, Zhaorui Xu, Jiming Yan, Shoumeng Ju, Lei School of Cyber Science and Technology Shandong University Shandong China Quan Cheng Laboratory Shandong China The Hong Kong Polytechnic University Department of Computing Hong Kong Hong Kong Ant Group China

ISBN: (纸本)9798350350579

With the growth of digital data and rising security concerns, techniques for privacy-preserving computation have become increasingly essential. Big integer multiplication, pivotal for these applications, is compute-intensive but poses challenges for GPU acceleration due to its complexity and the need for application-specific tailored implementations. This paper presents IMCompiler, a compiler-like framework that automatically gen-erates optimized GPU kernels for integer multiplications used in cryptosystems. It features a frontend-IR-backend structure, where the Intermediate Representation (IR) employs a segmented integer multiplication algorithm to decouple architecture-specific optimizations from high-level parameters. The frontend can then easily translate integer multiplication with various high-level parameters into the IR, while the backend focuses on fine-tuning a single GPU kernel for each device, enabling automatic code generation. Moreover, we introduce a computation diagram to facilitate the analysis of parallelization strategies, inspiring many optimizations, including two-dimensional parallelization, tailored caching strategy, index transposing, and lazy carrying. Experiments show that IMCompiler achieves a 4.47× speedup compared to the widely used baseline and 1.42 × over Nvidia's official library. The speedup will be even higher for larger integers and higher-capacity GPUs. © 2024 IEEE.

关键词： program compilers

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共277页 << < 12 13 14 15 16 17 18 19 20 21 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：