检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

2,723 篇 会议
57 册 图书
54 篇 期刊文献

馆藏范围

2,834 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

1,289 篇 工学
- 1,120 篇 计算机科学与技术...
- 591 篇 软件工程
- 287 篇 电气工程
- 159 篇 电子科学与技术（可...
- 150 篇 控制科学与工程
- 137 篇 信息与通信工程
- 50 篇 机械工程
- 44 篇 动力工程及工程热...
- 19 篇 仪器科学与技术
- 19 篇 建筑学
- 18 篇 材料科学与工程（可...
- 18 篇 安全科学与工程
- 17 篇 土木工程
- 16 篇 生物医学工程（可授...
- 14 篇 交通运输工程
- 14 篇 航空宇航科学与技...
- 13 篇 力学（可授工学、理...
- 12 篇 生物工程
- 11 篇 光学工程
275 篇 理学
- 173 篇 数学
- 81 篇 系统科学
- 38 篇 物理学
- 33 篇 统计学（可授理学、...
- 14 篇 化学
- 14 篇 生物学
142 篇 管理学
- 119 篇 管理科学与工程(可...
- 52 篇 工商管理
- 30 篇 图书情报与档案管...
21 篇 经济学
- 21 篇 应用经济学
20 篇 法学
- 20 篇 社会学
11 篇 医学
6 篇 教育学
5 篇 农学
3 篇 军事学
1 篇 文学
1 篇 艺术学

主题

763 篇 computational mo...
616 篇 computer archite...
361 篇 hardware
222 篇 embedded systems
144 篇 analytical model...
142 篇 embedded system
134 篇 computer simulat...
128 篇 application soft...
123 篇 mathematical mod...
117 篇 field programmab...
112 篇 real time system...
103 篇 program processo...
91 篇 embedded computi...
89 篇 kernel
84 篇 software
83 篇 computer science
82 篇 parallel process...
81 篇 registers
73 篇 control systems
72 篇 algorithm design...

机构

10 篇 school of electr...
9 篇 rhein westfal th...
8 篇 institute for co...
8 篇 center for embed...
7 篇 center for embed...
7 篇 department of co...
6 篇 eindhoven univer...
6 篇 computer systems...
6 篇 natl tech univ a...
6 篇 computer systems...
6 篇 institute for co...
6 篇 fraunhofer iese ...
5 篇 school of automa...
5 篇 school of electr...
5 篇 university of za...
5 篇 embedded systems...
5 篇 university of te...
5 篇 national cims en...
5 篇 the university o...
5 篇 eindhoven univer...

作者

19 篇 leupers rainer
16 篇 soudris dimitrio...
16 篇 ascheid gerd
14 篇 dimitrios soudri...
13 篇 rainer leupers
12 篇 gerd ascheid
12 篇 takala jarmo
12 篇 pimentel andy d.
11 篇 blume holger
11 篇 holger blume
10 篇 andy d. pimentel
10 篇 jung matthias
9 篇 mohamed abid
9 篇 fornaciari willi...
9 篇 pnevmatikatos di...
9 篇 kostas siozios
9 篇 christian haubel...
9 篇 andreas gerstlau...
9 篇 wehn norbert
8 篇 f. fummi

语言

2,817 篇 英文
9 篇 其他
9 篇 中文

检索条件"任意字段=International Conference on Embedded Computer Systems Architectures Modeling and Simulation"

共 2834 条记录，以下是381-390 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

RACOS: Transparent Access and Virtualization of Reconfigurable Hardware Accelerators 17

RACOS: Transparent Access and Virtualization of Reconfigurab...

引用

17th Annual international conference on embedded computer systems - architectures, modeling, and simulation (SAMOS)

作者： Vatsolakis, Charalampos Pnevmatikatos, Dionisios Tech Univ Crete Sch Elect & Comp Engn Khania Greece Fdn Res & Technol Hellas Inst Comp Sci Iraklion Greece

ISBN: (纸本)9781538634370

Crafting accelerators using reconfigurable hardware is a promising way to achieve improved performance and power/energy efficiency. However, deploying reconfigurable accelerators is still cumbersome as it involves overall system integration issues and and runtime reconfigurable resource management. We describe the design and implementation of RACOS, a Reconfigurable ACcelerator OS, that provides a simple and intuitive software interface to load/unload reconfigurable hardware accelerators and perform data I/Os transparently to the user. Multiple partially reconfigurable regions are supported, and each region can host either single- or dual-threaded accelerators, effectively virtualizing the reconfigurable resources. RACOS allows multiple applications to use one or more accelerators each, and schedules accelerators for execution according to four policies: simple and inorder that respect the order of request, and out of order and forced that aim to reduce the number of reconfigurations. We evaluate our proposed system varying the number of instances of an accelerated application and show that, despite its generality, RACOS can achieve both high reconfiguration and data communication throughput, close to the maximum reported in bibliography, with a very small resource cost comparable or better than the current state of the art.

关键词： Hardware Software Field programmable gate arrays Throughput Acceleration Libraries Linux Field programmable gate arrays Accelerators hardware accelerator Linux reconfigurable hardware Throughput computer hardware acceleration Libraries computer software Virtualization configurability systems Integration software interface

来源：评论

学校读者我要写书评

暂无评论

Balanced Application-Specific Processor System for Efficient SIFT-Feature Detection 17

Balanced Application-Specific Processor System for Efficient...

引用

17th Annual international conference on embedded computer systems - architectures, modeling, and simulation (SAMOS)

作者： Hartig, Julian Paya-Vaya, Guillermo Mentzer, Nico Blume, Holger Leibniz Univ Hannover Inst Microelect Syst Appelstr 4 D-30167 Hannover Germany

ISBN: (纸本)9781538634370

Due to its computational complexity, the Scale-Invariant Feature Transform (SIFT) algorithm poses a challenge for use in embedded applications. To meet real-time at low power, hardware acceleration is necessary. This paper presents an FPGA-based balanced processor system for real-time SIFT feature detection, containing a dedicated hardware coprocessor coupled to a custom VLIW soft-core processor using a FIFO memory. The coprocessor calculates the scale-space and performs the extrema detection for the extraction of feature candidates, whereas the VLIW soft-core processor performs sub-pixel localization and stability checks to get stable SIFT-features. The system achieves a peak frame rate of up to 338 fps on 1,024x376 px images at less than 3 W on a Xilinx Virtex-6 FPGA. The filters within the Gaussian pyramid operate in a time-multiplexed scheme on clock frequencies up to 400 MHz. Furthermore, this paper presents a comprehensive design space exploration, evaluating architectural performance, hardware resources and power consumption trade-offs as well as exposing performance-balanced and pareto-optimal design variants.

关键词： Feature extraction Hardware Field programmable gate arrays Coprocessors VLIW Throughput Real-time systems Field programmable gate arrays Coprocessors Very long instruction word computer hardware real-time systems Feature extraction Throughput PROCESSOR feature detections soft-core

来源：评论

学校读者我要写书评

暂无评论

An Efficient End-to-End Channel Level Pruning Method for Deep Neural Networks Compression

An Efficient End-to-End Channel Level Pruning Method for Dee...

引用

IEEE international conference on Software Engineering and Service Sciences (ICSESS)

作者： Lei Zeng Shi Chen Sen Zeng Department of Computer Science University of Science and Techonology of China Hefei Anhui Province China Department of Gree Electric Appliances Zhuhai Guangdong Province China

ISBN: (数字)9781728109459

ISBN: (纸本)9781728109466

Deep neural networks (DNNS) have obtained compelling performance among many visual tasks by a significant increase in the computation and memory consumption, which severely impede their applications on resource-constrained systems like smart mobiles or embedded devices. To solve these problems, recent efforts toward compressing DNNS have received increased focus. In this paper, we proposed an effective end-to-end channel pruning approach to compress DNNS. To this end, firstly, we introduce additional auxiliary classifiers to enhance the discriminative power of shallow and intermediate layers. Secondly, we impose Ll-regularization on the scaling factors and shifting factors in batch normalization (BN) layer, and adopt the fast and iterative shrinkage-thresholding algorithm (FISTA) to effectively prune the redundant channels. Finally, by forcing selected factors to zero, we can prune the corresponding unimportant channels safely, thus obtaining a compact model. We empirically reveal the prominent performance of our approach with several state-of-theart DNNS architectures, including VGGNet, and MobileNet, on different datasets. For instance, on cifar10 dataset, the pruned MobileNet achieves 26. 9x reduction in model parameters and 3. 9x reduction in computational operations with only 0.04% increase of classification error.

关键词： Training Neural networks Classification algorithms Acceleration Hardware Iterative methods Computational modeling

来源：评论

学校读者我要写书评

暂无评论

The Design Space of the Number Theoretic Transform: a Survey 17

The Design Space of the Number Theoretic Transform: a Survey

引用

17th Annual international conference on embedded computer systems - architectures, modeling, and simulation (SAMOS)

作者： Valencia, Felipe Khalid, Ayesha O'Sullivan, Elizabeth Regazzoni, Francesco USI ALaRI Lugano Switzerland QUB CSIT Belfast North Ireland

ISBN: (纸本)9781538634370

The Number Theoretic Transform (NTT) is a necessary part of most Lattice-based cryptographic schemes. In particular, it offers an efficient means to achieve polynomial multiplication within the more efficient ring-based schemes. The NTT is also a crucial component which needs to be implemented in a critical way, since it is often the bottle-neck and the most resource consuming block of the whole design. As a result, the NTT is an appealing target for exploring different architectures and design trade-offs. In this paper, we compare various optimization strategies applied to maximize the performance or to reduce the resource utilization. Our analysis covers general purpose processors as well as dedicated hardware implemented on reconfigurable platforms and on ASIC. Previously explored design strategies range from the traditional computation where the multiplicative factors (called twiddle factors) are calculated on-the-fly versus memory trade-off exploration (using memory to store pre-computed twiddle factors), to the use of different butterfly designs for implementing the Fast Fourier Transform and its inverse in software, or the sharing of resources for hardware implementations of the forward and inverse NTT. The problem of side channel resistance is also addressed, discussing designs which are robust against power analysis attacks.

关键词： Cryptography Hardware Discrete Fourier transforms Lattices Software Resistance discrete Fourier transform computer hardware cryptography crystal lattices computer software Resource utilization computer programs Transform power analysis Inverse

来源：评论

学校读者我要写书评

暂无评论

Enhancing Sensor Capabilities of Open-Source simulation Tools to Support Autonomous Vehicles Safety Validation

Enhancing Sensor Capabilities of Open-Source Simulation Tool...

引用

37th international conference on computer Safety, Reliability, and Security (SAFECOMP)

作者： Molina, C. B. S. T. Vismari, L. F. Fuji, T. Camargo, J. B., Jr. de Almeida, J. R., Jr. Inam, R. Fersman, E. Hata, A. Marquezini, M. V. Univ Sao Paulo Sch Engn Sao Paulo SP Brazil Ericsson AB Ericsson Res Stockholm Sweden Ericsson Telecomunicacoes SA Indaiatuba SP Brazil

ISBN: (纸本)9783319992297;9783319992280

Autonomous Vehicles (AVs) are expected to provide relevant benefits to the society in terms of safety, efficiency and accessibility. However, AVs are safety-critical systems, and it is mandatory to assure that they are going to be safe when operating on public roads. However, the safety of AV is still an open, and challenging issue. A combination of simulation, test track, and on-road testing approaches is being recommended to validate the AV safety performance. Testing AVs in real-world scenarios is a widely used, but neither an efficient nor a safe approach to validate safety. Therefore, simulation-based approaches are demanded. Motivated by this challenge, we have developed a simulation-based safety analysis framework, based on open-source tools, to be applied to the future of the road transportation systems. However, the open-source tools we have adopted for the framework have limitations to model real-world elements, especially perception sensors. We thus here present the extensions made to these open-source tools, focused on the development of a perception sensor model in the native OpenDS tool, which enables detecting obstacles around the vehicle, considering the same main characteristics observed in Radar and LiDAR sensors. As the main conclusion, these tools enhancements have improved the simulation based safety analysis framework capabilities for modeling, simulating and analyzing -in a more precise way and for safety validation purposes - the behavior of AV in simulated traffic scenarios when different embedded detection sensor characteristics are considered in its deployment.

关键词： Autonomous vehicle Safety simulation OpenDS Sensor

来源：评论

学校读者我要写书评

暂无评论

SPynq: Acceleration of Machine Learning Applications over Spark on Pynq 17

SPynq: Acceleration of Machine Learning Applications over Sp...

引用

17th Annual international conference on embedded computer systems - architectures, modeling, and simulation (SAMOS)

作者： Kachris, Christoforos Koromilas, Elias Stamelos, Ioannis Soudris, Dimitrios NTUA Inst Commun & Comp Syst ICCS Athens Greece Natl Tech Univ Athens Dept Elect & Comp Engn Athens Greece

ISBN: (纸本)9781538634370

Spark is one of the most widely used frameworks for data analytics that offers fast development of applications like machine learning and graph computations in distributed systems. In this paper, we present SPynq: A framework for the efficient utilization of hardware accelerators over the Spark framework on heterogeneous MPSoC FPGAs, such as Zynq. Spark has been mapped to the Pynq platform and the proposed framework allows the seamlessly utilization of the programmable logic for the hardware acceleration of computational intensive Spark kernels. We have also developed the required libraries in Spark that hides the accelerator's details to minimize the design effort to utilize the accelerators. A cluster of 4 nodes (workers) based on the all-programmable MPSoCs has been implemented and the proposed platform is evaluated in a typical machine learning application based on logistic regression. The logistic regression kernel has been developed as an accelerator and incorporated to the Spark. The developed system is compared to a high-performance Xeon cluster that is typically used in cloud computing. The performance evaluation shows that the heterogeneous accelerator-based MpSoC can achieve up to 2.3x system speedup compared with a Xeon system (with 90% accuracy) and 20x better energy-efficiency. For embedded application, the proposed system can achieve up to 40x speedup compared to the software only implementation on low-power embedded processors and 30x lower energy consumption.

关键词： Sparks Hardware Program processors Libraries Machine learning Field programmable gate arrays Cloud computing Field programmable gate arrays Program processors sparks Cloud Computing computer hardware Artificial Intelligence Machine Learning Libraries Accelerators design effort Performance Evaluation hardware accelerator

来源：评论

学校读者我要写书评

暂无评论

Exposed Datapath Optimizations for Loop Scheduling 17

Exposed Datapath Optimizations for Loop Scheduling

引用

17th Annual international conference on embedded computer systems - architectures, modeling, and simulation (SAMOS)

作者： Kultala, Heikki O. Jaaskelainen, Pekka O. IJzerman, Johannes Lehtonen, Lasse K. Viitanen, Timo T. Makitalo, Markku J. Takala, Jarmo H. Tampere Univ Technol Tampere Finland

ISBN: (纸本)9781538634370

Transport Triggered Architecture (TTA) processors allow unique low level compiler optimizations such as software bypassing and operand sharing. Previously, these optimizations have mostly been performed inside single basic blocks, leaving much of their potential unused. In this work, software bypassing and operand sharing are integrated with loop scheduling, allowing optimizations over loop iteration boundaries. This considerably further reduces register file accesses and immediate value transfers on tight loops - in some cases even eliminating all register file accesses from the loop body. In the benchmarked 12 small loops, compared to traditional VLIW-style processors, on average 63% of register file reads and 77% of register file writes could be eliminated. Compared to a compiler which performs these optimizations only inside a basic block, on average 58% of register file reads, 28% of register file writes could be eliminated. The additional register access reductions allow both direct energy savings from fewer register accesses and indirect energy savings by allowing the use of simpler register files with less read and write ports and a simpler interconnect network with less transport buses.

关键词： Registers Program processors Optimization VLIW Processor scheduling computer architecture

来源：评论

学校读者我要写书评

暂无评论

Adaptive Runtime Exploiting Sparsity in Tensor of Deep Learning Neural Network on Heterogeneous systems 17

Adaptive Runtime Exploiting Sparsity in Tensor of Deep Learn...

引用

17th Annual international conference on embedded computer systems - architectures, modeling, and simulation (SAMOS)

作者： Peng, Kuo-You Fu, Sheng-Yu Liu, Yu-Ping Hsu, Wei-Chung Natl Taiwan Univ New Taipei Taiwan

ISBN: (纸本)9781538634370

Deep neural networks have been widely applied in many areas, such as computer vision, natural language processing and information retrieval. However, due to the high computation and memory demands, deep learning applications have not been adopted in edge learning. In this paper, we exploit the sparsity in tensors to reduce the computation overheads and memory demands. Unlike other approaches which rely on hardware accelerator designs or sacrifice model accuracy for the performance by pruning parameters, we adaptively partition and deploy the workload to heterogeneous devices to reduce computation and memory requirements and increase computing efficiency. We had implemented our partitioning algorithms in Google's TensorFlow and evaluated on an AMD Kaveri system, which is an HSA-based heterogeneous computing system. Our method has effectively reduced the computation time, cache accesses, and cache miss rates, without impacting the accuracy of the learning models. Our approach achieves 66% and 88% speedup for the lenet-5 model and the lenet-1024-1024 model, respectively. For reducing memory traffic, our approach reduces 71% instruction cache references, 32% data cache references. Our system has also improved cache miss rate from 1.6% to 0.5% during the training of the lenet-1024-1024 model.

关键词： Neurons Computational modeling Training Sparse matrices Machine learning Heterogeneous networks Tensile stress Heterogeneous networks Sparse matrix Tensile stress Computational modeling Tensor HETEROGENEOUS SYSTEM Functional training Neurons Memory Memory Partitioning algorithms Machine Learning cache miss Training Artificial Intelligence

来源：评论

学校读者我要写书评

暂无评论

FPGA-based Evaluation Platform for Disaggregated Computing 17

FPGA-based Evaluation Platform for Disaggregated Computing

引用

17th Annual international conference on embedded computer systems - architectures, modeling, and simulation (SAMOS)

作者： Theodoropoulos, Dimitris Alachiotis, Nikolaos Pnevmatikatos, Dionisios Fdn Res & Technol Hellas FORTH Inst Comp Sci Comp Architecture & VLSI Syst Lab 100 Plastira AveVassilika Vouton Iraklion Greece

ISBN: (纸本)9781538634370

Disaggregated computing aims at overcoming the problem of fixed resource proportionality in existing infrastructures while advancing resource allocation to virtual machines, which is currently restricted by the physical boundaries of a server tray. Organizing resources into large homogeneous pools (e.g., compute, memory, accelerators, etc) enables the demand-driven, fine-grained allocation of resources, effectively leading to improved resource utilization and significant power savings. However, the success of this approach relies on how efficiently the underlying resources are utilized by the software application. To facilitate software development in disaggregated computing environments, we introduce a versatile multi-FPGA evaluation platform that can serve as an early exploration tool for the involved trade-offs and execution alternatives given the application at hand. To increase functionality of the proposed development/evaluation platform, we consider three types of building blocks, namely compute, memory, and accelerator ones, providing the developer with the option to instantiate and interconnect them in proportion to the application demands, thus facilitating both compute- and memory-intensive applications. We have implemented a fully fledged prototype platform, based on three interconnected Zynq boards, and rely on a thin user-level API to allocate compute and memory resources on remote blocks, transfer data, and deploy reconfigurable accelerators. As a case study, we employ one of the Seven Dwarfs of Symbolic Computation, the matrix multiply benchmark.

关键词： Software Hardware Resource management Data transfer computer architecture Field programmable gate arrays Task analysis Field programmable gate arrays task analysis data transmission Resource utilization computer Architecture computer software management of resources computer hardware Platform Physical boundaries Memory Memory

来源：评论

学校读者我要写书评

暂无评论

Agent and Multi-Agent systems: Technology and Applications 11th KES international conference, KES-AMSTA 2017 Vilamoura, Algarve, Portugal, June 2017 ...

引用

2018年

作者： Gordan Ježić Mario Kusek Yun Heh Jessica Chen-Burger Robert J Howlett Lakhmi C Jain

ISBN: (纸本)9783319866154

This volume highlights new trends and challenges in research on agents and the new digital and knowledge economy, and includes 23 papers classified into the following categories: business process management, agent-based modeling and simulation, and anthropic-oriented computing. All papers were originally presented at the 11th international KES conference on Agents and Multi-Agent systems Technologies and Applications (KES-AMSTA 2017) held June 2123, 2017 in Vilamoura, Algarve, Portugal. Todays economy is driven by technologies and knowledge. Digital technologies can free, shift and multiply choices, and often intrude on the territory of other industries by providing new ways of conducting business operations and creating value for customers and companies. The topics covered in this volume include software agents, multi-agent systems, agent modeling, mobile and cloud computing, big data analysis, business intelligence, artificial intelligence, social systems, computer embedded systems and nature inspired manufacturing, etc., all of which contribute to the modern Digital Economy. The results presented here will be of theoretical and practical value to researchers and industrial practitioners working in the fields of artificial intelligence, collective computational intelligence, innovative business models, the new digital and knowledge economy and, in particular, agent and multi-agent systems, technologies, tools and applications.

关键词：

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共284页 << < 35 36 37 38 39 40 41 42 43 44 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：