检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

16,215 篇 会议
373 篇 期刊文献
22 册 图书

馆藏范围

16,610 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

9,329 篇 工学
- 8,530 篇 计算机科学与技术...
- 4,011 篇 软件工程
- 1,983 篇 电气工程
- 1,380 篇 信息与通信工程
- 676 篇 电子科学与技术（可...
- 535 篇 控制科学与工程
- 226 篇 网络空间安全
- 188 篇 仪器科学与技术
- 141 篇 机械工程
- 115 篇 生物医学工程（可授...
- 106 篇 动力工程及工程热...
- 105 篇 测绘科学与技术
- 97 篇 光学工程
- 91 篇 生物工程
- 82 篇 建筑学
- 70 篇 土木工程
- 63 篇 环境科学与工程（可...
- 59 篇 安全科学与工程
1,969 篇 理学
- 1,502 篇 数学
- 244 篇 物理学
- 203 篇 统计学（可授理学、...
- 177 篇 系统科学
- 115 篇 生物学
- 100 篇 地球物理学
- 69 篇 化学
1,461 篇 管理学
- 1,203 篇 管理科学与工程(可...
- 467 篇 工商管理
- 321 篇 图书情报与档案管...
106 篇 医学
- 86 篇 临床医学
96 篇 经济学
- 93 篇 应用经济学
56 篇 法学
53 篇 农学
16 篇 教育学
12 篇 文学
9 篇 军事学
1 篇 艺术学

主题

2,209 篇 parallel process...
1,198 篇 computer archite...
1,130 篇 concurrent compu...
1,119 篇 distributed comp...
1,062 篇 computational mo...
1,037 篇 application soft...
1,018 篇 distributed proc...
989 篇 hardware
906 篇 computer science
702 篇 graphics process...
595 篇 runtime
526 篇 scalability
518 篇 parallel process...
507 篇 algorithm design...
493 篇 parallel program...
490 篇 parallel algorit...
471 篇 graphics process...
458 篇 kernel
447 篇 processor schedu...
440 篇 conferences

机构

38 篇 ibm thomas j. wa...
33 篇 college of compu...
31 篇 school of comput...
27 篇 oak ridge nation...
26 篇 university of ch...
26 篇 oak ridge natl l...
26 篇 ohio state univ ...
25 篇 georgia inst tec...
24 篇 department of co...
23 篇 tsinghua univers...
23 篇 pacific northwes...
21 篇 argonne national...
21 篇 oak ridge nation...
20 篇 georgia inst tec...
19 篇 college of compu...
19 篇 school of comput...
19 篇 department of co...
19 篇 argonne natl lab...
19 篇 pacific northwes...
19 篇 national laborat...

作者

39 篇 jack dongarra
31 篇 dongarra jack
29 篇 zomaya albert y.
26 篇 bader david a.
23 篇 feng wu-chun
22 篇 boukerche azzedi...
19 篇 hoefler torsten
18 篇 gagan agrawal
18 篇 schulz martin
16 篇 dhabaleswar k. p...
16 篇 p. sadayappan
16 篇 wang yijie
15 篇 ito yasuaki
15 篇 yves robert
14 篇 h. casanova
14 篇 alexey lastovets...
14 篇 azad ariful
13 篇 dongsheng li
13 篇 wang guojun
13 篇 kishore kothapal...

语言

16,550 篇 英文
30 篇 其他
27 篇 中文
2 篇 土耳其文
1 篇 葡萄牙文

检索条件"任意字段=IEEE International Symposium on Parallel and Distributed Processing with Applications"

共 16610 条记录，以下是4981-4990 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Cost-driven hybrid configuration prefetching for partial reconfigurable coprocessor

Cost-driven hybrid configuration prefetching for partial rec...

引用

21st international parallel and distributed processing symposium, IPDPS 2007

作者： Chen, Ying Chen, Simon Y. School of Engineering San Francisco State University 1600 Holloway Ave. San Francisco CA 94132 DSP Group Inc. 3120 Scott Blvd. Santa Clara CA 95054

ISBN: (纸本)1424409101

Reconfigurable computing systems have developed the capability of changing the configuration of the reconfigurable coprocessor multiple times during the course of a program. However, in most systems the reconfigurable coprocessor wastes computation cycles while waiting for the reconfiguration to complete. Therefore, the high demand for frequent run-time reconfiguration directly translates into higher reconfiguration overhead. Some studies have introduced the concept of prefetching to reduce the reconfiguration overhead. However, these prefetching algorithms are probability-driven. We believe that including configuration size information in the prediction algorithm directly links the training of the predictor with the performance gain. Therefore we proposed a performanceoriented cost-driven algorithm for coarse-grained configuration prefetching. Our cycle accurate simulation results show that the proposed cost-driven algorithm outperforms the probability-driven predictor by 10.8% to 29.6% in reducing reconfiguration overhead. © 2007 ieee.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Virtual distro dispatcher: A costless distributed virtual environment from trashware

引用

5th international symposium on parallel and distributed processing and applications/ISPA 2007 international Workshops

作者： Bertini, Flavio lamanna, D. Davide Baldoni, Roberto Univ Roma La Sapienza Dipartimento Informat & Sistemist Antonio Ruberti Rome Italy

ISBN: (纸本)9783540747413

Obsolete hardware can be effectively reused through intelligent software optimization, which is possible only when source code is available. Virtual Distro Dispatcher (VDD) is a system that produces virtual machines on a central server and projects them on a number of costless physical terminals. VDD is the result of an extreme software optimisation based on virtualization and terminal servers. VDD creates and projects Linux distros that are completely customizable and different from each other. They are virtual desktop machines that can be used for testing or developing and are completely controllable directly from each terminal. Memory consumption has been strongly reduced without sacrificing performances. Test results are encouraging to proceed with the research towards clustering.

关键词： trashware LTSP user mode linux clustering virtualization

来源：评论

学校读者我要写书评

暂无评论

GPU-ABiSort: Optimal parallel sorting on stream architectures 20

GPU-ABiSort: Optimal parallel sorting on stream architecture...

引用

20th ieee international parallel and distributed processing symposium, IPDPS 2006

作者： Greß, Alexander Zachmann, Gabriel Institute of Computer Science II Rhein. Friedr.-Wilh.-Universität Bonn Bonn Germany Institute of Computer Science Clausthal University of Technology Clausthal Germany

ISBN: (纸本)1424400546

In this paper, we present a novel approach for parallel sorting on stream processing architectures. It is based on adaptive bitonic sorting. For sorting n values utilizing p stream processor units, this approach achieves the optimal time complexity O((n log n)/p). While this makes our approach competitive with common sequential sorting algorithms not only from a theoretical viewpoint, it is also very fast from a practical viewpoint. This is achieved by using efficient linear stream memory accesses (and by combining the optimal time approach with algorithms optimized for small input sequences). We present an implementation on modern programmable graphics hardware (GPUs). On recent GPUs, our optimal parallel sorting approach has shown to be remarkably faster than sequential sorting on the CPU, and it is also faster than previous non-optimal sorting approaches on the GPU for sufficiently large input sequences. Because of the excellent scalability of our algorithm with the number of stream processor units p (up to n/ log2n or even n/log n units, depending on the stream architecture), our approach profits heavily from, the trend of increasing number of fragment processor units on GPUs, so that we can expect further speed improvement with upcoming GPU generations. © 2006 ieee.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Smart Video Hosting and processing Platform for Internet-of-Things

Smart Video Hosting and Processing Platform for Internet-of-...

引用

ieee international Conference on Internet of Things ( iThings) / ieee international Conference on Green Computing and Communications (GreenCom) / ieee international Conference on Cyber-Physical-Social Computing (CPS)

作者： Ting, Wei-Chih Lu, Kun-Hsien Lo, Chi-Wen Chang, Shu-Hsin Liu, Pin-Chuan Ind Technol Res Inst Hsinchu Taiwan

ISBN: (纸本)9781479959679

Due to the rapid improvement in resolution and codec, the number of video sensing device grows fast in the recent years. Tremendous data need to be stored and processed. To meet such a need, we developed a video hosting and processing platform for near-realtime applications. An intelligent device can obtain connection-less upload function from our client-side SDK. Video data are sliced into pieces, distributed over cloud-of-clouds and can be processed in parallel once stored. Various video processing algorithms can be mounted together and processed by multiple CPU cores. Performance evaluations show that our platform has the ability to host and process large-scale video data.

关键词： internet-of-things video hosting parallel processing

来源：评论

学校读者我要写书评

暂无评论

Splitting TCP for MPI applications executed on grids

Splitting TCP for MPI applications executed on grids

引用

作者： Glück, Olivier Mignot, Jean-Christophe ENS Lyon INRIA CNRS Université de Lyon 69364 Lyon Cedex 07 France

ISBN: (纸本)9780769544281

In this paper, we first study the interaction between MPI applications and TCP on grids. Then, we propose MPI5000, a transparent applicative layer between MPI and TCP, using proxies to improve the execution of MPI applications on grids. Proxies aim at splitting TCP connections in order to detect losses faster and avoid to return in ci slow-start phase after an idle time. Finally, we evaluate our layer executing the NAS parallel Benchmarks on Grid'5000, the French research grid. The results show that our architecture reduces the number of idle timeout and of long-distance retransmissions for BT, SP and LU benchmarks. Using MPI5000, these applications can decrease their execution time by 35%, 28%, and, 15% respectively. A comparison with MPICH-G2 performances shows that our layer can even outperform a grid enabled MPI implementation. © 2011 ieee.

关键词： Transmission control protocol

来源：评论

学校读者我要写书评

暂无评论

Partitioning programming environment for a novel parallel architecture

Partitioning programming environment for a novel parallel ar...

引用

Proceedings of the 1996 10th international parallel processing symposium

作者： Hartenstein, R. Becker, J. Herz, M. Kress, R. Nageldinger, U. Universitaet Kaiserslautern Kaiserslautern Germany

The paper presents a partitioning and parallelizing programming environment for a novel parallel architecture. This universal embedded accelerator is based on a reconfigurable datapath hardware. The partitioning and parallelizing programming environment accepts C-programs and carries out both, a profiling-driven host/ accelerator partitioning for performance optimization in a first step, and in a second step a resource-driven sequential/ structural partitioning of the accelerator source code to optimize the utilization of its reconfigurable resources.

关键词： Computer systems programming

来源：评论

学校读者我要写书评

暂无评论

Algorithms for all-to-all personalized exchange in 2D and 3D tori

Algorithms for all-to-all personalized exchange in 2D and 3D...

引用

Proceedings of the 1996 10th international parallel processing symposium

作者： Suh, Young-Joo Yalamanchili, Sudhakar Georgia Inst of Technology Atlanta United States

The inter-processor all-to-all communication patterns can be found in many important parallel algorithms. This paper presents new algorithms for all-to-all personalized exchange for circuit switched or wormhole routed 2D and 3D torus connected multiprocessors. The algorithms use message combining to minimize message startups at the expense of larger message sizes. The unique feature of these algorithms is that they are the first algorithms that we know of that operate in a bottom-up fashion rather than a recursive top-down manner.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Treating a user-defined parallel library as a domain-specific language 16

Treating a user-defined parallel library as a domain-specifi...

引用

16th international parallel and distributed processing symposium, IPDPS 2002

作者： Quinlan, D.J. Miller, B. Philip, B. Schordan, M. Center for Applied Scientific Computing Lawrence Livermore National Laboratory LivermoreCA United States

ISBN: (纸本)0769515738

The software crisis within scientific computing has been that application codes become larger and more complex. The only conceivable solution is to make application codes smaller and less complex. We know of no way to resolve this crisis, except to make each line of code mean more;this is the process of defining high-level abstractions. Achieving high-performance from high-level abstractions represents an essential key to simplifying scientific software. This paper presents several high-level abstractions used within scientific computing. These abstractions are part of multiple object-oriented libraries and represent complex and precise semantics. In each case the semantics of the abstraction is user-defined and ignored by the compilation process at a significant performance penalty for the application code. Our research work presents a mechanism to analyze and optimize the use of high-level abstractions within scientific applications. In this paper, we show that the high-level abstractions are not just significantly easier to use in the development of application code but can be made to perform equivalently to hand-coded C and Fortran. Our research work shows how to effectively treat any object-oriented library and its abstractions as if it where a domain-specific language with equivalent builtin types and specialized compile-time analysis and optimizations. With acceptable performance of high-level abstractions within scientific software, we expect that application codes can be made smaller and less complex;allowing much more complex applications to be built in the future. © 2002 ieee.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Adaptive distributed Data Structure Management for parallel CFD applications

Adaptive Distributed Data Structure Management for Parallel ...

引用

15th international symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC)

作者： Frisch, Jerome Mundani, Ralf-Peter Rank, Ernst Tech Univ Munich Chair Computat Engn D-80290 Munich Germany

ISBN: (纸本)9781479930357

Computational fluid dynamics (CFD) simulations require a lot of computing resources in terms of CPU time and memory in order to compute with a reasonable physical accuracy. If only uniformly refined domains are applied, the amount of computing cells is growing rather fast if a certain small resolution is physically required. This can be remedied by applying adaptively refined grids. Unfortunately, due to the adaptive refinement procedures, errors are introduced which have to be taken into account. This paper is focussing on implementation details of the applied adaptive data structure management and a qualitative analysis of the introduced errors by analysing a Poisson problem on the given data structure, which has to be solved in every time step of a CFD analysis. Furthermore an adaptive CFD benchmark example is computed, showing the benefits of an adaptive refinement as well as measurements of parallel data distribution and performance.

关键词： parallel computation adaptive data structure message passing paradigm multi-grid-like solver concept

来源：评论

学校读者我要写书评

暂无评论

CoMeFa: Compute-in-Memory Blocks for FPGAs 30

CoMeFa: Compute-in-Memory Blocks for FPGAs

引用

ieee 30th international symposium on Field-Programmable Custom Computing Machines (FCCM)

作者： Arora, Aman Anand, Tanmay Borda, Aatman Sehgal, Rishabh Hanindhito, Bagus Kulkarni, Jaydeep John, Lizy K. Univ Texas Austin Austin TX 78712 USA

ISBN: (纸本)9781665483322

Block RAMs (BRAMs) are the storage houses of FPGAs, providing extensive on-chip memory bandwidth to the compute units implemented using Logic Blocks (LBs) and Digital Signal processing (DSP) slices. We propose modifying BRAMs to convert them to CoMeFa (Compute-In-Memory Blocks for FPGAs) RAMs. These RAMs provide highly-parallel compute-in-memory by combining computation and storage capabilities in one block. CoMeFa RAMs utilize the true dual port nature of FPGA BRAMs and contain multiple programmable single-bit bit-serial processing elements. CoMeFa RAMs can be used to compute in any precision, which is extremely important for evolving applications like Deep Learning. Adding CoMeFa RAMs to FPGAs significantly increases their compute density. We explore and propose two architectures of these RAMs: CoMeFa-D (optimized for delay) and CoMeFa-A (optimized for area). Compared to existing proposals, CoMeFa RAMs do not require changing the underlying SRAM technology like simultaneously activating multiple rows on the same port, and are practical to implement. CoMeFa RAMs are versatile blocks that find applications in numerous diverse parallel applications like Deep Learning, signal processing, databases, etc. By augmenting an Intel Arria-10-like FPGA with CoMeFa-D (CoMeFa-A) RAMs at the cost of 3.8% (1.2%) area, and with algorithmic improvements and efficient mapping, we observe a geomean speedup of 2.55x (1.85x), across several representative benchmarks. Replacing all or some BRAMs with CoMeFa RAMs in FPGAs can make them better accelerators of modern compute-intensive workloads.

关键词： Deep learning Power demand Random access memory Signal processing algorithms Digital signal processing parallel processing System-on-chip

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 491 492 493 494 495 496 497 498 499 500 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：