检索结果-内蒙古大学图书馆

The Fourth International Conference on Parallel and Distributed Computing, Applications and Technologies

作者： Wanming Chu Yamin Li Department of Computer Hardware University of Aizu Aizu-Wakamatsu 965-8580 Japan Department of Computer Science Hosei University Tokyo 184-8584 Japan

＜正＞Designing a java processor supporting horizontal multithreading has been becoming more attractive as network computing gains *** from the traditional superscalar processors that issue multiple instructions from a single instruction stream to exploit the instruction level parallelism（ILP）,the horizontal multithreading java processors issue multiple instructions （bytecodes） from multiple threads in parallel to exploit not only the ILP but the thread level parallelism（TLP）.Such processors have multiple dispatch slots and require the instruction fetch unit to supply instructions with much higher bandwidth than superscalar *** a traditional superscalar cache architecture in a horizontal multithreading java processor results in high cache miss ratio caused by the interference among the *** paper investigates multibank instruction cache architecture for horizontal multithreading java processor to meet the requirements of the high instruction fetch *** order to evaluate the cache performance as well as the horizontal multithreading java processor performance,we developed a trace driven *** simulator consists of a trace generator that generates the java bytecode execution traces and an architectural simulator that reads the traces and evaluates the performance of the instruction cache and the overall performance of the java processor. Our simulation results show that the performance improvements are obtained by the low cache miss ratio and the high instruction fetch bandwidth of the proposed cache *** IPC performance is about 19 when both the number of slots and the number of banks are 8,about 5 times better than one bank cache.

关键词： Cache java virtual machine java processor instruction level parallelism thread level parallelism multithreading performance evaluation trace-driven simulation

来源：评论

学校读者我要写书评

暂无评论

An Instruction Cache Architecture for Parallel Execution of java Threads

An Instruction Cache Architecture for Parallel Execution of ...

引用

Proceedings of The Fourth International Conference on Parallel and Distribyted Computing,Applications and Technologies(第四届并行与分布式计算应用与技术国际会议)

作者： Wanming Chu Yamin Li Department of Computer Hardware University of Aizu Aizu-Wakamatsu 965-8580 Japan Department of Computer Science Hosei University Tokyo 184-8584 Japan

ISBN: (纸本)0780378407

Designing a java processor supporting horizontal multithreading has been becoming more attractive as network computing gains importance. Different from the traditional superscalar processors that issue multiple instructions from a single instruction stream to exploit the instruction level parallelism (ILP), the horizontal multithreading java processors issue multiple instructions (bytecodes) from multiple threads in parallel to exploit not only the ILP but the thread level parallelism (TLP). Such processors have multiple dispatch slots and require the instruction fetch unit to supply instructions with much higher bandwidth than superscalar processors. Using a traditional superscalar cache architecture in a horizontal multithreading java processor results in high cache miss ratio caused by the interference among the threads. This paper investigates multibank instruction cache architecture for horizontal multithreading java processor to meet the requirements of the high instruction fetch bandwidth. In order to evaluate the cache performance as well as the horizontal multithreading java processor performance, we developed a trace driven simulator. The simulator consists of a trace generator that generates the java bytecode execution traces and an architectural simulator that reads the traces and evaluates the performance of the instruction cache and the overall performance of the java *** simulation results show that the performance improvements are obtained by the low cache miss ratio and the high instruction fetch bandwidth of the proposed cache architecture. The IPC performance is about 19 when both the number of slots and the number of banks are 8, about 5 times better than one bank cache.

关键词： Cache java virtual machine java processor instruction level parallelism thread level parallelism multithreading performance evaluation trace-driven simulation

来源：评论

学校读者我要写书评

暂无评论

Heap compression for memory-constrained java environments 03

Heap compression for memory-constrained Java environments

引用

Proceedings of the 18th annual ACM SIGPLAN conference on Object-oriented programing, systems, languages, and applications

作者： G. Chen M. Kandemir N. Vijaykrishnan M. J. Irwin B. Mathiske M. Wolczko The Pennsylvania State University University Park PA Sun Microsystems Inc. Mountain View CA

ISBN: (纸本)9781581137125

java is becoming the main software platform for consumer and embedded devices such as mobile phones, PDAs, TV set-top boxes, and in-vehicle systems. Since many of these systems are memory constrained, it is extremely important to keep the memory footprint of java applications under *** goal of this work is to enable the execution of java applications using a smaller heap footprint than that possible using current embedded JVMs. We propose a set of memory management strategies to reduce heap footprint of embedded java applications that execute under severe memory constraints. Our first contribution is a new garbage collector, referred to as the Mark-Compact-Compress (MCC) collector, that allows an application to run with a heap smaller than its footprint. An important characteristic of this collector is that it compresses objects when heap compaction is not sufficient for creating space for the current allocation request. In addition to employing compression, we also consider a heap management strategy and associated garbage collector, called MCL (Mark-Compact-Lazy Allocate), based on lazy allocation of object portions. This new collector operates like the conventional Mark-Compact (MC) collector, but takes advantage of the observation that many java applications create large objects, of which only a small portion is actually used. In addition, we also combine MCC and MCL, and present MCCL (Mark-Compact-Compress-Lazy Al-locate), which outperforms both MCC and *** have implemented these collectors using KVM, and performed extensive experiments using a set of ten embedded java applications. We have found our new garbage collection strategies to be useful in two main aspects. First, they reduce the minimum heap size necessary to execute an application without out-of-memory exception. Second, our strategies reduce the heap occupancy. That is, at a given time, they reduce the heap memory requirement of the application being executed. We have also conducted expe

关键词： heap memory compression java virtual machine garbage collection

来源：评论

学校读者我要写书评

暂无评论

Application isolation in the java virtual machine 00

Application isolation in the Java Virtual Machine

引用

Proceedings of the 15th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications

作者： Grzegorz Czajkowski Sun Microsystems Laboratories 2600 Casey Avenue Mountain View CA

ISBN: (纸本)9781581132007

To date, systems offering multitasking for the java™ programming language either use one process or one class loader for each application. Both approaches are unsatisfactory. Using operating system processes is expensive, scales poorly and does not fully exploit the protection features inherent in a safe language. Class loaders replicate application code, obscure the type system, and non-uniformly treat 'trusted' and 'untrusted' classes, which leads to subtle, but nevertheless, potentially harmful forms of undesirable inter-application *** this paper we propose a novel, simple yet powerful solution. The new model improves on existing designs in terms of resource utilization while offering strong isolation among applications. The approach is applicable both on high-end servers and on small devices. The main idea is to maintain only one copy of every class, regardless of how many applications use it. Classes are transparently and automatically modified, so that each application has a separate copy of its static fields. Two prototypes are described and selected performance data is analyzed. Various aspects of the proposed architectural changes to the java virtual machine are discussed.

关键词： application isolation multitasking java virtual machine

来源：评论

学校读者我要写书评

暂无评论

Application isolation in the java™ virtual machine

Application isolation in the Java™ virtual machine

引用

Conference on Object-Oriented Programming, Systems, Lnaguages & Applications (OOPSLA 00)

作者： Czajkowski, G Sun Microsyst Labs Mt View CA 94043 USA

To date, systems offering multitasking for the java(TM) programming language either use one process or one class loader for each application. Both approaches are unsatisfactory. Using operating system processes is expensive, scales poorly and does not fully exploit the protection features inherent in a safe language. Class loaders replicate application code, obscure the type system, and non-uniformly treat 'trusted' and 'untrusted' classes, which leads to subtle, but nevertheless, potentially harmful forms of undesirable inter-application interaction. In this paper we propose a novel, simple yet powerful solution. The new model improves on existing designs in terms of resource utilization while offering strong isolation among applications. The approach is applicable both on high-end servers and on small devices. The main idea is to maintain only one copy of every class, regardless of how many applications use it. Classes are transparently and automatically modified, so that each application has a separate copy of its static fields. Two prototypes are described and selected performance data is analyzed. Various aspects of the proposed architectural changes to the java virtual machine are discussed.

关键词： java virtual machine multitasking application isolation

来源：评论

学校读者我要写书评

暂无评论

The apprentice challenge

引用

ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS 2002年第3期24卷 193-216页

作者： Moore, JS Porter, G Univ Texas Dept Comp Sci Austin TX 78712 USA

We describe a mechanically checked proof of a property of a small system of java programs involving an unbounded number of threads and synchronization, via monitors. We adopt the output of the javac compiler as the semantics and verify the system at the bytecode level under an operational semantics for the JVM. We assume a sequentially consistent memory model and atomicity at the bytecode level. Our operational semantics is expressed in ACL2, a Lisp-based logic of recursive functions. Our proofs are checked with the ACL2 theorem prover. The proof involves reasoning about arithmetic;infinite loops;the creation and modification of instance objects in the heap, including threads;the inheritance of fields from superclasses;pointer chasing and smashing;the invocation of instance methods (and the concomitant dynamic method resolution);use of the start method on thread objects;the use of monitors to attain synchronization between threads;and consideration of all possible interleavings (at the bytecode level) over an unbounded number of threads. Readers familiar with monitor-based proofs of mutual exclusion will recognize our proof as fairly classical. The novelty here comes from (i) the complexity of the individual operations on the abstract machine;(ii) the dependencies between java threads, heap objects, and synchronization;(iii) the bytecode-level interleaving;(iv) the unbounded number of threads;(v) the presence in the heap of incompletely initialized threads and other objects;and (vi) the proof engineering permitting automatic mechanical verification of code-level theorems. We discuss these issues. The problem posed here is also put forth as a benchmark against which to measure other approaches to formally proving properties of multithreaded java programs.

关键词： languages verification java java virtual machine parallel and distributed computation mutual exclusion operational semantics theorem proving

来源：评论

学校读者我要写书评

暂无评论

Estimating internal memory fragmentation for java programs

引用

JOURNAL OF SYSTEMS AND SOFTWARE 2002年第3期64卷 235-246页

作者： Skotiniotis, T Chang, JEM Iowa State Univ Dept Elect & Comp Engn Ames IA 50011 USA

Dynamic memory management has been an important part of a large class of computer programs and with the recent popularity of object oriented programming languages, more specifically java, high performance dynamic memory management algorithms continue to be of great importance. In this paper, an analysis of java programs, provided by the SPECjvm98 benchmark suite, and their behavior, as this relates to fragmentation, is performed. Based on this analysis, a new model is proposed which allows the estimation of the total internal fragmentation that java systems will incur prior to the programs execution. The proposed model can also accommodate any variation of segregated lists implementation. A comparison with a previously introduced fragmentation model is performed as well as a comparison with actual fragmentation values that were extracted from SPECjvm98. Finally the idea of a test-bed application that will use the proposed model to provide to programmers/developers the ability to know, prior to a programs execution, the fragmentation and memory utilization of their programs, is also introduced. With this application at hand developers as well as designers of applications could better assess the stability, efficiency as well reliability of their applications at compile time. (C) 2002 Elsevier Science Inc. All rights reserved.

关键词： memory fragmentation buddy system java virtual machine

来源：评论

学校读者我要写书评

暂无评论

Design of an optimal folding mechanism for java processors

引用

MICROPROCESSORS AND MICROSYSTEMS 2002年第8期26卷 341-352页

作者： Ton, LR Chang, LC Shann, JJ Chung, CP Natl Chiao Tung Univ Dept Comp Sci & Informat Engn Hsinchu 300 Taiwan Ind Technol Res Inst Comp & Commun Res Labs Hsinchu 310 Taiwan

java has become the most important language in the Internet area, but its execution performance is severely limited by the true data dependency inherited from the stack architecture defined by the Sun's java virtual machine (JVM). To enhance the performance of the JVM, a stack operations folding mechanism for the picojava-II processor was proposed by Sun Microsystems to fold 42.3% stack push/pop instructions. A systematic folding algorithm-Producer, Operator, and Consumer (POC) folding model was proposed in the earlier research to eliminate up to 82.9% of stack push/pop instructions. The remaining push and pop instructions cannot be folded due to the sequential checking characteristic of the POC folding model. A new folding algorithm-enhanced POC (EPOC) folding model is proposed in this paper to further fold the remaining push and pop instructions. In the EPOC folding model, stack push/pop instructions are folded with the proposed Stack Reorder Buffer (SROB) architecture. With a small SROB size of 584 bits, almost all of the stack push/pop instructions can be folded with the precise exception handling capability. Statistical data shows that 98.8% of the stack push/pop instructions can be folded, and the average execution performance speedup of a 4-foldable processor with a 7-byte instruction buffer is 1.74 as compared to a traditional single-pipelined stack machine without folding. (C) 2002 Elsevier Science B.V. All rights reserved.

关键词： java virtual machine stack operations folding POC folding model EPOC folding model java processor

来源：评论

学校读者我要写书评

暂无评论

The JAFARDD processor: A java Architecture based, on a folding algorithm, with reservation stations, dynamic translation, and dual processing

引用

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS 2002年第4期48卷 1004-1015页

作者： El-Kharashi, MW Gebali, F Li, KF Zhang, F Univ Victoria Dept Elect & Comp Engn Victoria BC V8W 2Y2 Canada

This paper presents the JAFARDD (a java Architecture based on a Folding Algorithm, with Reservation stations, Dynamic translation, and Dual processing) processor. JAFARDD dynamically translates java stack-dependent bytecodes to RISC-style stack-independent instructions to facilitate the use of a general-purpose RISC core. JAFARDD enables the exploitation of instruction level parallelism among the translated instructions by the use of bytecode folding coupled with Tomasulo's algorithm. We detail the JAFARDD architecture and the global architecture design principles observed while designing each pipeline module. We also illustrate the flow of the java bytecodes through each of the processing phases. Benchmarking of JAFARDD using SPECjvm98 has shown a performance improvement between 1.10 and 2.25.

关键词： reservation stations Tomasulo's algorithm dynamic scheduling java java virtual machine java processors java bytecode folding stack processors local variable renaming instruction shelving dual processing

来源：评论

学校读者我要写书评

暂无评论

Using memory compression for energy reduction in an embedded java system

引用

JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS 2002年第5期11卷 537-555页

作者： Chen, G Kandemir, M Vijaykrishnan, N Irwin, MJ Penn State Univ Microsyst Design Lab State Coll PA 16802 USA Princeton Univ Dept Elect Engn Princeton NJ 08544 USA

The java programming language is being increasingly used for application development for mobile and embedded devices. Limited energy and memory resources are important constraints for such systems. Compression is an useful and widely employed mechanism to reduce the memory requirements of the system. As the leakage energy of a memory system increases with its size and because of the increasing contribution of leakage to overall system energy, compression also has a significant effect on reducing energy consumption. However, storing compressed data/instructions has a performance and energy overhead associated with decompression at runtime. The underlying compression algorithm, the corresponding implementation of the decompression and the ability to reuse decompressed information critically impact this overhead. In this paper, we explore the influence of compression on overall memory energy using a commercial embedded java virtual machine (JVM) and a customized compression algorithm. Our results show that compression is effective in reducing energy even when considering the runtime decompression overheads for most applications. Further, we show a mechanism that selectively compresses portions of the memory to enhance energy savings. Finally, a scheme for clustering the code and data to,improve the reuse of the decompressed data is presented.

关键词： java virtual machine memory compression leakage energy dynamic energy embedded system

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：