Capability hardware enhanced risc instructions (cheri) supplement the memory management unit (mmu) with instruction-set architecture (isa) extensions that implement a capability-system model in each address space. The...
详细信息
Capability hardware enhanced risc instructions (cheri) supplement the memory management unit (mmu) with instruction-set architecture (isa) extensions that implement a capability-system model in each address space. The authors propose isa support for sealed capabilities, hardware-assisted checking during protection-domain switching, a lightweight capability flowcontrol model, and fast register clearing, while retaining the flexibility of a software-defined protection-domain transition model.
CRISP is a high-performance 32-bit RISC microprocessor that uses 172,163 transistors in a 1.75?m CMOS technology. The largechip size of 10.35 mmx 12.23 mm and complexity of the design required an extensive top-down ap...
详细信息
CRISP is a high-performance 32-bit RISC microprocessor that uses 172,163 transistors in a 1.75?m CMOS technology. The largechip size of 10.35 mmx 12.23 mm and complexity of the design required an extensive top-down approach in a set of integrateddesign tools. CAD tools ranged from an instructionset interpreter to detailed timing simulation. Practical problems encounteredinclude communication among different tools and limits on process size, file size, and total compute time. Use of both symboliccompaction and physical layout, while improving initial efficiency, created problems later in the design.
This Letter presents a fine-grained hardware switching scheme to choose from the proper hardware for low power computing. It exploits the word-length optimisation opportunities for multiplication unit. With the propos...
详细信息
This Letter presents a fine-grained hardware switching scheme to choose from the proper hardware for low power computing. It exploits the word-length optimisation opportunities for multiplication unit. With the proposed technique, the gate-level simulation result on OpenRISC shows 23.7% power reduction for the multiplication unit, which accounts for 9.5% power reduction for its execution unit.
Progress in industrial electronics during 1990 is reviewed. Fuzzy logic, intelligent control, and computer-integrated manufacturing (CIM) are improving manufacturing flexibility and reducing costs. At the same time, a...
详细信息
Progress in industrial electronics during 1990 is reviewed. Fuzzy logic, intelligent control, and computer-integrated manufacturing (CIM) are improving manufacturing flexibility and reducing costs. At the same time, advanced computer numerical control (CNC) based on a reduced-instruction-set computer (RISC) chip is expected to sharpen precision in high-speed machining. There were notable advances toward an open system architecture for programmable controllers and machine vision systems. An X-ray laminography system for inspecting the solder joints on printed-circuit boards, particularly those of surface-mounted components, with 25 mu m resolution became available.< >
作者:
RIUS, JMDEPORRATADORIA, RAntennas
Microwave and Radar GroupDepartment Teoria del Senyal i Comunicacions Universitat Politecnica de Catalunya Barcelona Spain
This correspondence presents a very short, simple, easy to understand bit-reversal algorithm for radix-2 fast Fourier transform (PPT), which is, furthermore, easily extendable to radix-M. In addition, when implemented...
详细信息
This correspondence presents a very short, simple, easy to understand bit-reversal algorithm for radix-2 fast Fourier transform (PPT), which is, furthermore, easily extendable to radix-M. In addition, when implemented together with Yong's technique, the computing time is comparable to that of the fastest algorithms.
The floating-point unit of a 600-MHz, out-of-order, superscalar RISC Alpha microprocessor is described. The unit achieves 59 SpecFP95 and can transfer register data at up to 9.6 GB/s, It has two independent pipelines ...
详细信息
The floating-point unit of a 600-MHz, out-of-order, superscalar RISC Alpha microprocessor is described. The unit achieves 59 SpecFP95 and can transfer register data at up to 9.6 GB/s, It has two independent pipelines for multiply and add/subtract operations, with iterative divide and square-root circuits, and is fabricated in a 2.2-V, 0.35-mu m CMOS process.
GPUs are widely used for diverse applications, particularly data-parallel tasks like machine learning and scientific computing. However, their efficiency is hindered by architectural limitations, inherited from histor...
详细信息
GPUs are widely used for diverse applications, particularly data-parallel tasks like machine learning and scientific computing. However, their efficiency is hindered by architectural limitations, inherited from historical RISC processors, in handling memory loads causing high register file contention. We observe that a significant number (around 26%) of values present in the register file are typically used only once, contributing to more than 25% of the total register file bank conflicts, on average. This paper addresses the challenge of single-use memory values in the GPU register file (i.e. data values used only once) which wastes space and increases latency. To this end, we introduce a novel mechanism inspired by CISC architectures. It replaces single-use loads with direct memory operands in arithmetic operations. Our approach improves performance by 20% and reduces energy consumption by 18%, on average, with negligible (<1%) hardware overhead.
We are discussing a general-purpose microprocessor architecture that is suitable for implementation using the Rapid Single-Flux-Quantum (RSFQ) family of logic-memory circuits. While the microprocessor provides a funct...
详细信息
We are discussing a general-purpose microprocessor architecture that is suitable for implementation using the Rapid Single-Flux-Quantum (RSFQ) family of logic-memory circuits. While the microprocessor provides a functionally complete RISC instructionset with 16-bit words, a bit-serial self-timed approach makes it simple enough to be fabricated using the current Josephson junction technology. Our plans are to design, fabricate and test a completely operational prototype of this system.
Different standards for video compression demand a flexible approach for integrating this functionality into multimedia terminals and computers. In this paper a programmable processor array (RISC, dataflow, four proce...
详细信息
Different standards for video compression demand a flexible approach for integrating this functionality into multimedia terminals and computers. In this paper a programmable processor array (RISC, dataflow, four processor elements) and the corresponding decoding software is presented for this task. The results show, that this kind of architecture is efficient and its expense can be compared with other implementations.
reduced-instruction-set computer (RISC) architecture entrenched itself more deeply in 1991. Embedded microcontrollers reached new levels of integration, became easier to program, and attained even greater acceptance a...
详细信息
reduced-instruction-set computer (RISC) architecture entrenched itself more deeply in 1991. Embedded microcontrollers reached new levels of integration, became easier to program, and attained even greater acceptance as embedded components. Explosive growth was seen in field-programmable gate arrays. Digital signal processors are being embedded in modems, speech recognition equipment, and other telecommunications products. Most chip suppliers are preparing chips for a shift in power supply voltage from the usual 5 V to 3.3 V or less. The cost of manufacturing systems continued its exponential rise.< >
暂无评论