IJRIT International Journal of Research in Information Technology, Volume 2, Issue 1, January 2014, Pg: 34-39

International Journal of Research in Information Technology (IJRIT) www.ijrit.com

ISSN 2001-5569

Design of A Low Power 16-Bit RISC Processor 1

2

V.Naveen Reddy , E.Srinivas 1

M.Tech Student, Dept of ECE, Anurag Group of Institutions (formerly CVSR College of Engineering), Ghatkesar, R.R Dist, A.P, India

2

Assistant Professor, Dept of ECE, Anurag Group of Institutions (formerly CVSR College of Engineering), Ghatkesar, R.R Dist, A.P, India

Abstract RISC architecture is used across a wide range of platforms from Cellular phones to super computers.In this paper,a 16- bit RISC processor is designed, which utilizes minimum functional units without compromising in performance. The design is based on architectural modification made in the incrementer circuit which is used in program counter.A Low Power Area Efficient carry select adder and a high speed low power modified Wallace tree multiplier has been designed to improving perfomance of ALU in RISC processor. The RISC processor has been realized using Verilog HDL.The individual modules are designed and tested at each level and finally integrated in the top level module.Individual modules, toplevel module are simulated by using Xilinx ISE14.2. Synthesis, power estimation and area estimation is done by using Cadence.The power consumption obtanied is 1174 nw and area is 15041 nm2.As against of referace RISC processor which is used Normal Carry select adder and Wallace tree multipler. Keywords: RISC, Lopower, modified Wallace tree multiplier, Carry select adder. .I.

Introduction

The trend in the recent past shows the RISC processors clearly outsmarting the earlier CISC processor architectures. The reasons have been the advantages, such as its simple, flexible and fixed instruction format and the RISC processor is its ability to support single cycle operation, meaning that the instruction is fetched from the instruction memory at the maximum speed of the memory. RISC processors in general, are designed to achieve this by pipelining, where there is a possibility of stalling of clock cycles due to wrong instruction fetch when jump type instructions are encountered. This reduces the efficiency of the processors. This paper describes a RISC architecture in which, single cycle operation is obtained without using a pipelined design [1][2]. The development of CMOS technology provides very high density and high performance integrated circuits. The performance provided by the existing devices has created a never-ending greed for increasingly better performing devices. This predicts the use of a whole RISC processor as a basic device by the year 2020. However, as the density of IC increases, the power consumption becomes a major threatening issue along with the complexity of the circuits. Hence, it becomes necessary to implement less complex, low power processor designs[3].

Program counter is one of the most complex building blocks of the processor design. It performs mainly two operations, namely, incrementing and loading. In order to address this issue, the present work establishes a novel design of an incrementer structure. The second part of this work concentrates on the complexity reduction in ALU by optimizing the design of arithmetic circuits. In this work, we have designed and developed a 16-bit single cycle RISC processor. In order to improve the performance, modification on incrementer circuit and Low power area efficent carry select adder

V.Naveen Reddy, IJRIT

34

IJRIT International Journal of Research in Information Technology, Volume 2, Issue 1, January 2014, Pg: 34-39

circuit and modified wallace tree multipler have been done and modified structure has been integrated into the design and the performance is validated[5],[8]. In this paper we are main focus on ALU design. Section II presents the design of the RISC CPU. Section III presents the implementation of Low power area efficent carry select adder circuit and modified Wallace tree multipler. Section IV gives the ASIC implementation results and analysis. Section IV concludes.

Proposed Block Diagram of RISC processor

II.DESIGN OF 16-BIT RISC CPU A. Architecture The architecture of the proposed RISC CPU is a uniform 16-bit instruction format, single cycle processor. It has a load/store architecture, where the operations will only be performed on registers, and not on memory locations. It follows the classical von-Neumann architecture with just one common memory bus for both instructions and data The instruction set consists of Load, store and HALT type of instructions. The Halt instruction acts as a border line between the instruction and data memory.Each of the register is of 16-bits width capacity.

B.Program Counter: The Program Counter (PC) is a 16-bit latch that holds the memory address of location, from which the next machine language instruction will be fetched by the processor. The proposed PC is the largest sub-block and second to the control unit in complexity.

C.Arithmetic and Logic unit: The arithmetic and logic unit (ALU) performs arithmetic and logic operations. It also performs the bit operations such as rotate and shift by a defined number of bit positions. The proposed ALU contains three submodules, viz. arithmetic, logic and shift modules. The arithmetic unit involves the execution of addition operations and generates Sign flag and Zero flag as per the result shown in the process. In order to reduce the complexity of the adder circuits used in the arithmetic unit of the RISC CPU, a very fast and low power carry select adder circuit has been introduced. The ALU also consists of a modified Wallace tree multiplier, which uses compressor circuits to achieve low power and improved speed of operation. The multiplier is designed to execute in a single cycle. Hence, it satisfies the requirement of the RISC design, to execute single cycle instructions

V.Naveen Reddy, IJRIT

35

IJRIT International Journal of Research in Information Technology, Volume 2, Issue 1, January 2014, Pg: 34-39

III.ALU DESIGN Mainly ALU design have LHI, LLI ,Xor, left shift, right shift, adder ,multiplier and halt. ALU by optimizing the design of arithmetic circuits. The previous works in literature focus on energy efficient arithmetic circuits. In order to increase the operating speed and power efficiency of the processor, we have come out with A Low Power Carry Select Adder, A Modified Wallace tree multiplier is proposed in ALU.

A.Low power area efficent carry select adder circuit: CSA adder, like ripple-carry adders, is the carry has to to travel through every full adder block. There is a way to improve the speed by duplicating the hardware due to the fact that the carry can only be either 0 or 1. The method is based on the conditional sum adder and extended to a carry-select adder. With one RCA.each computing the case of the one polarity of the carry-in, the sum can be obtained with a 2x1 multiplexer with the carry-in as the select signal. The basic idea of this work is to use Binary to Excess-1 Converter (BEC) instead of RCA with in the regular CSLA to achieve lower area and power consumption . The main advantage of this BEC logic comes from the lesser number of logic gates than the n-bit Full Adder (FA) structure.this work is to use BEC instead of the RCA with cin=1 in order to reduce power consumption of the regular CSA. To replace the n-bit RCA, an n+1bit BEC is required[5].

FIG:Low power area efficent carry select adder circuit

B.Modified Wallace tree multiplier: Wallace tree multiplier is used in ALU in order to perform the multiplication operations, The purpose of defining the Wallace tree multiplier in this ALU is because of the full adders used in the above carry select adder, the Wallace tree multiplier comprises of full adders the full adders used in CSA can be called to the Wallace tree multiplier to perform the multiplication operation which in turn leads to reduction of powe [8][6].

V.Naveen Reddy, IJRIT

36

IJRIT International Journal of Research in Information Technology, Volume 2, Issue 1, January 2014, Pg: 34-39

Figure:-Modified Wallace tree Multiplier

C.Register File: The register file consists of 8 general purpose registers of 16-bits capacity each. These register files are utilized during the execution of arithmetic and datacentric instructions. It is fully visible to the programmer. It can be addressed as both source and destination using a 3-bit identifier. The register addresses are of 3-bit length, with the range of 000 to 111. The load instruction is used to load the values into the registers and store instruction is used to retrieve the values back to the memory to obtain the processed outputs back from the processor.

D.Instruction Decoder Unit(IDU): Program Counter output is input to the IDU, the purpose of IDU is to load and store the data depending on the” WRENA”(write enable) signal ,and also to generate the opcode which is given to the ALU for its operations.

00000

Des addre

Datain

while ALU operations the status of the counter is as given below

Opcode

S add

Des add

00000

IV. ASIC IMPLEMENTATION AND RESULTS The RISC processor has been realized using Verilog HDL.The individual modules are designed and tested at each level and finally integrated in the top level module.Individual modules,toplevel module are simulated by using Xilinx ISE14.2. Synthesis, power estimation and area estimation is done by using Cadence.

V.Naveen Reddy, IJRIT

37

IJRIT International Journal of Research in Information Technology, Volume 2, Issue 1, January 2014, Pg: 34-39

RISC Simulation output The power estimation of the RISC processor is done by using Cadence tool, the power estimation in cadence is done by RC which is a synthesis tool in digital library of Cadence tool. The top module of RISC processor is synthesized by using the following command[7]. “Synthesize –to mapped_effort medium” After synthesis the power estimation is done by the following command “Report power” Power estimation results are as follows

V. CONCLUSIONS The design of a single cycle 16-bit RISC processor has been presented. A Low power adder and multiplier structures have been employed in the RISC architecture. The processor has been designed for executing based on the user requirements

ACKNOWLEDMENT V.Naveen Reddy, IJRIT

38

IJRIT International Journal of Research in Information Technology, Volume 2, Issue 1, January 2014, Pg: 34-39

We place our gratitude on record to the Department of Electronics and Communication Engineering, Anurag Group of Institutions (formerly CVSR College of Engineering). for the support rendered to us in carrying

out this work

REFERENCES [1] “A True Single Cycle RISC Processor without Pipelining”. ESS Design White Paper – RISC Embedded Controller. [2] A Low Power 16-Bit RISC Microprocessor Using ECRL Circuits [3] Springer.Guide to RISC Processors- for Programmers and Engineers. [4] Samiappa Sakthikumaran et al., “A Very Fast and Low Power Incrementer and Decrementer Circuits”, International Journal of Computer Communication and Information System (IJCCIS) Vol2.No.1 – 2011, pp. 200-203 [5]Low-Power and Area-Efficient Carry Select Adder: IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 20, NO. 2, FEBRUARY 2012. [6] A Novel VLSI Architecture for Low power FIR Filter: Published in International Journal of Advanced Engineering & Application, Jan 2011 Issue. [7] Cadence Digital Flow Manual ByCadence. [8] Wallace Tree Multiplier for RISC Processor”, 3rd InternationalConference on Electronics Computer Technology- ICECT 2011. [9] K. Nishimura, T. Kudo, and H. Amano, “Educational 16-bit microprocessor PICO-16,” Proc. 3rd Japanese FPGA/PLD design conference and exhibit (Japanese Edition), Tokyo, July 19–21, 1995, pp. 589–595.

V.Naveen Reddy, IJRIT

39

Design of A Low Power 16-Bit RISC Processor

[8] Wallace Tree Multiplier for RISC Processor”, 3rd InternationalConference on Electronics Computer. Technology- ICECT 2011. [9] K. Nishimura, T. Kudo, and H. Amano, “Educational 16-bit microprocessor PICO-16,” Proc. 3rd Japanese. FPGA/PLD design conference and exhibit (Japanese Edition), Tokyo, July 19–21, ...

358KB Sizes 0 Downloads 135 Views

Recommend Documents

risc processor pdf
Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. risc processor pdf. risc processor pdf. Open. Extract. Open with.

Low-power design - IEEE Xplore
tors, combine microcontroller architectures with some high- performance analog circuits, and are routinely produced in tens of millions per year with a power ...

A Low Power Design for Sbox Cryptographic Primitive ...
cations, including mobile phones, cellular phones, smart cards, RFID tags, WWW ..... the best of our knowledge, there has never been pro- posed such an ...

Quasi-Resonant Interconnects: A Low Power Design ...
In this paper, a low power, low latency on-chip interconnect design methodology is ... man Kodak Company, Manhattan Routing, and Intrinsix Corporation. 641.

modern processor design fundamentals of superscalar processors ...
modern processor design fundamentals of superscalar processors pdf download. modern processor design fundamentals of superscalar processors pdf ...

Semicustom Design Methodology of Power Gated Circuits for Low ...
design methodology for power gated circuits that allows unmodi- fied conventional ... standby mode, alternative elements, which are capable of state retention, must ..... leakage sources are M1, M3, and the nMOS of the inverter with. M3 is less leaky

Experimental exploration of ultra-low power CMOS design space ...
approach for energy efficient high performance computing[ 1,2,3]. However, V, scaling is ultimately limited by increasing subthreshold leakage current.

Chapter 1 LOW-POWER DESIGN OF TURBO ...
COMPILERS AND OPERATING SYSTEMS FOR LOW POWER π. C1. C2 s c1 c2. I. U encoder decoder channel π ... new scheme, called turbo coding [2], has enabled reliable data transmission at very low signal-to-noise ratios. Since it ...... Decoder Design for W

Basics of Low Power Circuit and Logic Design
Full-motion video display. Portable Functions ..... Ultra Low Power System Design. Technology .... on VLSI Circuits),. [Mutoh93] (International ASIC Conference) ...

Implementation of a 32-bit RIsC Processor for the Data ...
The Data-Intensive Archite ctur e(DIVA) system employs Pr ocessing-In-Memory(PIM) chips as smart-memory copr ocessors to a micropr ocessor.

processor design pdf
... was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. processor design pdf.

low power and low complex implementation of turbo ...
It consists of two recursive systematic encoders which are ... second encoder the interleaved version of the ... highly undesirable in the high data rate coding.

A Case Study in Low-Power System-Level Design - Computer Design ...
various problems within the design process[8]. In this case, we focus on the methods used to reduce power requirements in the design of a computer peripheral.

A Comparative Study of Low-Power Techniques for ...
TCAM arrays for ternary data storage, (ii) peripheral circuitry for READ, WRITE, and ... These issues drive the need of innovative design techniques for manufacturing ..... Cypress Semiconductor Corporation, Oct. 27, 2004, [Online], Available:.

High-Level Low-Power System Design Optimization - Cadence
an industrial high-level low-power design methodology that enables ..... Cadence. Joules™. RTL. Power. Solution, https://www.cadence.com/content/cadence-.

A VME RISC processor farm for third level triggering
3.rt or 1 v ni ... -0 +. PP + K0K-a+ and PP + K K n-. Here, the two 'primary' charged tracks of the K-n+ are ... 1. The processor proceeds through a number of stages, assessing the event in a cascade of trigger ... from CES running under the Valet Pl

Design Note 1034: Low Power, Precision Op Amp ... - Linear Technology
L, LT, LTC, LTM, Linear Technology and the Linear logo are registered trademarks of Linear ... data sheet SNR, THD and offset performance with very low power ...

Coupling-Driven Bus Design for Low-Power Application-Specific ...
wire-to-wire spacing is shrinking for higher densities and the as- ... ing wire widths. For example of metal 3 layer in typical 0.35 µm. CMOS process, the lateral component of capacitance reaches 5 times the sum of fringing and vertical components w

Low-power cmos digital design with dual embedded ... - IEEE Xplore
by 57% without degrading circuit performance compared to a conventional CMOS design. Index Terms—Adaptive power-supply system, clustered voltage.