IJRIT International Journal of Research in Information Technology, Volume 2, Issue 5, May 2014, Pg: 243-249

International Journal of Research in Information Technology (IJRIT)

www.ijrit.com

ISSN 2001-5569

Power Optimization of Low-Power Multi-Bit Flip-Flops In Memory Elements Using Ring Counter K.Jignus1, Department of ECE, Chandy College of Engineering, [email protected]

M.Kavitha2, Department of ECE, Chandy College of Engineering, [email protected]

Abstract-Power consumption is one of the major problems in emerge VLSI design. In modern integrated circuits, the power consumed by clocking progressively takes a dominant part. To reduce its power consumption, some 1-bit flip-flops are replaced with fewer multi-bit flipflops. The proposed delay buffer adopts a ring-counter addressing scheme. In the ring counter, double-edge-triggered (DET) flip-flops are utilized to diminish the operating frequency by half and using C-element gated-clock. A gated clock- driver tree is applied to reduce the activity beside the clock distribution network. Furthermore, the gated-driver-tree is also employed in the input and output ports of the memory block to decline their loading, and saving the power. Index terms: Delay Buffers, Gated Driver Tree, Ring Counter, Double edge triggered flip-flop, C-element.

I.

INTRODUCTION

The delay buffers are used in portable battery operated, multimedia and wireless applications. Most circuits adopt static random access memory (SRAM) and control/addressing logic to execute delay buffers. In former approach, SRAM compilers are optimized to generate memory modules with low power consumption and high operation speed with a dense cell size. In latter approach, shift register can be easily synthesized, though it may consume much power due to unnecessary data movement. As CMOS technology, the driving capability of an inverter-based clock buffer increases considerably. The driving capability of a clock buffer can be evaluated by the number of minimum-sized inverters that it can drive on a given rising or falling time. Hiroshi Kawaguchi [3] et al proposed, Reduced Clock-Swing Flip-Flop (RCSFF) is composed of reduced swing clock driver and a special flip-flop, which embodies the leak current cutoff mechanism. The RCSFF can reduce the clock system power of a VLSI down to one-third compared to the conventional flip-flop. The area and the delay of the RCSFF can also be reduced by a factor of about 20% compared to the conventional flip-flop. The RCSFF can also reduce the RC delay of a long RC interconnect to one-half. Chulwoo Kim [2] et al proposed, Low-Swing Clock Double-Edge Triggered Flip-Flop (LSDFF) is used to reduce power consumption considerably compared to conventional flip-flops. The LSDFF avoids redundant internal node transitions to reduce power consumption. Besides, power consumption in the clock tree is reduced because LSDFF uses a double-edge triggered operation as well as a low-swing clock. Peiyi Zhao [9] et al proposed, high-performance flip-flops, that are analyzed and classified into two categories: the conditional pre-charge and the conditional capture technologies. These are based on how to prevent or reduce the redundant internal switching activities. A new flip-flop is introduced: the conditional discharge flip-flop (CDFF). This CDFF not only reduces the internal switching activities, but also generates less glitches at the output, while maintaining the negative setup time and small D-to-Q delay characteristics. With a data-switching activity of 37.5%, the proposed flip-flop can save up to 39% of the energy with the same speed as that for the fastest pulsed flip-flops. Ya-Ting Shyu [4] et al Proposed, the power consumed by clocking gradually takes a dominant part. To reduce its power consumption by replacing some flip-flops with fewer multi-bit flip-flops. First, perform a co-ordinate transformation to identify those flip-flops that can be merged and their legal regions. Besides, we show how to build a combination table to enumerate possible combinations of flipflops provided by a library. Finally use a hierarchical way to merge flip-flops. According to the experimental results, our algorithm significantly reduces clock power by 20–30% and the running time is very short. The multi-bit flip-flops are used in the Delay buffer for reducing the power consumption. The following design methodology shows how the multi bit flip-flops are used in the ring counter.

K.Jignus,IJRIT

243

IJRIT International Journal of Research in Information Technology, Volume 2, Issue 5, May 2014, Pg: 243-249

II.

DESIGN METHODOLOGY

The design methodology of our project consists of memory organization, ring oscillator which employ in multi bit flip-flop Technology. This technique improves the overall performance and reduces the power consumption of the circuit. A. Multi-Bit Flip-Flop Methodology Multi-Bit Flip-flop is an effectual implementation method to reduce the power consumption by merging single-bit flip-flop. In this Technique, several flip-flops can share a common clock buffer to avoid redundant power waste.

Fig. 1 Merging two 1-bit flip-flops into 2-bit flip flop.

Fig 1 shows an example of merging two 1-bit flip-flops into one 2-bit flip-flop. Each 1-bit flip-flop contains two inverters, master-latch and slave-latch. Merging single-bit flip-flops into one multi-bit flip-flop can avoid duplicate inverters, and lower the total clock dynamic power consumption. The total area contributing to flip-flops can be reduced as well. A. Systematic overview In the proposed memory organization, several power reduction techniques are adopted. Mainly, these circuit techniques are designed with a view to decreasing the loading on high fan-out nets, e.g., clock and read/write ports. Fig 2 shows the delay buffer using gated driver tree. It consists of memory, gated driver tree, ring counter.

Fig.2 Delay Buffer using Gated Driver Tree

Random-access memory (RAM) is a form of computer data storage. It is a volatile types of memory, where the information is lost after the power is switched off. Many other types of memory are RAM as well, including most types of ROM and a type of flash memory called NOR-Flash.

K.Jignus,IJRIT

244

IJRIT International Journal of Research in Information Technology, Volume 2, Issue 5, May 2014, Pg: 243-249

C. Ring Counter Before Merging the Clock This ring counter proposed to replace the R–S flip-flop by a C-element and to use tree-structured clock drivers with gating. It greatly reduces the loading on active clock drivers. Moreover, DET flip-flops are used to reduce the clock rate to half and also reduce the power consumption on the clock signal. The proposed ring counter with hierarchical clock gating and the control logic is shown in above figure 3.

Fig.3 Ring Counter before merging the clock

Each block contains one C-element to control the delivery of the local clock signal “CLK” to the DET flip-flops, and only the “CKE signals along the path passing the global clock source to the local clock signal are active. The “gate” signal (CKE) can also be derived from the output of the DET flip-flops in the ring counter. The C-element is an essential element in asynchronous circuits for handshaking. When the input of the last DET flip-flop in the previous block changes to “1” making both two inputs of the C-element the same, the clock signal in the current block will be turned on. When the output of the first DET flip-flop in the current block is asserted, then both inputs of the C-element in the previous block go to “0” and the clock for the previous block is disabled. In order to further diminish the loading on the global clock signal (“CLK”), propose to use a driver tree distribution network for the global clock and activate only those drivers. D. Ring Counter After Merging the Clock In the existing system, there are two clocks are used to handle the operation. The proposed system, the clocks can be merged into one which operates the entire (Double Edge Triggered) DET flip-flops. Although, the both circuit performs the same operation and power consumption is different. Due to clock merging, the proposed system consumes the low power consumption compared to the existing one. Fig 4 shows the ring counter after merging the clock.

Fig 4. Ring counter after merging the clock

There are M blocks are used in the ring counter. Each block contains DET flop-flops which only operate in positive and negative edge clock. So, the speed of the circuit gets increased.

K.Jignus,IJRIT

245

IJRIT International Journal of Research in Information Technology, Volume 2, Issue 5, May 2014, Pg: 243-249

E. Gated Driver Tree Clock gating is a popular technique in many synchronous circuits for reducing dynamic power dissipation. To save area, the memory module of a delay buffer is often in the form of an SRAM array with input/output data bus. However, all of the memory cells, only two words will be activated. One is written by the input data and the other is read to the output. Figure 5 shows the gated driver tree using memory module.

Fig. 5 Gated Driver Tree

In gated clock tree technique, the input driving/output sensing circuitry in the memory module of the delay buffer. The memory words are also grouped into blocks. Each memory block associates with one DET flip-flop block in the proposed ring counter and one DET flip-flop output addresses a corresponding memory word for read-out and at the same time addresses the word that was read one-clock earlier for write-in.

III.

SIMULATION RESULTS

The Simulation can be performed by Xilinx 8.1 software. It gives the simulation result and power consumption of before and after merging the clock in ring counter. A. Simulation Result of Ring Counter Before Merging Fig 6 shows the simulation result of counter before merging. There are two clocks are used in the ring counter. The both clocks are double edge triggered flip-flop. Its mean, it can perform the operation in positive edge clock and also negative edge clock.

Fig. 6 Simulation result of ring counter before merging

K.Jignus,IJRIT

246

IJRIT International Journal of Research in Information Technology, Volume 2, Issue 5, May 2014, Pg: 243-249

There are two input are given to the ring counter. They are initialize1, initialize2. If initialize1 and initialize2 values are 1, the circuit performs ring counter operation. Otherwise, it doesn’t operate. Reset is one of the input functions. The circuit enable, when the reset is 1. Otherwise, the circuit is disable. The output of the ring counter is q. Initially, it has the value of ‘0000000000000000’. If the reset is set to 1, for one clock the output is ‘00000000000011’. For each clock, the output can be changed from left to right B. Power Consumption of Ring Counter Before Merging

Fig. 7 power consumption of ring counter before merging

The power consumption of the ring counter is simulated by the Xilinx software. The figure 8 shows the power value of ring counter before merging. There are two clocks are used in the ring counter. So, it uses the high power consumption compared to after merging the clock. C. Simulation Result And Power Consumption of Ring Counter After Merging Fig 8 shows the simulation result of counter before merging. There are one clocks are used in the ring counter. This clock is double edge triggered flip-flop. Its mean, it can perform the operation in positive edge clock and also negative edge clock.

Fig 8 Simulation of Ring Counter after merging

There are two input are given to the ring counter. They are initialize1, initialize2. If initialize1 and initialize2 values are 1, the circuit performs ring counter operation. Otherwise, it doesn’t operate. Reset is one of the input functions. The circuit enable, when the reset is 1. Otherwise, the circuit is disable. The output of the ring counter is q. Initially, it has the value of ‘0000000000000000’. If the reset is set to 1, for one clock the output is ‘00000000000011’. For each clock, the output can be changed from left to right. K.Jignus,IJRIT

247

IJRIT International Journal of Research in Information Technology, Volume 2, Issue 5, May 2014, Pg: 243-249

C. Simulation Result And Power Consumption of Ring Counter After Merging The power consumption of the ring counter is simulated by the Xilinx software. The figure 9 shows the power value of ring counter after merging. There are one clock is used in the ring counter. So, it uses the high power consumption compared to after merging the clock.

Fig. 9 Power Consumption of Ring Counter After Merging

E. Comparison of Result The Power Consumption of Ring Counter Before and After merging the flip flop is shown in following S.No

Before Merging (mW)

After Merging (mW)

1

73

62

Comparison of Ring Counter Before and After merging the flip flop using graph. 74 72 70 68 66 64 62 60 58 56 Before Merging (mW) IV.

After Merging (mW)

CONCLUSION

The ring counter with clock gated by the C-elements can effectively eliminate the excessive data transition without increasing loading on the global clock signal. The gated-driver tree technique used for the clock distribution networks can eliminate the power wasted on drivers. Another gated-de-multiplexer tree are used for the input and output driving circuitry to decrease the loading of the input and output data bus. All gating signals are easily generated by a C-element taking inputs from some DET flip-flop outputs of the ring counter. Optimization of power is observed by using tool xilinx 12.3 and observed the simulation results. Power consumption is known to be a crucial issue in current IC designs. K.Jignus,IJRIT

248

IJRIT International Journal of Research in Information Technology, Volume 2, Issue 5, May 2014, Pg: 243-249

REFERENCES [1].

Low Power High Speed Memory Architecture Using Multi-bit Flip-Flops” International Journal of Advanced Information Science and Technology (IJAIST) ISSN: 2319:2682 Vol.13, No.13.

[2].

Chulwoo Kim and Sung-Mo (Steve) Kang (2002) “A Low-Swing Clock Double-Edge Triggered Flip-Flop”, IEEE Journal Of Solid-State Circuits, Vol. 37, No. 5

[3].

Hiroshi Kawaguchi And Takayasu Sakurai (1998) “A Reduced Clock-Swing Flip-Flop (Rcsff) For 63% Power Reduction”, IEEE Journal Of Solid-State Circuits,Vol. 33,No. 5

[4].

Ya-Ting Shyu, Jai-Ming Lin, Chun-Po Huang, Cheng-Wu Lin, Ying-Zu Lin, and Soon-Jyh Chang,(2013) “Effective and Efficient Approach for Power Reduction by Using Multi-Bit Flip-Flops”, IEEE Transactions On Very Large Scale Integration (VLSI) Systems, Vol. 21, No. 4, April 2013.

[5].

Kiran Kumar, M.Venkara Rao,(2013) “Power Optimized Memory Organization Using Multi-Bit-Flip-Flop Approach and Enhanced Ring Counter” International Journal of Engineering Trends and Technology (IJETT) – Volume 5 Number 7- Nov 2013.

[6].

Rajya Lakshmi G, Radha Krishna T, Lowkya Ch, Basheer Ali Sheik, P Mahesh, L Srikanth “A Delayed Buffered Technique Using the Concept of Gated Driven Tree for Optimizing the Power” International Journal of Engineering Research and Applications, Vol. 2, Issue 2,Mar-Apr 2012, pp.727-733.

[7].

Mark Po-Hung Lin, Chih-Cheng Hsu and Yao-Tsung Chang (2011) “Post-Placement Power Optimization With MultiBit Flip-Flops” IEEE Transactions On Computer-Aided Design Of Integrated Circuits And Systems, Vol. 30, No. 12

[8].

Peiyi Zhao, Tarek K. Darwish and Magdy A. Bayoumi, (2004) ”High-Performance And Low-Power Conditional Discharge Flip-Flop” IEEE Transactions On Very Large Scale Integration (Vlsi) Systems, Vol. 12, No. 5

K.Jignus,IJRIT

249

Power Optimization of Low-Power Multi-Bit Flip-Flops In ... - IJRIT

IJRIT International Journal of Research in Information Technology, Volume 2, Issue 5, May 2014, Pg: 243-249. K.Jignus,IJRIT ... Chandy College of Engineering,.

1MB Sizes 3 Downloads 182 Views

Recommend Documents

Power Optimization of Low-Power Multi-Bit Flip-Flops In ... - IJRIT
Index terms: Delay Buffers, Gated Driver Tree, Ring Counter, Double edge triggered ... though it may consume much power due to unnecessary data movement.

compensation of harmonic in 3- phase distribution power system - IJRIT
distortion and harmonic voltage distortion in three phase power system within an ... has been analyzed and demonstrated for the designing of filters. ... The proposed work also provides comparative analysis of various filtering results.

compensation of harmonic in 3- phase distribution power system - IJRIT
distortion and harmonic voltage distortion in three phase power system within an acceptable range. ... Suitable design examples illustrate the sizing of necessary.

The Power of Recourse in Online Optimization
data. This situation can be modeled with a min-max approach: we choose our ...... processing a job in Bi together with a job in Ai on the same machine for each i .... at any intermediate iteration, the instance admits a large number of optimal ...

A Model for the Optimization of the Maintenance Support ... - IJRIT
Embedded System Technology (Dept. of ECE). SRM University ... The proposed model uses a PIC microcontroller for interfacing purposes. An ultrasonic sensor ...

On the Hardness of Optimization in Power-Law ... - Semantic Scholar
Given a graph, we will refer to two types of sequences of integers: y-degree sequences and d-degree sequences. The first type lists the number of vertices with a certain degree (i.e., the degree distribution) and the latter lists the degrees of the v

A Model for the Optimization of the Maintenance Support ... - IJRIT
measurements includes LIDAR, SODAR, radar, AUV, and remote satellite sensing. Because of the previous factors, one of the biggest difficulties with offshore wind farms is the ability to predict loads. Despite high capital cost and the cost of operati

An Overview of Peak-to-Average Power Ratio Reduction ... - IJRIT
IJRIT International Journal of Research in Information Technology, Volume 3, Issue 3, ... BER reduction, and their advantages and disadvantages in detail. ... in communications which can be used in both wired and wireless environments.

Frequency and yield optimization using power gates in ...
Aug 21, 2009 - in Proc. ISLPED, pp:229~232, Aug. 1999. [8] Y. Yasuda et al., “System LSI multi-Vth transistors design methodology for maximizing efficiency of ...

An Overview of Peak-to-Average Power Ratio Reduction ... - IJRIT
BER reduction, and their advantages and disadvantages in detail. Keywords: ... in communications which can be used in both wired and wireless environments.

Improvement in Performance Parameters of Image ... - IJRIT
IJRIT International Journal of Research in Information Technology, Volume 2, Issue 6, ... Department of Computer Science and Engineering, Technocrats Institute of Technology ... Hierarchical Trees (SPIHT), Wavelet Difference Reduction (WDR), and ...

Greening the Internet; Power Optimization - Ashutosh Dhekne
of power has been a consideration only in laptops and devices that are not ..... [9] have connected a low power radio to a PDA so that the main PDA does not.

Power Optimization of Real-Time Embedded Systems ...
Lehoczky et al. [8] shows .... task set [11], an ins [12], a flight control [13], and a cnc machine .... time system design: A case study on a CNC controller,” in Proc.

Multiobjective Optimization of Sleep Vector for Zigzag Power-Gated ...
Circuits]: Design Aids—Layout. General Terms: Algorithms, Design. Keywords: Zigzag power gating, low power, leakage current, sleep vector, standard-cell. 1.

Partial Bus-Invert Coding for Power Optimization of ...
Pa rt ia lBus-Inv er tCoding fo rPo wer Optimization o fSystem Level Bus. Y oungsoo ... we select a su b-group of bus lines in volved in b us encoding to avoid ...

Multi-Objective Optimization of Power Converters Using ...
Netlist file *.net. Optimization assignment ... manipulation. Variation of the variables .... an AMD64-3000+ CPU-based PC system. 0.925. 0.93. 0.935. 0.94. 0.945.

Design and Optimization of Power-Gated Circuits With Autonomous ...
Design and Optimization of Power-Gated Circuits. With Autonomous Data Retention. Jun Seomun and Youngsoo Shin, Senior Member, IEEE. Abstract—Power ...

Greening the Internet; Power Optimization - Ashutosh Dhekne
Building faster, smaller and more powerful computer systems has been a ... of power has been a consideration only in laptops and devices that are not mains .... is set to 0.1, the station remains awake for 10 beacons and then sleeps between.

End-to-end Modeling and Optimization of Power ...
Aug 16, 2016 - Power efficiency growing slower than compute power ... Department of Energy places it at 20MW. – Point of .... Energy efficiency of optical links.

Optimization in
Library of Congress Cataloging in Publication Data. Đata not available .... sumption and labor supply, firms” production, and governments' policies. But all ...