ECE Dept., University of Patras

Applying Decay to Reduce Dynamic Power in Set-Associative Caches

Georgios Keramidas

Polychronis Xekalakis

Stefanos Kaxiras

[email protected] {kaxiras,keramidas}@ee.upatras.gr HiPEAC Network of Excellence IST-004408 and by Intel Research Equipment Grant #15842 HiPEAC 2007, Ghent Jan 2007

ECE Dept., University of Patras

Introduction z z z z

z

Cache power reduction = Dynamic || Static Dynamic: • Reduce switching activity – i.e. way pred. Static: • Discard Di d useless l li lines – i.e. i cache h d decay Our observation: • Using decay a large # of the lines dead • Searching them is meaningless Our proposal: • Use static power tech. to assist dynamic ones HiPEAC 2007, Ghent Jan 2007

ECE Dept., University of Patras

Outline z

Way Reduction Techniques

z

What is Wrong with Way-Prediction

z

Presenting Way-Selection

z

Filtering Useful Cache-Ways

z

Decaying Bloom Filters

z

Analog Implementations

z

Evaluation

z

Conclusions

HiPEAC 2007, Ghent Jan 2007

ECE Dept., University of Patras

Reducing the Ways Accessed z

Way-prediction techniques reduce power by predicting ways to be accessed • What happens on miss-predictions??

z

Way-selection avoids this problem • Way-Selection != prediction • Way-Selection == access only useful lines

z

Insight: Using decay most of the ways do not have to be searched

HiPEAC 2007, Ghent Jan 2007

ECE Dept., University of Patras

Way-Prediction a.k.a. Multi-MRU z

The best way-prediction is MMRU tag

Way Presence Vector

index

Requested

off

Hash(tag+index)

0 01 0 Way prediction (e.g. way 2)

• Variable hit latency • No support for prediction of misses HiPEAC 2007, Ghent Jan 2007

address

Cache Ways 0 1 2 3

ECE Dept., University of Patras

Way-Selection Using Bloom Filters z

Instead of predicting we can use BFs to select possible outcomes

Bloom Filter

Requested

tag

index

Bloom Hashing

0 110 Way selection (e.g. way 1,2)

HiPEAC 2007, Ghent Jan 2007

off

address

Normal Cache 0 1 2 3

ECE Dept., University of Patras

Cache Decay z

Cache Decay: wait a “decay interval” after the last access and gate the cache line

z

Decay interval defines behavior: • “decay” the cache lines soon Î induce misses i ((performance f lloss)) • “decay” the cache lines late Î does not filter out enough lines

z

Decay filters out useless lines for 2% performance loss

HiPEAC 2007, Ghent Jan 2007

ECE Dept., University of Patras

Decaying Bloom Filters Decaying Bloom Filter

Requested

tag

Bloom Hashing dead entry

index

off

address

Decaying Cache 0 1 2 3

live lines

0 110 live entry Decaying BFs (e.g. way 2)

z

SDBFs: selects only among live ones! HiPEAC 2007, Ghent Jan 2007

dead lines

ECE Dept., University of Patras

Design Choices Digital and Analog flavours available z Totally equivalent behaviour • Digital: z

— hard to update — larger la ge — easier to design

• Analog: — — — —

harder to design prone to errors automagic updates smaller HiPEAC 2007, Ghent Jan 2007

ECE Dept., University of Patras

Analog Implementations z

Design based on 4T DRAM cells

Bitline

Bitline Wordline

HiPEAC 2007, Ghent Jan 2007

ECE Dept., University of Patras

Analog Implementations z

Design based on 4T DRAM cells • Need specific “decay” behaviour Vdd

Bitline

Bitline Wordline

HiPEAC 2007, Ghent Jan 2007

ECE Dept., University of Patras

Analog Implementations z

Design based on 4T DRAM cells • Need specific “decay” behaviour • Need to prevent refresh-on-reads — 4T cells are refreshed on access Refresh Bit Access Bit 5T

5T

Low Leak Inverter

HiPEAC 2007, Ghent Jan 2007

ECE Dept., University of Patras

Analog Implementations z

Design based on 4T DRAM cells • Need specific “decay” behaviour • Need to prevent refresh-on-reads — 4T cells are refreshed on access

• Need longer retention time Refresh Bit Access Bit 5T

5T

Low Leak Inverter

Ground T HiPEAC 2007, Ghent Jan 2007

ECE Dept., University of Patras

Methodology Simulation tools: Wattch, Hotleakage, Cacti z Used Spec2000 benchmarks z Compare with MMRU z We modeled: • 4-issue 4 i processor ((~Power P 4) 4), 4 4-way cache h • Do not count leakage savings • Simulation of Decay Induced Misses • One cycle extra penalty for the MMRU slow hits • No penalization of the MMRU missscheduling z

HiPEAC 2007, Ghent Jan 2007

ECE Dept., University of Patras

Evaluation: The Key – Ways Accessed Number of ways accessed on hits (normalized to base case) 30% 26% SDBF

MMRU

Number of ways accessed on misses(normalized to base case) SDBF

MMRU 10%

HiPEAC 2007, Ghent Jan 2007

ECE Dept., University of Patras

Evaluation: Exec. Time Relative ED Execution time (normalized to base case) MMRU

SDBF

1.18%(MMRU)

-0.2%(SDBF)

Power Savings SDBF

MMRU

HiPEAC 2007, Ghent Jan 2007

61% (SDBF)

59%(MMRU)

ECE Dept., University of Patras

Evaluation: Scaling with Associativity Scaling of Power Savings

Relative Energy Delay (lower is better for SDBF)

SDBF

MMRU

ways

z

ways

High associativities favour SDBFs • Up to 8% better in Power • Up to 31% better in EDP HiPEAC 2007, Ghent Jan 2007

ECE Dept., University of Patras

Conclusions z

We proposed way-selection coupled with decaying cache

z

Decaying the BFs in orchestration with the cache helps way-selection

z

We proposed W d digital di it l and d analog l implementations

z

Prediction of misses is the key over MMRU

z

Up to 8% in power and 31% in EDP

HiPEAC 2007, Ghent Jan 2007

Applying Decay to Reduce Dynamic Power in Set ...

The best way-prediction is MMRU index tag. Requested ... Compare with MMRU. ○ We modeled: 4 i. ( P ... Do not count leakage savings. • Simulation of Decay ...

528KB Sizes 1 Downloads 222 Views

Recommend Documents

Applying Decay to Reduce Dynamic Power in Set ...
Power savings follow the trend of the total ways accessed and relative EDP improves with each doubling of the .... IEEE Micro, 24(6), 2004. 18. S. Velusamy et al.

Recruiting Decay for Dynamic Power Reduction in Set ...
specific line, we predict, for each way, whether the line could be live in it. We access all the ways that possibly contain the live line and we call this way selection. ...... In In Proc. of the Int. Conference on Supercomputing, 2002. 11. K. Flautn

Applying Low Power Techniques in AES MixColumn ...
including smart cards and cellular phones, WWW servers and automated teller machines (ATMs). ... Comparisons with other works are given in Section VI and ...

Applying Low Power Techniques in AES MixColumn ...
in. )2(8. GF is done by multiplying the polynomials )(),( xbxa which yields a polynomial )( xt with degree less than 15. This step is followed by a modular reduction.

Application of stochastic programming to reduce uncertainty in quality ...
A process analysis of a large European pork processor revealed that in current ... [20] developed a discrete event simulation software package that incorporates.

Active EM to Reduce Noise in Activity Recognition
fying email to activities. For example, Kushmerick and Lau's activity management system [17] uses text classification and clustering to examine email activities ...

BOTEC_A New Drug Court To Reduce Drug Demand In ...
COUNTY EXECUTIVE AND MAYOR. Page 1. Whoops! There was a problem loading this page. Retrying... Page 3 of 28. Table of Contents. SUMMARY .

Vacuum decay in an interacting multiverse
Jun 2, 2016 - to address the cosmological constant problem in the context of string theory [1–3]. ...... where θ = 2Hσt + θ0, with θ0 some constant to fit with the.

BOTEC_A New Drug Court To Reduce Drug Demand In Indianapolis ...
A Strategy To Reduce Drug-Use, Drug-Related Crime, And Improve Infant Health_BOTEC_for Office of Justice Programs_May 1995.pdf. Page 1 of 28.

Methods to Reduce Deadlock Traffic
Definition:“The word “deadlock” means that it is a situation which occurs when a ... Distributed deadlocks can occur in distributed systems when distributed ... will never resolve itself and because of this database management systems will.

Alpha Decay OUT.pdf
Loading… Whoops! There was a problem loading more pages. Retrying... Whoops! There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Alpha Decay OUT.pdf. Alpha De

Applying Agents to Knowledge Management in ...
The system has different types of agents, each ... management system may be used during the software ..... Problem: Whom Do We Contact, and Why Do We.

Applying to Ph.D. Programs in Computer Science 1 ...
6 Current 2002 Rankings of CS Ph.D. programs in the U.S.. 19 ... task assignment of jobs to hosts in a server farm is considered fundamental research. You.

Applying to Ph.D. Programs in Computer Science 1 ...
www.gre.org for everything you ever wanted to know about the GREs, including test dates, test center locations, ... By age 5, I had taken apart every appliance in our house. At age 6 .... usually leads us to discount such publications. If you don't .

Radioactive Decay Notes.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Radioactive ...

Tooth Decay Mech
These bacteria take the sugar and turn it into acids, and these acids then start to dissolve the tooth enamel! Sugar doesn't turn into tooth-harming acid without these bacteria, so we need to have these bacteria in our mouths for sugar to harm our te

Nucleon decay in supergravity unified theories
A general analysis of nucleon decay in supergravity unified models is given including the full set ... trix arising in supergravity models from soft-breaking.

Some Tips to Reduce Your Exposure to Wireless Radiation2.pdf ...
orang yang bekerja dalam ruangan itu mendapat ruangan udara (cubic space). yang sedikit-sedikit 10 m3. sebaiknya 15 m3 . (2). ... There was a problem loading this page. Retrying... Some Tips to Reduce Your Exposure to Wireless Radiation2.pdf. Some Ti