FLORIDA INTERNATIONAL UNIVERSITY

Hardware-aided Monitoring of L1 and L2 D-Cache Misses in SMT Lichen Weng and Chen Liu Electrical and Computer Engineering Department {lichen.weng, chen.liu}@fiu.edu The OLS regression can be implemented for every thread and then conduct instruction fetching (single thread to illustrate in the figure).

Where are we? Simultaneous Multithreading (SMT) architectures are defined as fully shared execution resources among several concurrently running threads in the same core [1].

Long-latency load is one of the major obstacle to better performance as the expression of Memory Wall in the SMT architectures[2]: I

II

Prioritization

Fetching

• It has to fetch data from lower memory architectures

III

• It still holds the shared resources, e.g., ReOrder Buffer, for hundreds of cycles during such fetching

IV

• Resource efficiency is harmed because the shared resources are held without throughput

V

• Task Level Parallelism (TLP) is reduced because other threads cannot utilize such shared resources

VI

Regression

• A load misses in the Level 2 Data Cache

Two-level cache misses are sampled in Sampling Period, i.e., certain CPU cycles

Certain samples (Window Size) are utilized for OLS regression

The model evaluates future L2 cache miss based on immediate L1 cache miss rate for every thread

The priority descends as evaluated L2 miss rate grows

Fetch from the thread with highest priority then the second, and so on so forth

• Therefore, system performance is decreased

What did we achieve?

Fetch policy, which assigns the priority in fetch stage is used to manage the shared resources and handle long-latency load issue. STALL[2]

DG[3]

DWarn[4]

L2 D-Cache Miss

L1 D-Cache Miss

L1 D-Cache Miss

Linearity confirmation • F values are used to test the linearity between L1 and L2 cache miss rate for various benchmarks, which confirms its significance

Performance improvement Action Timing

Action Suspend the thread Suspend the thread Reduce the thread priority

• It adaptively minimizes the influence of long-latency load, because it utilizes updated statistical model • It achieves higher resource efficiency, because it reduces priority rather than gates threads

Sensitivity analysis • Larger sampling period leads to better performance • Larger L2 cache size means more throughput

The relationship between L1 and L2 cache misses is more complicated than it is assumed. gzip

L1 cache L2 cache miss rate miss rate

L1 cache miss rate

1.0000

-0.1792

L2 cache miss rate

-0.1792

1.0000

What do we propose? During an interval, the Ordinary Least Square (OLS) regression can be employed to describe the relationship, considering knowledge about L2 miss in advance will benefit the system. The β = 0.365134 and α=0.0003677 are from the OLS regression for the benchmark apsi. The linearity between L1 and L2 cache miss is statistically modeled.

Who did we reference? [1] D.M. Tullsen, S.J. Eggers, J.S. Emer, H.M. Levy, J.L. Lo and R.L. Stamm, “Exploiting choice: instruction fetch and issue on an implementable simultaneous multithreading processor”. ISCA, 1996. [2] D.M. Tullsen and J.A. Brown, “Handling long-latency loads in a simultaneous multithreading processor”. ISCA, 2001. [3] A. El-Moursy and D.H. Albonesi, “Front-end policies for improved issue efficiency in SMT processors”. HPCA, 2003. [4] F.J. Cazorla, A. Ramirez, M. Valero and E. Fernandez, “DCache warn: an I-fetch policy to increase SMT efficiency”. IPDPS, 2004. [5] T.T. Soong, “Fundamentals of probability and statistics for engineers”. John Wiley and Sons, Ltd, 2004

Hardware-aided Monitoring of L1 and L2 D-Cache Misses in SMT

processor”. ISCA, 1996. [2] D.M. Tullsen and J.A. Brown, “Handling long-latency loads in a simultaneous multithreading processor”. ISCA, 2001. [3] A. El-Moursy ...

542KB Sizes 1 Downloads 94 Views

Recommend Documents

BIDIRECTIONAL CROSSLINGUISTIC INFLUENCE IN L1-L2 ...
a+ Verb types: climb, crawl, creep, roll, run, slither, squeeze, swing b+ Adverbial types: like Tarzan. Bidirectional Influence in Speech and Gesture. 251.

BIDIRECTIONAL CROSSLINGUISTIC INFLUENCE IN L1-L2 ...
tures might provide an additional window through which cross linguistic influence can be observed, particularly for speakers whose speech sounds targetlike ~see Gullberg, 2008, for an overview; Kellerman ...... Increasing native English vocabulary re

Hardware-aided Monitoring of L1 and L2 D-Cache ...
Hardware-aided Monitoring of L1 and L2 D-Cache Misses in SMT ... Long-latency load is one of the major obstacle to better performance ... [3] A. El-Moursy and D.H. Albonesi, “Front-end policies for improved issue efficiency in SMT.

Hardware-aided Monitoring of L1 and L2 D-Cache ...
Page 1. FLORIDA INTERNATIONAL UNIVERSITY. Hardware-aided Monitoring of L1 and L2 D-Cache Misses in SMT. Lichen Weng and Chen Liu. Electrical and ...

Definiteness: from L1 Mandarin to Mandarin L2 English
with indefinites as in (9): (9) Harry smoked 5 cigarettes, so now he only has 3 left. Examples of the Mandarin .... mentioned laptop)). These data indicate that ...

SMT
Aug 21, 2017 - processes and a world class facility. The company is ... medical devices, Internet of things, optical communication, automotive electronics and ...

Solaris L2
Custom jumpstart installation. Domain Naming ... Types of RAIDS(hardware & software ). • overview of state database and state database replicas. • Creating ...

Solaris L2
Custom jumpstart installation. Domain Naming Service(DNS): ... Introduction to SVM. • Advantages of volume manager. • Types of RAIDS(hardware & software ).

definiteness across languages and in L2 acquisition ...
1 We provide examples taken from Ko et al. (2010). These represent ..... Custom-made software allows us to automatically generate groupings of contexts in a ...

QUALITY AND SAFETY MONITORING IN DAIRY INDUSTRY ...
QUALITY AND SAFETY MONITORING IN DAIRY INDUSTRY NOTES 1.pdf. QUALITY AND SAFETY MONITORING IN DAIRY INDUSTRY NOTES 1.pdf. Open.

Monitoring, Sanctions and Front-Loading of Job Search in ... - CiteSeerX
Email: [email protected], phone + 49 6131 39 23233, fax: + 49. 6131 39 ..... time Bellman Equations is available in the Internet Appendix, Section A.

Optimal Monitoring and Collusion in Board of Directors!
significant proportion of Board members of the Vinci Group in Europe or Worldcom and Home. Depot in the US proved to be ever loyal to their CEO. An example ...

Optimal Monitoring and Collusion in Board of Directors!
EEA&ESEM 2011 conference in Oslo and to the CASS Business School, EM Lyon and Rouen Business School seminars .... However, the force driving our result ...

Monitoring, Sanctions and Front-Loading of Job Search in ... - CiteSeerX
The details can be found in the Internet Appendix, Section B.2. ...... reservation wages, one expects a causal negative impact on take-home ...... Foug`ere, D., J. Pradel, and M. Roger (2009) 'Does the Public Employment Service Affect Search.

Dr (Smt) -
Jul 29, 2013 - I am to further inform that, Awards are proposed to be given to the deserving teachers & Teacher educators working under the categories at a State .... for recommendation the teachers for state Awards. 2. Criteria to be followed for se

L1.pdf
Sign in. Page. 1. /. 1. Loading… Page 1 of 1. Page 1 of 1. L1.pdf. L1.pdf. Open. Extract. Open with. Sign In. Main menu. Displaying L1.pdf. Page 1 of 1.

L1 Intro.pdf
3. Manufacturing System. 4. Manufacturing categories. 5. Examples of Manufacturing Industries. 1. Manufacturing big players. 6. Importance of Manufacturing. 1.

NILAI SMT Genap FL (smt 6_2012) 2015 mtbs.pdf
Page 2 of 10. PROGRAMACIÓ TRIMESTRAL Escola del Mar, curs 2017-18. 5è. 2. SEGON TRIMESTRE. Numeració i càlcul. - Nombres decimals: part sencera i ...

L2/10-436 - Unicode.org
As you are aware, International Forum for Information Technology in Tamil (INFITT - ...... 3தம, ba – 3 as in க3டம, rha – à®± as in மறம. Thus we don't need ...

L2/10-436 - Unicode.org
George L. Hart, University of California, Berkeley has explained the uniqueness ... stand as one of the great classical traditions and literatures of the world.

Methods in tropical reefs monitoring
May 31, 2018 - Passport (valid at least 6 month after arrival) ... Note: in order to properly organise transport from/to Bangka, every participant must arrive to ...