CloudLEGO: Scalable Cross-VM-Type Application Performance Prediction Shicong Meng, Arun K. Iyengar, Ling Liu* , Ting Wang, Jian Tan, Ignacio Silva-Lepe Isabelle M. Rouvellou IBM T.J. Watson Research Center {smeng,aruni,twang,tanji,isilval,rouvellou}@us.ibm.com

Abstract Understanding the performance difference of a multi-tier Cloud application between different provisioning plans and workloads is difficult to achieve. A typical IaaS provider offers a variety of virtual server instances with different performance capacities and rental rates. Such instances are often marked with a high level description of their hardware/software configuration (e.g. 1 or 2 vCPUs) which provides insufficient information on the performance of the virtual server instances. Furthermore, as each tier of an application can be independently provisioned with different types and numbers of VMs, the number of possible provisioning plans grows exponentially with each additional tier. Previous work [10] proposed to perform automatic experiments to evaluate candidate provisioning plans, which leads to high cost due to the exponential increase of candidate provisioning plans with the number of tiers and available VM types. While several existing works [8, 6, 7] studied a variety of performance models for multi-tier applications, these works assume that an application runs on a fixed deployment (with fixed machine type and number for each tier). We present CloudLEGO, an efficient cross-VM-type performance learning and prediction approach. Since building a model for each possible deployment is clearly not scalable, instead of treating each candidate deployment separately, CloudLEGO views them as derivatives from a single, fixed deployment. Accordingly, the task c 2013 by the Association for Computing Machinery, Inc. Copyright ⃝ (ACM). Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author. SoCC’13, 1–3 Oct. 2013, Santa Clara, California, USA. ACM 978-1-4503-2428-1. http://dx.doi.org/10.1145/2523616.2525948

* Georgia Institute of Technology [email protected]

of learning the performance of a targeted deployment can be decoupled into learning the performance of the original fixed deployment and learning the performance difference between the original deployment and the targeted one. The key to efficiently capture performance difference between deployments is to find multiple independent changes that can be used to derive any deployment from the original deployment. CloudLEGO formulates such “modular” changes as VM type changes at a given tier. To capture changes of performance at a tier caused by VM type changes, CloudLEGO uses relative performance models [5] which predict the performance difference between a pair of VMs (rather than the absolute performance of a VM) for a given workload. Moreover, training relative performance models requires only performance data from Cloud monitoring services [1, 4] rather than fine-grain data such as per-tier response time which requires application instrumentation. Training relative performance models with traditional passive learning techniques would require a large amount of training data as performance data are collected uniformly in a single batch. We find that different types of VMs often share similar performance for many “regions” of workloads. To leverage this characteristic and guide the profiling to regions with high performance differences, CloudLEGO uses active learning techniques [2, 3, 9] that split the profiling process into multiple stages where data collected in one stage are used to identify high-value regions for the next profiling stage. As a result, it significantly speeds up the convergence of models and the profiling process due to substantially reduced measurement. We deploy CloudLEGO in IBM’s Research Computing Cloud (RC2), an Infrastructure-as-a-Service Cloud, to evaluate its effectiveness. Our results suggest that CloudLEGO provides accurate predictions for various deployments and workloads with only a fraction of training cost incurred by existing techniques.

CloudLEGO: Scalable Cross-VM-Type Application ...

While several existing works [8, 6, 7] studied a variety of performance models for multi-tier applications, these works assume that an application runs on a fixed deployment (with fixed ma- chine type and number for each tier). We present CloudLEGO, an efficient cross-VM-type performance learning and prediction approach.

20KB Sizes 0 Downloads 85 Views

Recommend Documents

Design of Scalable & Simple SIP Application Development ... - IJRIT
Jun 10, 2013 - enterprise emergency notification, mobile conferencing, ... IM to Conferencing, mobile multiplayer gaming, toll-free calling, location based ...

A Scalable Application Placement Controller for ... - Semantic Scholar
IBM T.J. Watson Research Center. 19 Skyline ... Keywords. Dynamic Application Placement, Performance Management. 1. ... applications typically run on top of a middleware system and rely on it to dynamically allocate resources to meet their performanc

Design of Scalable & Simple SIP Application Development ... - IJRIT
Jun 10, 2013 - SIP Application server are used by most telecom operators to ... Also it takes much time to develop applications in java & host on the platform.

Scalable Component Abstractions - EPFL
from each other, which gives a good degree of type safety. ..... Master's thesis, Technische Universität ... Department of Computer Science, EPFL, Lausanne,.

Scalable Offline Monitoring
3 Department of Computer Science, TU Darmstadt, Germany. 4 ABB Corporate Research .... entries from a two year period, requiring 0.4 TB of storage. ..... (1) (¯D, ¯τ,v,i) |= ϕ, for all valuations v and i ∈ N with (v, τi) ∈ R. (2) (¯Dk .....

Scalable Machine Learning.pdf
Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Scalable Machine Learning.pdf. Scalable Machine Learning.pdf.

Scalable High Quality Object Detection
Dec 9, 2015 - posal quality over its predecessor Multibox [4] method: AP increases from 0.42 to ... call compared to Multiscale Combinatorial Grouping [18] with less proposals on the ... proposal ranking function provides a way to balance recall ....

Scalable Component Abstractions - LAMP | EPFL
We identify three programming language abstractions for the construction of .... out to be a convenient way to express required services of a component at the ..... the method call f() does not constitute a well-formed path. Type selection and ...

Scalable Offline Monitoring
entries from a two year period, requiring 0.4 TB of storage. The monitoring takes ... MFOTL's satisfaction relation |= is defined as expected for (i) a time ... we use terms like free variable and atomic formula, and abbreviations such as ...... Conf

Scalable Component Abstractions - LAMP | EPFL
Classes on every level can create objects ... level might be a simple element on the next level of scale. ...... Department of Computer Science, EPFL, Lausanne,.

Scalable Component Abstractions - LAMP - EPFL
software components with static data and hard references, resulting in a ... aspect-oriented programming (indeed, the fragment system .... An important issue in component systems is how to ab- ... this section gives an introduction to object-oriented

SCALABLE MULTIMEDIA FINGERPRINTING ...
Digital fingerprinting is an emerging forensic tool to protect mul- ... and enables to offer stronger protection of multimedia. .... Other criteria give the same trend. 3.

Scalable search-based image annotation - Semantic Scholar
query by example (QBE), the example image is often absent. 123 ... (CMRM) [15], the Continuous Relevance Model (CRM) [16, ...... bal document analysis.

Megastore: Providing Scalable, Highly Available Storage - CIDR
Jan 12, 2011 - 1. INTRODUCTION. Interactive online services are forcing the storage commu- .... networks that connect them to the outside world and the.

Scalable Hierarchical Multitask Learning ... - Research at Google
Feb 24, 2014 - on over 1TB data for up to 1 billion observations and 1 mil- ..... Wc 2,1. (16). The coefficients λ1 and λ2 govern the trade-off between generic sparsity ..... years for each school correspond to the subtasks of the school. ID. Thus 

Scalable Video Summarization Using Skeleton ... - Semantic Scholar
a framework which is scalable during both the analysis and the generation stages of ... unsuitable for real-time social multimedia applications. Hence, an efficient ...

Megastore: Providing Scalable, Highly Available Storage for ...
Jan 12, 2011 - Schemas declare keys to be sorted ascending or descend- ing, or to avert sorting altogether: the SCATTER attribute in- structs Megastore to prepend a two-byte hash to each key. Encoding monotonically increasing keys this way prevents h

Scalable Precomputed Search Trees
They showed that the concept of precomputation can lead to a faster runtime, but ..... International Conference on Robotics and Automation (2009). 4. Green, C.

Scalable Media Streaming to Interactive Users - CiteSeerX
Computer Science Department. Federal ... a high degree of interactivity has been observed in the ac- cesses to ... width Skimming and Patching, two state-of-the-art stream- ..... lyzed a three-year log of accesses to MANIC, an educational.

Scalable search-based image annotation
have considerable digital images on their personal devices. How to effectively .... The application of both efficient search technologies and Web-scale image set ...

DESIGNING HIGH PERFORMANCE AND SCALABLE ...
high performance computing environments by clustering , which combines the ... multiple transport services, hardware multicast, communication management ...

Highly Interactive Scalable Online Worlds - Semantic Scholar
[34] Macedonia, M. R., Brutzman, D. P., Zyda, M. J., Pratt, D. R., Barham, P. T.,. Falby, J., and Locke, J., “NPSNET: a multi-player 3D virtual environment over the. Internet”, In Proceedings of the 1995 Symposium on interactive 3D Graphics. (Mon

Scalable Cores in Chip Multiprocessors
Without question or hesitation, I dedicate this thesis to my beautiful and loving wife, ...... scale, like centralized register files and bypassing networks. ..... purpose is to recover true program order in the midst of actual out-of-order execution