Online Management of Jobs in Clusters using Virtual Machines Fabien Hermenier

Cluster computing is an attractive solution to meet the growing computational requirements of scientific applications. In this setting, a user organizes a job as a collection of tasks that each requires a finite amount of resources for a bounded amount of time. A scheduling algorithm is responsible of selecting the jobs to execute on the cluster by finding, for each task composing the jobs, a node with a sufficient amount of free resources. Static scheduling algorithms allocate each task to a node and a constant amount of resources for all the duration of the job. Such strategies lead to a waste of resources and a non-optimal schedule of the jobs as each task does not use all of the allocated resources at all times. On the opposite, dynamic scheduling algorithms manipulate in live the state of the jobs using preemption and the location of their tasks using process migration. These algorithms provide a finer use of resources by allocating them according to the current demand instead of the users estimate. In practice, dynamic scheduling strategies are hard to deploy on clusters. First, the actions that manipulate the tasks in live are tedious to implement in a non-intrusive way. Second, their executions are time and resource consuming and misusing them lead to a high computational overhead. Finally, each cluster has its own architecture and some specific objectives or constraints that may not fit with the available scheduling strategies. In this thesis, we investigate to ease the development and the use of dynamic scheduling strategies. First we propose to use virtual machines (VMs) to execute the jobs in their legacy environment. In addition, virtualization brings the necessary actions to manipulate VMs as dynamic scheduling strategies manipulate tasks: live migration allows to relocate a VM that embeds a task with a negligible downtime, while the suspend-to-disk and the resume actions provide jobs preemption. To ease the development of specific dynamic scheduling strategies, we propose to use constraint programming (CP), a flexible approach to model and solve combinatorial problems. With this approach, the developer only focuses on modelling its problem by stating the constraints (logical relations) that must be satisfied by the solutions. Hence, we have developed a constraint pro-

gramming model for a core dynamic scheduling problem as well as additional composable constraints to specify the strategy by restricting the state or the location of VMs. Finally, we introduce the concept of dynamic reconfiguration, a generic method to perform the transition between the current schedule of the VMs and the new computed one. Relying on a model that estimates the duration and the impact on performance of each action, our module, also based on CP plans the actions to ensure their feasibility and to minimize the total duration of their execution. We have validated our approach with the implementation of a prototype named Entropy1 which was used to implement two use cases. The first provides dynamic consolidation by packing all the running VMs on the minimum number of nodes while satisfying their CPU and memory resources needs. This strategy reduces the energy consumption of the clusters when unused nodes are turned off. Our second use case reduces the complexity of developing dynamic scheduling strategy. Using the dynamic reconfiguration, the developer only provides an algorithm to select the jobs to run while the reconfiguration module finds a host for each selected VM and provides an efficient transition to reach the new schedule. We have evaluated Entropy through several experiments using both simulations with up to 200 nodes and 400 VMs and clusters with up to 35 nodes and 70 VMs. We have observed that our approach for dynamic reconfiguration reduces by up to 70% the duration of a reconfiguration as compared to the heuristic First Fit Decrease (FFD). This has led to a reactive system that quickly fixes non-optimal schedules and provides an efficient use of resources for each strategy we have developed. For dynamic consolidation, we have compared our implementation with the common heuristic FFD. Our solution performs twice as much reconfigurations and uses half as much resources to execute the same workload. For our second strategy, we have developed a sample scheduling algorithm, similar to First Come, First Serve but which executes a workload of 11 jobs 40% faster. 1 http://entropy.gforge.inria.fr

Online Management of Jobs in Clusters using Virtual ...

the allocated resources at all times. On the opposite, dynamic scheduling algorithms manipulate in live the state of the jobs using preemption and the location of their tasks using process migration. These algorithms provide a finer use of resources by allocating them according to the current demand instead of the users.

27KB Sizes 2 Downloads 193 Views

Recommend Documents

Instrumentness in Clusters of Artefacts -- a First ... - Semantic Scholar
Software is not an instrument in the same sense as a violin, but several instrument ... reference to Wartofsky's (1973) analysis of how human perception develops ...

Instrumentness in Clusters of Artefacts -- a First ... - Semantic Scholar
Collaboration, instrumentness, computer mediated creativity ... specifics of computer-mediated creativity. .... revisit ten collaboration-oriented lessons on complex.

Efficient Barrier and Allreduce on IBA clusters using ...
ing used in the high performance computing arena. This is because they are very cost-effective and affordable. (MPI) [11] programming model has become the ...

PRIZM Clusters
When relocating due to a job transfer, people usually find a neighborhood that is very ... Claritas uses factor analysis of census data to uncover the demographic and lifestyle variables that explain the ..... Their diligence has enabled them to.

Stability mechanism of cuboctahedral clusters in UO2+x
May 20, 2008 - Color online a Cuboctahedral cluster COT-v incor- porated .... x = 2 VU − IU. + IO + ... FIG. 2. Color online Defect concentrations of point oxygen.

Dynamically Allocating the Resources Using Virtual Machines
Abstract-Cloud computing become an emerging technology which will has a significant impact on IT ... with the help of parallel processing using different types of scheduling heuristic. In this paper we realize such ... business software and data are

Boosting Clusters of Samples for Sequence Matching in ...
Abstract—This study introduces a novel classification algo- rithm for learning and matching sequences in view independent object tracking. The proposed learning method uses adaptive boosting and classification trees on a wide collection (shape, pos

pdf-1453\extraordinary-jobs-in-the-food-industry-extraordinary-jobs ...
... Association in Washington, D.C.. Page 3 of 6. pdf-1453\extraordinary-jobs-in-the-food-industry-extraordinary-jobs-by-alecia-t-devantier-carol-turkington.pdf.

Surgical Management of Melanoma-In-Situ Using a ... - Springer Link
Dec 3, 2008 - 1Department of Cutaneous Oncology, H. Lee Moffitt Cancer Center and Research Institute, 12902 Magnolia Drive,. SRB 24016, Tampa, FL 33612, USA; 2Department of Surgery, University of South Florida College of Medicine, Tampa,. FL, USA; 3D

Unit_3_Memory Management & Virtual Memory.pdf
main memory Kernel. process table. per process. region table Kernel region table. Page 4 of 124. Unit_3_Memory Management & Virtual Memory.pdf.

application of solid modeling in virtual manufacturing of ...
All the processes are developed on the platform of the 3D-STUDIO-MAX, which is one of the most important virtual tools. The software is developed using Max-.

TRIBAC: Discovering Interpretable Clusters and Latent Structures in ...
TRIBAC: Discovering Interpretable Clusters and Latent Structure in Graphs ..... Algorithm. Baboon. Monast. Karate. Les Mis. Pol. Books. Adj-Nouns. Football.

Are clusters more resilient in crises? Evidence from ...
Apr 22, 2014 - crisis to explain that the fall in trade has been larger than the fall in output, ... that benefit from cluster policies, we use data on the French .... In our analysis, we distinguish the effect of surrounding exporters from the speci

Price Convergence in an Online Virtual World - Albuquerque ...
The only trade tariff is a clear-cut 15% tax on international sales. The use of online virtual worlds to study economic phenomena is relatively new. Nonetheless ...

Teaching in the Virtual Outdoors: Online Forestry ...
Updating/offering 46 online or hybrid courses in Natural .... Online Tools…FREE http://www.skype.com. • Allows online students access ... conferencing…

Teaching in the Virtual Outdoors: Online Forestry ...
g sc oo stude ts oo g o college credits and trying to find career .... Housed on external server…no downloading of ... free hosting! • Lots of good forestry video.

Virtual Cloud Security by Using Hypervisor- Based Technology - IJRIT
Hypervisor security is the process of ensuring the hypervisor, the software that enables virtualization, is secure ... The research methodology adopted towards achieving this goal, is based on software ..... critical data must not only be protected a