Cluster management at Google with Borg 2015-06 dotScale john wilkes / [email protected] Principal Software Engineer Derived from EuroSys'15 paper (http://goo.gl/1C4nuo)

Cluster management at Google with

the system we internally call

Borg

2015-06 dotScale john wilkes / [email protected] Principal Software Engineer Derived from EuroSys'15 paper (http://goo.gl/1C4nuo)

Image by Connie Zhou

User view job hello_world = { runtime = { cell = 'ic' } // Cell (cluster) to run in binary = '.../hello_world_webserver' // Program to run args = { port = '%port%' } // Command line parameters requirements = { // Resource requirements ram = 100M disk = 100M (optional) . cpu = 0.1 } replicas = 510000 // Number of tasks }

User view

Binary

User view What just happened?

Config file webbrowsers browsers web

borgcfg

Cell

BorgMaster BorgMaster UIshard shard BorgMaster UI BorgMaster UI shard read/UI BorgMaster UI shard shard persistent store (Paxos)

Scheduler scheduler

linkshard shard link link shard linkshard shard link

Borglet

Borglet

Borglet

Borglet

User view Hello world! Hello Hello Hello world! world! Hello Hello Hello world! world! Hello Hello Hello Hello world! world! world! Hello world! world! world! Hello Hello world! Hello Hello Hello world! Hello world! Hello world! world! Hello world! world! world! world! Hello Hello Hello Hello world! Hello world! world! world! world!

Hello Hello world! Hello Hello world! Hello Hello world! world! world! Hello world! Hello world! world! Hello Hello world! world!

Hello world!

Hello Hello world! Hello HelloHello Hello world! Hello Hello world! world! world! Helloworld! world! world! Hello world! world! Hello Hello Hello world! world! Hello world! world!

Hello world!

Image by Connie Zhou

User view

Failures

task-eviction rates and causes 9

Failures

A 2000-machine service will have >10 task exits per day This is not a problem: it's normal Images by Connie Zhou

Efficiency Advanced binpacking algorithms Experimental placement of production VM workload, July 2014

available resources

one machine

stranded resources

Efficiency Multiple applications per machine CPI^2 paper, EuroSys 2013

tasks per machine

Efficiency

# machines

shared cell (original)

Sharing clusters between prod/batch helps

shared cell (compacted)

non-prod load (compacted) prod-only load (compacted)

Segregating them would need more machines

13

Efficiency

# machines

shared cell (original)

Sharing clusters between prod/batch helps

shared cell (compacted)

non-prod load (compacted)

overhead prod-only load (compacted)

Segregating them would need more machines

14

Efficiency Sharing clusters between prod/batch helps

Waste

Segregating them would need more machines

15 production cells from a larger pool, omitting small ones (<5000 machines)

15

Efficiency

Resource reclamation

limit: amount of resource requested potentially reusable resources

reservation: estimate of future usage usage: actual resource consumption time 16

Efficiency

Resource reclamation could be more aggressive

Nov/Dec 2013 17

Efficiency

Resource reclamation could be more aggressive

Nov/Dec 2013 18

A few other moving parts Config file webbrowsers browsers web

borgcfg

Cell

UI BorgMaster UI BorgMaster UI BorgMaster UI shard BorgMaster read/UI shard BorgMaster shard shard shard persistent store (Paxos)

Scheduler scheduler

linkshard shard link link shard linkshard shard link

Borglet

Borglet

Borglet

Borglet

A few other moving parts

job config

master

agent

app

A few other moving parts system config

security

accounting/planning

storage job config

master

agent

app monitoring binaries + data distribution Diagram from an original by Cody Smith.

A few other moving parts system config

security

accounting/billing

storage job config

master agent

app monitoring

binaries + data distribution Diagram from an original by Cody Smith.

Kubernetes κυβερνήτης:

pilot or helmsman of a ship http://kubernetes.io

Kubernetes Direct Borg analogues: ● ● ● ● ●

Borg containers => Docker containers alloc (task group) => pod (container group) Borglet => Kubelet persistent, declarative specs reconciliation loops

Kubernetes New / improved: ● ● ● ●

labels + label queries service abstraction composable microservices IP per pod

Observations: 1. Resiliency is achieved only by ruthless attention to detail a. ubiquitous software fault tolerance b. persistent, declarative specs

2. We get efficiency by: a. sharing resources b. reclaiming unused allocations

3. Containers make users more productive

[email protected] http://kubernetes.io http://goo.gl/1C4nuo (Borg paper) Images by Connie Zhou

2015-06 dotScale talk (external).pdf

Page 1 of 26. Cluster management. at Google. with Borg. 2015-06 dotScale. john wilkes / [email protected]. Principal Software Engineer. Derived from ...

3MB Sizes 0 Downloads 133 Views

Recommend Documents

Guide-R001-ResearcherWelcomeGuide-201506.pdf
Page 1 of 11. Researcher Welcome Guide. Welcome to GrantForward! In our Welcome Guide, we will help you through the basics of using GrantForward. by going over making accounts, searching for funding opportunities, and creating. GrantForward Researche

Talk, Talk, Talk Student Example.pdf
... below to open or edit this item. Talk, Talk, Talk Student Example.pdf. Talk, Talk, Talk Student Example.pdf. Open. Extract. Open with. Sign In. Main menu.

資社會訊-201506.pdf
... was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. 資社會訊-201506.pdf.

Making-HK-FinTech-Centre-201506.pdf
Currently it has pride of place with London and New York as one of the major. and critical ... FinTech companies are drawn to places where big financial services clients ... savvy personnel needed to support leading edge developments.

Untitled - We Talk Games
Alex Kidd. Miracle. World. The Mega Cartridge. SEGA. Page 13. Black. Belt. The Mega Cartridge. SEGA. Page 14. Ghost. House. The Sega Card. Ghost. House.

Can you Talk or only Touch-Talk?
We implemented the one-way calling system so to be immediately deployed from any phone capable of sending DTMF signals. No software was installed on the ...

Feynman's Talk
Oct 20, 2006 - use our system of lenses and make a small image in the form of ions, .... It is very easy to answer many of these fundamental biological ... There may even be an economic point to this business of making things very small.

Untitled - We Talk Games
GANAK W of Profith state-of-the-Art. Lirid të Pirtëri. || ||||||||||||||||||||||. Page 2. Page 3. Page 4. Page 5. Page 6. Page 7. Page 8. ORDINE, ISKOD BENEGA. \ETF-.

Can We Talk?
words, commands, questions, emphatic statements, images, or figures of speech. Make your notes specific as you examine the passage. Personal Conversation—Genesis 15:1–8. Genesis 15 records a personal conversation between God and Abraham. Read Gen

TALK EVS.pdf
the name of development how. concerned authorities are. playing with peoples' life. She. alos explained the detailed. concept of Carbon foot printing. At the end ...

Untitled - We Talk Games
Wrestling. The Mega Cartridge. Page 12. ty Lil. YA MAIUU n. GlАН. 町 UU. | || || || VINAMAIN" | |. World. The Mega Cartridg. EO. Kun ...

Money Talk -
+65 6590 6615 [email protected]. Money Talk www.uobkayhian.com.sg. 23 May 2013. Ascendas Hospitality Trust (ASHT SP). Key Takeaways From ...

Money Talk -
indirectly, in a diversified portfolio of income-producing real estate used predominantly for hospitality purposes located across Asia, Australia and New Zealand, ...

Compression Talk
Retransmission. Extra energy cost. Easier to amortize original energy cost of ... mobile sensors ... But too long => high retransmit cost when packets dropped.

CEBU SMEAGパンフレット 201506.pdf
Page 3 of 928. CEBU SMEAGパンフレット 201506.pdf. CEBU SMEAGパンフレット 201506.pdf. Open. Extract. Open with. Sign In. Main menu. Displaying CEBU ...

09.16.15_Chalk Talk Russian.pdf
Отель Springhill Suites. Marriott расположен рядом с классными комнатами и. лабораториями. В этот день учебный центр принимал. учеников старших ...

ICLR Invited Talk
Key Idea from Alex. 19. • Use model parallelism when we have a small parameters / activation ratio. (hint: convolutions!) • Use data parallelism when we have a ...

03.03.17_Chalk Talk Spanish.pdf
Se hacen tantas reservas de Martensen para entrenamientos, que de promedio se usa 190 días al año. Algunas veces, es la sede de. cursos de repaso sobre ...

Ram-Talk-References.pdf
Gallese V., Fadiga L., Fogassi L. and Rizzolatti G., “Action recognition in the premotor cortex”, Brain, 119. (Pt 2), 593-609, 1996. di Pellegrino, G., Fadiga, L., ...

01.27.17_Chalk Talk Spanish.pdf
Nuestros estudiantes tienen un número de oportunidades de aprender con mentores de empresas. El Proyecto Gran. Idea de Columbine High School tiene un ...

TECC TALK BB.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. TECC TALK BB.

05.26.17_Chalk Talk Spanish.pdf
Estudiantes en la foto: Kiera Armintrout, Maia Brownson, Deksyos Damtew, Rachel Emmitt, Luke Giusto,. Anna Hauserman, Justin Hill, Charles Huang, Troy ...

small talk def.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. small talk def.pdf.

10.21.16_Chalk Talk Spanish.pdf
Robert Lonbrake, consejero escolar de Falcon Bluffs, explicó: “Así tienen la oportunidad de ayudar a que cada estudiante del plan- tel realmente alcance su ...