Streamlining HPC Workloads with Containers @DustinKirkland

what does high-performance computing look like?

Wikipedia says...

Or perhaps in China...

Google image search shows...

The university student learns...

HackerNews suggests...

x1.32xlarge Your DevOps engineer launches...

But then there is your real, actual data center...

what do all of these have in common?

a lot, actually

they’re all running Linux

directly on the bare metal itself

performance is maximized

overhead is minimized

big problems are distributed across a cluster

everyone prefers a clean environment

virtual machines always involve overhead

Guest VM Exit VMXON

VM Entry

VM Monitor

VMXOFF

oh, and let’s reboot a datacenter

BIOS is checking memory for problems… Scanning 1,199,511,627,776 bytes… This may take several minutes… Running test 1 of 8: 1.0% complete Overall test status: 0.1% complete Time elapsed: 17m23s Status: No problems have been found yet.

so let’s have a look at containers

first, process containers

awesome for HPC functions

LXD

second, machine containers

➢ Ultra fast “vm-lite” guests (bare metal speed) lxc distribution of Linux lxc - e.g. Ubuntu, CentOS lxc ➢ Any machine ➢ machine Starts in less than machine 1 second ➢ 15x density of KVM or ESX for idle workloads

lxc machine

kernel

lxd

host A

kernel

lxd

kernel

host B

lxd

host C

kernel

lxc machine

lxd

kernel

host D

LXD REST API nova-lxd

lxc cli

other restful apps

host ...

lxd

lxc machine

kernel

lxd

host A

➢ Ultra fast “vm-lite” guests (bare metal speed) ➢ Any lxcdistribution of Linux lxc- e.g. Ubuntu, CentOS lxc ➢ machine Starts in less than 1 machine second machine ➢ 15x density of KVM or ESX for idle workloads kernel

lxd

kernel

host B

lxd

host C

kernel

lxd

lxc machine

kernel

host D

LXD REST API nova-lxd

lxc cli

other restful apps

host ...

lxd

One LXD container, with 100% of the system: CPU Cores CPU Cycles Memory Disk Space Disk IO Network IO

“alloy” mode

exclusive access to system resources

but secured from the underlying hardware and OS

cgroups, user namespaces, apparmor, seccomp

instant startup

looks like a machine, Linux on Linux

zero latency

zero overhead

identical performance

snapshot restore

live migration

demo

ubuntu.com/lxd github.com/lxc linuxcontainers.org

Streamlining HPC Workloads with Containers.pdf

Whoops! There was a problem loading more pages. Retrying... Whoops! There was a problem previewing this document. Retrying... Download. Connect more ...

4MB Sizes 1 Downloads 199 Views

Recommend Documents

Streamlining HPC Workloads with Containers.pdf
Google image search shows... Page 5 of 47. Streamlining HPC Workloads with Containers.pdf. Streamlining HPC Workloads with Containers.pdf. Open. Extract.

How enterprises are streamlining with Chromebooks Services
Breaking new ground. The DCS had gone through a big transition. The state had merged three different departments into the DCS, which tripled the number of ...

Stressing Linux with Real-World Workloads
More open source tests are micro benchmarks focused on individual ... hardware/software solutions. ○. Utilizing open ..... Recruiting members of the community ...

Stressing Linux with Real-World Workloads
DBT-1 Component Diagram. Database. Server. Application. Server. Driver. Database. Cache. Pool of database connections. Pool of database connections.

Stressing Linux with Real-World Workloads
TPC is a non-profit corporation created to define database benchmarks with which ..... [TPCH10000GB] NCR 5350 Using Teradata. V2R5.0 Executive Summary, ...

HPC-C Labeling - FDA
Blood Center at 1-866-767-NCBP (1-866-767-6227) and FDA at 1-800-FDA- .... DMSO is not removed, and at 4 °C for up to 24 hours if DMSO is removed in a ...... Call the Transplant Unit to advise them that the product is ready for infusion if ...

HPC Requirements.pdf
Sign in. Page. 1. /. 1. Loading… Page 1 of 1. Page 1 of 1. HPC Requirements.pdf. HPC Requirements.pdf. Open. Extract. Open with. Sign In. Main menu. Displaying HPC Requirements.pdf.

The Feasibility of Streamlining Aid.pdf
This extends the analysis. of Dynarski and Scott-Clayton (2006) which was. limited to dependent students. Page 3 of 13. The Feasibility of Streamlining Aid.pdf.

HPC Colony II
System software project funded by DOE Office of Science FastOS Award. ▫ Partners include .... L2 cache local memory .... Scheme 1: checkpoint to file-system.

HPC-ACCOUNT-REQUEST.pdf
pull down, select: USER ACCOUNT REQUEST. 3) In the “Describ your Issue”. box,enter: Request Access. Research HPC Systems. If not the PI of a grant, please.

HPC Processing of LIDAR Data
Feb 4, 2005 - processors, distributed memory, and message-passing software libraries. An enhancement is .... This section describes the software development issues associated with designing ...... Conversely, custom-designed .NET Web ...

Simul8 HPC Environment technical specifications.pdf
Page 1 of 1. Deep dive into technical specifications of our machine. Within the last few years, we have seen the transformative impact of deep learning in many.

SC16 HPC Training Workshop-final
SC16 Workshop: Best Practices for HPC Training. 1. Third SC Workshop on Best Practices for HPC Training. Abstract (150 ... 9:00-9:10 am. Welcome and Goals ... www.hpcuniversity.org portal and multiple social media avenues. Timeline.

HPC Processing of LIDAR Data
high-performance LIDAR data processing—in light of the design criteria set ... for the appropriate parallel computing system, data processing algorithms, and.

Migrate data & workloads to the cloud Services
By moving to Google Cloud Platform and using Datadog to improve application monitoring, [we], can quickly launch new services and features that will help us succeed in a changing market. “ Garrett Plasky, Technical Operations Manager, Evernote. ”

Migrate data & workloads to the cloud - Services
savings compared with rigid, predefined machine types. Our global network delivers ultra-low-latency data transfer. After you migrate to GCP, you can selectively replatform onto a growing ecosystem of modern services: big data analytics, AI/ machine

GPU Support for Batch Oriented Workloads
millions of files. BigTable is the primary storage system for several Google applications and handles tens of thousands of lookup operations per second. BigTable ... Bloom filters a good baseline for comparison, useful in providing a lower bound for

Scheduling Mixed Workloads in Multi-grids: The Grid ...
pools (which we call grids) that vary significantly in their ... tion level for a task is dictated by the task's complexity. .... in any way. ...... In 16th Conference on Un-.

Utilizing new Esri technologies for streamlining web GIS.pdf ...
Sign in. Page. 1. /. 1. Loading… Page 1 of 1. Sl no. Rollno NAME DOB Posting. 1 1201000028 LAKSHMINARAYANA S 23/01/1990 Mysuru. 2 1201000036 ...