MemzNet:  Memory-­‐Mapped  Zero-­‐copy  Network  Channel  for  Moving  Large  Datasets  over  100Gbps  Networks   Mehmet  Balman    (  [email protected])     Memory-­‐mapped  Network  Channel  Framework  

MemzNet’s  Architecture  for  data  streaming  

Computa8onal  Research  Division,  Lawrence  Berkeley  Na8onal  Laboratory   Collaborators:  Eric  Pouyoul,  Yushu  Yao,  E.  Wes  Bethel,  Burlen  Loring,  Prabhat,  John  Shalf,  Alex  Sim,     Arie  Shoshani,  Dean  N.  Williams,  Brian  L.  Tierney    

Increasing   the   bandwidth   is   not   sufficient   by   itself;   we   need   careful   evalua8on   of   future   high-­‐ bandwidth  networks  from  the  applica8ons'  perspec8ve.  We  require  enhancements  in  current   middleware   tools   to   take   advantage   of   future   networking   frameworks.   To   improve   performance   and   efficiency,   we   develop   an   experimental   prototype,   called   MemzNet:   Memory-­‐mapped   Zero-­‐copy   Network   Channel,   which   uses   a   block-­‐based   data   movement   method   in   moving   large   scien8fic   datasets.   We   have   implemented   MemzNet   that   takes   the   approach  of  aggrega8ng  files  into  blocks  and  providing  dynamic  data  channel  management.   We  present  our  ini8al  results  in  100Gbps  networks.    

SC11 100Gbps Demo Configuration

Measurement  in  ANI  100Gbps  Testbed   3  hosts,  each  connected  with  4  10Gbps  NICs  to   100Gbps  router  

Climate  Data-­‐file  characterisGcs   ²  Many  small  files   ²  One  of  the  fastest  growing  scien8fic  datasets   ²  Distributed  among  many  research  ins8tu8ons   around  the  world   ²  Requires  high-­‐performance  data  replica8on.  

Features  

File   size   distribu8on   in   IPCC   Fourth   Assessment   Report   (AR4)   phase   3,   the   Coupled   Model   Intercomparison   Project   (CMIP3)       ² Many   TCP   sockets   oversubscribe   the   network   and   cause   performance     degrada8on.       ² Host  system  performance  could  easily  be  the  bocleneck.    

Moving  Climate  Files  Efficiently  

•  Data   files   are   aggregated   and   divided   into   simple   blocks.     Blocks   are   tagged   and   streamed   over   the   network.   Each   data   block’s   tag   includes   informa8on  about  the  content  inside.     •  Decouples  disk  and  network  IO  opera8ons;  so,  read/write  threads  can   work  independently.       •  Implements   a   memory   cache   managements   system   that   is   accessed   in   blocks.   These   memory   blocks   are   logically   mapped   to   the   memory   cache  that  resides  in  the  remote  site.     •  The   synchroniza8on   of   the   memory   cache   is   accomplished   based   on   the   tag   header.   Applica8on   processes   interact   with   the   memory   blocks.  Enables  out-­‐of-­‐order  and  asynchronous  send  receive     •  MemzNet  is    is  not  file-­‐centric.  Bookkeeping  informa8on  is  embedded   inside   each   block.   Can   increase/decrease   the   number   of   parallel   streams  without  closing  and  reopening  the    data  channel.  

ANI testbed 100Gbps (10x10NICs, three hosts): CPU/Interrupts vs the number of concurrent transfers [1, 2, 4, 8, 16, 32 64 concurrent jobs - 5min intervals], TCP buffer size is 50M

Performance  

GridFTP

Special   Thanks   Peter   Nugent,   Zarija   Lukic   ,   Patrick   Dorn,   Evangelos   Chaniotakis,   John   Christman,   Chin   Guok,   Chris   Tracy,   Lauren   Rotman,   Jason   Lee,   Shane   Canon,   Tina   Declerck,   Cary   Whitney,   Ed   Holohan,     Adam   Scovel,   Linda   Winkler,   Jason   Hill,   Doug   Fuller,     Susan   Hicks,   Hank   Childs,   Mark   Howison,   Aaron   Thomas,  John  Dugan,  Gopal  Vaswani  

(a) total throughput vs. the number of concurrent memory-to-memory transfers, (b) interface traffic, packages per second (blue) and bytes per second, over a single NIC with different number of concurrent transfers. Each peak represents a different test; 1, 2, 4, 8, 16, 32, 64 concurrent streams per job were initiated for 5min intervals  

  Acknowledgements:   This   work   was   supported   by   the   Director,   Office   of   Science,   Office   of   Basic   Energy   Sciences,   of   the   U.S.   Department   of   Energy   under   Contract   No.   DE-­‐AC02-­‐05CH11231.   This   research   used   resources   of   the   ESnet   Advanced   Network   Ini8a8ve  (ANI)  Testbed,  which  is  supported  by  the  Office  of  Science  of  the  U.S.  Department  of  Energy  under  the  contract  above,   funded  through  the  The  American  Recovery  and  Reinvestment  Act  of  2009  

MemzNet

SC11  demo:  GridFTP  vs  memzNet References  

ANI Tetbed: Throughput comparison

²  Mehmet  Balman  et  al.,  Experiences  with  100Gbps  Network  Applica8ons.  In  Proceedings  of  the  fi0h  interna2onal  workshop  on  Data-­‐ Intensive   Distributed   Compu2ng,   in   conjunc8on   with   the   ACM   Symposium   on   High-­‐Performance   Parallel   and   Distributed   Compu8ng   (HPDC’12),  June  2012.   ²  Mehmet    Balman,  Streaming  Exa-­‐scale  data  over  100Gbps  Networks,  IEEE  Compu8ng  Now,  Oct  2012.  

Memory-‐mapped Network Channel Framework ...

Memory-‐mapped Zero-‐copy Network Channel, which uses a block-‐based data movement method in moving large scien^fic datasets. We have implemented ...

6MB Sizes 0 Downloads 49 Views

Recommend Documents

A Unified Framework and Algorithm for Channel ... - Semantic Scholar
with frequency hopping signalling," Proceedings of the IEEE, vol 75, No. ... 38] T. Nishizeki and N. Chiba, \"Planar Graphs : Theory and Algorithms (Annals of ...

A Unified Framework and Algorithm for Channel ...
Key words: Wireless networks, channel assignment, spatial reuse, graph coloring, .... Figure 1: Max. degree and thickness versus (a) number of nodes, with each ...

A SCALING FRAMEWORK FOR NETWORK EFFECT PLATFORMS.pdf
Page 2 of 7. ABOUT THE AUTHOR. SANGEET PAUL CHOUDARY. is the founder of Platformation Labs and the best-selling author of the books Platform Scale and Platform Revolution. He has been ranked. as a leading global thinker for two consecutive years by T

Separation of Source-Network Coding and Channel ...
Email: [email protected]. Abstract—In this paper we prove the separation of source- network coding and channel coding in a wireline network, which is a ...

Adaptive virtual channel partitioning for network-on ... - GT comparch
cс 2013 ACM 1084-4309/2013/10-ART48 $15.00. DOI: http://dx.doi.org/10.1145/2504906. ACM Transactions on Design Automation of Electronic Systems, Vol.

Adaptive Distributed Network-Channel Coding For ...
cooperative wireless communications system with multiple users transmitting independent ...... Cambridge: Cambridge University Press, 2005. [13] SAGE, “Open ...

/ CHANNEL(
May 1, 2009 - endpoints in a data network via a monitoring unit and a control unit. .... Nr. 21, (Oct. 1989), Cited on EP Search RepOIt for EP application.

Degrees of Freedom of the Network MIMO Channel ...
... using AP-ZF with HQ follows then directly. Proposition 6. For K ≥ 3, it is optimal to choose the passive TX to be TX j with j = nHQ defined in (47), for all the data streams. The number of DoFs achieved with Active-Passive ZF based on Hierarchi

Adaptive virtual channel partitioning for network-on-chip ... - CompArch
memory controllers, and this type of on-chip network will have a significant impact ... [email protected]; H. Kim, School of Computer Science, Georgia ...

The Impact of Channel Bonding on 802.11n Network ...
aggregation, channel bonding, and MIMO [6, 25, 27, 22]. ..... formance of 40MHz versus 20MHz channels under varying ..... to Interference-plus-Noise Ratio).

Adaptive virtual channel partitioning for network-on ... - GT comparch
and GPU applications, we can guarantee a minimum service in the network to each ... (1) We propose a feedback-directed virtual channel partitioning (VCP) ...

Separation of Source-Network Coding and Channel ...
Center for Mathematics of Information .... each node a has observed a length-L block of the process. U(a), i.e., U. (a),jL ..... By the data processing inequality [11],.

/ CHANNEL(
May 1, 2009 - Huang, G M., et al., “A New Had Algorithm for Optimal Routing of. Hierarchically ... Kieser, H “Software fur eine digitale Vermittlungsanlage mit modularem ... egies between Lost-call-cleared and Reservation Traf?c”, Proceed.

Linear Network Codes: A Unified Framework for ... - Semantic Scholar
This work was supported in part by NSF grant CCR-0220039, a grant from the Lee Center for. Advanced Networking, Hewlett-Packard 008542-008, and University of ..... While we call the resulting code a joint source-channel code for historical ...

TinyNET: A Tiny Network framework for TinyOS
layer operations and exchange of parameters; iii) allows fast reconfiguration of applications through new protocols and functionalities, that transparently become a part of the layered network stack. Our framework operates on top of TinyOS, but below

Linear Network Codes: A Unified Framework for ... - Caltech Authors
code approaches zero as n grows without bound for any source U with H(U) < R. The fixed-rate, linear encoder is independent of the source distribution; we use distribution-dependent typical set decoders for simplicity. Let an be an ⌈nR⌉ × n matr

Linear Network Codes: A Unified Framework for ... - Semantic Scholar
Page 1 ..... For any n × ⌊nR⌋ matrix bn, we can build a linear channel code with .... For any n × n matrix cn, we can build a joint source-channel code for the.

A. Channel Setting - GitHub
B. User ID setting. D. Sleeping Time Setting. E. Mode Setting. Memory Mode. Normal Mode. 8. 9. CHSET. 6. Data Count. 60 Seconds. 5 Minutes. Data Delete.

channel
95 lbased systems - the standby tirne of the phone. This paper ... There are basically the following operating scenar- ios for the Paging and Access Channel: Call origina- tion, Call ... tonomously enters the System Access State and sends.

channel
95 lbased systems - the standby tirne of the phone. This paper proposes a novel scheme ... The Paging Channel is used to communicate from the base station to ...