A Performance study on Operator-based stream processing systems

Miyuru Dayarathna, Souhei Takeno, Toyotaro Suzumura Department of Computer Science Tokyo Institute of Technology Japan

Stream Computing Systems 

Insights from data in motion ◦ It is impossible to store data on disk ◦ The volume of the data is very large



Process data on-the-fly in-memory Route keyless input events

OP 1

Join the serve and click events

OP 2

BotFilter

OP 3

Compute the correct click throughput rate

OP 4

Streaming Click-Through Rate Computation

2

Essence of our Performance Study System S (IBM) and S4 (Yahoo)  Four benchmarks (60 application Scenarios)  Five metrics 

3

12

140

Thousands

Thousands

Results - Throughput Throughput observed for four applications on S4

Throughput observed for five applications on System S CDR

120

10

VWAP

6 CDR Optimized

4

VWAP Twitter

2

Micro-benchmark

Throughput (Tuples\s)

Throughput (Events\s)

100 8

Micro-benchmark CDR Optimized

80 Twitter

60 40 20

CDR

0

0 0

2

4

6

8

Number of Nodes

(c)

10

12

14

0

2

4

6

8

10

12

14

Number of Nodes

(d)

4

Essence of our Performance Study System S (IBM) and S4 (Yahoo)  Four benchmarks (60 application Scenarios)  Five metrics  Conclusions on Stream Processing system architectures 

5

A Performance study on Operator-based stream processing systems

Department of Computer Science ... It is impossible to store data on disk. ◦ The volume of the data is very large. Process data on-the-fly in-memory. OP. 1. OP.

654KB Sizes 2 Downloads 227 Views

Recommend Documents

A Performance study on Operator-based stream ...
Operator-based stream processing systems. Miyuru Dayarathna, Souhei Takeno, Toyotaro Suzumura. Department of Computer Science. Tokyo Institute of ...

A Performance Study on Operator-based Stream ...
ios) on System S and S4 gave us sufficient insight to what kind of processing happen in the both the systems. It became clear from the throughput comparisons ...

Concurrent Stream Processing - GitHub
... to SQL and execute federated queries across data sources. ... where source data arrives in streams. .... or a single input pipe that is copied to two destinations.

A High-Performance Graph Processing Library on the ...
Gunrock of- fers: -the best performance on GPU graph analytics; ... build/lib/libgunrock.so'). ### read in .... •Gunrock Website: http://gunrock.github.io/. •Author's ...

A comparative study on engine performance and emissions of ...
Page 1 of 7. Indian Journal of Engineering & Materials Sciences. Vol. 21, August 2014, pp. 438-444. A comparative study on engine performance and emissions of biodiesel and JP-8. aviation fuel in a direct injection diesel engine. Hasan Yamika. , Hami

Enhanced Stream Processing in a DBMS Kernel
bounds, a system should provide incremental processing to avoid considering the same data over and over ... Management Systems (DBMS) and thus the pioneering Data Stream. Management Systems (DSMS) architects ... window as a peephole on the data conte

Combining Computer Vision & Data Stream Processing - eSprockets
computer-vision techniques and large-scale-data-stream processing algorithms to .... sub-fingerprint with the maximum score is the best match on that spectral image. .... Finding interesting associations without support pruning. Knowledge and ...

Performance Main Stream Enthusiast
PV-T73P-UDS(*). GF 7600 GS. --. 500. 128bit DDR2. 256MB. 900MHz. Heatsink. S-Video, Dual DVI, HDTV. Standard. Yes. Yes. PV-T73P-UDJ(*). GF 7600 GS.

Performance Main Stream Enthusiast
Heatsink. S-Video, Dual DVI, HDTV. Standard. Yes. Yes. PV-T73P-YDJ(*). GF 7600 GS. --. 400. 128bit DDR2. 512MB. 533MHz. Fansink. S-Video, VGA, DVI, ...

Watershed: A High Performance Distributed Stream ...
each node with an Intel R Core. TM. 2 CPU 6420 @2.13GHz equipped with ... tory landing without abrupt degradation. The graph also shows that Watershed is ...

Network Security on safety-critical systems: a case study ... - GitHub
SFD | Start-of-Frame Delimiter, 1 octet of 0xd5. DA / SA | MAC Destination Address / MAC Source Address ..... 11:56:57.340515 00:00:00:00:00:01 > 00:1f:16:37:b1:3d, ethertype IPv4. (0x0800), length 79: (tos 0x0, ttl 64, id 0, offset 0, flags ..... ht

An Experimental Study on Basic Performance of Flash ...
The simulator is expected to be effective to design flash-based database ... calculated the trend line for each data series. The ... RAID 0, 1, 5 and 10. Seagate ...

Experimental Study on Mobile RFID Performance - Springer Link
a realistic mobile setting. ... understanding about the reading performance for mobile scenario. ..... ysis of energy consumption for ISO 18000-7 RFID networks.

Study on hydraulic performance of drip emitters by ...
the experimental data. To study ... velocity of the flow, R (m) the hydraulic radius of flow, and m .... number of simulation, and m is the total number of data (m = 9.

A Performance-based Approach for Processing Large ...
ing of such files can occur for instance, in the conversion of the XML data to .... fitted with four gigabytes of DDR2 RAM and a 160 gigabyte SATA harddrive.