A Performance Study on Operator-based Stream Processing Systems Miyuru Dayarathna∗ , Souhei Takeno∗ , Toyotaro Suzumura∗† ∗ Department

of Computer Science, Tokyo Institute of Technology, Japan † IBM Research - Tokyo, Japan [email protected], [email protected], [email protected]

Abstract—This short paper compares and contrasts performance characteristics of System S and S4, two stream processing systems which use operator-based programming model. Our aim is to investigate and characterize which architecture is better for handling which type of stream processing workloads and observe the reasons for such characteristics.

Thousands

12

Throughput of five applications on S4 10

140

Throughput of five applications on System S 120

CDR VWAP

100

8

6 CDR Optimized

4

VWAP Twitter

2

Micro-benchmark

CDR

0 0

2

4

6 8 Number of Nodes (a)

10

12

Throughput (Tuples\s)

Throughput (Events\s)

Thousands

Stream processing [1] has emerged as an exciting new filed to support online information processing activities. Currently there is a growing attention towards operator-based stream programming models. We conducted this study on System S [2] and S4 [3] which are currently two of the most prominent systems developed following this model. The contribution of this work is the identification and characterization of performance and trade-offs of the two different stream processing systems with respect to their underlying architectural design principles. Such result will be useful for system designers and developers to identify what type of stream programming models would suit for their needs. We use three different stream programs (ApplicationSpecific Benchmarks) which are are used for different purposes such as Call Detail Record Processing (CDR), Volume Weighted Average Price calculation (VWAP) and Twitter trend detection. Then we use a micro-benchmark to get basic characterization of the two systems’ performance.

Micro-benchmark

80

CDR Optimized Twitter

60 40 20 0

14

0

2

4

6 8 Number of Nodes (b)

10

12

14

Fig. 1: Throughput comparison of sample application on System S and S4. The results we obtained by running ten different applications (totaling 50 applications considering 1,2,4,8,12 node scenarios) on System S and S4 gave us sufficient insight to what kind of processing happen in the both the systems. It became clear from the throughput comparisons shown on Figure 1 that these two stream programming models should be used carefully to maximize the throughput. E.g. A SPADE program written with single source operator might not scale well in different hardware configurations. Also a S4 application that generates

huge numbers of PEs for incoming events cannot scale well. Yet introduction of multiple source operators resulted in a 3.2 times speedup for CDR application on System S and a 1.34 times speed up for reducing 0.1 million Aggregator PEs of S4 to 100 PEs which indicated possible avenues for performance improvements. While Java based stream processing system architectures are gaining considerable attention due to their portability, system designers and stream programmers have to think carefully before choosing the correct solution. E.g. A light weight input event rate job could be easily processed using S4 with few amount of PEs (E.g. Twitter application run on S4). However a large scale application with high commercial importance such as the VWAP and CDR might produce millions of PEs since S4 dynamically generates PEs for each new data events its receives. By analyzing the throughput and profiling results we observed that properly designed stream applications result in high throughput. Another conclusion we arrived at is that choice of a stream processing system need to be made carefully considering factors such as performance, platform independence, size of the jobs. Furthermore we understood the importance of key role played by operating system kernel in stream processing system’s performance. While we observed heavy use of network bandwidth by S4, by using optimized protocols and techniques such as Java New I/O, InfiniBand Remote Direct Memory Access the conditions could be improved. Creating a stream processing system architecture that scales in terms of the number of PEs is a further work that was inspired by this work. In future we hope to extend this work to a micro level performance study on S4, specially to identify which components, code segments are most resource intensive. Moreover we hope to implement Linear Road Benchmark on System S and S4 and observe and characterize the systems’ performance. R EFERENCES [1] D. Turaga, H. Andrade, B. Gedik, C. Venkatramani, O. Verscheure, J. D. Harris, J. Cox, W. Szewczyk, and P. Jones, “Design principles for developing stream processing applications,” Software: Practice and Experience, Aug 2010. [2] B. Gedik, H. Andrade, K.-L. Wu, P. S. Yu, and M. Doo, “Spade: the system s declarative stream processing engine,” in SIGMOD ’08. New York, NY, USA: ACM, 2008, pp. 1123–1134. [3] L. Neumeyer, B. Robbins, A. Nair, and A. Kesari, “S4: Distributed stream computing platform,” in KDCloud 2010, December 2010.

A Performance Study on Operator-based Stream ...

ios) on System S and S4 gave us sufficient insight to what kind of processing happen in the both the systems. It became clear from the throughput comparisons ...

77KB Sizes 0 Downloads 223 Views

Recommend Documents

A Performance study on Operator-based stream ...
Operator-based stream processing systems. Miyuru Dayarathna, Souhei Takeno, Toyotaro Suzumura. Department of Computer Science. Tokyo Institute of ...

A Performance study on Operator-based stream processing systems
Department of Computer Science ... It is impossible to store data on disk. ◦ The volume of the data is very large. Process data on-the-fly in-memory. OP. 1. OP.

A comparative study on engine performance and emissions of ...
Page 1 of 7. Indian Journal of Engineering & Materials Sciences. Vol. 21, August 2014, pp. 438-444. A comparative study on engine performance and emissions of biodiesel and JP-8. aviation fuel in a direct injection diesel engine. Hasan Yamika. , Hami

Performance Main Stream Enthusiast
PV-T73P-UDS(*). GF 7600 GS. --. 500. 128bit DDR2. 256MB. 900MHz. Heatsink. S-Video, Dual DVI, HDTV. Standard. Yes. Yes. PV-T73P-UDJ(*). GF 7600 GS.

Performance Main Stream Enthusiast
Heatsink. S-Video, Dual DVI, HDTV. Standard. Yes. Yes. PV-T73P-YDJ(*). GF 7600 GS. --. 400. 128bit DDR2. 512MB. 533MHz. Fansink. S-Video, VGA, DVI, ...

Watershed: A High Performance Distributed Stream ...
each node with an Intel R Core. TM. 2 CPU 6420 @2.13GHz equipped with ... tory landing without abrupt degradation. The graph also shows that Watershed is ...

An Experimental Study on Basic Performance of Flash ...
The simulator is expected to be effective to design flash-based database ... calculated the trend line for each data series. The ... RAID 0, 1, 5 and 10. Seagate ...

Experimental Study on Mobile RFID Performance - Springer Link
a realistic mobile setting. ... understanding about the reading performance for mobile scenario. ..... ysis of energy consumption for ISO 18000-7 RFID networks.

Study on hydraulic performance of drip emitters by ...
the experimental data. To study ... velocity of the flow, R (m) the hydraulic radius of flow, and m .... number of simulation, and m is the total number of data (m = 9.

A Survey on Data Stream Clustering Algorithms
The storage, querying, processing and mining of such data sets are highly .... problems, a novel approach to manipulate the heterogeneous data stream ...

all eyez on me stream ...
... the apps below to open or edit this item. all eyez on me stream german__________________________________________.pdf. all eyez on me stream ...

Global Winds, Jet Stream, Gulf Stream, El Nino and La Nina Study ...
Global Winds, Jet Stream, Gulf Stream, El Nino and La Nina Study Guide.pdf. Global Winds, Jet Stream, Gulf Stream, El Nino and La Nina Study Guide.pdf. Open.

A Study on Double Integrals
This paper uses the mathematical software Maple for the auxiliary tool to study two types of ... The computer algebra system (CAS) has been widely employed in ...

DataCell: Building a Data Stream Engine on top of a ...
future to avoid duplicate notifications and (b) it adds result tuples to the output ... baskets meaning that each query can be processed indepen- dently at the ...

a blood pledge stream ...
... the apps below to open or edit this item. a blood pledge stream deutsch_________________________________________.pdf. a blood pledge stream ...

A note on performance metrics for Speaker ... - Semantic Scholar
Jun 9, 2008 - regardless of the (analysis) condition it happens to be part of. .... of hard decisions is replaced by a log-error measure of the soft decision score.

A note on performance metrics for Speaker ... - Semantic Scholar
Jun 9, 2008 - performance evaluation tools in the statistical programming language R, we have even used these weighted trials for calculating Cdet and Cllr, ...