Bandwidth usage distribution of multimedia servers ...

Viewer
Transcript

Computer Networks 51 (2007) 569–587 www.elsevier.com/locate/comnet

Bandwidth usage distribution of multimedia servers using Patching q Carlo K. da S. Rodrigues *, Rosa M.M. Lea˜o Federal University of Rio de Janeiro, COPPE/PESC, CxP 68511, Rio de Janeiro RJ 21941-972, Brazil Received 28 June 2005; received in revised form 5 March 2006; accepted 12 May 2006 Available online 13 June 2006 Responsible Editor: J. Misic

Abstract Several multicast bandwidth sharing techniques have been proposed in the literature to provide more scalability to multimedia servers. These techniques are often analyzed in terms of the average bandwidth requirements they demand to satisfy client requests. However, average values do not always provide an accurate estimate of the required bandwidth. Therefore, they cannot be the only parameter used to guarantee a certain level of quality to the clients. In this work we propose a simple analytical model to accurately calculate the distribution of the number of concurrent streams or, equivalently, the bandwidth usage distribution considering the popular Patching technique. We show that the distribution may be modeled as a binomial random variable in the single object case, and as a sum of independent binomial random variables in the multiple object case. Through simulation we validate our results. Moreover, we also illustrate how these results may be practically used for instance (i) to allocate bandwidth to provide a given level of QoS, (ii) to estimate the impact on QoS when some system parameters dynamically change, and (iii) to conﬁgure the overall system. 2006 Elsevier B.V. All rights reserved. Keywords: Patching; Multicast; Multimedia; Bandwidth; QoS

1. Introduction Multimedia servers for applications such as distance learning and video on demand have been the

q

This work is supported in part by grants from CNPq and Faperj. * Corresponding author. Tel.: +55 21 2562 8664; fax: +55 21 2562 8676. E-mail addresses: [email protected] (C.K.d.S. Rodrigues), [email protected] (R.M.M. Lea˜o).

focus of recent studies in the literature. In particular, considerable attention has been given to the issue of providing scalability to these servers. The easiest way to service client requests in such applications is to schedule unicast data streams, one for each client request. The multimedia server bandwidth increases linearly with the client arrival rate since clients arrive and presumably stay in the system for some reasonable time (i.e., it increases linearly with the number of concurrent clients in the system). This increase rate notably precludes

1389-1286/$ - see front matter 2006 Elsevier B.V. All rights reserved. doi:10.1016/j.comnet.2006.05.004

570

C.K.d.S. Rodrigues, R.M.M. Lea˜o / Computer Networks 51 (2007) 569–587

the use of this service for a very large number of clients and the study of bandwidth sharing techniques becomes essential to provide scalability. Bandwidth sharing techniques may be classiﬁed in multicast or periodic broadcast delivery techniques. Multicast techniques (e.g., [1–3,5–10,12]) are reactive in the sense that data is delivered in response to client requests. Most of them provide immediate service and save server bandwidth by avoiding unnecessary transmission of data. Periodic broadcast delivery techniques (e.g., [13–21]) cyclically transmit object segments in a proactive way. They can guarantee a maximum start up delay. The idea is to divide the media object into a series of segments and broadcast each segment periodically on dedicated server channels. While the client is playing the current object segment, it is guaranteed that the next segment is downloaded on time and the whole object can be played out continuously. Besides multicast and periodic broadcast, an orthogonal scheme for reducing the load on network and video server is video caching (see [44] and references within). In this scheme a set of proxies are strategically placed close to the clients. Another approach to address the scalability issue in video applications is peer-to-peer streaming (see [46] and references within). In this approach clients can serve other clients by forwarding the video stream. Patching [2] is a multicast technique and has been used in recent works [22,45,23–28]. For example, in [22] Patching is used to provide a peer-to-peer video streaming service, and in [45] it is integrated with proxy caching for media distribution. This technique is especially attractive because of its extreme simplicity and competitive eﬃciency in terms of bandwidth savings. The work of [29] for instance evaluated the complexity of several multicast delivery techniques. The analysis is based on the information the server has to keep and the work the server has to perform per client request. One of the main results they obtained is that Patching is one of the simplest schemes. And it was shown in the work of [7,9] that, for reasonably popular media objects, the average bandwidth requirements of Patching are very similar to those of much more complex techniques. It is quite common to evaluate the eﬃciency of multicast techniques in terms of the average bandwidth requirements they demand to satisfy the client requests arriving in the system. A discussion about average bandwidth requirements may be found in [5–7,30–33]. However, average values do not always

provide an accurate estimate of bandwidth requirements and therefore cannot be used solely to guarantee a given level of QoS to the clients. Tam and co-workers [29] were the ﬁrst to measure the performance of multicast techniques using other metrics such as the maximum bandwidth and the bandwidth distribution. The authors have focused on three merge techniques: the dynamic Fibonacci tree [9,34], the Dyadic [10], and the earliest reachable merge target (ERMT) [7]. Their analysis was all based on simulations. The quality of service of multicast techniques was also evaluated in [28,33]. De Souza e Silva et al. [28] proposed an algorithm to compute the distribution of the bandwidth required by the Patching technique. Their analytical model estimates the distribution of the total requested bandwidth in one window of Patching. They showed that the requested data streams in a window are a good estimator of the data streams actually transmitted in a window as t ! 1. The computational cost of their algorithm is O(P2), where P is a function of the estimated error and is usually not very large. Their results indicated that if the server conﬁguration is based on the average bandwidth required by each object, the probability of requiring more than the average value is large for a wide range of system parameters. In [33] approximate analytical models were developed to analyze the quality of service of the following schemes: Patching, Hierarchical Stream Merging (HSM) [7] and Bandwidth Skimming [8]. The authors used a closed queueing network model to estimate two performance measures: the mean client waiting time and the fraction of clients who leave without receiving service when the server is temporarily overloaded (i.e., the balking rate). The performance measures were computed from the iterative solution of a set of equations. Their results showed for instance that if server conﬁguration is based on the average bandwidth there may be unacceptably high client balking rates. We may see that the analytical evaluation of the Patching mechanism done so far is based on the solution of one equation [28] or a set of equations [33], i.e., there is not a closed-form solution for the bandwidth usage distribution that may be practically employed. Moreover, in [28] they only consider the single object case, and in [33] they only analyze the multiple object case. In this paper we propose a simple analytical model to accurately calculate the distribution of the number of concurrent streams or, equivalently,

C.K.d.S. Rodrigues, R.M.M. Lea˜o / Computer Networks 51 (2007) 569–587

the bandwidth usage distribution for the Patching technique. We consider both the single and the multiple object cases. We show that the distribution may be modeled in the form of a closed-form solution as a binomial random variable in the single object case. In the multiple object case, it can be represented as a sum of independent binomial random variables, which may be easily implemented using Fast Fourier Transform [38]. Through simulation we validate our results. Moreover, we illustrate how our results may be practically used to (i) allocate bandwidth to provide a given level of QoS, (ii) estimate the impact on QoS when some system parameters dynamically change, and (iii) conﬁgure the overall system. The remainder of this paper is organized as follows. Section 2 introduces some basic concepts and terminology. Section 3 is dedicated to the presentation of our analytical model. Analytical and simulation results are presented in Section 4. Lastly, conclusions and ongoing work are included in Section 5. 2. Basic concepts and terminology Consider a multimedia server and a group of clients receiving object streams (e.g., a ﬁlm, a video clip, etc.) across the Internet from this server. The paths from the server to the clients are multicast enabled. Multicast is not currently deployed over the global Internet. However, this can be alleviated through the use of proxies which can bridge unicast and multicast networks. One of the most common proposed architectures consists of a unicast channel from the server to the proxy and multicast channels from the proxy to the clients. Since, in general, the proxy and the clients are in the same local area network, multicast service can be easily implemented. Solutions like application level multicast using tunnelling have also been used over the public Internet [29,42,43]. Still assume that Patching is used to provide immediate service, clients always request playback from the beginning of the object and watch it continuously till the end without any interruptions (i.e., sequential access), and client bandwidth is twice the object play rate. All these assumptions are considered in the original model of Patching presented in [2]. Also, consider that the client buﬀer is large enough to store a portion of the object size (to allow the stream synchronization). This supposition is based on the results presented in the works of

571

Table 1 Key parameters kj Dj Wj Nj

Bj dj Tj wj u

Client arrival rate for object Oj Total length of object Oj in units of time Threshold window for object Oj in units of time Average number of requests for object Oj that arrive during the period of length Dj. It is referred as object popularity and is given by Nj = kjDj Average server bandwidth used to deliver object Oj using the Patching technique in units of the object play rate Unit length used for object Oj Total length of object Oj in units dj Threshold window for object Oj in units dj Total number of objects in the server

[30,4,5]. These works show that, since the client bandwidth is twice the object play rate, in the worst case, the client will listen to two concurrent streams during an interval equal to half of the object size. However, since it is possible to compute an optimal threshold window [30,31], we may relax this initial supposition for the client buﬀer being half of the object size and set it to the optimal threshold window. Lastly, Table 1 summarizes some notation used in the rest of this work. The Patching technique operates as follows [2]. The server begins to deliver a full-object multicast stream upon the arrival of the ﬁrst client. The following clients who request the same object and arrive within a certain time interval, denoted as threshold window, retrieve the stream from the multicast channel (buﬀering the data) and obtain the missing initial portion, denoted as patch, directly from the server over a unicast channel; for clients arriving after the threshold window, a new session is initiated and the process restarts again. Both the multicast and patch streams deliver data at the object play rate so that each client can play the object in real time. Similar models have been proposed in [30,31] to estimate the average server bandwidth for the Patching technique. They assumed a Poisson client arrival process with rate kj for the media object Oj, where j = 1, . . . , u and u is the total number of objects in the server. The average server bandwidth for delivering object Oj is [30,31]: kj W 2

Dj þ 2 j Bj ¼ ; W j þ k1j

ð1Þ

where Dj and Wj are the length and the threshold window, respectively, for object Oj.

572

C.K.d.S. Rodrigues, R.M.M. Lea˜o / Computer Networks 51 (2007) 569–587

The denominator of Eq. (1) is the average time that elapses between successive full-object multicasts, i.e., the duration of the threshold window plus the average time until the next client request arrives. The numerator is the expected value of the sum of the transmission times of the full-object and the patch streams that are initiated during the interval. Note that the average number of patches that are started before the threshold window expires is kjWj, and the average duration of a patch is Wj/2 [35]. Now diﬀerentiating Eq. (1) with respect to Wj and setting the result to zero gives pﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃ 2N j þ 1 1 W j;optimal ¼ : ð2Þ kj As already mentioned, the client buﬀer can be equal to the optimal threshold window Wj,optimal, instead of half of the object size. Substituting (2) into Eq. (1) yields the following result for the average server bandwidth for the optimized Patching technique: pﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃ Bj;optimized ¼ 2N j þ 1 1; ð3Þ where Nj = kjDj is the average number of client requests that arrive during Dj. The parameter Nj is often denoted as the object popularity. For simplicity, we will refer to the optimized Patching technique as the Patching technique, and we will denote Wj,optimal and Bj,optimized as Wj and Bj, respectively. When the object popularity Nj is above (below) a certain threshold value, periodic broadcast delivery techniques achieve more (less) bandwidth savings than multicast ones [14]. Thus, it is reasonably to classify media objects in accordance with popularity in order to choose the most adequate type of delivery technique [30,14]. This is usually done as follows [8]: for hot objects (i.e., Nj P 100), use periodic broadcast techniques; for cold objects (i.e., Nj < 10) and lukewarm objects (i.e., 10 6 Nj < 100), use multicast techniques. Since Patching is a multicast technique, we mainly focus on Nj values in the range of 10–100. 3. Bandwidth usage distribution In this section, we present our analytical model and derive the distribution of the number of concurrent streams needed by the Patching technique, or equivalently, the server bandwidth usage distribution. We initially address the case of the single object and then the case of multiple objects.

3.1. Single object Consider a server implementing the Patching technique and that client request arrivals follow a Poisson process. Let C(t) be the stochastic process which denotes the number of Patching windows opened in the interval (0, t). It thus directly follows that C(t) is a renewal process [35] whose renewal points correspond to the instants at which the server schedules a new full-object multicast stream, i.e., the start of a new window. Moreover, we have that a cycle of C(t) has an average duration of Wj + 1/kj = Wj + Dj/Nj, and then this is the base interval we consider in the analysis to follow. Now assume that the object Oj is divided into time units of length dj. For example, a two-hour object can be divided into 20 units of 6 min each, or divided into 100 units of 72 s each. Now let Tj = Dj/dj and wj = Wj/dj. Then, we have that Tj and wj are, respectively, the object size and the window size measured in number of discrete units of length dj. Also consider that if there is at least one client arrival in a time unit, the server initiates a stream (patch or full-object multicast) at the beginning of the next time unit to serve this client. We stress that the value of dj can be as small as desired to obtain a very small probability of more than one arrival in the same time unit and a guarantee of immediate service. Note that since the media object and time in our model are divided into units of length dj, the base interval of our analysis Wj + Dj/Nj may alternatively be rewritten as wj + Tj/Nj and Eq. (2) as pﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃ Tj 2N j þ 1 1 : ð4Þ wj ¼ Nj The interval wj + Tj/Nj is composed by a total of wj + Tj/Nj units of length dj. Hence, this interval can still be expressed in the following form: [1, wj + Tj/Nj]. Let us now focus on each one of its wj + Tj/Nj units individually in order to identify a distribution for the number of concurrent streams in each one of them. Once we obtain these individual distributions, we try to identify a unique distribution that may be deployed for the entire base interval of analysis [1, wj + Tj/Nj]. Note that Oj may be any object in the multimedia server, then we drop the subscript j from now on. Consider the kth interval [1, w + T/N] of the process C(t) (see Fig. 1). In this interval we have transmissions due to full-object multicasts and also due to

C.K.d.S. Rodrigues, R.M.M. Lea˜o / Computer Networks 51 (2007) 569–587

4-th unit w-th unit patch=w arrivals patch=4 w-d w (k-1) interval

Time

w-T/N

d T/N

w

T/N

k interval

Fig. 1. Patches in two consecutive intervals: (k 1) and k.

patches. Then, we have that the number of concurrent streams in each unit of length d of the kth interval is given by a multicast component plus a patch component. Let us denote the multicast component as MC. We consider MC as a constant valuepand estimate ﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃ ﬃ it using Little’s result [36]: M C ¼ N = 2N þ 1 (i.e., the average number of multicast streams is given by the arrival rate times the duration of the media object). Although the real number of multicast streams is not constant due to the possible variation of the arrival rate k (and consequently of N), MC is a good approximation since changes on the arrival rate k, within certain limits, do not severely impact on the value of MC. Consider N 1, which refers to the scenarios we are interested in. In this case pﬃﬃﬃﬃﬃﬃ ﬃ MC p can be approximated by M ¼ N = 2N ¼ C ﬃﬃﬃﬃ 0:707 N . Thus we have that the number of multicast streams increases approximately with the square root of the arrival rate. The patch component is associated to client arrivals and we estimate it as a random variable. Our goal now is to determine the distribution of this random variable. Let us examine two successive intervals: the (k 1)th and kth intervals. We want to determine how many units of the kth interval can transmit a patch initiated in the (k 1)th interval. Note that we only need to consider the (k 1)th interval because the patches generated in other previous intervals will ﬁnish before the kth interval starts. Consider that the kth interval is divided into two sub-intervals: [1, w T/N] and [w T/N + 1, w + T/N]. In the former we may have transmissions initiated in the (k 1)th interval, while in the latter we do not have any transmission initiated in the previous interval. Fig. 1 shows a scenario where an arrival occurs in the last unit of the (k 1)th interval. This arrival generates the longest possible patch which is equal to w. We note that the number of units in the kth interval, transmitting a patch stream initiated in the (k 1)th interval, decreases as the value of T/N increases. In the limit, when

573

T/N = w (see Eq. (4)), all patches are initiated and transmitted in the same interval. Note that we do not need to consider scenarios where T/N > w because this condition occurs only for N = 1 (see Eq. (4)). Bandwidth sharing techniques are mostly useful when N > 1. We mainly focus on the values of N in the range of 10–100, as already mentioned. We proceed in three steps to obtain the distribution of the number of concurrent patch streams in each unit of the kth interval. We ﬁrst compute the number of concurrent streams in the ith unit of the (k 1)th and kth intervals assuming that there is a client arrival in each unit of the window w of the (k 1)th interval. In the second step we compute the number of concurrent streams in the ith unit of the kth interval considering that there is a client arrival in each unit of both windows w of the (k 1)th and kth intervals. Finally, in the third step, we analyze the impact of the Poisson client request arrival process. Let us consider the ﬁrst step. As just mentioned, we only observe the patches started in the (k 1)th interval and assume that there is a client arrival in each unit of the window w of this interval. Fig. 1 shows that an arrival in the jth unit of the interval generates a patch of length j units. Then, there is a patch in the ith unit of the (k 1)th interval if, and only if, 2j P i, j = 1, 2, . . . , i 1, and a client arrival occurs in the jth unit. We can also see from Fig. 1 that the longest possible patch initiated in the (k 1)th interval starts at w + 1 and ends at 2w. Therefore the interval we have to evaluate is [1, 2w]. We denote by mi,w the number of concurrent streams in the ith unit of the interval [1, 2w], where i = 1, 2, . . . , 2w. The following lemma determines an expression for mi,w. Lemma 1. Considering one client arrival in each unit of the interval [1, w] and i = 1, 2, . . . , 2w, the number of concurrent patch streams mi,w in the ith unit of the interval [1, 2w] is given by • Case 1: i 2 [1, w]. i1 X i1 mi;w ¼ 1f2j P ig ¼ : 2 j¼1 • Case 2: i 2 [w + 1, 2w]. w X 2w ði 1Þ mi;w ¼ 1f2j P ig ¼ ; 2 j¼1

ð5Þ

ð6Þ

where 1 denotes the indicator function, i.e., 1{x} = 1 if x is true, and 1{x} = 0 if x is false.

574

C.K.d.S. Rodrigues, R.M.M. Lea˜o / Computer Networks 51 (2007) 569–587

Proof. See Appendix A. h So far we obtained expressions to calculate mi,w, i.e., the number of concurrent streams in the (k 1)th and kth intervals considering the patches exclusively initiated in the (k 1)th interval. We now present the second step of our analysis. Let li,w be the number of concurrent streams in the kth interval [1, w + T/N], considering the patches initiated in this interval as well as in the previous one. We show that li,w can be easily computed by properly superposing the expressions obtained in Lemma 1. Fig. 2 illustrates the superposition of two successive intervals: the (k 1)th and kth intervals. From this ﬁgure we can see that the number of concurrent patches in the ﬁrst unit of the kth interval is equal to m1,w + mw+T/N+1,w, in the second unit it is equal to m2,w + mw+T/N+2,w, and so on. The ﬁnal formulation for li,w is presented in Lemma 2. Lemma 2. Considering one client arrival in each unit of both windows w of the (k 1)th and kth intervals and i = 1, 2, . . . , w + T/N, the number of concurrent streams li,w in the ith unit of the kth interval is given by • Case 1: i 2 1; w NT 8 T < wN þ 1; if ðw T Þ and i are even; N 2 li;w ¼ lw T m N : ; otherwise: 2 ð7Þ

• Case 2: i 2 w NT þ 1; w i1 : li;w ¼ 2

ð8Þ

• Case 3: i 2 w þ 1; w þ NT 2w ði 1Þ li;w ¼ : 2 Proof. See Appendix B.

h

We now proceed to the third step. Assume that client arrivals are represented by a Poisson process. Let Xi be a random variable that denotes the number of concurrent streams due to patches in the ith unit of the interval [1, w + T/N], where i = 1, . . . , w + T/N. Consider the scenario of Fig. 3 and let us examine the possible values taken, for example, by X4. To simplify the ﬁgure we do not represent patches eventually generated in the previous interval. X4 may be equal to 0, 1 or 2, depending on the client arrivals in the previous units. X4 = 1 when there is one arrival in the second or in the third unit (see Fig. 3(a) and (b), respectively). In Fig. 3(c) we have the case where X4 = 2, i.e., when there is an arrival in both the second and third units. Lastly, X4 = 0 when there are arrivals neither in the second nor in the third units. Note that we do not need to make assumptions about the ﬁrst unit because only the second and third units of this interval generate patches long enough to extend over the fourth unit. The following theorem determines the probability distribution function of Xi. Theorem 1. Xi has binomial distribution with parameters p = 1 eN/T and ni,w, i = 1, . . . , w + T/N. The parameter ni,w is given by 1. Case 1: w ¼ NT , then: 8

< i1 ; if i 2 ½1; w; 2 ni;w ¼ l2wði1Þm : ; if i 2 w þ 1; w þ NT : 2 ð10Þ

w T/N

w (k-1)-th interval

1

mw+T/N+1,w

m2w,w w

w T/N

k-th interval m1,w

mw-T/N,w

T 2. Case 2: w > N ,then: • i 2 1; w NT , then: 8 w T < 2 N þ 1; if w NT and i are even; ni;w ¼ l T m : wN ; otherwise: 2

w-T/N k-th interval with patches initiated in the (k-1)-th and k-th intervals

mw+T/N+1,w + m1,w

ð9Þ

ð11Þ

m2w,w + mw-T/N,w

mw-T/N+1,w

w+T/N Fig. 2. Superposition of the (k 1)th and kth intervals.

• i 2 w NT þ 1; w , then: i1 ni;w ¼ : 2

ð12Þ

C.K.d.S. Rodrigues, R.M.M. Lea˜o / Computer Networks 51 (2007) 569–587

575

patch patch

patch

arrival

patch

arrival

1 2 3 4 w + T/N (a)

w

arrivals

1 2 3 4 5 6

1 2 3 4 5 6

w

w + T/N (b)

w

w + T/N (c)

Fig. 3. Concurrent streams in the fourth unit of an interval [1, w + T/N]: (a) one arrival in the second unit, (b) one arrival in the third unit, (c) one arrival in the second and third units.

• i 2 w þ 1; w þ NT , then: 2w ði 1Þ ni;w ¼ : 2

ð13Þ

Proof. As we know, the interval [1, w + T/N] is divided into units of length d. Let A be a random variable which represents the number of arrivals in one unit of the interval [1, w + T/N]. We have that P[A > 0] = 1 eN/T since the arrivals follow a Poisson process and k is normalized with regard to the value of d. Then we have a sequence of units, each one with probability p = 1 eN/T of success (where success means at least one arrival in the unit). The total number of units which can generate a patch to be transmitted in the ith unit is given by • Case 1: w ¼ NT All patches generated in an interval are transmitted in the same interval, then ni,w is given by Eqs. (5) and (6) obtained in Lemma 1. • Case 2: w > NT Some patches generated in an interval can extend over the next interval. In this case ni,w = li,w obtained in Lemma 2. Thus, it follows that Xi has a binomial distribution with parameters ni,w and p = 1 eN/T, where i = 1, . . . , w + T/N. h Xi is a random variable that denotes the number of concurrent streams due to patches in the ith unit of the interval [1, w + T/N]. Note that the random variables X1, X2, . . . , Xw + T/N have the same parameter p = 1 eN/T while the parameter ni,w depends on the value of i and w. Then, we have a binomial distribution with a diﬀerent parameter in each of the units of the interval. We want to determine a unique distribution to model the number of concur-

rent streams due to patches in the base interval of analysis [1, w + T/N]. Let us examine the values of ni,w for the Case 2 of Theorem 1. We analyze this case because w is larger than T/N for the situations we are interested in, i.e., 10 6 N 6 100 (see Eq. (4) for reference). From the equations of Case 2 we observe that the computation of ni,w varies according to the sub-interval. For the ﬁrst sub-interval, i.e., [1, w T/N], ni,w does not depend on i and can assume at most two values. For the two other sub-intervals, ni,w depends on i and may possibly assume more than two values. To illustrate this aspect consider T = 104 (recall that w = W/d and T = D/d). We calculate the proportion of the interval [1, w + T/N] corresponding to the ﬁrst subinterval [1, w T/N] and the two other sub-intervals together [w T/N + 1, w + T/ N]. Table 2 shows the results obtained for N = 10, 25, 50, 100, 200, 400, 700. It can be seen that the percentage of the ﬁrst sub-interval (where ni,w assumes at most two values) increases as the value of N increases. For example, Fig. 4(a) shows P[Xi = k] for N = 50, T = 103 and k = 1, 2. The value of P[Xi = k] is constant l for mthe sub-interval [1, w T/N] since ni;w ¼ wT2 =N for these units. In Fig. 4(b) the value of P[Xi = k], for N = 100,

Table 2 Size of the sub-intervals for diﬀerent values of N N

[1, w T/N]

[w T/N + 1, w + T/N]

10 25 50 100 200 400 700

56% 72% 80% 86% 90% 93% 95%

44% 28% 20% 14% 10% 7% 5%

576

C.K.d.S. Rodrigues, R.M.M. Lea˜o / Computer Networks 51 (2007) 569–587

0.18

0.06

0.16

k=2

k=2

0.05

0.14

P[X i =k]

P[X i =k]

0.04 0.12 0.1

0.03

k=1

0.08

0.02

k=1

0.06 0.01 0.04 50

(a)

100

150

200

i

(b)

20

40

60

80

100

120

140

i

Fig. 4. P[Xi = k] for the interval [1, w + T/N]: (a) N = 50, T = 1000; (b) N = 100, T = 1000.

T = 103 and k = 1, 2, oscillates between two values for the sub-interval [1, w T/N] (ni,w is computed from Eq. (11)). In both ﬁgures, at the end of the interval, we see that P[Xi = k] shows more variability. As already explained, this behavior is due to the larger variability of the parameter ni,w in the subinterval [w T/N + 1, w + T/N]. Based on the analysis presented above, we approximate the distribution of the number of concurrent patches in the interval [1, w + T/N] by a binomial distribution with parameters p = 1 eN/T and the expected value of ni,w. Note that the larger N is, the more accurate our approximation tends to be since the variability of the parameter ni,w tends to reduce. More formally, let X be a random variable which represents the number of concurrent patches in the interval [1, w + T/N]. We thus approximate the distribution of X by a binomial distribution with parameters p = 1 eN/T P wþT =N

and nw ¼

i¼1

ni;w

wþT =N

.

Finally, we have to consider the full-object multicasts to obtain the distribution of the total number of concurrent streams required by Patching. It is the sum of the concurrent patches and the concurrent full-object multicast streams. Let Z denote the random variable which represents the total number of concurrent streams. Z has the same distribution of X shifted to the right by MC. As previously mentioned, the number of concurrent full-object pﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃ multicasts is approximated by M C ¼ N = 2N þ 1. Thus it follows that P(Z 6 MC 1) = 0. The probability distribution function of Z is given by Eq. (14). We note that both X and Z only depend on the parameters N and T of the object O

8 0; > > > > > < 06 k 6 v 1; k nw P ðZ 6 kÞ ¼ P > ð1 eN =T Þl ðeN =T Þnw l ; > > l > j¼v > : v 6 k 6 nw þ v; PwþT =N

ni;w

ð14Þ pﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃ v ¼ M C ¼ N = 2N þ 1,

where nw ¼ , wþT =N and l = j v. Our model is closer to the original continuous model of Patching as the value of d decreases. In the limit, when d ! 0, our model becomes identical to the original Patching model since users have immediate service. Now, we deﬁne two criteria to estimate a value for d. The ﬁrst is based on the probability of more than one arrival in a unit of length d, and the second is based on the relative error between the expected value of Z and the average bandwidth B computed from Eq. (3). Fig. 5 shows the probability of more than one arrival in a unit of length d for several values of N and T. Recall that T = D/d. From this ﬁgure we note that, for T P 104, the probability of more than one arrival is smaller than 104 for all values of N. Let us now evaluate the second criterion. The expectation of Z is given by pﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃ E½Z ¼ nw ð1 eN =T Þ þ N = 2N þ 1: ð15Þ i¼1

We deﬁne the relative error between E[Z] and B as EMB ¼ jBE½Zj . In Fig. 6, we plot the relative error as B a function of T for N = 10, 25, 40, 75 and 100. We also show the complementary cumulative distribution function (CCDF) of Z for N = 100 and several values of T. As already expected, the value of the

C.K.d.S. Rodrigues, R.M.M. Lea˜o / Computer Networks 51 (2007) 569–587 0.01 3

0.001

T=10

0.0001

T=104

P[A>1]

1e-05 1e-06

T=105

1e-07 1e-08

T=106

1e-09 1e-10 1e-11 10

20

30

40

50

60

70

80

90

100

N

Fig. 5. Probability of more than one arrival in a unit of length d.

relative error decreases as the value of T increases. However, we note that the relative error has no signiﬁcant reduction for T > 104. Moreover, the CCDFs of Z are practically the same for T P 104.

577

Assume that there are n diﬀerent objects stored in the multimedia server. These objects have lengths D1, D2, . . . , Dn, respectively. We may compute the corresponding values T1, T2, . . . , Tn using the two criteria previously deﬁned (see last subsection). Notice that T = max{T1, T2, . . . , Tn} may be used for all these n objects. This is because the larger T is, the more accurate our model becomes, as already observed. We emphasize that to consider T as a constant and the same for all objects stored in the server does not mean that the objects must have a same length D, as we explain in the following. We know that T = D/d (see Table 1 for reference). Since we propose an analytical model and have a steady-state analysis, the value of D does not aﬀect the real distribution. For example, in Fig. 7 we plot the CCDF curves (obtained via simulation) for an object of popularity N = 5 and considering D = 100 min, 200 min, 500 min. We may see that the curves are the same. Similar behavior is observed

3.2. Multiple objects 1.2

A multimedia server usually services client requests for more than one object simultaneously and so the bandwidth usage distribution in this case plays an important role in the overall system conﬁguration. In the last subsection, we show that the random variable Z, which represents the distribution of the number of concurrent streams for a single object, depends on two parameters: N and T. Note that we can consider T equal to a constant value computed in accordance with the two criteria previously deﬁned (see last subsection). If T is set to a constant, the random variable Z depends exclusively on the parameter N.

1

P[ Z > k ]

0.8

N=5 and D=100 min, 200 min, 500 min

0.6 0.4 0.2 0 0

5

10

15

20

k

Fig. 7. Distribution of Z for N = 5 and diﬀerent values of D.

0.1 1 N=40

N=10

N=25

0.8 5

0.01 N=100

P[ Z > k ]

EMB

T=10 , 2x10 N=75

5

0.6 3

T=10

0.4 0.2

4

T=10 0.001 1000

(a)

T

0 0

10000

5

10

k

(b) Fig. 6. Analysis of the value of T: (a) relative error, (b) CCDF of Z.

15

20

25

578

C.K.d.S. Rodrigues, R.M.M. Lea˜o / Computer Networks 51 (2007) 569–587

for other diﬀerent values of D and a same N in the interval [10, 100]. Still we consider that the n objects stored in the multimedia server have the same play rate. This is a quite common assumption of previous work (e.g., [29,33,18,15]) and is mainly supported by recent researches (e.g., [41,11]) which analyze real media server workloads. Let YN be the random variable that denotes the number of concurrent patches due to the delivery of all objects of popularity N, N 2 {1, . . . , Nmax}, where Nmax is the highest object popularity value in the server. Once we consider T as a constant and the same for all objects stored in the server, we have that YN is a sum of identical binomial random variables, p = 1 eN/T and lPwith parameters m wþT =N ni;w nN ;w ¼ mN , where mN denotes i¼1 wþT =N the total number of objects of popularity N in the server [37]. Now we want to estimate the total required bandwidth due to patches in the multimedia server. Let Y be the random variable that denotes the total number of concurrent patches due to the delivery of all objects in the server. Then, Y is the sum of potentially Nmax binomial random variables with diﬀerent parameters. There is no closed-form solution for the distribution of this sum. However the distribution of Y may be calculated based on the convolution of the individual probability mass functions (PMFs) of YN, for N = 1, . . . , Nmax. This convolution may be eﬃciently implemented with the Fast Fourier Transform [38]. The total number of concurrent streams is the sum of patches and full-object multicasts. The number of full-object multicasts is equal to f ¼ P N max mN N pﬃﬃﬃﬃﬃﬃﬃﬃ N ¼1 2N þ1. Let S be the random variable that represents the total number of concurrent streams due to the delivery of all objects in the server. The distribution of S is the distribution of Y shifted to the right by f. Similarly to the analysis in the single object case, the shift is because we assume that the total number of full-object multicast streams is a constant given by f, which corresponds to the sum of all individual multicast components related to the objects in the server. Thus it follows that P(S 6 f 1) = 0.

tion we obtained (i) to allocate bandwidth to provide a given level of QoS, (ii) to estimate the impact on QoS when some system parameters dynamically change, and (iii) to conﬁgure the overall system. We mention that the simulation results are obtained using the modeling environment Tangram-II [39,40], and have 95% conﬁdence intervals that are within 5% of the reported values. Client arrivals are generated according to a Poisson process and the media objects are 100 min long. Lastly, since Patching is a multicast technique, we mainly focus on the values of N in the range of 10–100. Tangram-II is an environment for computer and communication system modeling and experimentation developed at Federal University of Rio de Janeiro (UFRJ), with participation of UCLA/ USA, for research and educational purposes. It combines a sophisticated user interface based on an object oriented paradigm and new solution techniques for performance and availability analysis. The user speciﬁes a model in terms of objects that interact via a message exchange mechanism. Once the model is compiled, it can be either solved analytically, if it is Markovian or belongs to a class of non-Markovian models, or solved via simulation. There are several solvers available to the user, both for steady state and transient analysis. The organization of this section is detailed in the following. Section 4.1 shows the model accuracy and how to use the bandwidth distribution of a single object to dynamically reserve a given number of channels for a group of clients. In Section 4.2, we study the behavior of the bandwidth distribution of a single object considering diﬀerent values of the threshold window w for a given object popularity. We evaluate the required bandwidth when the server operates using a window that is not the optimal one and quantify the impact on QoS. This analysis is motivated by the fact that the computation of the optimal window to be used at time t depends on the accurate estimation of the arrival rate k at time t. Finally, Section 4.3 evaluates the use of the bandwidth distribution of multiple objects to compute the total server capacity to achieve a given level of QoS to the clients.

4. Results

4.1. Model validation and bandwidth reservation

In this section, we validate our analytical model using simulation and show for instance how we may practically use the bandwidth usage distribu-

To evaluate the accuracy of our model we set T = 104 and use two important criteria: the relative error and the mean squared error (MSE). As for the

C.K.d.S. Rodrigues, R.M.M. Lea˜o / Computer Networks 51 (2007) 569–587

ﬁrst criterion, we compare the average bandwidth B, computed from Eq. (3) (theoretical), with that computed from our model E[Z] (analytical – Eq. (15)). We calculate the relative error from Error ¼ jBE½Zj . Fig. 8 shows B and E[Z], and Fig. 9 presents B the obtained relative errors for 10 6 N 6 100. We may see that they lie between 102 and 103 and are thus satisfactory. In Table 3 we illustrate the parameters p and nw considering some values of N.

14 13 12 11

Band

10

theoretical band

9 8 analytical band

7 6 5 4 3 10

20

30

40

50

60

70

80

90

100

N

Fig. 8. Average bandwidth.

1

Error

0.1

0.01

579

As for the second criterion, we compute the MSE values for the single object P and multiple objects M

jð1F Z ðzÞÞð1F Z ðzÞÞj2

s i¼1 cases from MSE and M PM single ¼ jð1F S ðsÞÞð1F S s ðsÞÞj2 MSEmultiple ¼ i¼1 , respectively, M where M is the number of samples and the subscript s of the random variables Z and S is used to refer to simulation results. Recall that Z and S represent the number of concurrent streams for the single object and multiple object cases, respectively. Fig. 11 shows the complementary cumulative distribution function (CCDF) of the number of concurrent streams considering N = 10, 25, 40, 50, 75 and 100. In this ﬁgure, we have both analytical and simulation results (solid and dotted lines, respectively). And in Fig. 10 we have the computed MSE values for 10 6 N 6 100. Note that the values of MSE are all smaller than 103. This indicates that the analytical results are satisfactorily close to those obtained using simulation. From Figs. 10 and 11 we can observe that the accuracy of the model tends to increase with the value of N. In other words, the proposed model tends to be more accurate for the most popular objects, i.e., the objects that will signiﬁcantly impact on the server bandwidth. We stress that this is a tendency that has been conﬁrmed by the overall results obtained in the experiments. Therefore we may not state that for any given value of N, the larger it is, the more accurate the model is. We recall that we use the expected value of ni,w to parameterize the binomial distribution. The parameter ni,w exhibits more variability in the sub-interval [w T/N + 1,w + T/N] as shown in Section 3.1. Then, the accuracy of the model is mainly aﬀected by the size of the sub-interval [w T/N + 1,w + T/N] and the

0.001 0.01

1e-04 10

20

30

40

50

60

70

80

90

100

N

Fig. 9. Relative errors for average bandwidth. MSE

0.001

Table 3 Values of parameters p and nw N

p

nw

10 25 40 50 75 100

9.99e04 2.50e03 3.99e03 4.98e03 7.47e03 9.95e03

1401 1057 890 816 692 613

1e-04

1e-05

0

20

40

60

80

N

Fig. 10. Mean squared error (MSE) values.

100

580

C.K.d.S. Rodrigues, R.M.M. Lea˜o / Computer Networks 51 (2007) 569–587

simulation analytical

1

0.8

P[Z > k]

P[Z > k]

0.8

N=25

0.6

N=40

0.4

0.2 N=10

0.2

0

0

5

10

15

20

25

k

N=75 N=50

0.6

0.4

(a)

simulation analytical

1

0

(b) 0

N=100

5

10

k

15

20

25

Fig. 11. CCDF of the number of concurrent streams for a single object: (a) N = 10, 25, 40; (b) N = 50, 75, 100.

simulation analytical

1 0.8

P[S > k ]

variability of ni,w in this subinterval. The size of the sub-interval [w T/N + 1, w + T/N] decreases with the value of N. However, the variability of ni,w does not always decrease with the value of N. For example, the coeﬃcient of variation of ni,w for N = 10 is smaller than that for N = 25. The behavior of the MSE is a consequence of the variability of ni,w. Fig. 11 also demonstrates that, in general, the analytical results underestimate the simulation ones. The explanation for this result is that we use average values to determine the multicast component (MC) as well as to calculate the parameter nw of the binomial distribution. Consider two binomial random variables Y1 and Y2 with the same parameter p (probability of success) and distinct parameters m1 and m2 (number of trials), where m1 < m2. We have that Y1 always underestimates Y2 [37]. In our model we approximate the number of concurrent patches by a binomial random variable X with parameters p and the expected value of ni,w. We recall that the number of patches in the ith unit of the interval [1, w + T/N] is represented by the random variable Xi with parameters p and ni,w. We thus have the condition stated above, i.e., the random variable X underestimates the random variables Xi with ni,w > nw. Similarly, the average value provided by MC underestimates some of the real instantaneous values of the multicast component. In Fig. 12 we have the CCDF for the multiple object case. In this example we considered the same six objects analyzed in Fig. 11. The analytical and simulation curves are quite close to each other, and the MSE is equal to 2.32e04 which is quite satisfactory. We may also note that this value is smaller than the values obtained for N 6 40. This occurs because although the bandwidth consumed

0.6 0.4 0.2 0

0

20

40

60

80

100

120

140

k

Fig. 12. CCDF of the number of concurrent streams for multiple objects.

for delivery of a given object may signiﬁcantly vary over time, the total bandwidth (i.e., the sum of individual bandwidths) to deliver a large number of objects simultaneously will have lower coeﬃcient of variation over time for independently requested objects and ﬁxed client request rate [6,37]. In the following we focus on applications concerning the single object case and leave the discussion of the multiple object case for Section 4.3. Using our analytical results we have the analysis that follows. With the distribution of the number of concurrent streams we can reserve bandwidth for a group of clients such that these clients may have a very low probability of not being immediately served. Assume, for example, that we reserve k channels in the server for a group of users retrieving object O such that P[Z > k] = 103, that is, the probability of the number of concurrent streams be larger than k is equal to 103. The value of k

C.K.d.S. Rodrigues, R.M.M. Lea˜o / Computer Networks 51 (2007) 569–587

4.2. Analysis for diﬀerent values of the threshold window Consider a server that dynamically estimates the value of k and then determines the threshold window to be used in accordance with the value of the optimal window (Eq. (2)). Suppose that the value initially obtained for N is equal to 75. From Fig. 13, the bandwidth needed in order to have P[Z > k] 102 is 17. Now assume that the value of k changes and the server does not immediately detect it. The server continues to operate with the optimal window determined for N = 75 and keeps the same previously reserved bandwidth. Considering this scenario, we would like to answer the two following questions: (i) What is the quality of service provided by the previously reserved bandwidth to the clients accessing the server now? (ii)

1 N=100

0.1

P[Z > k]

may be easily obtained from Table 4 for N = 25, 50, 100. This table shows the estimated bandwidth (EB), the probability of the number of concurrent streams to exceed the estimated bandwidth (P[Z > EB]), and the needed increase with respect to the average bandwidth (NI) for N = 25, 50, 100. The values of EB in the ﬁrst line of the table are equal to the average bandwidth computed from Eq. (3). We may conclude that if bandwidth reservation is based on the average bandwidth, the probability of the number of concurrent streams to exceed the reserved value is greater than 0.38 (Table 4) for all values of N. In others words, the probability that a client arrives and has to wait until the server may allocate a channel for him is equal to 0.38 for N = 50. This means that clients have a high probability of waiting until the server may allocate a channel to serve them. We note that for N = 25, approximately 50% of the clients will not be immediately served.

581

0.01 N=75 0.001 N=50 0.0001 1e-05 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 k

Fig. 13. Bandwidth usage distribution for diﬀerent values of N and the optimal window w deﬁned for N = 75.

What is the impact of operating with a threshold window that is not the optimal one? And what are the bandwidth requirements of this threshold window? Let us analyze the ﬁrst question. In our example we assume that N = 75, the reserved bandwidth is equal to k channels, and the threshold window is equal to the optimal value computed for N = 75. If the bandwidth and the threshold window are computed in accordance with a popularity equal to 75 and the arrival rate k increases (decreases), we can observe from Fig. 13 that the probability of the number of concurrent streams to be greater than the reserved value increases (decreases). This is an expected result, but using our model we can precisely quantify the QoS oﬀered to the users when their arrival rate changes and the server does not detect it. Suppose, for example, that the number of reserved channels is 17. We have P[Z > 17] = 6.8e03 for N = 75. If k decreases so that N = 50 and the server does not detect it, we have that P[Z > 17] = 1.1e04. Thus, the probability that

Table 4 Bandwidth evaluation for N = 25, 50, 100 N = 25

N = 50

N = 100

EB

P[Z > EB]

NI

EB

P[Z > EB]

NI

EB

P[Z > EB]

NI

6 7 8 9 10 11 12 13

4.9e01 2.7e01 1.3e01 5.2e02 1.8e02 5.7e03 1.6e03 4.1e04

– 16.67% 33.33% 50.00% 66.67% 83.33% 100.00% 116.67%

9 10 11 12 13 14 15 17

3.8e01 2.3e01 1.2e01 5.5e02 2.3e02 8.9e03 3.1e03 3.1e04

– 11.11% 22.22% 33.33% 44.44% 55.56% 66.67% 88.89%

13 14 15 16 17 19 21 22

4.1e01 2.7e01 1.6e01 8.9e02 4.6e02 9.6e03 1.5e03 5.6e04

– 7.69% 15.38% 23.08% 30.76% 46.15% 61.54% 69.23%

582

C.K.d.S. Rodrigues, R.M.M. Lea˜o / Computer Networks 51 (2007) 569–587

niﬁcantly aﬀect the quality oﬀered to the clients. The same behavior is observed for other values of N in the interval [10, 100]. Now, we present the analysis of the second question. Fig. 15 shows the distribution of the number of concurrent streams Z when the threshold window varies around 20% of its optimal value for N = 50 and N = 100. We may notice that these distributions are quite similar. Hence, if the server does not detect changes in the value of k, and consequently does not update the value of the threshold window, the required bandwidth for this non-optimal window is quite similar to that computed for the optimal one. In other words, the threshold window may vary within certain limits around the optimal window value without signiﬁcantly aﬀecting the bandwidth requirements. Qualitatively similar results

we need more channels, besides the reserved ones, decreases by one order of magnitude. On the other hand, if k increases so that N = 100, we have that P[Z > 17] = 7.2e02, i.e., the probability increases by one order of magnitude. (Note that the bandwidth distributions for N = 50 and N = 100 in Fig. 13 were computed using the optimal window deﬁned for N = 75.) Fig. 14 presents another example that illustrates the quality of service perceived by the clients when the arrival rate changes. In this example we set an initial value for the arrival rate and evaluate the QoS as the arrival rate increases. In Fig. 14(a) the server is initially conﬁgured for N = 10 and in Fig. 14(b) it is initially conﬁgured for N = 50. We note that, for N = 10 and N = 50, an increase of approximately 20% in the arrival rate does not sig1

1

P[Z > 5]

P[Z > k]

0.1

P[Z > k]

P[Z > 7] 0.01

P[Z > 8]

0.001

0.0001 5

P[Z>12]

0.1

10

15

20

P[Z>14] 0.01

P[Z>17]

0.001

25

30

(a)

35

40

45

50

0.0001

55

50

60

70

(b)

N

80

90

100

N

Fig. 14. Quality of service as the value of N increases: (a) for N = 10: P[Z > 5] = 102; P[Z > 7] = 103; P[Z > 8] = 104, (b) for N = 50: P[Z > 12] = 102; P[Z > 14] = 103; P[Z > 17] = 104. 1

1

0.9

window 20% smaller than the optimal one

0.9 0.8

0.7 optimal window

0.6 0.5

P[Z > k]

P[Z > k]

0.7

window 20% larger than the optimal one

0.4

0.4

0.2

0.2

0.1

0.1

(a)

10

k

15

20

0

25

window 20% larger than the optimal one

0.5

0.3

5

optimal window

0.6

0.3

0 0

window 20% smaller than the optimal one

0.8

0

5

10

k

15

20

25

(b)

Fig. 15. Distribution of the number of concurrent streams for N = 50 and N = 100 when the threshold window w varies around 20% of the optimal window value.

C.K.d.S. Rodrigues, R.M.M. Lea˜o / Computer Networks 51 (2007) 569–587

are obtained for other values of N in the interval [10, 100]. In summary, we are able to predict the bandwidth requirements for a given object, or for a group of clients, in order to guarantee a certain quality of service based on the distribution of the number of concurrent streams. This distribution also allows us to predict the quality of service when the client arrival rate changes and the server does not detect it quickly. Moreover, increases of 20% in the arrival rate do not considerably impact on the quality oﬀered to the clients. Another important point is related to the value of the threshold window. We observe that changes around 20% of the optimal window value do not signiﬁcantly impact on the distribution. 4.3. Server conﬁguration As already mentioned, the distribution of the number of concurrent streams in the case of the multiple objects may be used, for example, to conﬁgure the server. To exemplify this point, we consider four speciﬁc scenarios based on the real case studies of [41]. They showed that the distribution of access frequencies for all media objects of two educational servers (BIBS and e-Teach) can be approximated by a concatenation of two Zipf distributions. The ﬁrst Zipf is used for the most popular objects and the second for the least requested objects. We only consider the ﬁrst Zipf distribution since the access frequency of the least requested objects is very low and therefore has little or almost no inﬂuence on server bandwidth. The scenarios we

Table 5 Number of objects in each scenario N

Scenario 1, z = 0.60

Scenario 2, z = 0.55

Scenario 3, z = 0.80

Scenario 4, z = 0.65

100 75 50 40 25 10

1 1 1 1 1 1

2 2 2 1 1 1

2 2 2 2 2 2

4 3 3 3 3 2

studied are summarized in Table 5. It presents the number of objects in each scenario for each value of N. Each scenario has a distinct value of skew factor z and we have the most popular 50–55 min media objects of the BIBS server. Fig. 16(a) shows the probability of the total number of concurrent streams to exceed the average server bandwidth. The average server bandwidth is equal to the sum of the individual average bandwidths for the objects being delivered (computed from Eq. (3)). Fig. 16(b) shows the needed increment with respect to the average server bandwidth to achieve a given probability for a client not being immediately served. We can observe that the probability of exceeding the average server bandwidth is greater than 0.18 for all scenarios. The needed increase in bandwidth goes from 32% (Scenario 1 and P[S > k = 104]) to 6% (Scenario 4 and P[S > k = 102]). The experiments also show that the more objects we have, the smaller is the increase in the average server bandwidth to reach a given level of QoS, as similarly observed in [6,33].

35 30

0.24

Needed Increase (%)

P[S > Average server bandwidth]

0.26

0.22

0.2

25 10

10

-3

-4

20 15

10

-2

10

0.18 5 0.16

(a)

583

6

8

10

12

14

Number of Objects

16

0

18

(b)

6

8

10

12

14

16

18

Number of Objects

Fig. 16. Analysis of the multiple object case. (a) P[S > average server bandwidth], (b) % increase in server bandwidth to have P[S > k] = 102, 103 or 104.

584

C.K.d.S. Rodrigues, R.M.M. Lea˜o / Computer Networks 51 (2007) 569–587

5. Conclusions and ongoing work In this work, we developed a very simple and precise analytical model to calculate the distribution of the number of concurrent streams generated by the Patching technique, or equivalently, the server bandwidth usage distribution. In the case of the single object, we showed that the bandwidth usage distribution can be accurately modeled in the form of a closed-form solution as a binomial distribution. Only two system variables are required to compute the parameters of the binomial distribution: the mean client arrival rate and the media object length. For the multiple object case, the distribution is a sum of independent binomial random variables which may be easily implemented using Fast Fourier Transform. Through simulations and analytical analysis we validate our model and illustrated how our results may be practically used in real project designs to mainly provide quality of service to system clients. Some of our main observations were: • We may easily employ our results to dynamically allocate server bandwidth to guarantee a certain level of QoS for a group of clients retrieving an object from the multimedia server. • In the case of the single object, we observed that the probability of exceeding the well-known average value is signiﬁcant (0.4). This means that the QoS is severely aﬀected if bandwidth reservation is based on average values. • Using our results we may easily predict the QoS if the arrival rate changes and the server is not able to detect it immediately. For instance, we observed that variations below 20% do not severely impact on the system QoS. Similarly, we noticed that the Patching window size may also vary around 20% of its optimal value without signiﬁcantly aﬀecting the bandwidth distribution. • In the case of multiple objects and considering real-case-based scenarios, we observed that to guarantee a low probability of a client not being served immediately (104), we need a server bandwidth which is 15–32% larger than the well-know average value. This conﬁrms the usefulness of our results to guarantee system QoS. As an extension of this work, we include the evaluation of the impact on the server bandwidth usage distribution of both a limited client bandwidth and client interactive behavior. We also plan

to investigate the distribution considering other than a Poisson client arrival process. Appendix A Proof of Lemma 1. We show by mathematical induction that Eqs. (5) and (6) are correct and therefore Lemma 1 is true. • Case 1: i = 1, 2, . . . , w. Base case It is easy to verify that for i = 1 and all w, we have that m1,w = 0. This agrees with the result of Eq. (5). Inductive hypothesis First, assume that the lemma is true for 1 6 i 6 k, where k is even, i.e., k = 2c and c 2 {1, 2, . . .}. We the following: m2c;w ¼ 2c1 therefore may write 2c11

¼ ¼ c, and m ¼ c 1. 2c1;w 2 2 Second, assume that the lemma is true for 1 6 i 6 k, where k is odd, i.e., k = 2c 1 and c 2 {1, 2, . . .}. We therefore may write the follow 2c11

ing: m ¼ ¼ c 1, and m2c2;w ¼ 2c1;w 2 2c21

¼ c 1. 2 We thus obtain the following relation from the inductive hypothesis: (i) If i is even, then mi,w = mi1,w + 1; (ii) if i is odd, then mi,w = mi1,w. Evaluation of Eq. (5) for i + 1 We use the assumptions of the inductive hypothesis and prove that the result of Eq. (5) holds for i + 1. As before, let c 2 {1, 2, . . .}. Two cases must be considered: – i is odd, then i + 1 is even. We have i = 2c 1 and i + 1 = 2c. We obtain m2c;w ¼ 2c1 ¼ c using Eq. (5). Now using the 2 result of the inductive hypothesis we have that m2c,w = m2c1,w + 1 = c 1 + 1 = c. – i is even, then i + 1 is odd. We have i = 2c and

i + 1 = 2c + 1. We obtain ¼ c using Eq. (5). Now m2cþ1;w ¼ 2cþ11 2 using the result of the inductive hypothesis, we have that m2c+1,w = m2c,w = c. This completes the proof of Eq. (5). • Case 2: i = w + 1, w + 2, . . . , 2w. Base case It is easy to verify that for w = 1 and i = 2, we have that m2,1 = 1. This agrees with the result of Eq. (6). Inductive hypothesis Let c1 and c2 2 {1, 2, . . .}. Two cases must be considered: – Assume that the lemma is true for 1 6 i 6 k, where k is even, i.e., k = 2c2. For w even,

C.K.d.S. Rodrigues, R.M.M. Lea˜o / Computer Networks 51 (2007) 569–587

2 þ1 i.e., w = 2c1, we obtain: m2c2 ;2c1 ¼ 4c1 2c ¼ 2 4c 2c þ2

1 2 ¼ 2c1 2c1 c2 þ 1, and m2c2 1;2c1 ¼ 2 c2 þ 1. For w odd, i.e., þ1w = 2c1 1, we 2 obtain: m2c2 ;2c1 1 ¼ 4c1 22c ¼ 2c1 c2 , and 2 4c 22c þ2

m2c2 1;2c1 1 ¼ 1 2 2 ¼ 2c1 c2 . – Now, assume that the lemma is true for 1 6 i 6 k, where k is odd, i.e., k = 2c2 1. For w even, i.e., w = 2c1, we obtain: 2 þ2 ¼ 2c1 c2 þ 1, and m2c2 1;2c1 ¼ 4c1 2c 2 4c 2c

1 2 þ2þ1 m2c2 2;2c1 ¼ ¼ 2c1 c2 þ 2. For 2 w odd, i.e., w = 2c 1 1, we obtain: 2 þ2 m2c2 1;2c1 1 ¼ 4c1 22c ¼ 2c1 c2 , and 2 4c 22c þ3

1 2 m2c2 2;2c1 1 ¼ ¼ 2c1 c2 þ 1. 2 We thus obtain the following relation from the inductive hypothesis: when i is even, we have that mi,w = mi1,w; when i is odd, we have that mi,w = mi1,w 1. Evaluation of Eq. (6) for i + 1 We use the assumptions of the inductive hypothesis and prove that the result of Eq. (6) holds for i + 1. As before, let c1 and c2 2 {1, 2, . . .}. Two cases must be considered: – i is odd, then i + 1 is even. We have i = 2c2 1 then i + 1 = 2c2. For w even, i.e., w = 2c1, we have m2c2 ;2c1 ¼ 2c1 c2 þ 1 from Eq. (6). The result of the inductive hypothesis gives: m2c2 ;2c1 ¼ m2c2 1;2c1 ¼ 2c1 c2 þ 1. For w odd, i.e., w = 2c1 1, we have m2c2 ;2c1 1 ¼ 2c1 c2 from Eq. (6). The result of the inductive hypothesis gives: m2c2 ;2c1 1 ¼ m2c2 1;2c1 1 ¼ 2c1 c2 . – i is even, then i + 1 is odd. We have i = 2c2 then i + 1 = 2c2 + 1. For w even, i.e., w = 2c1, we have m2c2 þ1;2c1 ¼ 2c1 c2 from Eq. (6). The result of the inductive hypothesis gives: m2c2 þ1;2c1 ¼ m2c2 ;2c1 1 ¼ 2c1 c2 þ 1 1 ¼ 2c1 c2 . For w odd, i.e., w = 2c1 1, we have m2c2 þ1;2c1 1 ¼ 2c1 c2 1 from Eq. (6). The result of the inductive hypothesis gives: m2c2 þ1;2c1 1 ¼ m2c2 ;2c1 1 1 ¼ 2c1 c2 1. This completes the proof of Eq. (6). h Appendix B Proof of Lemma 2. We show that Eqs. (7)–(9) are correct and therefore Lemma 2 is true.

585

• Case 1: i 2 ½1; w NT We have that li,w = mi+w+T/N,w + mi,w, where mi+w+T/N,w is the number of patches initiated in the previous interval and mi,w is the number of patches initiated in the current interval. Substituting Eqs. (5) and (6) of Lemma 1 in the expression obtained for li,w, gives 2w ði þ w þ T =N 1Þ i1 li;w ¼ þ 2 2 w T =N i þ 1 i1 ¼ þ : ðB:1Þ 2 2 Now we consider the possible values for i and (w T/N) (even or odd): – i and w NT are even li;w ¼

w T =N i i w T =N þ1þ ¼ þ 1: 2 2 2 2 ðB:2Þ

– i is even and ðw NT Þ is odd li;w

w T =N 1 i i w T =N þ þ ¼ ¼ : 2 2 2 2 2

T

ðB:3Þ

– i and w N are odd w T =N i1 i1 w T =N li;w ¼ þ ¼ : 2 2 2 2 ðB:4Þ T – i is odd and w N is even w T =N i 1 i 1 w T =N þ ¼ li;w ¼ : 2 2 2 2 ðB:5Þ This completes the proof of Eq. (7). • Case 2: i 2 w NT þ 1; w We have that li,w = mi,w since all patches initiated in the (k 1)th interval have already ﬁnished. From Eq. (5) of Lemma 1 we have i1 li;w ¼ : ðB:6Þ 2 • Case 3: i 2 w þ 1; w þ NT We have that li,w = mi,w since all patches initiated in the (k 1)th interval have already ﬁnished. From Eq. (6) of Lemma 1 we have 2w ði 1Þ li;w ¼ : ðB:7Þ 2 This completes the proof of Eqs. (8) and (9). h

586

C.K.d.S. Rodrigues, R.M.M. Lea˜o / Computer Networks 51 (2007) 569–587

References [1] C.C. Aggarwal, J.L. Wolf, P.S. Yu, On optimal piggyback merging policies for video-on-demand systems, in: Proc. 1996 ACM SIGMETRICS Conf. On Measurement and Modeling of Computer Systems, 1996, pp. 200– 209. [2] K.A. Hua, Y. Cai, S. Sheu, Patching: a multicast technique for true video-on-demand services, in: Proc. 6th ACM Int. Multimedia Conference (ACM MULTIMEDIA’98), 1998, pp. 191–200. [3] Y. Cai, K.A. Hua, An eﬃcient bandwidth-sharing technique for true video on demand systems, in: Proc. 7th ACM Int. Multimedia Conference (ACM Multimedia’99), 1999, pp. 211–214. [4] Y. Cai, K.A. Hua, K. Hu, Optimizing patching performance, in: Proc. SPIE/ACM Conference on Multimedia Computing and Networking, January 1999. [5] S. Sen, L. Gao, J. Rexford, D. Towsley, Optimal patching schemes for eﬃcient multimedia streaming, in: Proc. 9th Int. Workshop on Network and Operating Systems Support for Digital Audio and Video (NOSSDAV’99), 1999. [6] D. Eager, M. Vernon, J. Zahorjan, Minimizing bandwidth requirements for on-demand data delivery, in: Proc. 5th Int. Workshop on Multimedia Information Systems (MIS’99), 1999, pp. 80–87. [7] D. Eager, M. Vernon, J. Zahorjan, Optimal and eﬃcient merging schedules for video-on-demand servers, in: Proc. 7th ACM Int. Multimedia Conference (ACM Multimedia’99), 1999, pp. 199–202. [8] D. Eager, M. Vernon, J. Zahorjan, Bandwidth skimming: a technique for cost-eﬀective video-on-demand, in: Proc. IS & T/SPIE MMCN’00, 2000, pp. 206–215. [9] A. Bar-Noy, R.E. Ladner, Competitive on-line stream merging algorithms for media-on-demand, in: Proc. 12th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), 2001. [10] E.G.C. Jr., P. Jelenkovic, P. Momcilovic, Provably eﬃcient stream merging, in: Proc. 6th Int. Workshop on Web caching and Content Distribution, Boston, MA, 2001. [11] C. Costa, I. Cunha, A. Borges, C. Ramos, M. Rocha, J.M. Almeida, B. Ribeiro-Neto, Analyzing client interactivity in streaming media, in: Proc. 13th ACM Int. World Wide Web Conference, May, 2004, pp. 534–543. [12] W.-T. Chan, T.-W. Lam, H.-F. Ting, W.-H. Wong, Competitive analysis of on-line stream merging algorithms, in: Proc. of the 27th Int. Symposium on Mathematical Foundations of Computer Science (MFCS), 2002, pp. 188– 200. [13] K.C. Almeroth, M.H. Ammar, The use of multicast delivery to provide a scalable and interactive video-on-demand service, IEEE Journal on Selected Areas in Communications 14 (5) (1996) 1110–1122. [14] A. Hu, Video-on-demand broadcasting protocols: a comprehensive study, in: Proc. IEEE Infocom, 2001, pp. 508– 517. [15] S. Viswanathan, T. Imielinski, Pyramid broadcasting for video-on-demand service, in: Proc. SPIE Multimedia Computing and Networking Conference, 1995, pp. 66–77. [16] L. Juhn, L. Tseng, Fast data broadcasting and receiving scheme for popular video service, IEEE Transactions on Broadcasting 44 (1) (1998) 100–105.

[17] L. Juhn, L. Tseng, Harmonic broadcasting for video-ondemand service, IEEE Transactions on Broadcasting 43 (3) (1997) 268–271. [18] K. Hua, S. Sheu, Skyscraper broadcasting: a new broadcasting scheme for metropolitan video-on-demand systems, in: Proc. ACM SIGCOMM, 1997, pp. 89–100. [19] J.-F. Paris, S.W. Carter, D.D.E. Long, A low bandwidth broadcasting protocol for video on demand, in: Proc. IEEE Int. Conference on Computer Communications and Networks (IC3N’98), 1998, pp. 690–697. [20] J.-F. Paris, S.W. Carter, D.D.E. Long, A hybrid broadcasting protocol for video on demand, in: Proc. SPIE Multimedia Computing Networking Conference (MMCN’99), 1999, pp. 317–326. [21] J.-F. Paris, S.W. Carter, D.D.E. Long, A simple lowbandwidth broadcasting protocol for video on demand, in: Proc. 8th Int. Conference on Computer Communications and Networks (IC3N’99), 1999. [22] Y. Guo, K. Suh, J. Kurose, D. Towsley, P2cast: peer-to-peer patching scheme for VoD service, in: Proc. 12th World Wide Web Conference (WWW-03), 2003, pp. 301–309. [23] M.K. Bradshaw, B. Wang, S. Sen, L. Gao, J. Kurose, P. Shenoy, D. Towsley, Periodic broadcast and patching services – implementation, measurement and analysis in an Internet streaming video testbed, ACM Multimedia Journal 9 (1) (2003) 78–93. [24] D. Guan, S. Yu, A two-level patching scheme for video-ondemand delivery, IEEE Transactions on Broadcasting 50 (1) (2004) 11–15. [25] Y.W. Wong, J.Y.B. Lee, Recursive patching – an eﬃcient technique for multicast video streaming, in: Proc. 5th International Conference on Enterprise Information Systems (ICEIS) 2003, Angers, France, 2003, pp. 23–26. [26] Y. Cai, K. Hua, Sharing multicast videos using patching streams, Journal of Multimedia Tools and Applications 21 (2) (2003) 125–146. [27] Y. Cai, W. Tavanapong, K. Hua, Enhancing patching performance through double patching, in: Proc. Int. Conf. on Distributed Multimedia Systems, Miami, FL, USA, 2003, pp. 72–77. [28] E. de Souza e Silva, R.M.M. Lea˜o, M.C. Diniz, The required capacity distribution for the patching bandwidth sharing technique, in: Proc. IEEE Int. Conference on Communications (ICC 2004), 2004. [29] A. Bar-Noy, G. Goshi, R.E. Ladner, K. Tam, Comparison of stream merging algorithms for media-on-demand, in: Proc. Multimedia Computing and Networking (MMCN’02), San Jose, CA, 2002, pp. 18–25. [30] L. Gao, D. Towsley, Supplying instantaneous videoon-demand services using controlled multicast, in: Proc. IEEE Multimedia Computing Systems, 1999, pp. 117– 121. [31] Y. Cai, K. Hua, K. Vu, Optimizing patching performance, in: Proc. SPIE/ACM Conference on Multimedia Computing and Networking, 1999, pp. 204–215. [32] D.L. Eager, M.C. Ferris, M. Vernon, Optimized regional caching for on-demand data delivery, in: Proc. ISET/SPIE Conf. Multimedia Computing and Networking (MMCN’99), 1999, pp. 301–316. [33] H. Tan, D. Eager, M. Vernon, H. Guo, Quality of service evaluations of multicast streaming protocols, in: Proc. ACM SIGMETRICS, 2002, pp. 183–194.

C.K.d.S. Rodrigues, R.M.M. Lea˜o / Computer Networks 51 (2007) 569–587 [34] A. Bar-Noy, R.E. Ladner, Eﬃcient algorithms for optimal stream merging for media-on-demand, SIAM Journal on Computing 33 (5) (2004) 1011–1034. [35] S.M. Ross, Stochastic Processes, second ed., John Wiley & Sons, Inc., New York, 1996. [36] L. Kleinrock, Queueing Systems, Theory, vol. 1, John Wiley & Sons, New York, 1975. [37] K.S. Trivedi, Probability and Statistics with Reliability, Queuing and Computer Science Applications, second ed., John Wiley & Sons, Inc., New York, 2002. [38] E.O. Brigham, The Fast Fourier Transform and Its Applications, Prentice-Hall, Inc., New Jersey, 1988. [39] R.M.L.R. Carmo, L.R. de Carvalho, E. de Souza e Silva, M.C. Diniz, R.R. Muntz, Performance/availability modeling with the TANGRAM-II modeling environment, Performance Evaluation 33 (1998) 45–65. [40] E. de Souza e Silva, R.M.M. Lea˜o, The Tangram-II environment, in: Proc. 11th Int. Conference TOOLS2000, 2000, pp. 366–369. [41] J.M. Almeida, J. Krueger, D.L. Eager, M.K. Vernon, Analysis of educational media server workloads, in: Proc. 11th Int. Workshop Network and Operating Systems Support for Digital Audio and Video (NOSSDAV’01), 2001, pp. 21–30. [42] E. de Souza e Silva, R.M.M. Lea˜o, B. Ribeiro-Neto, S. Campos, Performance issues of multimedia applications, Lecture Notes in Computer Science 2459 (2002) 374–404. [43] Huadong Ma, K.G. Shin, Multicast video-on-demand services, ACM SIGCOMM, Computer Communication Review 32 (1) (2002) 31–43. [44] K.A. Hua, M.A. Tantaoui, W. Tavanapong, Video delivery technologies for large-scale deployment of multimedia applications, Proceedings of the IEEE 92 (9) (2004) 1439–1451. [45] B. Wang, S. Sen, M. Adler, D. Towsley, Optimal proxy cache allocation for eﬃcient streaming media distribution, IEEE Transactions on Multimedia (2004) 1726–1735.

587

[46] X. Zhang, J. Liu, B. Li, Tak-Shing P. Yum, CoolStreaming/ DONet: a data-driven overlay network for eﬃcient live media streaming, in: IEEE Infocom, 2005.

Carlo K. da S. Rodrigues received the B.Sc. in Electrical Engineering from Federal University of Paraiba in 1993 and the M.Sc from IME in 2000. He has been D.Sc. Student from Federal University of Rio de Janeiro of System Engineering and Computer Science Program since 2002. He is also a member of the LAND – Laboratory for Modeling, Analysis and Development of Networks and Computer System. His current research interests are in the areas of computer networks and streaming media delivery.

Rosa M.M. Lea˜o received the B.Sc. degree in Computer Science from the Federal University of Rio de Janeiro in 1983, the M.Sc. degree in Computer Science from PUC-Rio in 1990, and the Ph.D. degree in Computer Science from the Paul Sabatier University (LAAS) in 1994. Currently she is an Associate Professor in the Systems Engineering and Computer Science Department at the Federal University of Rio de Janeiro. Her research interests include computer networks, computer and communication systems modeling, performance evaluation, and multimedia systems.

Bandwidth usage distribution of multimedia servers ...

across the Internet from this server. The ... nelling have also been used over the public Internet. [29,42,43]. ..... corresponding values T1,T2,...,Tn using the two.

Download PDF

360KB Sizes 1 Downloads 207 Views

Report

Bandwidth usage distribution of multimedia servers ...

Recommend Documents