- Stoch. Syst.
- Volume 6, Number 1 (2016), 90-131.
Randomized assignment of jobs to servers in heterogeneous clusters of shared servers for low delay
We consider the problem of assignning jobs to servers in a multi-server system consisting of $N$ parallel processor sharing servers, categorized into $M$ ($\ll N$) different types according to their processing capacities or speeds. Jobs of random sizes arrive at the system according to a Poisson process with rate $N\lambda$. Upon each arrival, some servers of each type are sampled uniformly at random. The job is then assigned to one of the sampled servers based on their states. We propose two schemes, which differ in the metric for choosing the destination server for each arriving job. Our aim is to reduce the mean sojourn time of the jobs in the system.
It is shown that the proposed schemes achieve the maximal stability region, without requiring the knowledge of the system parameters. The performance of the system operating under the proposed schemes is analyzed in the limit as $N\to\infty$. This gives rise to a mean field limit. The mean field is shown to have a unique, globally asymptotically stable equilibrium point which approximates the stationary distribution of load at each server. Asymptotic independence among the servers is established using a notion of intra-type exchangeability which generalizes the usual notion of exchangeability. It is further shown that the tail distribution of server occupancies decays doubly exponentially for each server type. Numerical evidence shows that at high load the proposed schemes perform at least as well as other schemes that require more knowledge of the system parameters.
Stoch. Syst., Volume 6, Number 1 (2016), 90-131.
Received: February 2015
First available in Project Euclid: 16 November 2016
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Primary: 60K35: Interacting random processes; statistical mechanics type models; percolation theory [See also 82B43, 82C43]
Secondary: 60K25: Queueing theory [See also 68M20, 90B22] 90B15: Network models, stochastic
Mukhopadhyay, Arpan; Karthik, A.; Mazumdar, Ravi R. Randomized assignment of jobs to servers in heterogeneous clusters of shared servers for low delay. Stoch. Syst. 6 (2016), no. 1, 90--131. doi:10.1214/15-SSY179. https://projecteuclid.org/euclid.ssy/1479287406