Abstract
Coded computation is a framework which provides redundancy in distributed computing systems to speed up large-scale tasks. Although most existing works assume error-free scenarios, the link failures are common in current wired/wireless networks. In this paper, we consider the straggler problem in distributed computing systems with link failures, by modeling the links between the master node and worker nodes as packet erasure channels. We first analyze the latency in this setting using an (n, k) maximum distance separable (MDS) code. Then, we consider a setup where the number of retransmissions is limited due to the bandwidth constraint. By formulating practical optimization problems related to latency, bandwidth and probability of successful computation, we obtain achievable performance curves as a function of packet erasure probability.
Original language | English |
---|---|
Title of host publication | 2019 IEEE International Symposium on Information Theory, ISIT 2019 - Proceedings |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 717-721 |
Number of pages | 5 |
ISBN (Electronic) | 9781538692912 |
DOIs | |
Publication status | Published - 2019 Jul |
Event | 2019 IEEE International Symposium on Information Theory, ISIT 2019 - Paris, France Duration: 2019 Jul 7 → 2019 Jul 12 |
Publication series
Name | IEEE International Symposium on Information Theory - Proceedings |
---|---|
Volume | 2019-July |
ISSN (Print) | 2157-8095 |
Conference
Conference | 2019 IEEE International Symposium on Information Theory, ISIT 2019 |
---|---|
Country/Territory | France |
City | Paris |
Period | 19/7/7 → 19/7/12 |
Bibliographical note
Publisher Copyright:© 2019 IEEE.
All Science Journal Classification (ASJC) codes
- Theoretical Computer Science
- Information Systems
- Modelling and Simulation
- Applied Mathematics