Abstract
Fast network quality analysis in the telecom industry is an important method used to provide quality service. SK Telecom, based in South Korea, built a Hadoop-based analytical system consisting of a hundred nodes, each of which only contains hard disk drives (HDDs). Because the analysis process is a set of parallel I/O intensive jobs, adding solid state drives (SSDs) with appropriate settings is the most cost-efficient way to improve the performance, as shown in previous studies. Therefore, we decided to configure SSDs as a write-through cache instead of increasing the number of HDDs. To improve the cost-per-performance of the SSD cache, we introduced a selective I/O bypass (SIB) method, redirecting the automatically calculated number of read I/O requests from the SSD cache to idle HDDs when the SSDs are I/O over-saturated, which means the disk utilization is greater than 100 percent. To precisely calculate the disk utilization, we also introduced a combinational approach for SSDs because the current method used for HDDs cannot be applied to SSDs because of their internal parallelism. In our experiments, the proposed approach achieved a maximum 2x faster performance than other approaches.
Original language | English |
---|---|
Pages (from-to) | 589-595 |
Number of pages | 7 |
Journal | IEEE Transactions on Computers |
Volume | 67 |
Issue number | 4 |
DOIs | |
Publication status | Published - 2018 Apr 1 |
Bibliographical note
Funding Information:We gratefully acknowledge the financial support of the German Ministry of Education, Science, Research and Technology (BMBF), the Deutsche Forschungsgemeinschaft (SFB 5 13) and Land Baden-Wurttemberg (Optik Zentrum).
Publisher Copyright:
© 1968-2012 IEEE.
All Science Journal Classification (ASJC) codes
- Software
- Theoretical Computer Science
- Hardware and Architecture
- Computational Theory and Mathematics