SSDStreamer: Specializing I/O Stack for Large-Scale Machine Learning

Jonghyun Bae, Hakbeom Jang, Jeonghun Gong, Wenjing Jin, Shine Kim, Jaeyoung Jang, Tae Jun Ham, Jinkyu Jeong, Jae W. Lee

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)

Abstract

This article presents SSDStreamer, an SSD-based caching system for large-scale machine learning. By using DRAM as stream buffer, instead of an upper-level cache, SSDStreamer significantly outperforms state-of-the-art multilevel caching systems on Apache Spark, while requiring much less DRAM capacity.

Original languageEnglish
Article number8770099
Pages (from-to)73-81
Number of pages9
JournalIEEE Micro
Volume39
Issue number5
DOIs
Publication statusPublished - 2019 Sept 1

Bibliographical note

Publisher Copyright:
© 1981-2012 IEEE.

All Science Journal Classification (ASJC) codes

  • Software
  • Hardware and Architecture
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'SSDStreamer: Specializing I/O Stack for Large-Scale Machine Learning'. Together they form a unique fingerprint.

Cite this