601.817: Selected Topics in Systems Research

Fall 2018

Fridays 1-2:15pm, Malone 338

Prof. Ryan Huang


601.817 is a weekly seminar organized by the OrderLab. It covers latest advances in the research of computer systems including operating systems, distributed system, mobile and cloud computing. Students will read and discuss recent papers in top systems conferences such as OSDI, SOSP, NSDI, EuroSys, MobiSys, ASPLOS and FAST.

Each week, one student will present the paper and lead the discussion for the week. Other students MUST read the paper to be presented before the seminar. Do not come to the seminar to read the paper. This seminar is supposed to generate in-depth discussions. It is impossible to do so without reading the paper first. During the reading group discussion, there might be some questions that were left unanswered and required further investigation. In that case, the discussion lead should start a email thread to systalk@cs.jhu.edu afterwards to follow up.

The focus topics covered in the papers vary semester to semester. Example topics include fault-tolerance, reliability, verification, energy efficiency, and virtualization. The presenter decides which paper to present. In general, select the papers that are relevant to your research project first (i.e., depth-first). If you are not sure, it is a good idea to check with me first before preparing the presentation. Also, try to avoid picking papers that have already been picked in the past (the past schedules are linked on the leftside menu).

The presentation announcements are sent via the mailing list systalk@cs.jhu.edu. Students who registered for the course please email me to sign up for the mailing list.


08/31/2018 Organizational Meeting
09/07/2018 Justin Shafer

Fail-Slow at Scale: Evidence of Hardware Performance Faults in Large Production Systems

Haryadi S. Gunawi, Rita O. Suminto (U. Chicago); Russell Sears, Casey Golliher (Pure Storage); Swaminathan Sundaraman (Parallel Machines); Xing Lin, Tim Emami (NetApp); Weiguang Sheng, Nematollah Bidokhti (Huawei); Caitie McCaffrey (Twitter); Gary Grider, Parks M. Fields (Los Alamos National Laboratory); Kevin Harms, Robert B. Ross (Argonne National Laboratory); Andree Jacobson (New Mexico Consortium); Robert Ricci, Kirk Webb (U. Utah); Peter Alvaro (UCSC); H. Birali Runesha (U. Chicago Research Computing Center); Mingzhe Hao, Huaicheng Li (U. Chicago)

FAST '18 Paper
09/14/2018 Yigong Hu

WAFL Iron: Repairing Live Enterprise File Systems

Ram Kesavan, NetApp, Inc.; Harendra Kumar, Composewell Technologies; Sushrut Bhowmik, NetApp, Inc.

FAST '18 Paper
09/21/2018 Canceled for Amy Baby's defense
09/28/2018 Disa Mhembere

Improving Docker Registry Design Based on Production Workload Analysis

Ali Anwar, Virginia Tech; Mohamed Mohamed and Vasily Tarasov, IBM Research—Almaden; Michael Littley, Virginia Tech; Lukas Rupprecht, IBM Research—Almaden; Yue Cheng, George Mason University; Nannan Zhao, Virginia Tech; Dimitrios Skourtis, Amit S. Warke, and Heiko Ludwig, and Dean Hildebrand, IBM Research—Almaden; Ali R. Butt, Virginia Tech

FAST '18 Paper
10/05/2018 Shiwei Weng

The benefits and costs of writing a POSIX kernel in a high-level language

Cody Cutler, M. Frans Kaashoek, and Robert T. Morris, MIT CSAIL

OSDI '18 Paper
10/12/2018 Steven Cheng

Finding Crash-Consistency Bugs with Bounded Black-Box Crash Testing

Jayashree Mohan, Ashlie Martinez, Soujanya Ponnapalli, and Pandian Raju, University of Texas at Austin; Vijay Chidambaram, University of Texas at Austin and VMware Research

OSDI '18 Paper
10/19/2018 Kunal Lillaney

To FUSE or Not to FUSE: Performance of User-Space File Systems

Bharath Kumar Reddy Vangoor, Stony Brook University; Vasily Tarasov, IBM Research-Almaden; Erez Zadok, Stony Brook University

FAST '17 Paper
10/26/2018 Brian Choi

REPT: Reverse Debugging of Failures in Deployed Software

Weidong Cui and Xinyang Ge, Microsoft Research Redmond; Baris Kasikci, University of Michigan; Ben Niu, Microsoft Research Redmond; Upamanyu Sharma, University of Michigan; Ruoyu Wang, Arizona State University; Insu Yun, Georgia Institute of Technology

OSDI '18 Paper
11/02/2018 Chang Lou

Fault-Tolerance, Fast and Slow: Exploiting Failure Asynchrony in Distributed Systems

Ramnatthan Alagappan, Aishwarya Ganesan, Jing Liu, Andrea Arpaci-Dusseau, and Remzi Arpaci-Dusseau, University of Wisconsin - Madison

OSDI '18 Paper
11/09/2018 Justin Shafer

Orca: Differential Bug Localization in Large-Scale Services

Ranjita Bhagwan, Rahul Kumar, Chandra Sekhar Maddila, and Adithya Abraham Philip, Microsoft Research India

OSDI '18 Paper
11/16/2018 Yigong Hu

Differential Energy Profiling: Energy Optimization via Diffing Similar Apps

Abhilash Jindal and Y. Charlie Hu, Purdue University and Mobile Enerlytics, LLC

OSDI '18 Paper
11/23/2018 Thanksgiving break
11/30/2018 Ziyan Wang

LegoOS: A Disseminated, Distributed OS for Hardware Resource Disaggregation

Yizhou Shan, Yutong Huang, Yilun Chen, and Yiying Zhang, Purdue University

OSDI '18 Paper
12/07/2018 Shiwei Weng

TVM: An Automated End-to-End Optimizing Compiler for Deep Learning

Tianqi Chen and Thierry Moreau, University of Washington; Ziheng Jiang, University of Washington, AWS; Lianmin Zheng, Shanghai Jiao Tong University; Eddie Yan, Haichen Shen, and Meghan Cowan, University of Washington; Leyuan Wang, UC Davis, AWS; Yuwei Hu, Cornell; Luis Ceze, Carlos Guestrin, and Arvind Krishnamurthy, University of Washington

OSDI '18 Paper