Schedule

Mon 24/1

Introduction Part 1 (slides)

The datacenter as a computer (BCH chapters 1, 2)

Course syllabus

Wed 26/1

Introduction Part 2 (slides)

The datacenter as a computer (BCH chapters 1, 2)


Mon 31/1

Datacenter hardware (slides)

(Lecture notes, BCH chapters 3, 6)

Wed 23/2

Application frameworks

Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing.

X-Stream: edge-centric graph processing using streaming partitions.

Presenters:


Mon 28/2

Winter break (No lecture)


Wed 2/3

Serverless compute

Occupy the Cloud: Distributed Computing for the 99%

ExCamera -- Encoding, Fast and Slow: Low-Latency Video Processing Using Thousands of Tiny Threads

Presenters:


Mon 7/3

Microservices

Introduction to microservices

An Open-Source Benchmark Suite for Microservices and Their Hardware-Software Implications for Cloud and Edge Systems

Presenters:


Wed 9/3

Performance Analysis

Profiling a Warehouse-Scale Computer

CPI2: CPU performance isolation for shared compute clusters

Presenters:


Mon 14/3 (online on zoom)

Tail latency

The Tail at Scale

IX: a protected dataplane operating system for high throughput and low latency

Presenters:


Wed 16/3

Midterm (in class)


Mon 21/3

Security

CryptDB: Protecting Confidentiality with Encrypted Query Processing

Shielding applications from an untrusted cloud with Haven

Presenters:


Wed 23/3

Privacy

Hey, You, Get Off of My Cloud: Exploring Information Leakage in Third-Party Compute Clouds

HomeAlone: Co-residency Detection in the Cloud via Side-Channel Analysis

Presenters:


Mon 28/3

Monitoring

Dapper, a Large-Scale Distributed Systems Tracing Infrastructure

The Mystery Machine: End-to-end Performance Analysis of Large-scale Internet Services

Presenters:


Wed 30/3

Performance Debugging

X-Trace: A Pervasive Network Tracing Framework

Sage: Practical & Scalable ML-Driven Performance Debugging in Microservices


Mon 4/4

Spring Break (no lecture)


Wed 6/4

Spring Break (no lecture)


Mon 11/4

Low-latency service management

Retail: Opting for Learning Simplicity to Enable QoS-Aware Power Management in the Cloud

Caladan: Mitigating Interference at Microsecond Timescales

Presenters:


Wed 13/4

Datacenter management (slides)

(Lecture notes)


Mon 18/4

Cluster scheduling

Sparrow: distributed, low latency scheduling

Shinjuku: Preemptive Scheduling for μsecond-scale Tail Latency

Presenters:


Wed 20/4

ML in Systems

Quasar: Resource-Efficient and QoS-Aware Cluster Management

Resource Central: Understanding and Predicting Workloads for Improved Resource Management inLarge Cloud Platforms

Presenters:


Mon 25/4

Cluster management

Large-scale cluster management at Google with Borg

Omega: Flexible, scalable schedulers for large compute clusters

Presenters:


Wed 27/4

Project presentations


Mon 2/5

Project presentations


Wed 4/5

Project presentations


Mon 9/5

Project presentations



Note: Paper presentations can be found in this folder.