Schedule
Wed 26/1
Introduction Part 2 (slides)
The datacenter as a computer (BCH chapters 1, 2)
Mon 31/1
Datacenter hardware (slides)
(Lecture notes, BCH chapters 3, 6)
Wed 2/2
Power management (slides)
(Lecture notes, BCH chapters 4, 5)
Mon 7/2
Hardware architectures
A Reconfigurable Fabric for Accelerating Large-Scale Datacenter Services
Architecting to Achieve a Billion RPS Throughput on a Single Key-Value Store Server Platform
Presenters:
Wed 9/2
Energy & power
Heracles: improving resource efficiency at scale
Towards Energy Proportionality for Large-Scale Latency-Critical Workloads
Presenters:
Mon 14/2
Datacenter storage
Pocket: Elastic Ephemeral Storage for Serverless Analytics
Presenters:
Wed 16/2
Reliability (slides)
(Lecture notes, BCH chapter 7)
Mon 21/2
Datacenter networking
Jupiter Rising: A Decade of Clos Topologies and Centralized Control in Google’s Datacenter Network
Azure Accelerated Networking: SmartNICs in the Public Cloud
Presenters:
Wed 23/2
Application frameworks
Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing.
X-Stream: edge-centric graph processing using streaming partitions.
Presenters:
Mon 28/2
Winter break (No lecture)
Wed 2/3
Serverless compute
Occupy the Cloud: Distributed Computing for the 99%
ExCamera -- Encoding, Fast and Slow: Low-Latency Video Processing Using Thousands of Tiny Threads
Presenters:
Mon 7/3
Microservices
Presenters:
Wed 9/3
Performance Analysis
Profiling a Warehouse-Scale Computer
CPI2: CPU performance isolation for shared compute clusters
Presenters:
Mon 14/3 (online on zoom)
Tail latency
IX: a protected dataplane operating system for high throughput and low latency
Presenters:
Wed 16/3
Midterm (in class)
Mon 21/3
Security
CryptDB: Protecting Confidentiality with Encrypted Query Processing
Shielding applications from an untrusted cloud with Haven
Presenters:
Wed 23/3
Privacy
Hey, You, Get Off of My Cloud: Exploring Information Leakage in Third-Party Compute Clouds
HomeAlone: Co-residency Detection in the Cloud via Side-Channel Analysis
Presenters:
Mon 28/3
Monitoring
Dapper, a Large-Scale Distributed Systems Tracing Infrastructure
The Mystery Machine: End-to-end Performance Analysis of Large-scale Internet Services
Presenters:
Wed 30/3
Performance Debugging
X-Trace: A Pervasive Network Tracing Framework
Sage: Practical & Scalable ML-Driven Performance Debugging in Microservices
Mon 4/4
Spring Break (no lecture)
Wed 6/4
Spring Break (no lecture)
Mon 11/4
Low-latency service management
Retail: Opting for Learning Simplicity to Enable QoS-Aware Power Management in the Cloud
Caladan: Mitigating Interference at Microsecond Timescales
Presenters:
Wed 13/4
Datacenter management (slides)
(Lecture notes)
Mon 18/4
Cluster scheduling
Sparrow: distributed, low latency scheduling
Shinjuku: Preemptive Scheduling for μsecond-scale Tail Latency
Presenters:
Wed 20/4
ML in Systems
Quasar: Resource-Efficient and QoS-Aware Cluster Management
Presenters:
Mon 25/4
Cluster management
Large-scale cluster management at Google with Borg
Omega: Flexible, scalable schedulers for large compute clusters
Presenters:
Wed 27/4
Project presentations
Mon 2/5
Project presentations
Wed 4/5
Project presentations
Mon 9/5
Project presentations
Note: Paper presentations can be found in this folder.