Schedule
Thu 8/2
Introduction Part 2 (slides)
The datacenter as a computer (BCH chapters 1, 2)
Tue 13/2
No lecture
Thu 22/2
Power management (slides)
(Lecture notes, BCH chapters 4, 5)
Tue 27/2
Hardware acceleration
In datacenter performance analysis of a tensor processing unit
Dagger: Efficient and Fast RPCs in Cloud Microservices with Near-Memory Reconfigurable NICs
Thu 29/2
Hardware architectures
A Reconfigurable Fabric for Accelerating Large-Scale Datacenter Services
Architecting to Achieve a Billion RPS Throughput on a Single Key-Value Store Server Platform
Presenters:
Tue 5/3
Energy & power
Towards Energy Proportionality for Large-Scale Latency-Critical Workloads
Heracles: improving resource efficiency at scale
Presenters:
Thu 7/3
Datacenter storage
Pocket: Elastic Ephemeral Storage for Serverless Analytics
Presenters:
Tue 12/3
Reliability (slides)
(Lecture notes, BCH chapter 7)
Thu 14/3
Datacenter networking
Jupiter Rising: A Decade of Clos Topologies and Centralized Control in Google’s Datacenter Network
Azure Accelerated Networking: SmartNICs in the Public Cloud
Presenters:
Tue 19/3
Application frameworks
Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing.
X-Stream: edge-centric graph processing using streaming partitions.
Presenters:
Thu 21/3
Serverless compute
Occupy the Cloud: Distributed Computing for the 99%
ExCamera -- Encoding, Fast and Slow: Low-Latency Video Processing Using Thousands of Tiny Threads
Presenters:
Tue 26/3
Spring Break (no lecture)
Thu 28/3
Spring Break (no lecture)
Tue 2/4
Microservices
Presenters:
Thu 4/4
Midterm (in class)
Tue 9/4
Performance Analysis
Profiling a Warehouse-Scale Computer
CPI2: CPU performance isolation for shared compute clusters
Presenters:
Thu 11/4
Tail latency
IX: a protected dataplane operating system for high throughput and low latency
Presenters:
Tue 16/4
Security & Privacy
CryptDB: Protecting Confidentiality with Encrypted Query Processing
Hey, You, Get Off of My Cloud: Exploring Information Leakage in Third-Party Compute Clouds
Presenters:
Thu 18/4
Monitoring
Dapper, a Large-Scale Distributed Systems Tracing Infrastructure
The Mystery Machine: End-to-end Performance Analysis of Large-scale Internet Services
Presenters:
Tue 23/4
Performance Debugging
X-Trace: A Pervasive Network Tracing Framework
Sage: Practical & Scalable ML-Driven Performance Debugging in Microservices
Thu 25/4
Low-latency service management
Retail: Opting for Learning Simplicity to Enable QoS-Aware Power Management in the Cloud
Caladan: Mitigating Interference at Microsecond Timescales
Presenters:
Tue 30/4
Datacenter management (slides)
(Lecture notes)
Thu 2/5
ML in Systems
Quasar: Resource-Efficient and QoS-Aware Cluster Management
Presenters:
Tue 7/5
Cluster management
Sparrow: distributed, low latency scheduling
Large-scale cluster management at Google with Borg
Omega: Flexible, scalable schedulers for large compute clusters
Presenters:
Thu 9/5
Project presentations
Tue 14/5
Project presentations
Note: Paper presentations can be found in this folder.