Schedule

Mon 24/1

Introduction Part 1 (slides)

The datacenter as a computer (BCH chapters 1, 2)

Course syllabus

Wed 26/1

Introduction Part 2 (slides)

The datacenter as a computer (BCH chapters 1, 2)

Mon 31/1

Datacenter hardware (slides)

(Lecture notes, BCH chapters 3, 6)

Wed 2/2

Power management (slides)

(Lecture notes, BCH chapters 4, 5)

Mon 7/2

Hardware architectures

A Reconfigurable Fabric for Accelerating Large-Scale Datacenter Services

Architecting to Achieve a Billion RPS Throughput on a Single Key-Value Store Server Platform

Presenters:

Wed 9/2

Energy & power

Heracles: improving resource efficiency at scale

Towards Energy Proportionality for Large-Scale Latency-Critical Workloads

Presenters:

Mon 14/2

Datacenter storage

Pocket: Elastic Ephemeral Storage for Serverless Analytics

The Google File System

Presenters:

Wed 16/2

Reliability (slides)

(Lecture notes, BCH chapter 7)

Mon 21/2

Datacenter networking

Jupiter Rising: A Decade of Clos Topologies and Centralized Control in Google’s Datacenter Network

Azure Accelerated Networking: SmartNICs in the Public Cloud

Presenters:

Wed 23/2

Application frameworks

Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing.

X-Stream: edge-centric graph processing using streaming partitions.

Presenters:

Mon 28/2

Winter break (No lecture)

Wed 2/3

Serverless compute

Occupy the Cloud: Distributed Computing for the 99%

ExCamera -- Encoding, Fast and Slow: Low-Latency Video Processing Using Thousands of Tiny Threads

Presenters:

Mon 7/3

Microservices

Introduction to microservices

An Open-Source Benchmark Suite for Microservices and Their Hardware-Software Implications for Cloud and Edge Systems

Presenters:

Wed 9/3

Performance Analysis

Profiling a Warehouse-Scale Computer

CPI2: CPU performance isolation for shared compute clusters

Presenters:

Mon 14/3 (online on zoom)

Tail latency

The Tail at Scale

IX: a protected dataplane operating system for high throughput and low latency

Presenters:

Wed 16/3

Midterm (in class)

Mon 21/3

Security

CryptDB: Protecting Confidentiality with Encrypted Query Processing

Shielding applications from an untrusted cloud with Haven

Presenters:

Wed 23/3

Privacy

Hey, You, Get Off of My Cloud: Exploring Information Leakage in Third-Party Compute Clouds

HomeAlone: Co-residency Detection in the Cloud via Side-Channel Analysis

Presenters:

Mon 28/3

Monitoring

Dapper, a Large-Scale Distributed Systems Tracing Infrastructure

The Mystery Machine: End-to-end Performance Analysis of Large-scale Internet Services

Presenters:

Wed 30/3

Performance Debugging

X-Trace: A Pervasive Network Tracing Framework

Sage: Practical & Scalable ML-Driven Performance Debugging in Microservices

Mon 4/4

Spring Break (no lecture)

Wed 6/4

Spring Break (no lecture)

Mon 11/4

Low-latency service management

Retail: Opting for Learning Simplicity to Enable QoS-Aware Power Management in the Cloud

Caladan: Mitigating Interference at Microsecond Timescales

Presenters:

Wed 13/4

Datacenter management (slides)

(Lecture notes)

Mon 18/4

Cluster scheduling

Sparrow: distributed, low latency scheduling

Shinjuku: Preemptive Scheduling for μsecond-scale Tail Latency

Presenters:

Wed 20/4

ML in Systems

Quasar: Resource-Efficient and QoS-Aware Cluster Management

Resource Central: Understanding and Predicting Workloads for Improved Resource Management inLarge Cloud Platforms

Presenters:

Mon 25/4

Cluster management

Large-scale cluster management at Google with Borg

Omega: Flexible, scalable schedulers for large compute clusters

Presenters:

Wed 27/4

Project presentations

Mon 2/5

Project presentations

Wed 4/5

Project presentations

Mon 9/5

Project presentations

Note: Paper presentations can be found in this folder.

Google Sites

Report abuse