Advanced Parallel Computing (240199)
Tue. 19:30~22:30 / Online
Instructor: Duksu Kim (bluekds at koreatech.ac.kr / #435, 2nd Eng. Building)
Prerequisite
(Required) C Programming
(Strongly Recommended) Multi-core Programming (Undergraduate level)
(Recommended) System Programming, Data structure
(Required) PC or Laptop with a multi-core CPU / (Recommended) PC or Laptop with a NVIDIA GPU
We will rent a development kit (e.g., Jetson kit) for CUDA if you need
However, you need to prepare a monitor and a keyboard/mouse yourself to use that.
Textbooks
Lecture notes in this page
Lecture videos for the multi-core programming class (CPH351) [link]
Sample codes are available at the git repository [link]
References
This page may not work with Internet Explorer 10 or older versions (Recommended browser: Chrome)
Setup CUDA Dev. environments
Windows Dev. environments [Kor]
Linux(Ubuntu) Dev. environments on Jetson Kit [Kor]
Trouble shooting
Q. My laptop has a Nvidia GPU, but CUDA does not work properly
A. Check the GPU system on your laptop whether a hybrid GPU system (e.g., Intel HD graphics + Nvidia GPU)
In this case, disabling the intel GPU on the device manager of you OS may fix the problem
Lecture Notes and Videos
- Lecture 1. Parallel Processing Overview (Week 1)
- Lecture 2. OpenMP Overview (Week 2-3)
- Lecture 3. CUDA Overview (Week 4-6)
Paper Seminar 1 (10/13)
Efficient parallel optimizations of a high-performance SIFT on GPUs [paper] [slides] [video]
Li et al, Journal of Parallel and Distributed Computing, 2019
Presenter: Young-Wo Kim
A New GPU Bundle Adjustment Method for Large-Scale Data [paper] [slides] [video]
Maoteng Zheng, Photogrammetric Engineering and Remote Sensing, 2017
Presenter: ByungHyun Yoon
Paper Seminar 2 (10/20)
Accelerating reduction and scan using tensor core units [paper][slides][Video]
Abdul Dakkak et al., ACM International Conference on Supercomputing, 2019
Presenter: Jaehong Lee
Optimizing Memory Efficiency for Deep Convolutional Neural Networks on GPUs [paper] [slides][Video]
Chao Li et al, International Conference for High Performance Computing, Networking, Storage and Analysis, 2016
Presenter: Jun Wook Jeong
[Project] Proposal presentation (10/27)
Paper Seminar 3 (11/03)
Paper Seminar 4 (11/10)
GPU Fast Convolution via the Overlap-and-Save Method in Shared Memory [paper][slides][video]
Adamek et al., ACM TACO, 2020
Presenter: Young-Wo Kim
Real-time enhancement of sparse 3D maps using a parallel segmentation scheme based on superpixels [paper][slides][video]
Cruz-Martinez et al., Journal of Real-time Image Processing, 2018
Presenter: ByungHyun Yoon
[Project] Midterm presentation (11/17)
Paper Seminar 5 (11/24)
Overcoming the difficulty of large-scale CGH generation on multi-GPU cluster [paper][slides][video]
Yokota et al., Proceedings of the 11th Workshop on General Purpose GPUs, 2018
Presenter: Jaehong Lee
NeuGraph: Parallel Deep Neural Network Computation on Large Graphs [paper][slides][video]
Ma et al., USENIX, 2019
Presenter: Jun Wook Jeong
Paper Seminar 6 (12/01)
Parallel Tracking and Mapping for Small AR Workspaces [paper][slides][video]
Klein et al., IEEE and ACM International Symposium on Mixed and Augmented Reality, 2007
Presenter: Hyun-wo Jeong
Pragma-Oriented Parallelization of the Direct Sparse Odometry SLAM Algorithm [paper][slides][video]
Pereira et al., PDP, 2019
Presentor: Sang-Won Choi