Dominik Kempa
Postdoctoral Fellow (Langmead Lab)
Department of Computer Science,
Johns Hopkins University
Email: lastname (at) cs.jhu.edu
(replace lastname with kempa)
Short Bio
I am currently a postdoc at Johns Hopkins University fortunate to be hosted by Ben Langmead. Prior to joining JHU, I was a postdoc at the University of California, Berkeley, and before that, and the University of Warwick.
I obtained my PhD in Computer Science in 2015 at the University of Helsinki, under the supervision of Esko Ukkonen and Juha Kärkkäinen. During my PhD, I implemented a collection of parallel and external-memory algorithms on strings (code: here or Github). As an undergraduate, I enjoyed algorithm competitions (and practiced here). I am a recipient of the Junior Researcher Award and Outstanding Doctoral Dissertation Award.
Recent (+Selected) Publications
See the full list here (alternative: Google Scholar, DBLP).
Dominik Kempa, Tomasz Kociumaka: Resolution of the Burrows-Wheeler Transform Conjecture, FOCS 2020 Arxiv
Dominik Kempa, Tomasz Kociumaka: String Synchronizing Sets: Sublinear-Time BWT Construction and Optimal LCE Data Structure, STOC 2019 PDF | Arxiv
Dominik Kempa: Optimal Construction of Compressed Indexes for Highly Repetitive Texts, SODA 2019 PDF | Slides
Dominik Kempa, Nicola Prezza: At the Roots of Dictionary Compression: String Attractors, STOC 2018 PDF
Juha Kärkkäinen, Dominik Kempa: Better External Memory LCP Array Construction, ACM JEA 2019 View | Code
Juha Kärkkäinen, Dominik Kempa, Simon J. Puglisi, Bella Zhukova: Engineering External Memory Induced Suffix Sorting, ALENEX 2017 View | Code
Research Interests
String Algorithms, Data Compression, Compressed Data Structures, Bioinformatics, Parallel and External-Memory Algorithms. A one-slide summary of Compressed Indexing, one of the most exciting recent topics that I am interested in: View
News
2021/02: I will give an invited talk at SEA 2021
2021/01: I joined Langmead Lab at Johns Hopkins University as a Postdoctoral Scholar
2020/07: Paper accepted to FOCS 2020 Arxiv
2020/07: Github release of pSAscan, a parallel external-memory algorithm for suffix array construction
2020/06: Our STOC 2019 paper on String Synchronizing Sets and fast BWT construction featured in "Highlights of CPM" Link
2019/11: Code release of fSAIS, and LZ-End parser
2019/10: I will be on the PC of CSR 2020, CPM 2020, SPIRE 2020
2019/06: Invited talk about BWT in Dagstuhl Slides | Seminar
Recent Teaching
Algorithms for Data Science (Co-Instructor), UC Berkeley, Fall 2020