Join us for an intensive 5-day workshop on computational reproducibility and scientific computing organized by the MCML. Each day runs from 9:00 AM to approximately 5:00 PM, starting with lecture-style sessions in the morning. After a lunch break, most afternoons will be dedicated to hands-on tutorials, where you can apply the concepts learned in the morning sessions. This workshop is designed to provide a balanced mix of theoretical background and practical skills, ensuring you gain a thorough understanding of key topics, including writing reproducible manuscripts for small-scale data analysis projects, high-performance computing (HPC) for large-scale projects, packaging and software development.
"Really happy overall with the workshop. Many advanced concept that are also useful in the daily work, and that are hard to get the time to get a proper idea."
"Thanks for this great workshop! I will definitely recommend it to other colleagues :) I hope you will provide it again in the future."
"Very relevant topics, great hands-on sessions, helpful slides. Thank you very much for making the workshop possible."
Details of the workshop
Who: Anybody interested, primary target group are junior MCML PhD students
(Max. 50 participants, please see prerequisites)
Where: Munich, in-person, Location: Richard-Wagner-Str 10, 80333 Munich, Room D 105
What: Topics include reproducible manuscripts, HPC, Packaging, CI/CD, both for R and Python users
How: Lecture-style morning sessions, hands-on tutorials in the afternoons
When: We-Fr, 24.09. - 26.09. &
Mo-Tu, 29.09. - 30.09.
If it is organizationally feasible, we will try to enable participation on individual days.
Joint session for Python and R
Welcome and introduction to WS
Background: Open Science, reproducibility and the replication crisis
Overview of computational reproducibility
Separate sessions for Python and R
Project-oriented workflows
Reproducible analyses & manuscripts
Reproducible environments
Joint session for Python and R
HPC: slurm & LRZ
Workflow management: batchtools, make, snakemake, nextflow
Containerization
Separate sessions for Python and R
What are packages
Documentation
Tests
Packaging tools
Dependencies
Joint and separate sessions
CI & CD
Licensing
MLOps
Participants must be able to program in R or Python and use Git.
In general, we consider MCML PhD students to fulfill the requirements.
If you are a student of the Department of Statistics at the LMU and an R user, you should have finished the Introduction to Statistical Software ("StatSoft") and Programming with Statistical Software ("ProgR") courses. We refer other users of R to this list.
Python users should have a level of programming skills that is equivalent to the topics covered in this course: Data Science with Python
You must bring your own laptop! Depending on the practicals you wish to participate in, we expect you to have a suitable version of Python and/or R and other dependencies installed. We will give a more detailed list of requirements before the workshop.
Participation is open to all members of LMU and TUM as well as students and researchers from other institutions.
Seats will be given on a first-register, first-serve basis. Half of the seats are reserved for members of the MCML.
Please register for the workshop by 05.09.2025 by filling in this form.
If you have any questions, please reach out to moritz.herrmann@lmu.de or ma.binder@lmu.de.
MCML Reproducibility and Open Science Transfer Coordinator
MCML Open Source and Open Data Transfer Coordinator