General Guidelines for Projects
Each group project involves the following 4 items:
All deadlines will be provided on Brightspace.
1. Progress report
The labs of week 7 are dedicated to supporting your project work. At least two representatives of each group must attend the lab and report their progress to the instructors (e.g. by preparing a couple of slides or printing a 1-page handout). You'll receive feedback and advice on the spot. Here are some ideas for points to discuss in your progress report:
Division of roles among team members
Current literature review stage - what sources have you read and found most useful to proceed with your experiments.
Current experimental stage - which experiments have been conducted so far, and preliminary results if available.
Which research direction do you plan on taking beyond the shared mandatory part? Do you plan to work on the challenge?
Provide non-exhaustive time planning for the weeks leading up to the submission of the report for every member of the team.
2. Final written report
Complete all sections of the report template and submit the final PDF version through Brightspace.
Length: between 5 and 8 pages (not counting references and appendices).
You don’t need to fill in the maximum number of pages to get a good grade. Substance, clarity and conciseness are more important than length. We encourage you to follow the instructions provided in the template, and to focus your writing on clearly discussing/motivating your choices and commenting on results, following the format of a typical research report.
3. Github repository
Create a GitHub repository with all relevant data & code for the project. Add a link to the repository in the written report.
Students are encouraged to collaborate using Github to version code, but it is not mandatory (i.e. you can simply upload everything at the end if you prefer). Important: The repository needs to be public by the time the written report is submitted.
Data:
any newly created datasets (including new annotations of existing datasets) should also be hosted on the repository (unless it’s huge and easy to re-generate with a script).
for existing datasets, only provide links to the original sources.
Include output files for all reported numbers in the report (e.g. output logs from running evaluation scripts)
Code:
add all code for preprocessing, training, feature extraction, predicting, evaluating, analyzing, etc.
You can use Jupyter Notebooks for analysis and visualization, but code needs to be modular, well-commented and well-structured. If you have no experience, Google Style Guide is a good starting point for docstrings and comments.
GitHub README:
The most important part of your repository! It should tell us exactly how to use your code and reproduce your results step-by-step.
Include at least the following sections:
How to install all dependencies
How to preprocess the data so it is in the correct format
How to train the models on the data
How to use one of your already trained models to predict unseen data
How to evaluate the output
Installing dependencies: pip install -r requirements.txt
To check if everything works as it should: create a new Python virtual environment, install the dependencies and follow along with your own README file. You can ask another student group to test (help each other)!
4. Results presentation
Each team delivers an oral presentation at the Final Project Fair (April 11th, 2024 between 9:00 and 13:00):
Time: 10 minutes + 3 minutes for Q&A session. We expect everyone's active participation during all Q&A sessions.
Presenter(s): All team members must be present in person and deliver a part of the presentation that most closely matches their personal contribution to the project. The presentation will be followed by a Q&A session in which presenters will be asked to answer questions for teachers and the general audience.
Format: Submit the PDF version of your presentation via Nestor by 16:00 the day before the presentation. We will download the files on the PC the morning of the presentation. See Brightspace for details on the naming of the file;
Style: do not get lost in the details, 10 minutes are short! Use the time to get the main message across, and raise some interesting questions/discussion points.
Project Grading Sheet
[Subject to change. If needed, changes will be communicated in due time before the deadline]
Clarity (2): structure, coherence & language of the final writeup
Introduction (2): motivation & previous work, research questions
Method (3): description of models, soundness of experiments, choice baselines
Results (3): overview, performance, discussion & analysis
Reproducibility (2): quality and comprehensiveness of Github code & README
Presentation (3): quality of the presentation, answers to questions from the audience