cSE 291 Project
Introduction
In this course, you will learn advanced topics in deep learning. Part of the learning will be through in-class lectures and take-home assignments, but you will really gain hands-on experience with deep learning in your course project. We would like you to choose wisely a project that fits your interests. One that would be both motivating and technically challenging.
Project Format
You will participate in the project in small groups, with a maximum of 3 people. The project contributes to 50% of the course credits:
5 % proposal
15 % milestone report
20 % final report
10 % final presentation
In the end, you will write an 8-page report about your project. The quality of this report should be close to a top machine learning conference workshop paper. See Hint for more details.
Project Area
Most students choose one of four types of projects:
Theoretical project. Analyze and prove some interesting properties of a deep learning model or a learning algorithm.
Algorithmic project. Develop a new learning algorithm, or a novel variant of an existing algorithm, for deep learning.
Modeling project. Develop a new deep neural network architecture, or a novel variant of an existing model.
Application project. Apply a model and a learning algorithm to solve a novel application of interests more efficiently.
Many fantastic projects come from students picking either an application that they’re interested in, or picking some subfield of deep learning that they want to explore more. So, pick something that you are passionate about! Be brave rather than timid, and do feel free to propose ambitious things that you’re excited about. ( Just be sure to ask us for help if you’re uncertain how to best get started.) Alternatively, if you’re already working on a research project that deep learning might apply to, then you may already have a great project idea.
Evaluation Criteria
Your project will be evaluated based on criteria that are similar to a research paper.
Novelty. Is this project applying a common technique to a well-studied problem, or is the problem or method relatively unexplored?
Significance. Did the authors choose an interesting or a “real” problem to work on, or only a small “toy” problem? Is this work likely to be useful and/or have an impact?
Technical Quality. Does the technical material make sense? Are the things tried reasonable? Are the proposed algorithms or applications clever and interesting? Do the authors convey novel insight about the problem and/or algorithms?
Presentation and Writing. Is the idea and the solution clearly conveyed? Are the figures and tables carefully crafted? Is the report well structured and well reasoned?
Hint: A very good project will be a publishable or nearly-publishable piece of work. Three of the main machine learning conferences are ICML, NeurIPS and ICLR. All papers from these machine learning conferences are available online.
You can browse some of the recent machine learning papers to get inspired.
Resources
Data: many benchmark datasets are freely available, e.g. MNIST, CIFAR-10, SQuAD. You can also search public datasets at:
Compute:
UC San Diego provides Data Science/Machine Learning platform for this course. You can log in with your Active Directory ID.
In addition, you can apply for student cloud computing credits at Google Cloud and Amazon AWS.
Example Projects
COVID-19 Forecasting
As COVID-19 continues to spread, accurately forecasting the number of newly infected, removed and death cases has become a crucial task in public health. This project aims to develop deep generative models for COVID-19 forecasting for four weeks ahead. The data is publically available at JHU CSSE repo. You can learn more about other forecasting models at CDC COVID-19 forecasting website.
Image Animation
Image animation translates the motion of a driving video to animate an object in a target image. This project aims to generate a realistic video to animate objects in a diverse set of target images. A popular method in image animation is First Order Motion Model. You can learn more about image animation in this NeurIPS paper.
Automatic Drug Design
Drug discovery finds target molecules with desired chemical properties. Current drug discovery and development pipelines are long, complex and depend on numerous factors. This project aims to speed up drug discovery by automatically generating molecular graphs. The common datasets include ChEMBL, ZINC. You can learn more about this problem and solutions from this ICML paper and this github repo.
Predicting Machine Learning Trends
Discovering trendy topics in machine learning can assist researchers in the field and prepare for the future. However, there does not exist an automatic tool that can make such predictions. This project aims to leverage deep generative models to generate trendy topics from a Arxiv machine learning paper abstracts. You can download the collection of paper abstracts from past few decades following this link.