Teaching college level students how machine learning and AI work and how to buil models from intro to advanced topics.
Analyze foundational concepts in data science
Define data science
Examine data science related case studies
Define foundational concepts in data analysis
Examine components of data analysis pipelines
Outline the data science process
Demonstrate understanding of the reasoning for each step in the data analytics pipeline
Design and execute programs using a high-level language to solve basic data science problems
Utilize data types, operations, control structures and iterations
Utilize functions and follow scoping rules
Demonstrate use of a debugging method
Analyze key concepts in data science project management
Demonstrate an understanding of how a data science project progresses
Explain what advantages or disadvantages exist for popular project methodologies
Evaluate ethical issues in data science
Argue the difference between patents, copyrights, designs and trademarks and illustrate their use in the context of data science
Explain how laws and technology safeguard from cyberattacks
Describe the role of trade secrets in relation to data science
Discuss how organizations with international ties must consider privacy laws, regulations, and standards across countries in which they operate.
Compare and contrast individual privacy and security
Compare the needs of society to individual rights to privacy
Discuss potential concerns involving microtargeting and algorithmic fairness
Compare and contrast broad classes of learning approaches.
Identify inputs of various learning approaches
Identify outputs of various learning approaches
Describe ranges of problem types to which learning approaches can be applied
Analyze and report data collection needs
Document collecting high quality data for a particular purpose
Document the resources needed to carry out a particular investigation
Perform data analysis
Format/cleanse a dataset so that it can be better analyzed
Identify interesting information from a dataset that could be used to make better business decisions
Report data findings
Write a professional report based on a data analysis findings
Present findings orally
Examine data acquisition
Define data acquisition
Describe exploratory data analysis
Incorporate data acquisition
Discuss various storage structures and methods for data gathering
Write a web-scraper to gather data
Access a database in a program
Read data from a file in a program
Discuss how ethics play a role in data acquisitions
Evaluate methods of data delivery
Analyze types and uses of various data displays
Apply best practices in displaying data
Construct numerical summary of data
Create visual summaries of data
Write a program to construct a pipeline for data analysis
Perform data filtering
Apply data transformation
Perform data aggregation
Construct data visualizations
Model data
Outline the data exploration lifecycle
Demonstrate understanding of the data exploration lifecycle and its components
Explain the necessity of this lifecycle in producing quality results
Incorporate visual tools, techniques and strategies for data analysis
Use graphical tools for data exploration
Use appropriate techniques for data visualizations
Evaluate the interpretation of results
Describe the four types of data analytics: descriptive, diagnostic, predictive and prescriptive
Explain the role of statistical significance in data analytics
Compare and contrast reproducibility and repeatability
Examine ethical issues in data science
Identify ethical issues in data science
Discuss ethical behaviors of a data scientist
Report data findings
Write a professional report based on a data analysis findings
Present findings orally
DAT303 - In this final course in the certificate, students were introduced to advanced topics and completed a capstone project to demonstrate their learning.
The competencies for this course include:
Compare and contrast general machine learning concepts
Describe the advantages/disadvantages of each model class
Explain when a supervised leaning model should be used vs. an unsupervised one
Discuss problems related to algorithmic and data bias, as well as privacy and integrity of data
Evaluate unsupervised learning models and their concepts
Explain when and why a hierarchical clustering model is the appropriate tool for analyzing a dataset
Discuss when and why k-Means clustering is the appropriate tool for analyzing a dataset
Analyze neural networks and their concepts
Explain the advantages and disadvantages of using a neural network and when it is appropriate for making predictions
Demonstrate understanding of backward propagation and how it applies to neural networks
Demonstrate an understanding of supervised learning models and the advantages/disadvantages of each
Explain when and why a regression model is the appropriate tool for analyzing a dataset
Discuss when and why a Naïve Bayes model is the appropriate tool for analyzing a dataset
Describe when and why a decision trees model is the appropriate tool for analyzing a dataset
Build models to analyze datasets utilizing open source desktop machine learning tools
Construct an unsupervised learning model to analyze a dataset
Develop a supervised learning model to analyze a dataset
Create a neural network model to analyze a dataset
Explore topics in data sampling
Describe the different types of biases in data sampling
Demonstrate the danger of overfitting
Explain the purpose of training, validation and test datasets
Use k-fold cross validation to evaluate the performance of a model
Evaluate modeling results and interpret the meaning/value of the results
Define true/false positive/negative
Give examples of recall, precision and accuracy
Generate and use a ROC curve to evaluate prediction performance
Interpret model quality by applying performance metrics such as root mean squared error (RMSE), confusion matrices, gain charts and silhouette scores
Demonstrate an understanding of overfitting and underfitting and their causes
Construct machine learning models in Python to do data analysis on datasets
Design a project that utilizes an unsupervised machine learning model to analyze a dataset
Develop a project that utilizes a supervised machine learning model to analyze a dataset
Create an ensemble learning model
Build an ensemble learning model that utilizes multiple machine learning model techniques to analyze a dataset
Demonstrate an understanding of why and when you would most effectively utilize ensemble learning.
My role as instructor for these courses was to run the courses, grade submissions, and ensure students had the knowledge they needed to be successful data scientists. I was also responsible for justifying the certificate, developing the content, and setting up all courses in Blackboard.