Projects

Object detection using tensorflow 2 and yolov5

4 weeks

Developing an AI-driven & Secure Online Marketplace

4 weeks

The objective of this study is to analyze the online market places available in Bangladesh e.g. daraz, bikroy.com, chandal, foodpanda, etc. to analyze the currently offered technologies, products, compare prices with physical stores, analyze user reviews and sentiments, and identify the shortcomings related to technology, trust, financial transactions, and business policy.

The Project Goals:

  • Collect data from online market places in Bangladesh related to product, technology in use, user-reviews, complaints, survey-reports etc.

  • Process the data following a systematic methodology, and do exploratory data analysis

  • Develop an AI-driven product and deal recommendation system.outline an AI-driven solution to improve the online marketplace eco-system in Bangladesh.

Covid 19 predictive analysis of severity illness

3 weeks

Four kind of analysis is performed in this big dataset and they are Descriptive and Diagnostic analysis,Predictive analysis and Prescriptive analysis. Descriptive and Diagnostic analysis is done to identify hidden insights from data; Predictive analysis is done to predict severity illness determination from machine learning algorithm and prescriptive analysis is done to show how this machine learning algorithm going to help identifying severity illness.

This dataset is imbalanced and has a lot of missing values. As we know working with missing value is challenging and thats why data cleaning and feature engineering is one of the core part of this project.

This is a classification problem and I selected Gradient Boosting Classifiers model for predicting severity illness.Gradient boosting classifiers are a group of machine learning algorithms that combine many weak learning models together to create a strong predictive model. Decision trees are usually used when doing gradient boosting. Gradient boosting models are becoming popular because of their effectiveness at classifying complex datasets.

To handle imbalanced data BorderlineSMOTE was used. I over sampled the minority class by BorderlineSMOTE and then again trained the model to see how it performs. Difference between severity illness prediction with imbalanced data and balanced data results were compared .

For prescriptive analysis LIME(Local Interpret-able Model-agnostic Explanations) was used to explain the model’s result. Prescriptive model describes how this model can be applied in the real life scenario.

Market analysis report for national clothing chain

3 weeks

In this project, I used population statistics from the US Census Bureau to determine and understand thingss that was our main objective of the projects. what i found in this projects are given below:

  • Correlation (R^2 value) between sales and income which is 0.78

  • Correlation (R^2 value) between customer ratings and product return rate which is 0.50

  • Prdicted the incomes of the customers from purchases and sales data

  • Which products should we advertise most for most profits

  • Which market should we target

Cross Filteration

Salaries in Different Buckets

Robo Race

4 weeks

Made an arduino based robot that was able to pass several obstacles to win the race. That was a great experience and became "runner up" of that competition.

A video of the robot functioning:

"https://www.dropbox.com/s/uwssdkk8cqnuk73/robot.mp4?dl=0 "

robot.mp4

Fashion MNIST -Machine Learning project

4 weeks

A neural network that can recognize images of articles of clothing. Fashion MNIST data was used. The MNIST dataset contains images of handwritten digits (0, 1, 2, etc) in an identical format to the articles of clothing I used here. The images are 28x28 arrays, with pixel values in the range [0,255].

Fashion MNIST Dataset

Plot the first X test images, their predicted label and the true label

SMTP correlation analysis to find out customer

interest

3 weeks

On behalf of my company, I performed many data science related projects which helped the company finding out the targeted customers and business stategies. This is one of the those projects related to email marketing.

SMTP Correlation

Email subject line analysis to find out subject

line effect on email marketing

3 weeks

On behalf of my company, I performed many data science related projects which helped the company finding out the targeted customers and business stategies. This is one of the those projects related to email marketing.

Email Subject line analysis

The Android App Market on Google Play

4 weeks

Mobile apps are everywhere. They are easy to create and can be lucrative. Because of these two factors, more and more apps are being developed. In this notebook, we will do a comprehensive analysis of the Android app market by comparing over ten thousand apps in Google Play across different categories. We'll look for insights in the data to devise strategies to drive growth and retention.

Size and price of an app


Sentiment analysis of user reviews


Relation between app category and app price


Do Left handed People Really Die Young

2 weeks

Barack Obama is left-handed. So are Bill Gates and Oprah Winfrey; so were Babe Ruth and Marie Curie. A 1991 study reported that left-handed people die on average nine years earlier than right-handed people. Nine years! Could this really be true?

Applying Bayes' rule

The probability of dying at a certain age given that you're left-handed is not equal to the probability of being left-handed given that you died at a certain age. This inequality is why we need Bayes' theorem, a statement about conditional probability which allows us to update our beliefs after seeing evidence.

We want to calculate the probability of dying at age A given that you're left-handed. Let's write this in shorthand as P(A | LH). We also want the same quantity for right-handers: P(A | RH).

Here's Bayes' theorem for the two events we care about: left-handedness (LH) and dying at age A.

P(A|LH)=P(LH|A)P(A)P(LH)

P(LH | A) is the probability that you are left-handed given that you died at age A. P(A) is the overall probability of dying at age A, and P(LH) is the overall probability of being left-handed. We will now calculate each of these three quantities, beginning with P(LH | A).

To calculate P(LH | A) for ages that might fall outside the original data, we will need to extrapolate the data to earlier and later years. Since the rates flatten out in the early 1900s and late 1900s, we'll use a few points at each end and take the mean to extrapolate the rates on each end. The number of points used for this is arbitrary, but we'll pick 10 since the data looks flat-ish until about 1910.

Final comments

We got a pretty big age gap between left-handed and right-handed people purely as a result of the changing rates of left-handedness in the population, which is good news for left-handers: you probably won't die young because of your sinisterness. The reported rates of left-handedness have increased from just 3% in the early 1900s to about 11% today, which means that older people are much more likely to be reported as right-handed than left-handed, and so looking at a sample of recently deceased people will have more old right-handers.

When do people normally die?


Plotting the distributions of conditional probabilities


The Hottest Topics in Machine Learning-NLP Project

2 weeks

Neural Information Processing Systems (NIPS) is one of the top machine learning conferences in the world where groundbreaking work is published. In this Project, I analyzed a large collection of NIPS research papers from the past decade to discover the latest trends in machine learning. The techniques used here to handle large amounts of data can be applied to other text datasets as well. Here i found out the top 10 machine learning topics from 1987 until 2017.

Plotting how machine learning has evolved over time

Plotting how machine learning has evolved over time


A word cloud to visualize the preprocessed text data


Analysing trends with LDA