This section comprises of my PYTHON, POWER BI, SQL, R, and TABLEAU PROJECTS
This section comprises of my PYTHON, POWER BI, SQL, R, and TABLEAU PROJECTS
This project was part of the Udacity Data Analyst Nanodegree program's data wrangling segment and is mostly focused on wrangling data from the WeRateDogs Twitter account using Python and documenting it in Jupyter Notebook. I gathered data in three (3) different file formats from three (3) different sources. I then analyzed using both visual and programmatic criteria. Then cleaned any issues discovered during the assessment step. Finally, I created various visualizations using Python's plotting libraries.
For this project, I began by reviewing the dataset, which was a soccer database provided by Kaggle, and brainstorming questions I could answer with it. Then I used pandas and NumPy to answer the questions that piqued my curiosity, and I wrote a report to share the results.
In this project, I cleaned and analyzed exit surveys from employees of the Department of Education, Training and Employment (DETE)}) and the Technical and Further Education (TAFE) body of the Queensland government in Australia.
In this project, I used data provided by a bike share system provider for many major cities in the United States, to uncover bike share usage patterns. Iwill compared the system usage between three large cities: Chicago, New York City, and Washington, DC. This is an interactive experience in which the user may ask questions regarding the dataset and receive answers. The experience is interactive since the answers to the questions changes based on the user's interaction!
I scraped the match results from the English Premier League for this project. I downloaded the data using a python library called requests, then parse it using beautiful soup to extract what I needed, and finally loaded everything into a pandas data frame so I can clean it up and prepare it for analysis.
In this project I worked with a dataset of used cars from eBay Kleinanzeigen, a classifieds section of the German eBay website. The aim of this project is to clean the data and analyse the included used car listings.
In this project, I compared two different types of posts from Hacker News and answered some questions, this is a popular site where technology related stories (or 'posts') are voted and commented upon. The two types of posts I explored began with either Ask HN or Show HN.
The aim in this project is to find mobile app profiles that are profitable for the App Store and Google Play markets. I'm to enable the team of developers make data-driven decisions with respect to the kind of apps they build.
I analyzed a real-world job posting dataset for DataSearch, a recruiting agency, to identify insights. I used Power Query to investigate and clean the data to determine which skills are most in demand for data scientists, analysts, and engineers. I utilized DAX to create meaningful visualizations of my findings, which I then combined using a dashboard.
I used a churn dataset from a Telecom provider called Databel where I was hired as a consultant, and my task was to analyze why customers were churning, or in other words, leaving Databel.
Better price here. Better service there. Best for small businesses here. Best for young urbanites there. But what do customers really want? Our client, a big telecom company needs to know.
In this project, I analyzed data from CIA World Factbook, a website that provides information on the history, people and society, government, economy, energy, geography, communications, transportation, military, and transnational issues.
In this project, I explored data from BusinessFinancing.co.uk on the world's oldest businesses: when they were founded, and which industries do they belong to? Like many business problems, the data I'll explore is contained in several different datasets. In order to understand the world's oldest businesses, I used joining techniques to merge the data. From there, I used manipulation tools such as grouping and filtering to answer questions about these historic businesses.
In this project, I analyzed international debt data collected by The World Bank. The dataset contains information about the amount of debt (in USD) owed by developing countries across several categories. I also used the data to answer to business-related questions
I worked as a junior data analyst in the marketing analyst team of Cyclistic, a bike-share company based in Chicago. According to the director of marketing, the company's future success is dependent on increasing the number of yearly subscribers. As a result, my team was curious about how casual riders and annual members used their Cyclistic bikes.
On December 31, 2019, the first international report of a pneumonia of unknown cause was made from Wuhan, China. This virus is now known as Coronavirus. As a result, we have seen societal estrangement and the deaths of many people. Several groups did not hesitate to provide several datasets permitting the performance of various types of analyses in order to comprehend this epidemic in the spirit of unity in the face of this unprecedented global calamity. Because I cannot always rely on the news and am a data analyst, it is logical for me to study these datasets on my own to answer questions.
The goal: I want to extract data from the top 30 movies between March and July 2020. I can extract various details, including a movie's title, description, actors, director, genre, runtime, and ratings. Then,I want to check whether ratings correlate to user votes. For instance, do the highest-rated movies also have the highest user vote scores?
I am a data analyst at a medical institute. I was assigned to assist in the development of a mobile app intended to guide lottery addicts through exercises that will let them better estimate their chances of winning. The hope is that this app will help them realize that buying too many tickets will do little to improve their chances of winning. The institute has a team of engineers that will build the app, but they need me to build the logic behind the app and calculate probabilities.
In this project, I analyzed data from the New York City school department to determine whether parent, teacher, and student views of various factors influence average school SAT scores.
I analyzed forest fire data for this project. This project's objectives were to practice making bar charts, box plots, scatter plots, and other data visualizations. Develop insight on when to employ various data visualization techniques. Investigate how to utilize data visualization to address data-related problems.
In this project, I want to extract solar data from New York City. Such data, for example, can help us estimate the most productive times of the year for solar panel placement.
In this project, l analyzed more recent movie ratings data to determine whether there has been any change in Fandango's rating system after Hickey's analysis.
In this project, I analyzed a dataset from a company to answer a business question they had. I explored the data, cleaned it and produced some valuable information for the book company in the form of a report.
In this project, I created visualizations to reveal insights from a three different datasets. I created data visualizations that told a story or highlight patterns in the datasets. There are 3 different data sets were.
Flight Delays and Cancellations
US Census Demographic Data
YouTube Data from the US