Kennedy Wakura
Nairobi, Kenya
I am a junior data engineer passionate about building data-driven solutions and developing architectures for data generation and collection. I am experienced in working with different databases. I have also worked with Apache technologies such as Kafka, Spark and Airflow for building data pipelines.
Machine Learning, Data Engineering, and Web3 Engineering training
May 2022 -- July 2022
Courses:
Programming in Python SQL
Machine learning
Data Engineering
Web 3 Engineering
Career Skills
Code deployment
Bachelor of Technology in Communication and Computer Networks
September 2017 - December 2022
Courses:
Programming in Java, Python
Web Development
Database design and administration
Statistics
Linear Algebra
Apache Kafka, Spark, Airflow
SQL Databases (MySQL, SQL Lite, PostgreSQL, Oracle, MS Access)
NoSQL Databases(MongoDB)
Graph Databases (Neo4J)
Dockerization
AWS (S3),
Git, GitHub, BitBucket
Models and Data Versioning with DVC and MLFlow
CI/CD, CML, Travis CML
Python
Java
SQL
VB.Net
C#
Bash
JavaScript
PHP
ReactJS
Django
NodeJS
Laravel
ICT Intern - Mukuru Slums Development Projects
(March 2022 – Present)
Developing and Maintaining Students Management System
Maintainance of IT resources in the organization
Developing data collection and analysis systems and pipelines
Training staff on ICT skills
Supporting staff on IT-related issues
The objective of this project was to predict the sales across the various stores owned by Rossman Pharmaceuticals. I used the Rossman Pharmaceuticals dataset. I loaded the data using pandas and used matplotlib and Seaboarn to visualize the data. I then built a linear regression model that I used to predict the sales for various stores. I designed an interactive streamlitdashboard that allowed prediction of sales for various stores.
In this project, I build an easy to use data pipeline that fetch AgriTech Lidar data from the USGS_3DEP ( United States Geological Survey 3D Elevation Program) API. This data pipeline allowed AgriTech company extract, visualize and transform the data for easy use. The data was to be used by the company to model water flow across their farms.ject.
Before investing in a business it is a must to have the best understanding of the business you need to invest in. In this project, I conducted user analytics to determine whether investing in Tellco company is worth it. I was provided with a telecommunication dataset that contained useful information about the customers of Tellco Company & their activities on the network. I conducted a user overview analysis, user engangement analysis, user engagement analysis and user satisfaction analysis. I used pandas, matplotlib, seaboarn and numpy in this project. From the analysis, I concluded that Tellco company was worth investing in and this results were presented in a streamlit dashboard and a report.