Nahom Habtemichael
Data Engineer
Addis Ababa, Ethiopia
Addis Ababa Science and Technology University(2017-2021)
Email: nahomhabtemichael
Medium: @nahomhabtemichael
Python,
Java
C++
HTML 5
CSS
React/Next.js
Flask, Django
Docker
AWS
Airflow
MLflow
DVC
Unit Test, Pytest
Kafka
Airflow
Spark
DBT
SQL
Snowflake
Unix/Linux
Windows
Seaborn,Matplotlib, Plotly
Streamlit
Tableau
Redash
Ci/CD
CML
Github Actions
Jenkins, Travis CI
Scikit-Learn
Tensorflow
OpenCV
Linear Algebra
Applied Mathematics I
Calculus
Discrete Mathematics
About me
Nahom is a Data Engineer with a Software Engineering background who is proficient in Python, SQL, OOP, relational and non-relational databases, data cleaning, visualization & analysis. He has experience working on ETL and ELT data integration frameworks. He is also well versed in building end-to-end data pipelines using Kafka, Airflow and Spark.
Education
Relevant Courses taken:
Database Management and Relational Database
Object-oriented-programming (C++, Java)
Advanced Programming (Java)
Data Structures and Algorithms
Software Quality Assurance and Testing
Web design and development
Software Requirements Engineering
Software Design and Architecture
Systems Programming
Operating Systems
Probability and Statistics, Calculus I , Discrete Mathematics I, II
Work Experience
Built a data collection pipeline for Speech-to-Text processing using Kafka, Airflow, Spark, and AWS S3 bucket.
Built an ELT pipeline for Traffic Data warehouse collected from swarm UAVs (drones).
Developed Machine Learning Engineering pipelines.
Projects
We were tasked with building a scalable data warehouse for this project, which will hold the vehicle trajectory data that was gleaned from video captured by swarms of drones and stationary roadside cameras. Using DBT, we employ the Extract Load Transform (ELT) structure. The ELT framework assists analytic engineers in the city traffic department in setting up transformation workflows as needed, in contrast to Extract, Transform, and Load (ETL).
A data engineering pipeline that allows recording millions of Amharic and Swahili speakers reading digital texts in-app and web platforms. There are a number of large text corpora we will use We will design and build a robust, large scale, fault tolerant, highly available Kafka cluster that can be used to post a sentence and receive an audio file.
A user overview analysis project to aid an investor to decide in buying a telecom company. This project will help the investor understand if the chosen investment is profitable or not.