Python, SQL, Java ,C++,
C#, Bash, HTML, CSS, Matlab
Applied Maths I, II &III
Probablity and statistics
Windows
Unix/Linux
Docker
AWS
MLflow ,
Tensorflow
GitHub
CI/CD
Travis CML
DVC
MLflow
Kafka
Spark
Airflow
Streamlit
Tableau
Redash
About me
Dagmawi Yohannes is a diligent junior data engineer with background knowledge of C++, Java, and Python as well as utilizing relational database systems using PostgreSQL. His expertise includes using ETL tools like Kafka and Airflow together with designing and managing SQL-based, high-performance, scalable, and reliable end-to-end data pipelines.
Educational Background
Computer Architecture and Organization
Object Oriented Programming
C++
Applied Mathematics I, II, III
Signal Processing
Network Analysis
Digital Systems Process
Probability and Random Process
Introduction to computing
Computational Methods
B.A in Business Management
Accounting I&II
Cost Accounting I&II
Financial Accounting
Financial Management
Statistics I&II
Principles of Marketing
Microeconomics and Macroeconomics
Operations Research
Research Methodologies
Managment of Financial Institutions
Business Law
Data Engineering
Maching Learning
Deep Learning
Web3 Engineering
Work Experience
Dejen Technologies
Ethiopian Electric Utility
ATYM Engineering
worked as Front end Engineer
.
Projects
You work at Rossmann Pharmaceuticals as a Machine Learning Engineer. The finance team wants to forecast sales in all their stores across several cities six weeks ahead of time. Managers in individual stores rely on their years of experience as well as their personal judgment to forecast sales.
The data team identified factors such as promotions, competition, school and state holidays, seasonality, and locality as necessary for predicting the sales across the various stores. Your job is to build and serve an end-to-end product that delivers this prediction to analysts in the finance team.
You work at an AgriTech, which has a mix of domain experts, data scientists, data engineers. As part of the data engineering team, you are tasked to produce an easy to use, reliable and well designed python module that domain experts and data scientists can use to fetch, visualise, and transform publicly available satellite and LIDAR data. In particular, your code should interface with USGS 3DEP and fetch data using their API
SmartAd provides an additional service called Brand Impact Optimiser (BIO), a lightweight questionnaire, served with every campaign to determine the impact of the creative, the ad they design, on various upper funnel metrics, including memorability and brand sentiment.
As a Machine learning engineer in SmartAd, one of your tasks is to design a reliable hypothesis testing algorithm for the BIO service and to determine whether a recent advertising campaign resulted in a significant lift in brand awareness.
Your employer wants you to provide a report to analyse opportunities for growth and make a recommendation on whether TellCo is worth buying or selling. You will do this by analysing a telecommunication dataset that contains useful information about the customers & their activities on the network. You will deliver insights you managed to extract to your employer through an easy to use web based dashboard and a written report.