10 Academy profile site
Saba Agizew Woldeamanuel
Addis Ababa, Ethiopia
Addis Ababa University, Addis Ababa Institute Of Technology (2011-2019)
BSc. (Civil Engineering)
Email: savaagizew2@gmail.com
Git, PostgreSQL, AWS
Python programming, pandas
SQL programming, MySQL
Data Engineer + ETL/ELT pipelines, Data Warehousing
Apache Airflow, dbt, Power bi
About me
A Junior data Engineer capable of working from end-to-end Machine Learning projects involving data preparation and processing, visual analytics, machine learning, building deep learning models, and model deployment. structure and organize data for use in specific analytics appilcations and enabling businesses to make trustworthy business decisions.
Education
Addis Ababa University, Addis Ababa Institute Of Technology (2011-2019)
Bsc. (Civil Engineering)
10 Academy (2021-present)
Data Science and Machine Learning Engineering
Work Experience
Data Entry
Data Entry Addis Ababa, Ethiopia LonAdd Consultancy PLC has entered a contract with Amhara Bank S.C to outsource data collection for one month and I worked with the PLC as a data encoder (http://lonadd.com/).
I have been working on data collecting, organizing them into categories, and finally digitalizing the hard copy for each person to soft copy.
Data Engineer
Green professional service P.L.C
My duty was to analyze the dataset for missing values, duplicates, and outliers and deal with them accordingly.
Make a new variable based on the multiplication of two columns.
Create three Data Marts and establish relationships between them using foreign keys and primary keys.
Load selected columns into the data warehouse for future use and visualize the three Data Marts.
Projects
Twitter Data Analysis
Analyses my Twitter data for sentiment and creates a Twitter sentiment analysis model. loads the data forking from 10xac/Twitter-Data-Analysis (github.com), extract and test it by using Travis CI, And build a dashboard by using MYSQL.
AMHARIC SPEECH-TO-TEXT ENGINE
Worked in a group of 7 to make an Automatic speech recognition system for the Amharic language. Layed out the MLOps pipeline using CML and DVC which had GPU runners integrated within Pull requests to allow easy training on AWS server.
Data engineering: speech to text data collection with kafka, Airflow and spark
Worked in group 8 on speech to text data.The tool that we create should be deployed to process posting and receiving text and audio files from and into a data lake, apply transformation in a distributed manner, and load it into a warehouse in a suitable format to train a speech-to-text model.
SmartAd A/B testing
Using A/B testing to test if the ads that the advertising company ran resulted in a significant lift in brand awareness.Comparing machine learning models vs A/B testing gave me insights on what to use in which particular problem.