Genet Shanko Dekebo
Addis Ababa, Ethiopia
University of GitHub (2020-2025)
Email: gdekebo2020@gmail.com
Tenser flow
Scikit-learn
Deep learning with python
ELT
Time series forecasting with python
Python
SQL
PostgreSQL
MySQL
Docker
DVC
MLflow
power BI
Tableau
Stream-lit
About me
Leading Machine Learning Engineer and Data Analyst with a Master’s degree in Computer Science, I possess extensive experience in data preprocessing, transformations, visualization, feature extraction, and machine learning modeling. I have participated in various project developments and have expertise in creating scalable end-to-end data pipelines. I am particularly passionate about working on projects in healthcare, education, and gender equity. My technical skills include proficiency in Python, PySpark, PostgreSQL, and Power BI, as well as a comprehensive understanding of various machine learning tools and algorithms.
Education
key courses
Deep learning
Machine learning
Design thinking ,
System thinking ,
Project management
MSc.(Computer Science)
key courses
Artificial Intelligence,
Natural Language Processing(NLP)
Image Processing
Algorithm Analysis and Design
Big Data
Research Method
key projects
Developing an offline and online multilingual mobile app to improve public awareness about COVID-19 to limit its Calamity Impact from 2019-2021
Data Engineering: text-to-speech data collection with Kafka, Airflow, and Spark
10 Academy (September 2022- November 2022)
Worked on real-world challenges in the domains of Data Engineering, ML, and Web 3 Engineering
Key Projects
Pharmaceutical Sales prediction, Natural Language Processing, Twitter Sentiment Analysis, and Causal Inference and ALgorand blockchain,
Work Experience
ACATECH Technology PLC (November /2021- 02/02/2023)
Responsibilities
data analysis and software development :using MySQL ,Python , power BI
working on translating business Requirement to operational dashboard.
Data analytics
Yeron Ride PLC (August 2019- September 2021)
Participating on the project of Yeron Ride as a data analyst and back-end developer .
Projects
IT professional survey Analysis
The project dashboard provides a comprehensive overview of the current status of IT professionals by analyzing several key factors. These factors include job preferences, job satisfaction in relation to work-life balance, contentment with current salary levels, and the perceived difficulty of tasks assigned by their employers. Additionally, the project delves into the programming language preferences of these professionals, offering insights into the technologies they favor. Data for this analysis was collected from a diverse group of 630 individuals worldwide, with an average age of 29 and a maximum age of 92. This broad demographic range allows for a nuanced understanding of the varying experiences and preferences within the IT profession.
The objective of this project was to design and build a robust, reliable, large-scale trading data pipeline, for both crypto and stock market trading, that can run various backtests and store various useful artifacts in a robust data warehouse system. As a development tools: Ariflow, Kafka, flask, React.js and MLflow.
Prompt-Engineering-In-context-learning-with-GPT-3-and-LLMs
The project focuses on Large Language Models (LLMs), which are pivotal in advancing natural language processing capabilities. For analysis purposes, two distinct types of datasets were utilized, allowing for a comprehensive examination of the models' performance and effectiveness. To facilitate development, several tools were employed, including NLTK for natural language processing tasks, and a machine learning framework specifically designed for prompt engineering. Python served as the primary programming language, while Streamlit was utilized to create an interactive dashboard that showcases various features and insights derived from the analysis. In addition, various feature extraction tools were integrated to enhance the data processing capabilities. Ultimately, the report provides valuable insights into fundamental concepts related to LLMs, offering a clearer understanding of their functionalities, applications, and the implications of their use in real-world scenarios.
Pharmaceutical Sales Prediction across multiple stores using Deep learning
The project involved a six-week collaboration with Rossmann Pharmaceuticals, focusing on sales predictions to enhance operational efficiency and inventory management. To achieve accurate forecasting, the project utilized four distinct store datasets, each providing valuable insights into sales trends and customer behavior. The analytical approach incorporated various tools and methodologies, including statistical modeling techniques to identify patterns within the data. Core data science libraries in Python, such as Pandas, NumPy, and Scikit-learn, were employed to facilitate data manipulation and analysis. Additionally, machine learning algorithms were implemented to develop predictive models, allowing for more precise sales forecasts. To streamline the development and deployment processes, MLOps practices were integrated, utilizing tools such as DVC (Data Version Control) for managing datasets and experiments, CML (Continuous Machine Learning) for automating the training and evaluation of models, and MLflow for tracking experiments and managing the machine learning lifecycle. This comprehensive approach not only aimed to improve sales predictions but also fostered a robust framework for ongoing model refinement and performance monitoring.