Lilongwe, Malawi
Malawi University of Science and Technology(2016-2020)
Programming
Python, SQL, C++, C#, Bash, HTML5, CSS
Operating Systems Windows, Unix/Linux, Mac os
Python Frameworks Scikit-Learn, TensorFlow
Technologies
Docker, VMware, AWS, Airflow, MLflow
Visualization Software
Streamlit, Tableau, Excel, Power BI, Redash, Superset
CI/CD
Travis CI, GitHub Actions, CML
Apache Tools
Kafka, Spark
Networks
WAN and LAN Protocols, Troubleshooting and Maintenance
Software Documentation
pdoc
About me
A data engineer and a Linux enthusiast with strong SQL and Python skills to work with data. I have experience in building data pipelines for ETL and ELT using Apache Kafka, Apache Spark and AWS.
Looking to use my Bachelor of Computer Systems and Security to build scalable data pipelines and data-related solutions at organizations. Self-driven, hardworking, fast learner and a good team player looking to make an impact using technology.
Work Experience
Write articles on Linux using wordpress
Optimize articles for SEO using Yoast SEO
Worked on Data Migration project
Assisted in turning PowerQuery transformations in Power BI to SQL queries in Microsoft SQL Server
Write 6 articles per week on Linux, Python, and GitHub using Markdown
Research on new Linux and Python topics
Editing Articles
Designing web applications UI using Bootstrap for clients
Designing web applications database using SqlDBM for clients
Using .NetCore (Microsoft framework) to develop the backend for client's web applications
Simulating system/network attacks to find potential vulnerabilities on client's network/systems
Check system reports to understand system state
Monitoring systems/networks to identify any abnormal activities
Installing and configuring anti-virus software for clients
Projects
Worked in a team of 7 people, to build a data pipeline that could generate Amharic audio file given Amharic text. The pipeline was built on AWS. Kakfa and Spark were used for scalability. Airflow was used as a scheduler to automate the data pipeline.
This project focused on cleaning raw twitter data and saving it to a csv file. The data in csv file was uploaded to MySQL database.
Worked on fixing bugs, writing unittest and integrating Travis CI to automatically run unittests once there is push to the GitHub repository.
Built a python package that fetches lidar data from USGS API and preprocess it using PDAL. Plotly was used to create a 3D visualization of the terrain from the data. The package went through UnitTest to make sure the code was functioning very well.
Analyzed the users' data of a Telecom company to identify business growth opportunities.
I used python to analyze the data and made business recommedations that would increase the company revenue.
Worked in a group of 7 people. All from different countries to build an Amharic(language in Ethiopia) speech to text model. Three architectures were used: simple Recurrent Neural Network, Convolutional Bidirectional Recurrent Neural Network and Residual Network. The Residual Network model performed better in making correct speech to text translation than the other two models.
A/B testing is a marketing strategy that puts two different versions of a website, advert, email, popup or landing page against each other to see which is most effective.
A/B testing for Smart AD used two methods: The classical way of A/B testing and A/B testing using machine learning. The algorithms that were used in A/B testing using machine learning are: Logistic Regression,DecisionTreesRegressor and XGBoost
Binaries are machine code for a computer to execute. Binary exploitation is the process of subverting a compiled application such that it violates some trust boundary in a way that is advantageous to you, the attacker.
This project used bash and python scripting to automate processes. GDB-peda was used for disassembling and objdump was used to identify functions/methods within the program.
A reverse shell is a type of session cyber attackers commonly use to open communication ports between their machines and the victim's. It is also one of the penetration testers' go-to methods.
The project was fully built using python to run on Linux machines. The reverse shell can run 'almost' any Linux command