10 Academy profile site

Emtinan Osman

Alexandria, Egypt

Scuola Internazionale Superiore di Studi Avanzati SISSA (2013-2017)

PhD (Theoretical Physics)

International Center for Theoretical Physics ICTP (2012-2013)

Postgraduate Diploma (HE physics)

University of Khartoum (2007-2012)

B.Sc (physics)

Email: emtinan.s.e.osman@gmail.com

Programming Languages

  • Python

  • SQL

  • C++

Data Engineering Tools

  • DBT

  • Apache Airflow

  • Apache Kafka

Automation Tools

  • Docker

  • CI/CD

  • DVC

  • MLflow

  • Unit Tesing

Frameworks

  • ETL

  • Flask

  • Streamlit


About me

I'm a data engineer in training, building and optimizing high speed, scalable and reliable data pipelines. I am also a scientist, with a PhD in theoretical physics and 8 years experience in research and problem solving. Proficient in python and SQL. I have experience in database management and using Apache Kafka, Airflow and Spark

Education

  • Scuola Internazionale Superiore di Studi Avanzati SISSA ( 2013-2017 )

PhD (Theoretical Physics)

  • Thesis Title: "On Conformal Field Theories in 4D"

  • Developed mathematical tools and advanced code to carry conformal bootstrap computations in 4 dimensions.

  • International Center for Theoretical Physics ICTP ( 2012-2013 )

Postgraduate Diploma (High Energy Physics)

  • Thesis Title: "On Charge Quantization"

  • Courses included: Quantum field theory, standard model, relativity and cosmology.

  • University of Khartoum ( 2007-2012 )

B.Sc (Physics)

  • Thesis Title: "Review of Electrical Methods of Rain Enhancement and Their Possible Application in Sudan"

  • Courses included: Statistical mechanics, quantum mechanics, mathematical methods, particle physics, principles of computer programming and renewable energy

Work Experience

  • Trainee

10 Academy B6 Training (Aug 2022- Present)

Data Engineering projects:

  • Text-to-speech data collection with Apache Kafka, Airflow, and Spark.

  • Data warehousing for city traffic department with MySQL, DBT, Airflow

  • Postdoc

University of Karlstad (Feb 2021- Jun 2021)

  • Project: Build ML model to identify areas with high conservation value within Sweden. Based on this work, I produced university course material on pandas for data analysis.

  • Postdoc

University of Uppsala (Oct 2017- Sep 2020)

  • Project: Exact Results in Gauge Theories. I provided computational building blocks and Mathematica codes to facilitate the analysis of physical systems at criticality (conformal field theory).

  • Advance Mathematica coding

Projects

Cryptocurrency trading engineering: A scalable back testing infrastructure and a reliable, large-scale trading data pipeline. A team effort to build a large-scale trading data pipeline, for both crypto and stock market trading that can run many backtests and store important artifact and results in a robust data warehouse system.


This was a team effort to build a data engineering pipeline that allows recording millions of Amharic and Swahili speakers reading digital texts in-app and on web platforms.
End-to-end ETL data pipeline that uses Apache Kafka, and Apache Airflow in order to receive user voice audio files, transform them and load them to a data warehouse that will later be used for text-to-speech conversion machine learning project