kARTIK THAKUR

SENIOR Data Engineer at LENDINGCLUB BANK

SKILLS

ACADEMIC BACKGROUND

Masters, COMPUTER SCIENCE

Aug 2018 - May 2020

Academic Standing: GPA 3.63

Relevant Coursework



Bachelors, ELECTRONICS AND COMMUNICATION

Aug 2011 - May 2015

Academic Standing: GPA 3.63


Relevant Coursework




Career Overview

SENIOR DATA ENGINEER, REMOTE USA

DEC 2021 - PRESENT
  • Designing and building data pipelines that integrate directly with LendingClub’s cross-functional teams, opening the door to new products and features.
  • Successfully migrated on-premises data infrastructure to the cloud, resulting in ~40% cost savings and improved scalability.
  • Spearheaded the development of a scalable data pipeline, handling 3 terabytes of data daily, reducing processing time by ~20%
  • Identified and implemented the internal process improvements- automating manual processes, optimizing data delivery, reducing Cloud cost, and re-designing infrastructure for greater scalability that reduced costs by ~30%.
  • Mentored offshore resources, fostering a culture of knowledge sharing and continuous improvement.


DATA PLATFORM ENGINEER, REMOTE USA

JUNE 2020 - DEC 2021
  • Collaborated with senior engineers in translating raw, technical data into actionable insights that influence major objectives.
  • Assisted in the Engineering of end-to-end data pipelines using Spark and Airflow, improving data processing efficiency by 40%.
  • Supported data lake on the Hadoop ecosystem for data analytics and capturing business insights from historical data.
  • Exposed and delivered aggregated/customer-centric data through reports, visualization products, and RESTful APIs.
  • Collaborated with the analytics team to design and implement data warehouses, enabling faster and more accurate reporting.

DATA ML Engineer, DARPA-JPL, Albany, New York

DEC 2018 - MARCH 2020

DATA ENGINEER, ACCENTURE/MICROSOFT, India

JUNE 2015 - JUNE 2018

Projects

GDELT Data Analysis: Data Pipelines, Big Data, EC2, S3, Amazon EMR, Spark


Obstacle detection for self-driven AI cars: (Q-Learning, Python, ANN, PyTorch) 


Food Images Recognition (Computer Vision, CNN, TensorFlow, Python, Food101 dataset


Email Structural Analysis (Textual Analysis, RNN, LSTM, Python, Quagga dataset


Twitter Sentiment Analysis: (PoS Tagging, Bag of words, Twitter textual data, SVM)


Categorial Spam Detection (TF-IDF, TensorFlow, Python, NLTK, Neural Networks) 


Live Scores Update Application (Sports Feed API, Java, JavaScript, Facebook API, Bootstrap)

 

Online Events/Movies Booking Application (Java, Spring boot, Thyme leaf, CSS, HTML)


Certifications

COMMUNITY INVOLVEMENT

Volunteer

Nov 2013 - Nov 2017

Volunteer

Jan 2014 - Jan 2018