Software Development Internship
May 2024 - Present
Developed full-stack software for satellite calibration (GSICS), now used by ISRO for satellite data processing and analysis.
Implemented parallel computing, reducing processing time by 83% while efficiently handling over 5 TB of satellite data.
Developing PyQt based GUI for real-time and long-term satellite data analysis, enhancing workflow efficiency.
Engineered in collaboration with ISRO scientists, the software is live at ISRO’s Data Center – MODSAC.
Frontend: Python, PyQt. Backend: Python, NumPy, Pandas, multiprocessing, matplotlib, SciPy
Software Development Internship
January 2024 - April 2024
Engineered AI-powered search engine for DRDO eLibrary, integrated with real-time Autocomplete-search-suggestions
Engineered an AI-driven search summary feature using OpenAI API, improving search relevance and user engagement.
Implemented real-time autocomplete search suggestion feature using Elasticsearch, indexing over 300,000+ library resources.
Frontend: CSS+HTML, AJAX, jQuery. Backend: Python, Flask, OpenAI API, REST API. Database & Search: Elasticsearch
Machine Learning Internship
June 2023 - October 2023
Developed a Language-Identification ML model for 3 low-resource Indian languages (Bhojpuri, Maithili, Magahi) and used it to automate parallel-corpora formation
Implemented Random Forest Classifier, achieving an accuracy of 99.61%, a 2% improvement over previous model.
Automated parallel corpora cleaning using Python scripts and Language Identification model, saving time by 90%
Tech Stack: Python, Random Forest Classifier, scikit-learn, pandas, NumPy, SciPy
Under-Writing: Aryan Joshi, Anil Kumar Singh, Ajit Singh Bridging the Linguistics Gap : Forming Parallel Corpora for Low Resource Indian Languages using Language-Identification Model
Dedicated to advancing language-identification models for low-resource Indian languages, I am actively involved in co-authoring a research paper titled 'Bridging the Linguistics Gap: Forming Parallel Corpora.' Crafted in LATEX, this ongoing project represents a commitment to excellence and innovation in Natural Language Processing, Machine Learning, and Computational Linguistics. My collaborative efforts within a diverse team underscore effective teamwork, amalgamating varied perspectives to push the boundaries of knowledge. As the paper takes shape, the focus is on finalizing its content, preparing for submission, and anticipating its meaningful contributions to the academic community.