Hello, I am

SHREYA GAUTAM

Research Driven | ML/AI enthusiast | Exploring LLMs | Data Scientist

ABOUT

Hi there! I’m a final year Master’s student in Computer Science at Politecnico di Milano, Italy. I am engaged in projects around LLMs where my work spans the realms of Machine Learning, Natural Language Processing and Data Science. With hands-on experience building and experimenting with LLMs, I'm passionate about using data-driven methods to solve real-world problems.

I'm currently preparing my master’s thesis and actively exploring full-time opportunities and internships across Europe and globally, particularly in finance, R&D, and AI/ML-focused companies. My goal is to work at the intersection of intelligent systems and impactful industries—be it FinTech, Pharma, or AI research.

I bring a mix of academic depth, curiosity, and practical skill to the table—If you're looking for someone who’s motivated, adaptable, and ready to contribute from day one—let’s connect!

CONNECT WITH ME

PUBLICATIONS

SEPSIS: I Can Catch Your Lies -- A New Paradigm for Deception Detection

ACL SRW 2025

Counter Turing Test (CT2): AI-Generated Text Detection is Not as Easy as You May Think - Introducing AI Detectability Index (ADI)

EMNLP 2024

FACTIFY-5WQA: 5W Aspect-based Fact Verification through Question Answering - ACL Anthology

ACL 2024

ML based IoT Framework for Diabetes Detection

Springer Wireless Personal Communications

EXPERIENCES

ACADEMIC RESEARCH

Politecnico di Milano

(Jan'24 - Aug'24)

• Evaluated the performance of ChatGPT-4, Gemini, LLaMA 3, and ChatGPT-3.5 in generating UML class diagrams from informal textual descriptions by developing optimized prompt engineering techniques that improved PlantUML code accuracy by 60%

• Quantified model performance using introduced metrics for subjectivity of class diagrams across 200 textual descriptions.

ACADEMIC RESEARCH

Artificial Intelligence Institute at University of South Carolina

(Sep'22 - July'23)

Published 3 research papers in ACL/EMNLP (including Outstanding Paper Award at EMNLP 2023):

• Deception Detection: Curated 266k+ dataset via GPT-3 prompt engineering; evaluated MPNet/ELECTRA/RoBERTa for mask infilling; developed visualization tools (heatmaps, matrices) Columbia, USA

• AI Text Detection: Pioneered perplexity/burstiness analysis framework for 1k texts; quantified detection thresholds for 8 LLMs (GPT, OPT, BERT, etc.)

• Fact Verification: Implemented AllenNLP for Semantic Role Labeling with > 95% accuracy on 390k+ Twitter, FakeNews datapoints

Research Intern

Birla Institute of Technology, Mesra

(Jan'23 - Jun'23)

• Conducted research on IoT enabled diabetes detection system using machine learning models, including EvalML, AutoML, and Artificial Neural Network algorithms.

• Implemented GridSearchCV, a hyperparameter tuning technique, to optimize the performance of the machine learning models. • Demonstrated the effectiveness of the Random Forest algorithm, achieving the highest accuracy in diabetes detection within the system.

MITACS Research Intern

Dalhousie University, Canada

(May'22 - Aug'22)

• Performed exploratory data analysis and applied ML models on CICIDS 2019; achieved 90% attack detection accuracy.

• Used PCA, correlation plots, and visualizations for interpretability and performance tuning.

Summer Research Training Internship

Birla Institute of Technology, Mesra

(Jan'22 - Apr'22)

• Performed Text Sentiment Analysis using Natural Language Processing and VADER which gives sentiment scores on some rules for the words used, achieving 95% accuracy in sentiment prediction.

• Utilized five Machine Learning Algorithms to predict the output of sentiment analysis results

• Generated WordCloud visualizations to observe the frequency of different words and created tables of the frequency of the top 5 words.

PROJECTS

Multi-Agent Financial Insights Platform

This project explores a modular multi-agent system for automated financial analysis. The system integrates five autonomous agents, including price forecasting, anomaly detection, sentiment analysis, peer benchmarking, and regulatory risk assessment, all coordinated through a central orchestrator and a coordinator agent.

Each agent performs domain-specific analysis, the orchestrator executes these agents in sequence, while the coordinator agent consolidates their outputs into a coherent, executive-level market brief.

A Streamlit dashboard presents forecasts, anomalies, sentiment, and peer comparisons in an interactive interface .
The result is a modular, end-to-end system that produces clear, actionable financial insights. [link]

Chatbot-RAG Instruct

As part of an academic NLP project, my team built an intelligent chatbot using the RAG-Instruct dataset.

This project aims to explore the full pipeline of modern NLP systems — from data preprocessing and analysis to training transformer-based models for answering diverse, information-rich queries. We applied techniques such as document clustering, Word2Vec embeddings and fine-tuning models and rerankers on question-document-answer triples. The result is a chatbot capable of understanding context-rich questions and generating relevant, informed answers by grounding responses in retrieved Wikipedia based documents. [link]

Predictive Models for Pest Incidence with Satellite and Weather Data Integration

This project aims to predict pest incidence in crops by integrating satellite imagery (Sentinel) and weather data. Collaborated with the NGO WOTR to process and clean 150,000+ datapoints, reducing noise and optimizing for binary classification (pest/no-pest). Achieved a 3% improvement in model accuracy by incorporating geospatial features, with XGBoost and Random Forest emerging as top-performing models. This project supported precision agriculture efforts in rural India by enabling early intervention and improving crop resilience.

LLM Powerplay: A Comparative Study of Large Language Models for UML Class Diagram Generation

In this paper, we conduct a comparative analysis of four leading large language models—GPT-4, GPT-3.5, Google Gemini, and Meta LLaMA—for automated UML class diagram generation from textual descriptions. Using a curated dataset and consistent evaluation metrics, we assess each model’s accuracy, clarity, and domain understanding. Our results reveal distinct strengths and weaknesses across the models, with ChatGPT 3.5 achieving the most reliable performance. This study offers practical guidance for leveraging LLMs in software engineering tasks. You can request to read the paper here.

Musafir (HackUPC, Barcelona)

At HackUPC 2024 in Barcelona, we developed Musafir, a Python library designed to enhance business travel for TravelPerk users. It simplifies logistics with features like fastest flight recommendations, tailored hotel stays, and networking opportunities. This project demonstrates skills in Python programming, API integration, data analysis, and user experience optimization. You can access our project here.

University Startup Challenge (Milan)

At the UPC Startup Challenge, we developed Griham, a platform designed to facilitate co-housing arrangements by connecting families, elderly individuals, couples, and singles with available space to those seeking shared accommodations. I created the entire website design from scratch using Figma. Griham offers affordable living options, fosters community interaction, and provides flexibility in housing arrangements. Tenants can reduce rent by contributing to household chores, creating a cooperative and shared responsibility environment. You can access the project here.

SKILLS

Programming Languages

Machine Learning / Deep Learning

Natural Language Processing

Tools

AWARDS

Grace Hopper Celebration India 2024 - Advancing Inclusion Scholarship

I was awarded the prestigious GHC India 2024 scholarship in recognition of my commitment to advancing diversity and inclusion in technology. Selected to attend Asia’s largest gathering of women in computing to engage with leading innovators and changemakers in tech.

EMNLP 2023 -- Oustanding Paper Award

As a Computer Science and Engineering graduate from BIT Mesra,India, I am honored to have received the Outstanding Paper Award at the EMNLP 2023 Conference in Singapore. My paper, titled "Counter Turing Test (CT2): AI-Generated Text Detection is Not as Easy as You May Think - Introducing AI Detectability Index (ADI)" explores the complex challenges involved in detecting AI-generated text. You can access our award-winning paper here and find more information about the award here.

Google Sites

Report abuse