Nasim

 Sabetpour

I am a Ph.D. candidate in Computer Science at Iowa State University of Science and Technology- College of Liberal Arts and Sciences. My research interests include the principles and applications of Data Mining, Machine Learning and Natural Language Processing. I'm currently focusing on truth discovery in crowdsourcing at Data and Knowledge Mining Lab under Dr. Qi Li supervision.

I have earned my M. S. degree in Artificial Intelligence from Azad University of Tehran, Science and Research branch; Admitted benefiting from the privilege of being top graduate. 

Email: nasim@iastate.edu 

Research Interests 

Education

Doctor of Philosophy in Computer Science at Iowa State University, Ames, IA, US        (Fall 2017 - Expected graduation: May 2022)

Master of Science in Artificial Intelligence - Computer Engineering at Azad University of Tehran-Science and Research Branch, Tehran, Iran

Bachelor of Science in Software Engineering - Computer Engineering at Azad University of Tehran-North Branch, Tehran, Iran

Publications

Teaching Assistant

Internships

In this research, we present machine learning solutions as targeting strategies for the programs based on available alumni data (Behavioral, Analytics, Events, Gift history, RSVPs, etc.) in almost ten institutions. The Regression, Gaussian Naive Bayes, Random Forest, and Support Vector Machine algorithms are used and evaluated. The test results show that having been trained with enough samples, all four algorithms can distinguish donors from rejectors well, and more interested donors are identified more often than others.

The results show that in a practical scenario where the models are properly used as the targeting strategy, nearly 85% of new donors and more than 90% of new interested donors can be acquired. The test results show that all used algorithms can distinguish promising donors from non-promising donors. -Those who never upgrade their pledge. The best model produces an overall accuracy of 97% in the test dataset. 

Finding a set of reference points which can be used by the meteorologists in their daily process to tune the demand forecasting model output has always been a challenge. This research presents a model that can hopefully lead us to the final goal of minimizing the differences between the actual demand and the nearest weather demand model output. Here, we construct a model to find highly similar weathers using historical weather data having attributes (Date, Temperature, Humidity, Humidity Index, Dew Point, Cloud Coverage, etc.), and it has the potential to be expanded to weather forecast model in the future. Here, we settle to use the Humidity, Temperature, Cloud Coverage and Dew Point as our model inputs.

This study focuses on providing a search algorithm that sifts through available historical data and selects the most similar historical dates based on multidimensional weather features. The K-Nearest Neighbors’ algorithm is based around the simple idea of predicting unknown values by matching them with the most similar known values. Then, to construct our “Almost KNN model” (a method which is very similar to K-Nearest Neighbors), we first construct the feature set from raw data, then compute the distance based on different similarity measures of Euclidean, Manhattan, and Cosine, and at the end pick the top k closest distances. In terms of time complexity, one challenge we encountered with Almost-KNN method is computational complexity. The amount of computation required for computing and sorting pairwise distance grows exponentially.

In order to provide an efficient and fast search time, we applied a K-Dimensional tree (KD_Tree) to partition our data into a more efficient structure for multidimensional searching. It is a kind of binary search tree to do space partitioning for organizing points in k-dimensional space. The results show that the combination of almost-KNN and KD tree can provide a computationally efficient algorithm for nearest weather day matching.

Awards


Volunteer activities

GitHub
LinkedIn
Link
Link
Link
Link