The Credit Score Classification dataset, available on Kaggle, contains information necessary to build and evaluate models for predicting credit scores. This dataset is valuable for financial institutions and data scientists interested in understanding and predicting creditworthiness based on various features.
The dataset consists of the following features:
ID: Unique identifier for each entry.
Customer_ID: Unique identifier for each customer.
Month: The month of record.
Name: The name of the customer.
Age: Age of the customer.
Gender: Gender of the customer.
Occupation: Occupation of the customer.
Annual_Income: Annual income of the customer.
Monthly_Inhand_Salary: Monthly salary in hand.
Num_Bank_Accounts: Number of bank accounts held by the customer.
Num_Credit_Card: Number of credit cards owned by the customer.
Interest_Rate: Interest rate applicable on credit card.
Num_of_Loan: Number of loans taken by the customer.
Type_of_Loan: Type of loan taken.
Delay_from_due_date: Average number of days delayed in payment.
Num_of_Delayed_Payment: Number of delayed payments.
Changed_Credit_Limit: Credit limit change indicator.
Num_Credit_Inquiries: Number of credit inquiries.
Credit_Mix: Credit mix indicator.
Outstanding_Debt: Total outstanding debt.
Credit_Utilization_Ratio: Ratio of credit utilization.
Credit_History_Age: Age of credit history.
Payment_of_Min_Amount: Minimum amount paid indicator.
Total_EMI_per_month: Total EMI (Equated Monthly Installment) per month.
Amount_invested_monthly: Amount invested monthly.
Payment_Behaviour: Payment behavior.
Monthly_Balance: Monthly balance.
Credit_Score: The target label indicating the credit score, categorized into three classes: Poor, Standard, and Good.
This dataset can be used for various machine learning tasks, including but not limited to:
· Credit scoring
· Predictive modeling
It provides a comprehensive set of features that can help in understanding the factors influencing credit scores and building robust models for credit score prediction.
The dataset is sourced from Kaggle and can be accessed via the following link. For the purposes of our project, the file "train.csv" was the only dataset we needed on that page.