EE202B - Human Activity Prediction

Human Activities Prediction

Human Activities Selection

Ground truth human activities are not provided, our first step is to make reasonable assumptions about human activities from plug-level power consumption data. 2012-06-03 plugs data is shown below, 12 appliances power consumption data is visualized in red lines against time from 0:00 to 24:00.

Our first goal is to find appliances capable of representing human activities. Thresholds are used to decide the on/off states of their activities. For example, if Dishwasher/Kettle power consumption exceeds 100 w, we say that kitchen work is ongoing.

Tablet: Ignored as stated in Electrical Data Visualization section
Dishwasher/Kettle: Suitable as it clearly shows that occupants are working in the kitchen. (threshold=100)
Air exhaust: Not suitable as it can not represent any human activities.
Fridge: Not suitable as it contains periodic spikes.
Entertainment: Suitable. It clearly shows that occupants are relaxing using TV & stereo. (threshold=100)
Freezer: Not suitable as it contains periodic spikes.
Lamp: Suitable as it clearly shows that occupants are relaxing/working in bedroom. (threshold=100)
Laptops: Suitable as it clearly shows that occupants are working in bedroom. (threshold=10)
Stove: Not suitable because there are only 28 samples.

Train Set / Validation Set / Test Set Split

In our project, we split our data set into train set, validation set and test set. The training set and validation sets are used during training, once training is finished, we run against our test set and verify that the accuracy is sufficient. Specifically, we split data set so that 80% would be training and validation set, 20% would be test set. For training and validation set, we do 10 fold cross-validation and observe statistics including accuracy scores across 10 folds and importance of features. For test set, we apply our classifier on it and show f1-score, confusion matrix and accuracy score.

Features Selection

Different from human occupancy prediction, we only use smart meters power consumption to predict human occupancy, however we evaluate results for two different sets of features.

Using the same features as human occupancy prediction

In the Using smart meters power consumption to predict human occupancy section in Human Occupancy Prediction page, we extracted min, max, range, mean, standard variation, correlation, onoff statistics for each type of four power, as well as sad12, sad13, sad23 across phases of power. We inherit the same features here for comparison.

Using power consumption distribution to predict human activities

We scan through a 15 min time window, find the power consumption and increase the count of corresponding power bucket by 1, and use power buckets as features for human activities prediction.

Here is an example, for 2012-06-06 smart meter or plugs data, the power consumption range is 0~2000 w, the time window is 15 min. We construct 40 power buckets, each accommodate up to 50 w. The bar chart is shown below. Six bar charts show the power distribution of six combination of states: No activities, Dishwasher/Kettle+Laptop, Laptop, Entertainment+Laptop, Entertainment+Lamp+Laptop, Lamp. From the bar chart we can see that power consumption is likely to have lower mean and standard deviation with lamp working, but a higher mean and standard deviation when dishwasher/kettle is involved. Thus, with proper classification method trained on these features, it is prospective to achieve good classification results.

Prediction Results and Evaluation

Using the same features as human occupancy prediction

Also, as what we have implemented in human occupancy prediction, to make sure that the classifier is not overfitting the data, we divide the data into training, validation and test sets.

We have more than 23000 sets of data this time. Firstly we use 80% data to train the model and verify the model using the 10-cross validation. Then we test the model with the remaining 20% of data, which is the test set.

Cross-validation results on train/validation set:

The figure below shows the 10-cross validation results of using MLP, KNN, SVM and Random Forest. We can see that the Random Forest still gives the best performance of prediction, which has 85% average accuracy and 0.0006 variation, KNN is the runner-up, which gives 82% average accuracy. The performance of MLP only has 66% average accuracy. For SVM, even though it gives 63% average accuracy, but we can easily find from the figure that it gives almost totally incorrect result for one set of validation data, which is really unacceptable.

MLP:
- Mean: 0.670942704112
- Variance: 0.000450458844919
KNN:
- Mean: 0.822849604695
- Variance: 0.000556146186011
SVM:
- Mean: 0.585670594496
- Variance: 0.0449577889243
Random Forest:
- Mean: 0.857242273142
- Variance: 0.000387311662525

Prediction results on test set:

MLP:
- F1 score: 0.630309498399
- Accuracy score:
- Confusion matrix: Shown right

KNN:
- F1 score: 0.824759871932
- Accuracy score: 3864/4685=82.48%
- Confusion matrix: Shown right

SVM:
- F1 score: 0.711846318036
- Accuracy score:
- Confusion matrix: Shown right

Random Forest:
- Three most significant features: mean_0 (37.62%), min_1 (16.35%), min_0 (5.42%)
- F1 score: 0.849519743863
- Accuracy score: 84.95%
- Confusion matrix: Shown right

Using power consumption distribution to predict human activities

We using the similar way to train and test the 4 classifiers and predict the human activities. Compared to the features we use in the previous prediction methods, the features we use here look more straightforward for predicting the human activities. We generate the power distribution features by slicing the power consumption into 80 power slots in order to let the power consumption of different appliances fall into different slots. The prediction result by this method gives a similar performance with the previous one.

Cross-validation results on train/validation set:

We can see that the Random Forest gives the best performance of prediction, which has 80% average accuracy and 0.0006 variation, KNN is the runner-up, which gives 78% average accuracy. The performance of MLP and SVM have some improvement. MLP gives a 74% average accuracy and SVM has 71%. Importantly, SVM does not have the similar problem we find in the previous method.

MLP:
- Mean: 0.751892120906
- Variance: 0.000319683099781
KNN:
- Mean: 0.795380660015
- Variance: 0.000267449154236
SVM:
- Mean: 0.730089547508
- Variance: 0.00160405079189
Random Forest:
- Mean: 0.804510292863
- Variance: 0.00030459023018

Prediction results on test set:

MLP:
- F1 score: 0.722518676628
- Accuracy score: 72.25%
- Confusion matrix: Shown right

KNN:
- F1 score: 0.773959445037
- Accuracy score: 77.40%
- Confusion matrix: Shown right

SVM:
- F1 score: 0.75560298826
- Accuracy score: 75.56%
- Confusion matrix: Shown right

Random Forest:
- F1 score: 0.781430096051
- Accuracy score: 78.14%
- Confusion matrix: Shown right

Comparison

The two methods are based on two ways to get features. The first one tries to use absolute value and variablity of power consumption as feature and the second one tries to get into the power consumption and use the power distribution information. However, the results present that the two methods give out similar prediction accuracy. In detail, the first method has higher best prediction accuracy but the prediction is less stable than that of the second method.

Google Sites

Report abuse