Relationship between different movie ? How to define?
Based on "Rating History", assumption :
Using co-occurrence matrix
table 1 & table 2 : user rating movie history data <user : movie_id0 = score, movie_id1=score>
table 3 & table 4 : normalized co-occurrence matrix, normalize is becuase each movie has differnet rating feedback, need to balanced weight of contribution
table 5 : rating matrix per user
MapReduce task :
user0 : movie0=rating
user0 : movie1=rating
user10 : movie 5=rating
<user_id, movie0=rating, movie1=rating ......>
input : <user_id, movie0=rating, movie1=rating ......>
output : <movieA : movieB, count>
input : <movieA : movieB, count>
output : <movieB, movieA = count/total count>
input : user0 : movie0=rating data
output : <user_id, movie = rating>
input : rating matrix & co-occurrence matrix
output : rating matrix cell * co-occurrence matrix cell