Reward Optimizing Recommender Systems using Deep Learning and Fast Maximum Inner Product Search