Discriminative Particle Filter Reinforcement Learning for Complex Partial Observations