Tendency Reinforcement Learning