Directed Exploration in Reinforcement Learning from Linear Temporal Logic