RL-PGO: Reinforcement Learning-based Planar Pose-Graph Optimization