Observation Space: The Observation space for each agent is a 5×5 array that describes a list of 5 nearby vehicles by a set of features such as Position ( x, y) and Velocity (Vx, Vy) . For the purpose of Multi Agent Learning We concatenate the observation of the number of agents present into a tuple