BOSS: A Benchmark for Human Belief Prediction in Object-context Scenarios