Can Large Audio Language Models Understand Audio Well?

SSEU-Bench: Speech, Scene and Events Understanding Benchmark for LALMs