Legend:
Original Audio: Normal noisy input to the model, x = r∗ (y + b)
Initial Model Output: Normal model behavior f (x), i.e., ŷ
Attacked Audio: Original audio plus a perturbation: x + δ
Attacked Model Output: f (x + δ)
Original Audio
Initial Model Output
Attacked Audio
Attacked Model Output
Original Audio
Initial Model Output
Attacked Audio
Attacked Model Output
Original Audio
Initial Model Output
Attacked Audio
Attacked Model Output