Adversarial Attack and Defense for Commercial
Black-box Multilingual Speech Recognition Systems
Adversarial Attack and Defense for Commercial
Black-box Multilingual Speech Recognition Systems
2.1. Digital world attack (attack Aliyun API)
2.2. Digital world attack (attack Tencentyun API)
(1) Siri recognizes the sound as “播放音乐”, which means “play music” in English.
(2) Siri recognizes the sound as “打开QQ”, which means “open QQ” in English.
(3) Siri recognizes the sound as “打开QQ音乐”, which means “open QQ music” in English.
(4) Siri recognizes the sound as “我要听歌”, which means “ I want to listen to songs” in English.
(5) Siri recognizes the sound as “支付宝付款”, which means “AliPay payment” in English.
(6) Siri recognizes the sound as “安装抖音”, which means “install Tiktok ” in English.
(7) Xiaoaitongxue recognizes the sound as “今天天气怎么样”, which means “what is the weather today” in English.
(8) Xiaoaitongxue recognizes the sound as “把电视机打开”, which means “turn on the TV” in English.
(9) Xiaoaitongxue recognizes the sound as “打开浏览器”, which means “open browser” in English.
(10) Xiaoaitongxue recognizes the sound as “直接关机”, which means “directly close the phone” in English.
(11) Xiaoaitongxue recognizes the sound as “打开车灯”, which means “turn on the lights of the car ” in English.
(12) Xiaoaitongxue recognizes the sound as “金融诈骗”, which means “financial fraud” in English.
(1) Siri recognizes the sound as “截屏”, which means “Screen cut” in English.
(2) Siri recognizes the sound as “开始录音”, which means “start record” in English.
(3) Siri recognizes the sound as “打开电视”, which means “turn on the TV”.
(4)Siri recognizes the sound as “安装快手”, which means “install kwaishou”.
(5) Siri recognizes the sound as “打电话给小陈”, which means “call Xiao Chen”.