AlphaDog

the first attack that is no-box, and universal

Attacker needs no query, no access to decision, no information about AI model.
One single Attack Instance can attack a wide range of AI model!

1. AI Removal and Eye White/grey background adversarial examples: Biden or Obama? Cat or Dog? Speed limit 20mph or 75mph? Hand X-ray dislocated or not?

With/grey background it's Biden

With/grey background it's a cat

With/grey background it's 20mph

With/grey background it's a healthy hand

Removed Alpha it's Obama

Removed Alpha it's a dog

Removed Alpha it's 75mph

Removed Alpha it's dislocated hand

Advantages of ACA:

Strong No-box attack: No query!
Universal: No need to tailor attack image for a specific AI model!

Effortless: No need to reverse engineer anything about AI model!

ACA can attack:

1) online image classifiers, such as:

https://chat.openai.com/ (Chatgpt, Removal)

https://aws.amazon.com/rekognition/ (Amazon Rekognition api, Removal)

https://www.baidu.com/ (Baidu Image api, Removal)

https://portal.vision.cognitive.azure.com/demo/generic-object-detection (Microsoft Azure Vision, Removal)

https://imagerecognize.com/ (Removal)

https://www.imageidentify.com/ (Wolfram, Removal)

https://clarifai.com/ (Calrifai, Removal)

https://vision.aliyun.com/ (AliYun vision, Removal)

https://cloud.tencent.com/ (Tencent vision, Removal)

https://imagga.com/ (Imagga, Removal)

https://www.bard.com/ (Bard, Black Background)

https://www.google.com/ (Google Cloud Vision Api, White Background)

2) Local image classifiers, such as:

Yolo v3-v8 (Default Remove Alpha Channel),

Faster R-CNN (Default Remove Alpha Channel)

Alpha Channel Attack Demonstration:

We can attack most AI Model in a NO-BOX way!

Attack Image is colorful

Chatgpt sees 20mph as a 75mph

Attack Image is purely grayscale

Amazon Rekognition Vision sees Biden as Obama

(1) Attack Cloud-base online AI Model (chatgpt, baidu, Amazon Rekognition, Microsoft Azure Vision, Bard, Gemini Vision Pro, online Free Image Classifier, walfram, Clarifi, Imagga, LandingLens .etc):

1.Chatgpt reads 20mph as a 75mph

2.Chatgpt reads a cat as a dog

3.Chatgpt sees a healthy hand as dislocated hand

4.Amazon Rekognition reads Biden as Obama

5.Baidu reads a cat as a dog

6.Baidu reads a Biden as Obama

7.Online Classifier reads a cat as a dog

8.Online Classifier reads Biden as Obama

9. Bard reads city view as a dog

10.Wolfram reads a cat as a dog

11.Clarifi reads a cat as a dog

12.LandingLens sees cat as a cow (this image hides a cow in a cat)

13.Microsoft Azure Vision sees a cat as a dog

14.Tencent Vision sees a cat as a dog

15. AliYun vision sees cat as dog

16. Google Cloud Vision Api sees a dog as city view. Note that this must be done with browser in night mode with black background because Google Cloud Vision uses white background.

(2) Attack Local Image Recognition Model(Yolo, FasterR-CNN...):

YOLO v5 reads a Biden as Obama

YOLO recognize a cat as a dog

Faster R-CNN reads Biden as Obama

Faster R-CNN reads cat as dog

Weak no-box attack Query Image ------------>

it's transparent with RGB channel to be red. We use this image to query AI model to determine their Alpha blending strategy.

Here is the result:

2. AI/Eye white/black background adversarial example: (try it yourself!)

with white background : city view

with black background : dog

Drag to

Right!

---->

with white black background, it's city view.But with black background it's a dog. ( We cannot provide presidential candidate attack image in case of malicious usage)

Part of our 1000 dataset: 100 randomly generated Attack images.

Each image hides the image on left side of it,

for example

1st image: IEye=eagle, IAI=apple, 2nd image: IEye=dog, IAI=eagle

3rd image: IEye=flower, IAI=dog, 4th image: IEye=bird, IAI=flower