Source Product Images and Lifestyle Images from the provided set of Brand URLs. You will be assigned URLs each week and should aim to annotate all relevant products before moving to the next brand.
Do use: only retail brand URLs (e.g. Zara, Bose)
Do NOT use: Marketplace URLs (e.g. Amazon, Alibaba)
Can source the following combinations:
Product Image and Lifestyle Image (see definitions in point 3)
No product images and 2+ Lifestyle Images [NOTE: the 1st lifestyle image is annotated as a product image in this case]
The following combination is invalid (skip the product in this case):
2+ Product Images and No Lifestyle Images
From the Product URL, save the following types of images and mark the type (need to delineate between the two types of image):
The product image should be isolated
Color
Featuring either a transparent or solid color background
A neutral or plain background
Objects in the image
The image never include humans. Objects in an image could come in pairs/triplets/etc.
Object could be located not in focal point of an image
There should be no text or graphics on the image (Text or graphics printed on the product is OK)
No other artefacts present on image such as stripes, compression artefacts, marks, highlights
The image should not have an extreme aspect ratio (>4)
It should not depict the product in a real life environment or setting. It is merely an image designed to show the user a given product in a high-level of detail from a given angle, or series of angles through multiple product images
The whole product should be shown
Packaging
If the product has packaging, then show the product, not the packaging
Image Quality and Format
The image should be a minimum resolution of 768px
The image should be JPG, PNG
Max file size of 10MB
Surface textures are captured clearly
Lighting and Colors Requirements
The lighting of an image shouldn’t be too dark or bright that ends up distorting or hiding the original product’s details
Ensure the color of the object is accurate
Setting
An image that depicts the product in use within a real-life setting, showcasing its practical application and context.
This type of image helps to convey how the product fits into the daily life of the consumer, often featuring human interaction or a relatable environment.
Objects in the image
There should be no text or graphics on the image (Text or graphics printed on the product is OK)
Object could be located not in focal point of an image
Photo can feature multiple instances of a product
No other artefacts present on image such as stripes, compression artefacts, marks, highlights
The image should not have an extreme aspect ratio (>4)
The whole product should be shown
Extra additions to the image that wouldn’t be natural
Image Quality and Format
Minimum of 500x500px
JPG or PNG
Lighting and Color Requirements
The lighting of an image shouldn’t be too dark or bright that ends up distorting or hiding the original product’s details
Ensure the color of the object is accurate
List of product classifications:
Electronics
Home & Living
Beauty & Personal Care
Health & Fitness
Clothing Accessories
Fashion (Women's)
Fashion (Men's)
Fashion (Kids')
Toys & Games
Automotive
Books, Music & Movies
Office Supplies
Pet Supplies
Baby & Maternity
Tools & Home Improvement
Travel & Luggage
Arts, Crafts & Sewing
Musical Instruments
Event & Party Supplies
Industrial & Scientific
Collectibles & Memorabilia
Jewelry
Automobiles
Food and Beverages
Write two captions to describe the Product in the Product Image.
For the Short Description:
Emphasize simplicity.
Focus only on the key elements. What this object is, if something that comes in different shapes (i.e. perfume bottle), function, material, color
20-25 words maximum, but better to keep it 5-15 words
Avoid using too many details for the short description and focus only on the key attributes.
Example: A cordless vacuum with a light blue base and silver and gold handle
For the Long Description:
Include all the specific details: the color of the fan, the number of blades, the material of the blades, brand/manufacturer and any additional features such as the industrial cage around the lights.
Do not describe it in a way to sell it, describe it in a way to help a machine learning model to understand what it looks like. Only include details that can be gathered from looking at the image, do not include details that would only be learned from the product description.
Any positioning should be based on your perspective looking at the image e.g. left, right
Example: A cordless Dyson vacuum cleaner with a light blue vacuum base and silver and gold handle
Short: A red and gray coffee maker with a glass carafe.
Long: A Coleman coffee maker featuring a bright red top and handle, paired with a sleek gray body that has a perforated metal side. The glass carafe is marked with measurements and sits within a black base. The design includes a clear glass carafe with a red handle, and the entire machine has an industrial, rugged aesthetic typical of outdoor appliances.
Short: A red manual air pump with a black hose.
Long: A Coleman Dual Action QuickPump featuring a bright red cylindrical body with a black T-shaped handle at the top. The pump has bold white branding and text on the front, and it is equipped with a flexible black hose coiled at the base, designed for inflating various items.
Label the Lifestyle Image as either “In the wild” OR “Plain background”.
In the wild example (see dropdown)
An image that has a natural background setting
Plain background example (see dropdown)
“Plain background” (an image against a neutral colored background)
Add one caption to describe the Lifestyle Image. Only one caption in detail is required.
Identify and describe the main objects and their positions within the image (e.g. centered, top right corner, slightly off center to the left)
a. Positions i.e. “left” and “right” are based on your perspective looking at the image
Describe any prominent shapes and colors of these objects. Describe any textures or patterns.
Details of how this product exists and interacts with the rest of the objects in this image
Describe the background in the image.
Avoid any description speaking to the feeling of the image (e.g. "adding a touch of life to the composition.") or any other commentary that doesn’t help re create the image. (You aren’t trying to sell the product, you’re trying to describe how it looks for a machine learning model)
Pay special attention to ensuring the product image that's now in the lifestyle image is described in a way it can be reproduced with sufficient detail.
A Coleman Dual Action QuickPump in bright red is being used to inflate a pale green air mattress with a cream colored topper. One person is visible from the torso down dressed in burgundy and black flannel pants and a white t-shirt positioned in the left of the image is holding the hose of the pump steady as it inflates the air mattress. Another person visible from the legs down is dressed in bright red and flannel plants and black socks is stepping on either side of the air pump and holding onto the handles with both hands. They are set on a beige colored carpeted floor.
A white cylindrical container labeled "1UP Nutrition Pre-Workout" prominently placed on the left, on top of a marble slab with natural gray veining. Next to it, a stemless, rounded clear glass containing a blue liquid, slightly off-center to the right. Adjacent to the glass, a small pile of bright blue, round candies rests directly on the marble surface. The setting features a powder blue background that graduates to white at the base, matching the container's color scheme.
A white induction cooktop with five burners, viewed from above, centered within a granite countertop. Surrounding the cooktop are assorted cooking items: a stainless steel bowl of vegetables and fruit to the left, a knife set and two cooking oils to the upper right, and a metal sauté pan with red sliced peppers to the lower right. The setup is on a well-lit wooden surface, creating a contrast with the neutral tones of the granite.
Two purple, oval shaped perfume bottles with gold labels and caps are positioned on a reflective, slightly off-white surface. The bottle on the left stands vertically, while the one on the right lies horizontally with the cap off, revealing the spray nozzle. The perfume bottles have a rectangular gold logo in the center and the words "Eudora" and "Eu De Parfum" written in gold underneath it. The background consists of evenly spaced, vertical light beige and white curtains.
Indicate what type of camera shot was used in the Lifestyle Image only. Provide an Other free form field.
Close-Up (CU): Focuses on the subject closely with little around the frame.
Medium Shot (MS): Frames the subject from the waist up.
Long Shot (LS): Captures the full body of the subject within the environment.
Extreme Close-Up (ECU): Zooms in on a small detail of the subject.
Wide Shot (WS): Emphasizes the surroundings with the subject in the frame.
Low Angle Shot: Filmed from below towards the subject.
High Angle Shot: Filmed from above to towards the subject.
If the product URL includes a video, upload it and provide a caption.
Uploading a video is only required if the video is 10 seconds or less.
1. Use video downloader (e.g. this Chrome Extension)
2. Enter the URL of the product page that contains the video
3. Download the mp4 file and add it to the data set
4. Provide a caption for the video
5. Check "Does the product page contain videos?" checkbox
For videos that are more than 10 seconds, please just mark the checkbox that there is a video available for the product.
Example video and caption
A young woman with shoulder length, bright red hair with bangs is wearing a pair of gray Bose over ear headphones. She is walking down the street in a green hoodie and is shown from multiple angles wearing the headphones, smiling, and looking around at her surroundings.
The preference is to use ONE (1) PRODUCT IMAGE and ONE (1) LIFESTYLE IMAGE. Only use 2 lifestyle images when a product image is not available.
2. Install the Grammarly Chrome Extension and always use it while annotating to easily avoid spelling and grammar mistakes
3. IMPORTANT: Only describe what is visible in the image. The goal of the caption is to train the AI model on what it is seeing in the image, not to sell it the product. Do NOT include descriptions such as:
1. 🚫 ”Adds a stylish touch to any living room” 🚫
2. 🚫 ”Making it perfect for day to night wear” 🚫
3. 🚫 ”Necessary addition to your wardrobe” 🚫