20,000 Image caption data of human face includes multiple races under the age of 18, 1845 years old, 4660 years old, and over 60 years old; the collection scene is rich, including indoor scenes and outdoor scenes; the image content is rich, including wearing masks, glasses, wearing headphones, facial expressions, gestures, and adversarial examples. The language of the text description is English, which mainly describes the race, gender, age, shooting angle, lighting and diversity content, etc.
For more details, please refer to the link: https://www.nexdata.ai/datasets/llm/1286?source=Github
10,100 images
Asian, Caucasian, Black, Brown
male, female
under 18 years old, 1845 years old, 4660 years old, over 60 years old
including indoor scenes and outdoor scenes
different age groups, different collection environments, and different seasons
including wearing masks, adversarial samples, expression data, wearing glasses, wearing headphones, and multiple gestures
image format is .jpg, text format is .txt
English, Chinese
in principle, 30~60 words, usually 3-5 sentences
race, gender, age, shooting angle, lighting, diversity content
the proportion of correctly labeled images is not less than 97%
Commercial License