Skip to content

Nexdata-AI/11000-Image-Video-caption-data-of-human-action

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

11000-Image-Video-caption-data-of-human-action

Description

20,000 Image & Video caption data of human action contains 20,000 images and 10,000 videos of various human behaviors in different seasons and different shooting angles, including indoor scenes and outdoor scenes. The description language is English, mainly describing the gender, age, clothing, behavior description and body movements of the characters.

For more details, please refer to the link: https://www.nexdata.ai/datasets/llm/1289?source=Github

Data size

10,000 images, 1,000 videos

Race distribution

Caucasian, black

Gender distribution

male, female

Age distribution

from teenagers to old age, mainly young and middle-aged

Collection environment

including indoor scenes and outdoor scenes

Collection diversity

different age groups, different collection environments, different seasons, various shooting angles, and various human behaviors

Data format

image format is .jpg, video format is .mp4, text format is .txt

Description language

English, Chinese

Text length

in principle, 30~60 words, usually 3-5 sentences

Main description conten

gender, age, clothing, behavior description, body movements

Accuracy rate

the proportion of correctly labeled images is not less than 97%

Licensing Information

Commercial License