559,460 Videos - 50 Types of Dynamic Gesture Recognition Data
559,460 Videos - 50 Types of Dynamic Gesture Recognition Data. The collecting scenes of this dataset include indoor scenes and outdoor scenes (natural scenery, street view, square, etc.). The data covers males and females (Chinese). The age distribution ranges from teenager to senior. The data diversity includes multiple scenes, 50 types of dynamic gestures, 5 photographic angles, multiple light conditions, different photographic distances. In terms of data annotation, collecting scene, season, time, angle, device, light condition, video data format, resolution, fps (frames per second), as well as the race, gender and age of the subject were labeled in the metadata. This data can be used for tasks such as smart homes, audio equipments, and on-board systems.
Multiple scenes50 types of dynamic gestures5 photographic anglesMultiple light conditionsDifferent photographic distancesSample
40,002 Images – OCR Data of Internet Image
40,002 Images – OCR Data of Internet Image. The collecting scenes of this dataset include subtitle, advertisement, cellphone screenshot, comic, emoticon, poster, magazine cover, etc. The language distribution is Chinese, English (a few). For annotation, line-level rectangular bounding box annotation and transcription for the texts were adopted for the internet images (column-level quadrilateral bounding box annotation and transcription for the texts were adopted for small amount of data). The dataset can be used for OCR tasks of internet images.
OCRMultiple types of internet imagesSample