请输入您要查询的百科知识:

 

词条 NTU RGB-D dataset
释义

  1. Classifiers

  2. See also

  3. References

{{Multiple issues|{{Underlinked|date=March 2017}}{{Orphan|date=March 2017}}
}}

The NTU RGB-D (Nanyang Technological University's Red Blue Green and Depth information) dataset is a large dataset containing recordings of labeled human activities

.[1] This dataset consists of 56,880 action samples containing 4 different modalities (RGB videos, depth map sequences, 3D skeletal data, infrared videos) of data for each sample.

The dataset consists of 60 labelled actions. Specifically, drink water, eat meal/snack, brushing teeth, brushing hair, drop, pickup, throw, sitting down, standing up (from sitting position), clapping, reading, writing, tear up paper, wear jacket, take off jacket, wear a shoe, take off a shoe, wear on glasses, take off glasses, put on a hat/cap, take off a hat/cap, cheer up, hand waving, kicking something, put something inside pocket / take out something from pocket, hopping (one foot jumping), jump up, make a phone call/answer phone, playing with phone/tablet, typing on a keyboard, pointing to something with finger, taking a selfie, check time (from watch), rub two hands together, nod head/bow, shake head, wipe face, salute, put the palms together, cross hands in front (say stop), sneeze/cough, staggering, falling, touch head (headache), touch chest (stomachache/heart pain), touch back (backache), touch neck (neckache), nausea or vomiting condition, use a fan (with hand or paper)/feeling warm, punching/slapping other person, kicking other person, pushing other person, pat on back of other person, point finger at the other person, hugging other person, giving something to other person, touch other person's pocket, handshaking, walking towards each other and walking apart from each other.

Classifiers

This is a table of some of the machine learning methods used on the database and their error rates, by type of classifier:

Type Paper Preprocessing Description Accuracy (%)
Deep Learning Glimpse Clouds: Human Activity Recognition from Unstructured Feature Points [2] Unconstraint attention mechanism over RGB stream 86.6
Deep Learning Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition [3] Arranging skeletal joints for tree-traversal 77.7
Deep Learning Deep LSTM [1] None 67.3

See also

  • List of datasets for machine learning research

References

1. ^{{cite arXiv |last= |first= |author-link= |eprint=1604.02808 |title= NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis|class= cs.CV|date= |last1= Shahroudy|first1= Amir|last2= Liu|first2= Jun|last3= Ng|first3= Tian-Tsong|last4= Wang|first4= Gang|year= 2016}}
2. ^{{cite arXiv |last= |first= |author-link= |eprint=1802.07898 |title= Glimpse Clouds: Human Activity Recognition from Unstructured Feature Points|class= cs.CV|date= |last1= Baradel|first1= Fabien|last2= Wolf|first2= Christian|last3= Mille|first3= Julien|last4= Taylor|first4= Graham|year= 2018}}
3. ^{{cite arXiv |last= |first= |author-link= |eprint=1607.07043 |title= Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition|class= cs.CV|date= |last1= Liu|first1= Jun|last2= Shahroudy|first2= Amir|last3= Xu|first3= Dong|last4= Wang|first4= Gang|year= 2016}}

2 : Artificial intelligence|Datasets in computer vision

随便看

 

开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。

 

Copyright © 2023 OENC.NET All Rights Reserved
京ICP备2021023879号 更新时间:2024/9/24 13:16:07