“NTU RGB-D dataset”的意思、由来-开放百科全书

词条

NTU RGB-D dataset

释义

Classifiers
See also
References

{{Multiple issues|{{Underlinked|date=March 2017}}{{Orphan|date=March 2017}}
}}

The NTU RGB-D (Nanyang Technological University's Red Blue Green and Depth information) dataset is a large dataset containing recordings of labeled human activities

.^[1] This dataset consists of 56,880 action samples containing 4 different modalities (RGB videos, depth map sequences, 3D skeletal data, infrared videos) of data for each sample.

The dataset consists of 60 labelled actions. Specifically, drink water, eat meal/snack, brushing teeth, brushing hair, drop, pickup, throw, sitting down, standing up (from sitting position), clapping, reading, writing, tear up paper, wear jacket, take off jacket, wear a shoe, take off a shoe, wear on glasses, take off glasses, put on a hat/cap, take off a hat/cap, cheer up, hand waving, kicking something, put something inside pocket / take out something from pocket, hopping (one foot jumping), jump up, make a phone call/answer phone, playing with phone/tablet, typing on a keyboard, pointing to something with finger, taking a selfie, check time (from watch), rub two hands together, nod head/bow, shake head, wipe face, salute, put the palms together, cross hands in front (say stop), sneeze/cough, staggering, falling, touch head (headache), touch chest (stomachache/heart pain), touch back (backache), touch neck (neckache), nausea or vomiting condition, use a fan (with hand or paper)/feeling warm, punching/slapping other person, kicking other person, pushing other person, pat on back of other person, point finger at the other person, hugging other person, giving something to other person, touch other person's pocket, handshaking, walking towards each other and walking apart from each other.

Classifiers

This is a table of some of the machine learning methods used on the database and their error rates, by type of classifier:

Type	Paper	Preprocessing Description	Accuracy (%)
Deep Learning	Glimpse Clouds: Human Activity Recognition from Unstructured Feature Points ^[2]	Unconstraint attention mechanism over RGB stream	86.6
Deep Learning	Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition ^[3]	Arranging skeletal joints for tree-traversal	77.7
Deep Learning	Deep LSTM ^[1]	None	67.3

References

1. ^¹{{cite arXiv |last= |first= |author-link= |eprint=1604.02808 |title= NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis|class= cs.CV|date= |last1= Shahroudy|first1= Amir|last2= Liu|first2= Jun|last3= Ng|first3= Tian-Tsong|last4= Wang|first4= Gang|year= 2016}}
2. ^{{cite arXiv |last= |first= |author-link= |eprint=1802.07898 |title= Glimpse Clouds: Human Activity Recognition from Unstructured Feature Points|class= cs.CV|date= |last1= Baradel|first1= Fabien|last2= Wolf|first2= Christian|last3= Mille|first3= Julien|last4= Taylor|first4= Graham|year= 2018}}
3. ^{{cite arXiv |last= |first= |author-link= |eprint=1607.07043 |title= Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition|class= cs.CV|date= |last1= Liu|first1= Jun|last2= Shahroudy|first2= Amir|last3= Xu|first3= Dong|last4= Wang|first4= Gang|year= 2016}}

2 : Artificial intelligence|Datasets in computer vision

随便看

开放百科全书收录14589846条英语、德语、日语等多语种百科知识，基本涵盖了大多数领域的百科知识，是一部内容自由、开放的电子版国际百科全书。

Classifiers

See also

References