Home Gestures and Actions recognition
Post
Cancel

Gestures and Actions recognition

How to Recognize Gestures and Actions

We use the MPII model to recognize human gestures and actions. The MPII model was developed using a specific CNN architecture called VGG. The structure of VGG is:



As shown in the image above:

  • The white boxes : Convolutional layers.
  • The red boxes : Pooling layers.
  • Other boxes : Detection layers. The output layer will represent the number of objects. That is, if we use imagenet, the number of output layer channels will be 1000.
    However, since we use MPII which detects human points, the number of output layer units will be 15. Like:



Implementation


This post is licensed under CC BY 4.0 by the author.