• Journal of Internet Computing and Services
    ISSN 2287 - 1136(Online) / ISSN 1598 - 0170 (Print)
    http://jics.or.kr/

Real-time Human Pose Estimation using RGB-D images and Deep Learning


Beanbonyka Rim, Nak-Jun Sung, Jun Ma, Yoo-Joo Choi, Min Hong, Journal of Internet Computing and Services, Vol. 21, No. 3, pp. 113-121, Jun. 2020
10.7472/jksii.2020.21.3.113, Full Text:
Keywords: Human Pose Estimation, human skeleton tracking, Keypoint Localization, Deep Learning

Abstract

Human Pose Estimation (HPE) which localizes the human body joints becomes a high potential for high-level applications in the field of computer vision. The main challenges of HPE in real-time are occlusion, illumination change and diversity of pose appearance. The single RGB image is fed into HPE framework in order to reduce the computation cost by using depth-independent device such as a common camera, webcam, or phone cam. However, HPE based on the single RGB is not able to solve the above challenges due to inherent characteristics of color or texture. On the other hand, depth information which is fed into HPE framework and detects the human body parts in 3D coordinates can be usefully used to solve the above challenges. However, the depth information-based HPE requires the depth-dependent device which has space constraint and is cost consuming. Especially, the result of depth information-based HPE is less reliable due to the requirement of pose initialization and less stabilization of frame tracking. Therefore, this paper proposes a new method of HPE which is robust in estimating self-occlusion. There are many human parts which can be occluded by other body parts. However, this paper focuses only on head self-occlusion. The new method is a combination of the RGB image-based HPE framework and the depth information-based HPE framework. We evaluated the performance of the proposed method by COCO Object Keypoint Similarity library. By taking an advantage of RGB image-based HPE method and depth information-based HPE method, our HPE method based on RGB-D achieved the mAP of 0.903 and mAR of 0.938. It proved that our method outperforms the RGB-based HPE and the depth-based HPE.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from November 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[APA Style]
Beanbonyka Rim, Nak-Jun Sung, Jun Ma, Yoo-Joo Choi, & Min Hong (2020). Real-time Human Pose Estimation using RGB-D images and Deep Learning. Journal of Internet Computing and Services, 21(3), 113-121. DOI: 10.7472/jksii.2020.21.3.113.

[IEEE Style]
B. Rim, N. Sung, J. Ma, Y. Choi and M. Hong, "Real-time Human Pose Estimation using RGB-D images and Deep Learning," Journal of Internet Computing and Services, vol. 21, no. 3, pp. 113-121, 2020. DOI: 10.7472/jksii.2020.21.3.113.

[ACM Style]
Beanbonyka Rim, Nak-Jun Sung, Jun Ma, Yoo-Joo Choi, and Min Hong. 2020. Real-time Human Pose Estimation using RGB-D images and Deep Learning. Journal of Internet Computing and Services, 21, 3, (2020), 113-121. DOI: 10.7472/jksii.2020.21.3.113.