Preprint Article Version 1 This version is not peer-reviewed

Human Pose Estimation for Yoga using VGG-19 and COCO Dataset

Version 1 : Received: 13 August 2024 / Approved: 14 August 2024 / Online: 20 August 2024 (11:04:33 CEST)

How to cite: Shrestha, D.; Nepal, P.; Gautam, P.; Oli, P. Human Pose Estimation for Yoga using VGG-19 and COCO Dataset. Preprints 2024, 2024081107. https://doi.org/10.20944/preprints202408.1107.v1 Shrestha, D.; Nepal, P.; Gautam, P.; Oli, P. Human Pose Estimation for Yoga using VGG-19 and COCO Dataset. Preprints 2024, 2024081107. https://doi.org/10.20944/preprints202408.1107.v1

Abstract

Human Pose Estimation (HPE) is a critical technology in computer vision with diverse applications ranging from healthcare to sports analysis. This project presents a method for detecting the 2D stance of multiple persons in an image using a nonparametric representation known as Part Affinity Fields (PAFs). By leveraging the first 10 layers of the VGG-19 convolutional neural network and training on the COCO dataset, our model effectively identifies and associates key points of the human body.The architecture employs a two-branch system that jointly learns part locations and their associations through sequential prediction. This enables the model to maintain real-time performance while achieving high accuracy, regardless of the number of persons in the image. To enhance accessibility, we developed a mobile application using Flutter and TensorFlow Lite, allowing real-time pose estimation via a mobile device’s front camera. The app provides immediate feedback on physical exercises and yoga poses, making it an invaluable tool for fitness enthusiasts and healthcare professionals. Visual outputs such as heatmaps and PAFs confirm the model’s capability to accurately localize and connect key points. Despite potential challenges such as data quality and hyperparameter tuning, the results indicate that our approach is both reliable and practical for real-world deployment. This project not only advances the state-of-the-art in HPE but also opens up possibilities for future enhancements, including integrating 3D pose estimation and applying the technology in augmented and virtual reality applications.

Keywords

Human Pose Estimation (HPE); Convolutional Neural Network (CNN); VGG-19, Part Affinity Fields (PAFs); COCO Dataset; Real-Time Pose Detection

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.