Aashish mishra. “Reinforcement Learning in Humanoid Robotics: Deploying Deep Policy Networks for Agile Motion Control”. International Journal on Recent and Innovation Trends in Computing and Communication 10, no. 12 (December 31, 2022): 521–528. Accessed September 23, 2025. https://ijritcc.org/index.php/ijritcc/article/view/11595.