Optimization of Actuation Configuration in Earthworm-Like Robots via Reinforcement Learning

doi:10.6052/1672-6553-2025-050

Home > Archive>Volume 23, Issue 10, 2025 >35-44. DOI:10.6052/1672-6553-2025-050

Optimization of Actuation Configuration in Earthworm-Like Robots via Reinforcement Learning
DOI:
                        10.6052/1672-6553-2025-050
                    
CSTR:
                        
Author:
                        
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

This study presents a reinforcement-learning-based intelligent configuration method for optimizing the actuation of multi-segment earthworm-like robots. First, a dynamic model of the multi-segment robotic system is established, and the actuator arrangement problem is formulated as a Markov decision process. By designing a multi-discrete action space, computational costs are significantly reduced. A reward function integrating locomotion speed and energy consumption constraints is proposed to effectively balance exploration and exploitation. For actuator-limited conditions, an action masking mechanism enables efficient policy search under hard constraints. Key findings include: (1) Midline-symmetric actuation yields optimal performance under full-drive conditions; (2) A “posterior-priority, centripetal-clustering” distribution pattern emerges under constrained actuation.

Reference

Cited by

Get Citation

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:April 23,2025
Revised:May 13,2025
Adopted:
Online: October 29,2025
Published:

Get Citation

Related Videos

Share

Article Metrics

History

Article QR Code

WeChat

Mobile website