2026-06-25 04:00 UTCOriginal source2 min readUpdated: 2026-06-25 08:06 UTC

RGB: RL Guided Whole-Body MPPI for Humanoid Control

This paper proposes RGB, an RL-guided whole-body MPPI framework that uses a pretrained RL policy as a sampling prior and MPPI for online correction, achieving robust and precise humanoid control without retraining. Simulations on a Unitree G1 humanoid demonstrate stable 280Hz control and improved precision over pure RL.

SourcearXiv RoboticsAuthor: Yunsoo Seo, Sol Choi, Euncheol Im, Myo Taeg Lim, Yisoo Lee

[2606.25123] RGB: RL Guided Whole-Body MPPI for Humanoid Control

[Submitted on 23 Jun 2026]

Title:RGB: RL Guided Whole-Body MPPI for Humanoid Control

View a PDF of the paper titled RGB: RL Guided Whole-Body MPPI for Humanoid Control, by Yunsoo Seo and 4 other authors

View PDF HTML (experimental)

Abstract:Humanoid robots require whole-body controllers that are both robust and precise in contact-rich environments. While deep reinforcement learning (RL) achieves robust stability, its behavior is tightly coupled to the training objective and command interface, making it difficult to add new feedback objectives without retraining. In this study, we propose an RL guided whole-body model predictive path integral (MPPI) framework that acts as an add-on feedback controller on top of a pretrained RL policy. Instead of using RL policy as the final controller, we use it as a sampling prior that biases MPPI rollouts toward dynamically feasible behaviors. Task objectives are specified through modular MPPI cost terms, and MPPI closes the loop by continuously correcting the RL prior online to satisfy these objectives without retraining the policy. Simulations on a 29-DoF Unitree G1 humanoid in MuJoCo demonstrate stable high-rate control (average 280~Hz). The proposed method improves task-level precision over a pure RL baseline under the same command interface. This is achieved by correcting systematic drift during straight walking and tracking additional whole-body reference signals imposed through the cost.

Comments: 7pages

Subjects:

Robotics (cs.RO)

Cite as: arXiv:2606.25123 [cs.RO]

(or arXiv:2606.25123v1 [cs.RO] for this version)

https://doi.org/10.48550/arXiv.2606.25123

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Yunsoo Seo [view email] [v1] Tue, 23 Jun 2026 19:54:02 UTC (3,654 KB)

Full-text links:

Access Paper:

View a PDF of the paper titled RGB: RL Guided Whole-Body MPPI for Humanoid Control, by Yunsoo Seo and 4 other authors

View PDF

HTML (experimental)

TeX Source

view license

Current browse context:

cs.RO

new | recent | 2026-06

Change to browse by:

References & Citations

NASA ADS

Google Scholar

Semantic Scholar

Data provided by:

Bibliographic Tools

Bibliographic and Citation Tools

Bibliographic Explorer Toggle

Bibliographic Explorer (What is the Explorer?)

Connected Papers Toggle

Connected Papers (What is Connected Papers?)

Litmaps Toggle

Litmaps (What is Litmaps?)

scite.ai Toggle

scite Smart Citations (What are Smart Citations?)

Code, Data, Media

Code, Data and Media Associated with this Article

alphaXiv Toggle

alphaXiv (What is alphaXiv?)

Links to Code Toggle

CatalyzeX Code Finder for Papers (What is CatalyzeX?)

DagsHub Toggle

DagsHub (What is DagsHub?)

GotitPub Toggle

Gotit.pub (What is GotitPub?)

Huggingface Toggle

Hugging Face (What is Huggingface?)

ScienceCast Toggle

ScienceCast (What is ScienceCast?)

Demos

Replicate Toggle

Replicate (What is Replicate?)

Spaces Toggle

Hugging Face Spaces (What is Spaces?)

Spaces Toggle

TXYZ.AI (What is TXYZ.AI?)