2026-05-26 04:00 UTCOriginal source2 min readUpdated: 2026-06-30 13:03 UTC

AcroRL: Learning Aggressive Quadrotor Inversion using Bidirectional Thrust

This paper proposes a reinforcement learning framework that modulates a constant reference trajectory to perform compact, position-constrained quadrotor inversions while remaining compatible with traditional trajectory generation and tracking. In simulation, the method reduces position RMSE by 32% and settling time by 57% relative to the strongest optimization-based baseline. Hardware experiments demonstrate successful inversion across multiple yaw configurations with position RMSE below 0.35m.

SourcearXiv RoboticsAuthor: Gabriel Rodriguez, Henri Sayag, Abhishek Rathod, John Stecklein, Siddharth Saha, Christopher Barngrover, Wennie Tabib

[2605.24301] AcroRL: Learning Aggressive Quadrotor Inversion using Bidirectional Thrust

[Submitted on 23 May 2026]

Title:AcroRL: Learning Aggressive Quadrotor Inversion using Bidirectional Thrust

View a PDF of the paper titled AcroRL: Learning Aggressive Quadrotor Inversion using Bidirectional Thrust, by Gabriel Rodriguez and 6 other authors

View PDF HTML (experimental)

Abstract:Bidirectional thrust grants quadrotors a second equilibrium condition and increased control authority, expanding the envelope of possible aggressive maneuvers and enabling inverted flight, perching, and sensing. Prior geometric control approaches extend differential flatness through Hopf fibration-based attitude representations to support bidirectional thrust, but struggle with actuator saturation and motor reversal delay during inversions, requiring heuristic thrust posture scheduling and waypoint tuning. We propose a learning-based framework that modulates a constant reference trajectory to perform compact, position-constrained quadrotor inversions while remaining compatible with traditional trajectory generation and tracking across flight regimes. Separate policies are trained via reinforcement learning for nominal-to-inverted and inverted-to-nominal transitions. In JAX-based simulation, the proposed method achieves the lowest position deviation and settling time across all evaluated baselines, reducing position root mean square error (RMSE) by 32% and settling time by 57% relative to the strongest optimization-based baseline. Hardware experiments demonstrate successful inversion across multiple yaw configurations with position RMSE below 0.35m, and compatibility with downstream trajectory generation and control through circular flight in both regimes. Additionally, we provide an open-source implementation of the proposed framework.

Comments: 17 pages, 8 figures

Subjects:

Robotics (cs.RO)

Cite as: arXiv:2605.24301 [cs.RO]

(or arXiv:2605.24301v1 [cs.RO] for this version)

https://doi.org/10.48550/arXiv.2605.24301

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Gabriel Rodriguez [view email] [v1] Sat, 23 May 2026 00:02:22 UTC (8,408 KB)

Full-text links:

Access Paper:

View a PDF of the paper titled AcroRL: Learning Aggressive Quadrotor Inversion using Bidirectional Thrust, by Gabriel Rodriguez and 6 other authors

View PDF

HTML (experimental)

TeX Source

view license

Current browse context:

cs.RO

new | recent | 2026-05

Change to browse by:

References & Citations

NASA ADS

Google Scholar

Semantic Scholar

Data provided by:

Bibliographic Tools

Bibliographic and Citation Tools

Bibliographic Explorer Toggle

Bibliographic Explorer (What is the Explorer?)

Connected Papers Toggle

Connected Papers (What is Connected Papers?)

Litmaps Toggle

Litmaps (What is Litmaps?)

scite.ai Toggle

scite Smart Citations (What are Smart Citations?)

Code, Data, Media

Code, Data and Media Associated with this Article

alphaXiv Toggle

alphaXiv (What is alphaXiv?)

Links to Code Toggle

CatalyzeX Code Finder for Papers (What is CatalyzeX?)

DagsHub Toggle

DagsHub (What is DagsHub?)

GotitPub Toggle

Gotit.pub (What is GotitPub?)

Huggingface Toggle

Hugging Face (What is Huggingface?)

ScienceCast Toggle

ScienceCast (What is ScienceCast?)

Demos

Replicate Toggle

Replicate (What is Replicate?)

Spaces Toggle

Hugging Face Spaces (What is Spaces?)

Spaces Toggle

TXYZ.AI (What is TXYZ.AI?)