Abstract

Due to the safety risks and training sample inefficiency, it is often preferred to develop controllers in simulation. However, minor differences between the simulation and the real world can cause a significant sim-to-real gap. This gap can reduce the effectiveness of the developed controller. In this paper, we examine a case study of transferring an octorotor reinforcement learning controller from simulation to the real world. First, we quantify the effectiveness of the real-world transfer by examining safety metrics. We find that although there is a noticeable (around 100%) increase in deviation in real flights, this deviation may not be considered unsafe, as it will be within > 2m safety corridors. Then, we estimate the densities of the measurement distributions and compare the Jensen-Shannon divergences of simulated and real measurements. From this, we show that the vehicle’s orientation is significantly different between simulated and real flights. We attribute this to a different flight mode in real flights where the vehicle turns to face the next waypoint. We also find that the reinforcement learning controller actions appear to correctly counteract disturbance forces. Then, we analyze the errors of a measurement autoencoder and state transition model neural network applied to real data. We find that these models further reinforce the difference between the simulated and real attitude control, showing the errors directly on the flight paths. Finally, we discuss important lessons learned in the sim-to-real transfer of our controller.

Keywords

Computer scienceOptimization algorithmAlgorithmMathematical optimizationMathematics

Related Publications

Handbook of Genetic Algorithms

This book sets out to explain what genetic algorithms are and how they can be used to solve real-world problems. The first objective is tackled by the editor, Lawrence Davis. Th...

1991 7308 citations

Publication Info

Year
2024
Type
preprint
Citations
11172
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

11172
OpenAlex
0
Influential

Cite This

John Schulman, Filip Wolski, Prafulla Dhariwal et al. (2024). Quantifying the Sim-To-Real Gap in UAV Disturbance Rejection. Leibniz-Zentrum für Informatik (Schloss Dagstuhl) . https://doi.org/10.4230/oasics.dx.2024.16

Identifiers

DOI
10.4230/oasics.dx.2024.16

Data Quality

Data completeness: 77%