Q: What makes APEX different from existing motion tracking methods?
APEX builds on reward-based tracking methods (like DeepMimic), with several practical advantages: it avoids extensive hyperparameter tuning, is significantly more sample-efficient, and makes it straightforward to learn motions without including reference trajectories in the actor’s observations. This leads to more reliable sim-to-real transfer and allows controlled deviations from the reference when needed, as shown in the imitation-beyond-demonstration section. The cool part is that the action priors can be added with just a few lines of code to any existing reward-based motion-tracking approach.
Q: Can APEX be applied to other robots beyond quadrupeds?
Yes :) While our main experiments focus on quadrupedal locomotion, APEX is a general framework that can be applied to other robotic systems. We also have initial results on humanoid motions using BeyondMimic, and we will release the humanoid locomotion code soon.