I noticed that in Host, (current position + scaled action) is used as the target dof position, whereas the standard approach (e.g., in the original Legged Gym) typically uses (default position + scaled action) as the target dof position.
Does the method used in Host offer significant advantages compared to the normal approach? Or was this choice made simply because it demonstrated better performance for this specific task (e.g., get-up) during experimentation?