Online world modeling
enables real-world
Inverse Reinforcement Learning from Observation

meaning... NO
  • rewards
  • action supervision
  • pre-training
  • play data
  • failure examples
  • prior models
  • interventions
  • simulation

Only 15 observation-only demonstrations and < 40 minutes of real-world training from scratch

Training 100%