4. Contribution
• Value Iteration Networks (VIN)
• Model free training
• It does not require robot dynamics models.
• Generalized action prediction in new environments
• It can not work outside of training environments.
• Key approach
• Represents value-iteration planning by CNN
• Prediction of reward map and computation of sum
of future rewards.