Source code: Luigi’s Mansion
Reports:
Summary:
We’re using OpenAI’s Gym toolkit to implement a Double Q Learning Reinforcement Learning agent for Super Mario Bros. Our goal is to have Mario levels optimally based on different metrics/goals. The RL’s Reward driving Mario’s actions are based on Mario’s x position, the time he takes, and a death penalty.