Software engineer Peter Whidden trained AI reinforcement learning agents to play Pokemon Red with over 40,000 hours of simulated game time. The Pavlovian reinforcement model rewarded AI with points for leveling up creatures, exploring new areas and battling gym leaders.
The reinforcement learning agents started with no knowledge of the game whatsoever and can only press random buttons. Throughout the five years of simulated game time, it gathers data on what to do based on past experiences. Eventually, it masters the game to a point where defeating gym leaders becomes a regular occurrence. You can even run the pre-trained model interactively on your own machine running Python 3.10. More information here.
- Limited-edition design featuring elements inspired by Mario. Available while supplies last!
- Local co-op*, local wireless*, and online multiplayer**
- Detachable Joy-Con controllers

By default the game will terminate after 32K steps, or ~1 hour. You can increase this by adjusting the ep_length variable, but it will also use more memory,” said Whidden.