In the study they published, the team wrote that adopting a built-in rewards system "significantly improved exploration in a number of hard games, including the infamously difficult Montezuma's Revenge." Okay, you might not find it "infamously difficult," but it's tough for an AI to plan for the traps (and the platformer has plenty) that lie ahead.

You can read the team's paper if you want to know more about the technique, but the video below can show you how the AI tackled the game. DeepMind, if you'll recall, is also behind AlphaGo, the program that bested Korean Go grandmaster Lee Sedol in four games out of five.