Above: Most successful AI runs, character type (Wizard-Elf-Chaotic-Male, etc.) the level at which they died, and what killed them
Back in 2021, Meta (still Facebook at the time), sponsored an innovative challenge in AI programming: Create a bot which could win NetHack on its own. If you know the classic roguelike RPG, first created in the 80s, you know how hard that is. I know that from personal experience, because it was one of the very first games I obsessed over as a kid, and first inspired my interest in virtual worlds.
Though it was limited to ASCII graphics, the complexity and depth of the world made it seem real. Unlike say, a chess game, where each individual move is limited to a few dozen options, the moves in NetHack seem unlimited. I wrote about that complexity in one of my very first articles:
With every object, tool, weapon and creature imbued with a wealth of attributes, every situation has endless potential. The cockatrice, for example, could turn you into stone, but that is only the beginning. If you kill one, then pick it up with gloves, you can wield its body like a flail, instantly turning monsters to stone when you bash them with it. (Usenet wags dubbed this maneuver “wielding the rubber chicken.”) If you have a wand of Polymorph and also wear a Ring of Polymorph Control, you can actually turn yourself into a cockatrice, and explore the dungeon in that deadly form. You can even lay cockatrice eggs, too — usable as hand grenades of instant paralysis.
In NetHack, at any point, anything seems possible. Jean-Christophe Collet, a DevTeam member who discovered the game while working for a Parisian Unix company, says he was enthralled by “the sheer complexity of the situations you could get into, and the way that there was no ‘right way’ to get out of them.” Surrounded by Orcs, for example, you could incinerate most of them with your Wand of Lightning, but the blast would likely ricochet off the opposite wall and crisp you, too. You could wear your Ring of Conflict, which would magically compel the Orcs to start attacking each other instead — but then again, wearing it would probably also compel your pet Large Dog to attack you. You’d often get the eerie sense the game was anticipating you and all these uniquely intricate conundrums that no one could have possibly foreseen. Or could they?
AI developers at Oxford, NYU, and other top universities participated. So how'd they do? One of the participants recently shared the academic summaries with me, but the non-academic answer would be: