Odyssey is taking a different approach than many AI labs in the world modeling space. Odyssey has developed an AI model that lets users “interact” with streaming video. It designed a 360-degree, backpack-mounted camera system to capture real-world landscapes, which Odyssey thinks can serve as a basis for higher-quality models than models trained solely on publicly available data. The model generates and streams video frames every 40 milliseconds. Via basic controls, viewers can explore areas within a video, similar to a 3D-rendered video game. Powering this is a new world model, demonstrating capabilities like generating pixels that feel realistic, maintaining spatial consistency, learning actions from video, and outputting coherent video streams for 5 minutes or more.” World models could one day be used to create interactive media, such as games and movies, and run realistic simulations like training environments for robots. “Interactive video … opens the door to entirely new forms of entertainment, where stories can be generated and explored on demand, free from the constraints and costs of traditional production,” the company says. “Over time, we believe everything that is video today — entertainment, ads, education, training, travel, and more — will evolve into interactive video, all powered by Odyssey.” The model can currently stream video at up to 30 frames per second from clusters of Nvidia H100 GPUs at the cost of $1 to $2 per “user-hour.”