Combining next-token prediction and video diffusion in computer vision and robotics
Author: Alex Shipps|MIT CSAIL Publication Date:October 16, 2024A new method can train a neural network to sort corrupted data while anticipating next steps. It can make flexible plans for robots, generate high-quality video, and help AI agents navigate digital environments.