CVPR 2026 (Highlight)
Introducing TUNA, a family of native unified multimodal models
All videos have a resolution of 384×672 and a frame rate of 12 fps. Hover over each video to see the corresponding text prompt.
If you find our work helpful, please cite our paper: