NOVA Text-to-Video
Generate images and answer questions using text input
Comparing powerful zero-shot image classification models