Overview
Phenaki represents an AI model designed for the generation of videos directly from textual inputs, even spanning multiple minutes. It also offers the capability to generate videos based on a static image and a provided prompt. Notably, the video encoder-decoder proposed by Phenaki surpasses existing per-frame methods in the field in terms of both spatial-temporal quality and the number of tokens used per video.
Comments