Allegro is a new tool for generating text videos | Best dall e 3 image generator github free | Stable diffusion | Best image to image ai generator | Turtles AI
Rhymes Ai has released Allegro, an open source model for generation of videos starting from text inputs. This innovative technology promises to offer new opportunities to creators and researchers, promoting visual creativity.
Key points:
- Allegro transforms text prompt into short high quality video clips.
- Supports 6 second video to 15 frames per second with 720p resolution.
- Use advanced compression technologies and video generation.
- It is completely open source, encouraging the collaboration of the community.
Allegro, the new model of Rhymes AI, represents an advanced tool for generating videos starting from simple textual descriptions. This technology allows you to create high quality video clips, varying from 6 seconds to a resolution of 720x1280 pixels, with a frame rate of 15 frames per second, which can be interpolated at 30 fps. Thanks to its versatility, Allegro allows users to explore a range of content, from the scenes with details of human faces to dynamic representations of moving animals. To achieve this level of quality, Rhymes Ai has implemented a series of technical processes that allow effective management of large -scale video data. In particular, it was necessary to develop elaboration and filtering pipeline to extract raw videos, which were subsequently structured to facilitate the training of the model. The compression of the RAW videos is another important phase, made possible through a Variational Autoencoder video (Videovae). This system allows you to codify videos in a space-time latent space, keeping the details necessary for a fluid video generation. At the center of Allegro’s ability is the diffusion transformation architecture, designed to generate high quality video frames and fluidity. This architecture makes use of a Diffusion Transformer, equipped with attention and incorporation technologies of the position, which manages to efficiently capture spatial and temporal relationships within the video data. While the model already presents impressive features, Rhymes Ai is working to integrate future improvements, such as the generation of narrative videos and the ability to control movement within the clips. With the decision to make Allegro Open Source, Rhymes Ai aims to make video creation to the most accessible, inviting the community to explore and further develop this technology.
An important step in the panorama of videos generated by the AI.