CM3leon is a leading generative model capable of both creating images from text and generating text from images. This multimodal model combines the benefits of autoregressive models with affordable training and efficient inference.
The model's training process is derived from text-based language models, featuring retrieval-augmented pre-training and supervised fine-tuning for multiple tasks. CM3leon excels in generating images from text, using significantly less computation than other transformer-based methods.
It can produce sequences of text and images based on varying sequences of other visual and textual content, thereby extending the capabilities of prior models limited to text-to-image or image-to-text generation. The model has been instruction-tuned for both image and text generation, leading to notable improvements in areas like image captioning, visual question answering, text-driven editing, and conditional image generation.
CM3leon surpasses Google's text-to-image model and attains a Fréchet Inception Distance (FID) score of 4.88 on the standard image generation benchmark, marking a new achievement. CM3leon stands out in generating detailed objects and performing text-guided image manipulations.
It is adept at creating consistent visuals that adhere to prompts, even with structural constraints. Furthermore, the model excels in tasks such as text-guided image modification, generating images from compositional text, and answering questions about images. Even when trained on a smaller dataset, CM3leon's performance without prior training rivals that of larger models trained on bigger datasets.
It highlights the potential of retrieval augmentation and the influence of scaling techniques on autoregressive model performance. CM3leon's adaptability and strong performance make it a useful tool for diverse vision-language tasks.
Efficient at producing images from text.
Multimodal capabilities.
Efficient at producing text from images.
No API available.
Limited training dataset.
Potential for introducing bias.

Released 4 months ago
Free + from free tier available

Released 10 months ago
Free + from $9.99

Released 6 months ago
Contact for pricing

Released 1 year ago
Free + from $24.88/month

Released 3 years ago
Contact for pricing

Released 2 years ago
Free

Released 1 year ago
Contact for pricing

Released 3 years ago
Free