Mistral unveils ‘Pixtral 12B,’ its inaugural multimodal artificial intelligence model
The innovative French artificial intelligence startup, Mistral, has introduced its maiden multimodal creation, the Pixtral 12B, capable of managing both textual content and visual graphics, as per the report by Techcrunch.
The innovative French artificial intelligence startup, Mistral, has introduced its maiden multimodal creation, the Pixtral 12B, capable of managing both textual content and visual graphics, as per the report by Techcrunch. With a staggering 12 billion parameters, this model is structured upon Mistral’s Nemo 12B text-oriented model. Pixtral 12B can provide responses to image-related queries using URLs or images encoded with base64, for instance, determining the number of instances of a specific object that are discernible.
Numerous creative AI (artificial intelligence) models have undergone partial training that involved copyrighted materials, which consequently triggered legal disputes initiated by the respective copyright holders. (The AI companies, on the other hand, argue that this approach should qualify under fair use principles.)
The specific image dataset leveraged by Mistral for the development of Pixtral 12B remains shrouded in mystery.
