News

Apple has revealed its latest development in artificial intelligence (AI) large language model (LLM), introducing the MM1 family of multimodal models capable of interpreting both images and text data.
Mistral AI, a Paris-based artificial intelligence startup, today unveiled its latest advanced AI model capable of processing both images and text. The new model, called Pixtral 12B, employs about ...
The Latest AI Chatbots Can Handle Text, Images and Sound. Here’s How New “multimodal” AI programs can do much more than respond to text—they also analyze images and chat aloud ...
Mistral AI released Pixtral Large, a 124-billion-parameter multimodal model designed for advanced image and text processing with a 1-billion-parameter vision encoder. Built on Mistral Large 2, it achi ...
The added multi-modal input feature will generate text outputs — whether that's natural language, programming code, or what have you — based on a wide variety of mixed text and image inputs.
Everyone's favorite chatbot can now see and hear and speak. On Monday, OpenAI announced new multimodal capabilities for ChatGPT. Users can now have voice conversations or share images with ChatGPT ...
As AI-generated images become more precise and accessible, GPT-4o represents a significant step forward in making text-to-image generation a mainstream tool for communication, creativity, and ...