Multimodal Model - Search News

10d

Google's latest on-device AI model is custom-made for your laptop

Google has released the Gemma 4 12B multimodal agentic AI model that's designed to run on consumer laptops without dedicated ...

Napster Launches NV2: A Real-Time Conversational Video Model That Democratizes Access To Multimodal Agents

Napster, a frontier AI company powering the next generation of embodied and agentic AI, today launched NV2 (Napster Video Model 2) , a real-time conversational video model. Available through ...

Ophthalmology Times

Reasoning prompts sharpen multimodal AI on bilingual ophthalmology exam questions

Asking multimodal large language models (LLMs) to reason step by step before answering improved both their accuracy and the ...

10d

What the Leaked ChatGPT 5.6 Model Reveals About OpenAI’s Next Move

Explore the latest June 2026 AI developments, including leaked GPT-5.6 benchmarks, Microsoft's new MAI Thinking One model, ...

Tech Times

Google Gemma 4 12B Brings Multimodal AI to 16GB Laptops, Free Under Apache 2.0

Google Gemma 4 12B, released June 3, is an open-weight multimodal model that processes text, images, audio, and video in a ...

26d

Google's newest Gemini Omni model can turn real videos into surreal fever dreams

Google's new Gemini Omni Flash video-to-video model lets you twist reality on camera, and it's coming to YouTube Shorts too.

TechCrunch

Mistral releases Pixtral 12B, its first multimodal model

French AI startup Mistral has released its first model that can process images as well as text. Called Pixtral 12B, the 12-billion-parameter model is about 24GB in size. Parameters roughly correspond ...

26d

Google unveils Gemini Omni 'any-to-any' AI model: what enterprises should know

The model marks Google's bid to collapse the multimodal generative stack — text-to-image, image-to-video, video-to-video, audio generation — into a single foundation model with a single editing ...

Forbes

The Rise Of The Multimodal LLM

This voice experience is generated by AI. Learn more. This voice experience is generated by AI. Learn more. Illustration of abstract stream. Artificial intelligence. Big data, technology, AI, data ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results