Microsoft is expanding its Phi-3 family of small language models with the introduction of Phi-3-vision. Unlike its siblings, Phi-3-vision isn’t just focused on text – it’s a multimodal model that can ...
Microsoft announced a new version of its small language model, Phi-3, which can look at images and tell you what’s in them. Phi-3-vision is a multimodal model — aka it can read both text and images — ...
Hugging Face Inc. today open-sourced SmolVLM-256M, a new vision language model with the lowest parameter count in its category. The algorithm’s small footprint allows it to run on devices such as ...
Liquid AI has released LFM2-VL, a new generation of vision-language foundation models designed for efficient deployment across a wide range of hardware — from smartphones and laptops to wearables and ...
If you would like the ability to run AI vision applications on your home computer you might be interested in a new language model called Moondream. Capable of processing what you say, what you write, ...
To accelerate and refine decision-making in a fast-paced, global marketplace, enterprises may deploy generative artificial ...
“Semiconductor lithography inspection requires reliable detection of small pattern defects such as bridge, burr, pinch, and contamination. In this study, we propose a two-stage vision-language ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results