Build reliable multimodal AI apps with text, voice, and vision using shared context, smart orchestration, routing, and ...
The company on Jan. 20 announced Lucidworks Data Enrichment, a new feature that uses multimodal generative AI to ...
Microsoft Corp. today expanded its Phi line of open-source language models with two new algorithms optimized for multimodal processing and hardware efficiency. The first addition is the text-only ...
French AI startup Mistral has dropped its first multimodal model, Pixtral 12B, capable of processing both images and text. The 12-billion-parameter model, built on Mistral’s existing text-based model ...
Manzano combines visual understanding and text-to-image generation, while significantly reducing performance or quality trade-offs.
Microsoft has introduced a new AI model that, it says, can process speech, vision, and text locally on-device using less compute capacity than previous models. Innovation in generative artificial ...