Multimodal Model - Search News

Ai2 says its Molmo 2 multimodal AI model can do more with less data

Molmo 2 is an 8B-parameter model that surpasses the 72B-parameter Molmo in accuracy, temporal understanding, and pixel-level ...

14d

Gemini 3 Flash : Fast, Multimodal & Easy on Your Budget

Try Gemini 3.0 Flash via AI Studio and APIs, with up to 90% savings from context caching to cut costs on high-volume workloads.

EurekAlert!

Beyond bigger models: How efficient multimodal AI is redefining the future of intelligence

Multimodal large language models have shown powerful abilities to understand and reason across text and images, but their ...

Geeky Gadgets

AnyGPT any-to-any open source multimodal large language model (LLM)

AnyGPT is an innovative multimodal large language model (LLM) is capable of understanding and generating content across various data types, including speech, text, images, and music. This model is ...

techtimes

Meta Is Developing New Multimodal AI Model Chameleon to Rival OpenAI's GPT-4o

Following the recent AI offerings showdown between OpenAI and Google, Meta's AI researchers seem ready to join the contest with their own multimodal model. Multimodal AI models are evolved versions of ...

SD Times

Google unveils Gemini, a new multimodal AI model

Value stream management involves people in the organization to examine workflows and other processes to ensure they are deriving the maximum value from their efforts while eliminating waste — of ...

Phys.org

Multimodal machine learning model increases accuracy of catalyst screening

Identifying optimal catalyst materials for specific reactions is crucial to advance energy storage technologies and sustainable chemical processes. To screen catalysts, scientists must understand ...

SiliconANGLE

Microsoft releases new Phi models optimized for multimodal processing, efficiency

Microsoft Corp. today expanded its Phi line of open-source language models with two new algorithms optimized for multimodal processing and hardware efficiency. The first addition is the text-only ...

VentureBeat

Meta introduces Chameleon, a state-of-the-art multimodal model

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now As competition in the generative AI field ...

Computerworld

OpenAI announces new multimodal desktop GPT with new voice and vision capabilities

Chirag Dekate, a Gartner vice president analyst, said that while he was impressed with OpenAI’s multimodal large language model (LLM), the company was clearly playing catch-up to competitors, in ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results