Multimodal
[/ˌmʌltiˈmoʊdəl/]
adjectiveAI & Technology#ai#vision#audio#llm0 views1 definitions
Definitions
1
+1328
Describing AI systems capable of processing and generating multiple types of data — such as text, images, audio, and video — in a unified model. Multimodal AI can answer questions about images, generate images from text, transcribe speech, and reason across modalities simultaneously.
“The multimodal model analyzed the chart image and provided a written summary of the trends.”
by @mlresearcher1/1/1970