#audio
1 approved public terms with this tag.
Multimodal
/ˌmʌltiˈmoʊdəl/adjective
机器辅助翻译草稿 (Chinese) for "Multimodal": Describing AI systems capable of processing and generating multiple types of data — such as text, images, audio, and video — in a unified model. Multimodal AI can answer questions about images, generate images from text, transcribe speech, and reason across modalities simultaneously.
“示例草稿: The multimodal model analyzed the chart image and provided a written summary of the trends.”