#audio

1 approved public terms with this tag.

Multimodal

/ˌmʌltiˈmoʊdəl/adjective

AI & Technology

机器辅助翻译草稿 (Chinese) for "Multimodal": Describing AI systems capable of processing and generating multiple types of data — such as text, images, audio, and video — in a unified model. Multimodal AI can answer questions about images, generate images from text, transcribe speech, and reason across modalities simultaneously.

“示例草稿: The multimodal model analyzed the chart image and provided a written summary of the trends.”

作者 @dictionary_auto_translate