• GPU 算力方案
  • Cluster Engine
  • Application Platform
  • NVIDIA H200
  • NVIDIA GB200 NVL72
  • 解決方案
    
    GPU 算力租賃Cluster EngineInference EngineAI 應用開發平台
  • GPUs
    
    H200NVIDIA GB200 NVL72NVIDIA HGX™ B200
  • 定價
  • 關於
    
    關於我們部落格Discourse合作夥伴聯絡我們
  • 關於我們
  • 部落格
  • Discourse
  • 合作夥伴
  • 聯絡我們
  • 開始使用
繁體中文
繁體中文

English
日本語
한국어
繁體中文
一鍵啟用聯繫專家

Modality

Get startedfeatures

Related terms

A.I. (Artificial Intelligence)
BACK TO GLOSSARY

Modality refers to a distinct type or form of data that a system can perceive, process, and learn from. Each modality represents a different way of encoding information much like how humans use different senses (sight, hearing, touch, etc.) to understand the world.

Common Modalities in AI:

  • Text – Written or spoken language (e.g., emails, transcripts, books)
  • Images – Still visual content (e.g., photographs, X-rays, diagrams)
  • Audio – Sound data (e.g., speech, music, environmental noise)
  • Video – A sequence of images with associated audio over time
  • Sensor Data – Data from physical devices (e.g., accelerometers, temperature sensors)

Why It Matters:

Each modality provides unique and complementary information. For example:

  • A photo provides visual context.
  • Audio may convey tone and emotion.
  • Text can provide background or instructions.

By understanding the characteristics and strengths of each modality, AI systems can be designed to:

  • Make better predictions
  • Understand context more fully
  • Handle real-world complexity with more nuance

This is particularly important in multimodal learning, where models are built to integrate information across different modalities—for example, combining vision and language to describe an image or answer a question about it.

Example in Practice:

A virtual assistant might:

  • Hear your voice (audio modality)
  • Understand your words (text modality from speech-to-text)
  • Recognize an image you upload (image modality)
  • Respond with a mix of speech and on-screen text (output modalities)

‍

‍

訂閱 GMI Cloud 電子報

Empowering humanity's AI ambitions with instant GPU cloud access.

[email protected]

278 Castro St, Mountain View, CA 94041

  • GPU 算力租賃
  • Cluster Engine
  • AI 應用開發平台
  • 定價
  • AI 技術字彙索引
  • 關於我們
  • Blog
  • Partners
  • 人才招募
  • 聯絡我們

© 2024 版權所有。

隱私政策

使用條款