• GPU インスタンス
  • クラスターエンジン
  • Application Platform
  • NVIDIA H200
  • NVIDIA GB200 NVL72
  • ソリューション
    
    GPU 計算力レンタルCluster EngineInference EngineAI 開発プラットフォーム
  • GPUs
    
    H200NVIDIA GB200 NVL72NVIDIA HGX™ B200
  • 料金プラン
  • 会社情報
    
    会社情報リソースDiscourseパートナーお問い合わせ
  • 私たちについて
  • ブログ
  • Discourse
  • パートナー
  • お問い合わせ
  • さあ、始めましょう
日本語
日本語

English
日本語
한국어
繁體中文
今すぐ利用Contact Sales

Modality

Get startedfeatures

Related terms

No items found.
BACK TO GLOSSARY

Modality refers to a distinct type or form of data that a system can perceive, process, and learn from. Each modality represents a different way of encoding information much like how humans use different senses (sight, hearing, touch, etc.) to understand the world.

Common Modalities in AI:

  • Text – Written or spoken language (e.g., emails, transcripts, books)
  • Images – Still visual content (e.g., photographs, X-rays, diagrams)
  • Audio – Sound data (e.g., speech, music, environmental noise)
  • Video – A sequence of images with associated audio over time
  • Sensor Data – Data from physical devices (e.g., accelerometers, temperature sensors)

Why It Matters:

Each modality provides unique and complementary information. For example:

  • A photo provides visual context.
  • Audio may convey tone and emotion.
  • Text can provide background or instructions.

By understanding the characteristics and strengths of each modality, AI systems can be designed to:

  • Make better predictions
  • Understand context more fully
  • Handle real-world complexity with more nuance

This is particularly important in multimodal learning, where models are built to integrate information across different modalities—for example, combining vision and language to describe an image or answer a question about it.

Example in Practice:

A virtual assistant might:

  • Hear your voice (audio modality)
  • Understand your words (text modality from speech-to-text)
  • Recognize an image you upload (image modality)
  • Respond with a mix of speech and on-screen text (output modalities)

‍

‍

最新情報をメールでお届けします

GPU クラウドの即時アクセスで、
人類の AI への挑戦を加速する。

[email protected]

2860 Zanker Rd. Suite 100 San Jose, CA 95134

  • GPU 計算力レンタル
  • Cluster Engine
  • Inference Engine
  • 料金プラン
  • 用語集
  • 会社情報
  • Blog
  • パートナー
  • 採用情報
  • お問い合わせ

© 2024 無断転載を禁じます。

個人情報保護

利用規約