Qwen3 Coder 480B A35B Instruct FP8

Qwen3 Coder 480B A35B Instruct FP8 is a powerful code-focused model with 480B parameters, fine-tuned to follow instructions using A35B Instruct. It uses FP8 quantization for faster and more efficient inference, making it ideal for developer tools and AI coding assistants.
Model Library
Model Info

Provider

Qwen

Model Type

LLM

Context Length

256k

Serverless

Available

Fine-Tuning

Available

Pricing Per 1M Tokens Input/Output

$

3

/

$

8

GMI Cloud Features

Serverless API

Optimized for large models and data, the H200 delivers faster training and inference with ultra-high memory bandwidth
Learn More

Fine-Tuning

Optimized for large models and data, the H200 delivers faster training and inference with ultra-high memory bandwidth
Learn More

On-demand Deployments

Optimized for large models and data, the H200 delivers faster training and inference with ultra-high memory bandwidth
Learn More
Try Qwen3 Coder 480B A35B Instruct FP8 now.
Try this Model

Ready to build?

Explore powerful AI models and launch your project in just a few clicks.
Get Started