Qwen3's 235B-A22B and 32B models are now available to be used!
Aliquet morbi justo auctor cursus auctor aliquam. Neque elit blandit et quis tortor vel ut lectus morbi. Amet mus nunc rhoncus sit sagittis pellentesque eleifend lobortis commodo vestibulum hendrerit proin varius lorem ultrices quam velit sed consequat duis. Lectus condimentum maecenas adipiscing massa neque erat porttitor in adipiscing aliquam auctor aliquam eu phasellus egestas lectus hendrerit sit malesuada tincidunt quisque volutpat aliquet vitae lorem odio feugiat lectus sem purus.
Viverra mi ut nulla eu mattis in purus. Habitant donec mauris id consectetur. Tempus consequat ornare dui tortor feugiat cursus. Pellentesque massa molestie phasellus enim lobortis pellentesque sit ullamcorper purus. Elementum ante nunc quam pulvinar. Volutpat nibh dolor amet vitae feugiat varius augue justo elit. Vitae amet curabitur in sagittis arcu montes tortor. In enim pulvinar pharetra sagittis fermentum. Ultricies non eu faucibus praesent tristique dolor tellus bibendum. Cursus bibendum nunc enim.
Mattis quisque amet pharetra nisl congue nulla orci. Nibh commodo maecenas adipiscing adipiscing. Blandit ut odio urna arcu quam eleifend donec neque. Augue nisl arcu malesuada interdum risus lectus sed. Pulvinar aliquam morbi arcu commodo. Accumsan elementum elit vitae pellentesque sit. Nibh elementum morbi feugiat amet aliquet. Ultrices duis lobortis mauris nibh pellentesque mattis est maecenas. Tellus pellentesque vivamus massa purus arcu sagittis. Viverra consectetur praesent luctus faucibus phasellus integer fermentum mattis donec.
Commodo velit viverra neque aliquet tincidunt feugiat. Amet proin cras pharetra mauris leo. In vitae mattis sit fermentum. Maecenas nullam egestas lorem tincidunt eleifend est felis tincidunt. Etiam dictum consectetur blandit tortor vitae. Eget integer tortor in mattis velit ante purus ante.
“Lacus donec arcu amet diam vestibulum nunc nulla malesuada velit curabitur mauris tempus nunc curabitur dignig pharetra metus consequat.”
Commodo velit viverra neque aliquet tincidunt feugiat. Amet proin cras pharetra mauris leo. In vitae mattis sit fermentum. Maecenas nullam egestas lorem tincidunt eleifend est felis tincidunt. Etiam dictum consectetur blandit tortor vitae. Eget integer tortor in mattis velit ante purus ante.
Today we’re excited to announce that Qwen 3 32B and Qwen 3 235B are now available on GMI Cloud’s US-based inference clusters with global deployment support taking advantage of our datacenters around the globe.
Built by Alibaba’s Qwen team and open-sourced under the permissive Apache 2.0 license, Qwen 3 models represent a new leap forward in open LLM performance, flexibility, and multilingual accessibility. And now, for the first time, developers can deploy these models instantly on high-availability, low-latency infrastructure in the USA backed by GMI Cloud’s purpose-built AI stack.
The flagship Qwen 3 235B-A22B model boasts 235 billion total parameters (22B activated), and rivals the performance of models like Gemini 2.5 Pro and Grok-3 in STEM, coding, long-context tasks, and multilingual reasoning.
Meanwhile, the smaller Qwen 3 32B model offers elite performance at a lighter footprint and lower latency—ideal for production inference at scale.
Key innovations include:
Qwen 3's hybrid thinking, massive context length, and multilingual fluency create new opportunities for AI developers that simply weren't practical before:
Real-world use cases now within reach:
Amplifying what you can do with Qwen
Before Qwen 3, delivering scalable multilingual agents, reasoning engines, or cost-optimized AI applications meant stitching together multiple models or relying on proprietary platforms. Now, it’s open-source—and production-ready !—on GMI Cloud.
GMI Cloud is purpose-built for the AI workloads of today and tomorrow:
Whether you're running autonomous agents, building a multilingual co-pilot, or researching new AI behaviors, Qwen 3 is now just a few clicks away.
Ready to build agents, copilots, or next-gen AI products?
Spin up Qwen 3 32B and 235B today on GMI Cloud’s Inference Engine—with flexible scaling, API simplicity, and no surprises.
Read Qwen's blog announcement.
Build faster, think deeper—with Qwen 3 on GMI Cloud.
Give GMI Cloud a try and see for yourself if it's a good fit for AI needs.
Starting at
$4.39/GPU-hour
As low as
$2.50/GPU-hour