Qwen-Flash
The fastest and cheapest Qwen model, ideal for high-volume workloads.
Model Details
Provider
Alibaba Cloud
Model Type
text
Context Window
1,000,000 tokens
Pricing
Input (1M)$0.02
Output (1M)$0.22
Capabilities
1. Ultra-fast, ultra-cheap
- Designed for mass-scale workloads.
- Excellent for rewriting, extraction, classification.
2. Limited reasoning but great utility
- High throughput, low latency.
3. Optional thinking mode
- Adds chain-of-thought when needed.
4. Supports context cache & batch calls
- Very cost-effective system design.