Qwen3-VL-Plus
Text-generation model with strong vision understanding, OCR, reasoning, and summaries.
Model Details
Provider
Alibaba Cloud
Model Type
vision
Context Window
262,144 tokens
Pricing
Input (1M)$0.40
Output (1M)$1.20
Capabilities
1. Advanced OCR and extraction
- Reads receipts, documents, product photos.
2. Visual reasoning
- Understands diagrams and logical layouts.
3. Thinking + non-thinking modes
- Supports chain-of-thought.
4. Large 262K context
- Great for multimodal RAG.