GPT 4.1 Nano
OpenAI
openai/gpt-4.1-nano
Ultra-fast, lowest cost
Context Window
200K
200,000 tokens
Max Output
33K
32,768 tokens
About this model
GPT-4.1 Nano is the fastest and cheapest model in the GPT-4.1 family. Designed for simple tasks like text classification, entity extraction, and format conversion. Ultra-low response latency makes it ideal for latency-sensitive real-time applications.
While reasoning capability is weaker than Mini and standard versions, accuracy on simple tasks remains high, at just 1/20 the cost of GPT-4.1.
Highlights
1/20 price of GPT-4.1
Ultra-low latency
High accuracy on simple tasks
200K context
Best For
Text classificationEntity extractionFormat conversionReal-time response
2025-04-14TransformerProprietary
Capabilities
ChatVisionCodepdftoolscache
Aliases
gpt-4.1-nanoPricing (per 1M tokens)
| Pricing (per 1M tokens) | / 1M tokens |
|---|---|
| Input / 1M | $0.10 |
| Output / 1M | $0.42 |
| Cache Read | $0.03 |
Final prices shown
Quick Start
main.py
from openai import OpenAI client = OpenAI( base_url="https://api.chuizi.ai/v1", api_key="ck-your-key-here", ) response = client.chat.completions.create( model="openai/gpt-4.1-nano", messages=[{"role": "user", "content": "Hello!"}], ) print(response.choices[0].message.content)