GPT 4.1 Nano

OpenAI
openai/gpt-4.1-nano

Ultra-fast, lowest cost

Context Window

200K

200,000 tokens

Max Output

33K

32,768 tokens

About this model

GPT-4.1 Nano is the fastest and cheapest model in the GPT-4.1 family. Designed for simple tasks like text classification, entity extraction, and format conversion. Ultra-low response latency makes it ideal for latency-sensitive real-time applications.

While reasoning capability is weaker than Mini and standard versions, accuracy on simple tasks remains high, at just 1/20 the cost of GPT-4.1.

Highlights

1/20 price of GPT-4.1
Ultra-low latency
High accuracy on simple tasks
200K context

Best For

Text classificationEntity extractionFormat conversionReal-time response
2025-04-14TransformerProprietary

Capabilities

ChatVisionCodepdftoolscache

Aliases

gpt-4.1-nano

Pricing (per 1M tokens)

Pricing (per 1M tokens)/ 1M tokens
Input / 1M$0.10
Output / 1M$0.42
Cache Read$0.03

Final prices shown

Quick Start

main.py
from openai import OpenAI

client = OpenAI(
    base_url="https://api.chuizi.ai/v1",
    api_key="ck-your-key-here",
)

response = client.chat.completions.create(
    model="openai/gpt-4.1-nano",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

FAQ

Related Models

GPT 4.1 Nano β€” Pricing, Context, Capabilities | Chuizi AI