GPT 4.1 Mini

OpenAI

openai/gpt-4.1-mini

Fast balanced, best value

Context Window

200K

200,000 tokens

Max Output

33K

32,768 tokens

About this model

GPT-4.1 Mini is a streamlined version of GPT-4.1 that significantly reduces cost and latency while maintaining core reasoning capabilities. The 200K context window remains unchanged, with input pricing at 1/5 of GPT-4.1.

Ideal for scenarios requiring fast responses with moderate reasoning depth, such as customer service bots, content summarization, and classification tasks.

Highlights

1/5 price of GPT-4.1

200K context

Low latency, high throughput

Vision support

Best For

Customer service botsContent summarizationClassification & labelingBatch processing

2025-04-14TransformerProprietary

Capabilities

ChatVisionCodepdftoolscache

Aliases

gpt-4.1-mini

Pricing (per 1M tokens)

Pricing (per 1M tokens)	/ 1M tokens
Input / 1M	$0.42
Output / 1M	$1.68
Cache Read	$0.10

Final prices shown

Quick Start

main.py

from openai import OpenAI

client = OpenAI(
    base_url="https://api.chuizi.ai/v1",
    api_key="ck-your-key-here",
)

response = client.chat.completions.create(
    model="openai/gpt-4.1-mini",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

GPT 4.1 Mini

About this model

Highlights

Best For

Capabilities

Aliases

Pricing (per 1M tokens)

Quick Start

FAQ

Related Models

GPT 4.1

GPT 4.1 Nano

GPT 4o

GPT 4o Mini

GPT 4o Audio Preview

O3

GPT 4.1 Mini

About this model

Highlights

Best For

Capabilities

Aliases

Pricing (per 1M tokens)

Quick Start

FAQ

How do I get an API Key?

How does billing work?

What payment methods are supported?

Are there rate limits?

How is this different from the official OpenAI API?

Does vision (image input) work?

Are o3/o4-mini reasoning models supported?

Related Models

GPT 4.1

GPT 4.1 Nano

GPT 4o

GPT 4o Mini

GPT 4o Audio Preview

O3