Llama 4 Scout

About this model

Efficient Llama 4 variant with 512K context

This model supports up to 512K tokens of context. It includes native vision understanding for analyzing images and documents. It provides strong code generation and debugging capabilities.

Access it through Chuizi.AI with a single ck- API key — no separate Meta account needed.

Highlights

512K context window

33K max output

Native vision support

Strong code generation

Best For

Code generationRefactoringDebuggingDocumentation

2025-04-05

Capabilities

ChatVisionCodetools

Pricing (per 1M tokens)

Pricing (per 1M tokens)	/ 1M tokens
Input / 1M	$0.619
Output / 1M	$0.924

Final prices shown

Quick Start

main.py

from openai import OpenAI

client = OpenAI(
    base_url="https://api.chuizi.ai/v1",
    api_key="ck-your-key-here",
)

response = client.chat.completions.create(
    model="meta/llama-4-scout",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

Llama 4 Scout

About this model

Highlights

Best For

Capabilities

Pricing (per 1M tokens)

Quick Start

FAQ

Related Models

Llama 4 Maverick

Llama 3.3 70b

Llama 3.1 405b

Llama 3.1 70b

Llama 3.1 8b

Llama 3.2 90b

Llama 4 Scout

About this model

Highlights

Best For

Capabilities

Pricing (per 1M tokens)

Quick Start

FAQ

How do I get an API Key?

How does billing work?

What payment methods are supported?

Are there rate limits?

Are Llama models really that cheap?

Related Models

Llama 4 Maverick

Llama 3.3 70b

Llama 3.1 405b

Llama 3.1 70b

Llama 3.1 8b

Llama 3.2 90b