Chat with R1 Distill Llama 70B

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [Dee

No signup required — try it as a guest. 30,000 free tokens every day once you sign up.

Provider
DeepSeek
Model slug
deepseek/deepseek-r1-distill-llama-70b
Typical cost
Around 689–1,725 tokens per typical message. 15M Pro tokens buy roughly 8,695–21…
Availability
On Faceb.ai · chat + API

About R1 Distill Llama 70B

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techn

What it's good at

1

131,072-token context window — enough for long documents.

2

Extremely low per-token price compared to frontier models — good for high-volume workloads.

3

Hosted by DeepSeek — you can access it here alongside GPT-4o, Claude, Gemini and 100+ more on one plan.

4

Switch to any other model mid-conversation from the picker.

Pricing on Faceb.ai

Around 689–1,725 tokens per typical message. 15M Pro tokens buy roughly 8,695–21,770 messages.

Use R1 Distill Llama 70B from the API

OpenAI-compatible. Same Faceb.ai tokens cover chat and API. Drop-in replacement for the OpenAI SDK.

curl https://api.faceb.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-faceb-YOUR_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "deepseek/deepseek-r1-distill-llama-70b",
    "messages": [{"role": "user", "content": "Hello!"}],
    "stream": true
  }'
from openai import OpenAI

client = OpenAI(
    base_url="https://api.faceb.ai/v1",
    api_key="sk-faceb-YOUR_KEY",
)

stream = client.chat.completions.create(
    model="deepseek/deepseek-r1-distill-llama-70b",
    messages=[{"role": "user", "content": "Hello!"}],
    stream=True,
)
for chunk in stream:
    print(chunk.choices[0].delta.content or "", end="", flush=True)
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.faceb.ai/v1",
  apiKey: "sk-faceb-YOUR_KEY",
});

const stream = await client.chat.completions.create({
  model: "deepseek/deepseek-r1-distill-llama-70b",
  messages: [{ role: "user", content: "Hello!" }],
  stream: true,
});
for await (const chunk of stream) {
  process.stdout.write(chunk.choices[0]?.delta?.content || "");
}

Also works for image generation (image-output model slugs return image_url content parts) and web search (add "web_search": true to the payload). Same endpoint, same wallet.

Full API docs → · Get an API key →

Frequently asked — R1 Distill Llama 70B

What is R1 Distill Llama 70B?

R1 Distill Llama 70B is a chat/completion model served by DeepSeek and accessed through Faceb.ai. Try it without signing up — guest users get a small pool of session tokens to experiment.

Is R1 Distill Llama 70B free on Faceb.ai?

You get 30,000 tokens free every day — usually enough for a handful of messages a day on this model. Need more? Pro is $14.99/month for 15M tokens, or top-ups from $5.

What's R1 Distill Llama 70B's context window?

131,072 tokens. Paste your source material in, no need to truncate.

Can I call R1 Distill Llama 70B from the API?

Yes. Any API key from /account/api/ works with model slug `deepseek/deepseek-r1-distill-llama-70b`. The OpenAI SDK works with base_url=https://api.faceb.ai/v1.

How does R1 Distill Llama 70B compare to GPT-4o or Claude?

Depends on the task. Faceb.ai lets you switch models per message — benchmark side-by-side by asking both the same prompt, which is more reliable than abstract comparisons.

How much does R1 Distill Llama 70B cost per message here?

Around 689–1,725 tokens per typical message. 15M Pro tokens buy roughly 8,695–21,770 messages.

Does Faceb.ai train on my R1 Distill Llama 70B prompts?

No. We contractually request that upstream providers not train on content routed through us. Your chat history lives only on your account.

Is R1 Distill Llama 70B good for coding?

It depends on the model size and training mix. For serious code work the developer favourites are Claude 3.5 Sonnet and DeepSeek V3; for quick edits, most capable models work fine.

Can I use R1 Distill Llama 70B on the API with the OpenAI SDK?

Yes — point your SDK at https://api.faceb.ai/v1 and use this model's slug as the model parameter. Everything else works as normal.

Does R1 Distill Llama 70B support image inputs?

Check the model catalog — multimodal models are marked in the picker. If it accepts images, you can drop screenshots and diagrams straight into the chat.

Can I switch from R1 Distill Llama 70B to another model mid-chat?

Yes — the picker is always at the top of the chat. Previous context carries over.

Will newer versions of R1 Distill Llama 70B show up here automatically?

Yes. Our catalog auto-fetches from the upstream aggregator, so provider updates and new versions appear in the picker as soon as they're available.

Or try a different model

Your Faceb.ai tokens work for every model — switch per message, no extra subscriptions.

Ready to chat?

One subscription covers every frontier model — switch between them per message. No extra API keys, no extra bills.

Start chatting with R1 Distill Llama 70B → Go Pro · $14.99/mo