Chat with Gemini 3.1 Flash Lite

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, a

No signup required — try it as a guest. 30,000 free tokens every day once you sign up.

Provider
Google
Model slug
google/gemini-3.1-flash-lite
Typical cost
Around 975–2,437 tokens per typical message. 15M Pro tokens buy roughly 6,155–15…
Availability
On Faceb.ai · chat + API

About Gemini 3.1 Flash Lite

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...

What it's good at

1

Very large context window — 1,048,576 tokens (paste entire codebases or long PDFs).

2

Extremely low per-token price compared to frontier models — good for high-volume workloads.

3

Hosted by Google — you can access it here alongside GPT-4o, Claude, Gemini and 100+ more on one plan.

4

Switch to any other model mid-conversation from the picker.

Pricing on Faceb.ai

Around 975–2,437 tokens per typical message. 15M Pro tokens buy roughly 6,155–15,384 messages.

Use Gemini 3.1 Flash Lite from the API

OpenAI-compatible. Same Faceb.ai tokens cover chat and API. Drop-in replacement for the OpenAI SDK.

curl https://api.faceb.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-faceb-YOUR_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "google/gemini-3.1-flash-lite",
    "messages": [{"role": "user", "content": "Hello!"}],
    "stream": true
  }'
from openai import OpenAI

client = OpenAI(
    base_url="https://api.faceb.ai/v1",
    api_key="sk-faceb-YOUR_KEY",
)

stream = client.chat.completions.create(
    model="google/gemini-3.1-flash-lite",
    messages=[{"role": "user", "content": "Hello!"}],
    stream=True,
)
for chunk in stream:
    print(chunk.choices[0].delta.content or "", end="", flush=True)
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.faceb.ai/v1",
  apiKey: "sk-faceb-YOUR_KEY",
});

const stream = await client.chat.completions.create({
  model: "google/gemini-3.1-flash-lite",
  messages: [{ role: "user", content: "Hello!" }],
  stream: true,
});
for await (const chunk of stream) {
  process.stdout.write(chunk.choices[0]?.delta?.content || "");
}

Also works for image generation (image-output model slugs return image_url content parts) and web search (add "web_search": true to the payload). Same endpoint, same wallet.

Full API docs → · Get an API key →

Frequently asked — Gemini 3.1 Flash Lite

What is Gemini 3.1 Flash Lite?

Gemini 3.1 Flash Lite is a chat/completion model served by Google and accessed through Faceb.ai. Try it without signing up — guest users get a small pool of session tokens to experiment.

Is Gemini 3.1 Flash Lite free on Faceb.ai?

You get 30,000 tokens free every day — usually enough for a handful of messages a day on this model. Need more? Pro is $14.99/month for 15M tokens, or top-ups from $5.

What's Gemini 3.1 Flash Lite's context window?

1,048,576 tokens. Paste your source material in, no need to truncate.

Can I call Gemini 3.1 Flash Lite from the API?

Yes. Any API key from /account/api/ works with model slug `google/gemini-3.1-flash-lite`. The OpenAI SDK works with base_url=https://api.faceb.ai/v1.

How does Gemini 3.1 Flash Lite compare to GPT-4o or Claude?

Depends on the task. Faceb.ai lets you switch models per message — benchmark side-by-side by asking both the same prompt, which is more reliable than abstract comparisons.

How much does Gemini 3.1 Flash Lite cost per message here?

Around 975–2,437 tokens per typical message. 15M Pro tokens buy roughly 6,155–15,384 messages.

Does Faceb.ai train on my Gemini 3.1 Flash Lite prompts?

No. We contractually request that upstream providers not train on content routed through us. Your chat history lives only on your account.

Is Gemini 3.1 Flash Lite good for coding?

It depends on the model size and training mix. For serious code work the developer favourites are Claude 3.5 Sonnet and DeepSeek V3; for quick edits, most capable models work fine.

Can I use Gemini 3.1 Flash Lite on the API with the OpenAI SDK?

Yes — point your SDK at https://api.faceb.ai/v1 and use this model's slug as the model parameter. Everything else works as normal.

Does Gemini 3.1 Flash Lite support image inputs?

Check the model catalog — multimodal models are marked in the picker. If it accepts images, you can drop screenshots and diagrams straight into the chat.

Can I switch from Gemini 3.1 Flash Lite to another model mid-chat?

Yes — the picker is always at the top of the chat. Previous context carries over.

Will newer versions of Gemini 3.1 Flash Lite show up here automatically?

Yes. Our catalog auto-fetches from the upstream aggregator, so provider updates and new versions appear in the picker as soon as they're available.

Or try a different model

Your Faceb.ai tokens work for every model — switch per message, no extra subscriptions.

Ready to chat?

One subscription covers every frontier model — switch between them per message. No extra API keys, no extra bills.

Start chatting with Gemini 3.1 Flash Lite → Go Pro · $14.99/mo