Question 1

What is Gemma 4?

Accepted Answer

Gemma 4 is Google DeepMind's latest family of lightweight, state-of-the-art open models. It's built from the same research and technology used to create Gemini, but released with open weights so you can download, inspect, fine-tune, and deploy the models on your own infrastructure.

Question 2

How is Gemma 4 different from Gemini?

Accepted Answer

Gemini is Google's closed, hosted flagship model, available through Google's APIs. Gemma 4 shares much of the underlying research but is released as open weights under a permissive license, so you can run it locally, fine-tune it for your own data, and deploy it without sending requests to Google.

Question 3

Is Gemma 4 free to use commercially?

Accepted Answer

Yes. Gemma 4 is released under the Gemma license, which permits both research and commercial use. You are free to build products, services, and businesses on top of Gemma 4 — including fine-tuned derivatives — subject to the license's responsible use policy.

Question 4

What hardware do I need to run it?

Accepted Answer

It depends on the size. Gemma 4 Nano (2B) runs comfortably on a modern laptop or phone. The 9B and 27B models run on a single high-end GPU such as an NVIDIA RTX 4090 or H100. The 70B Ultra model is best suited for multi-GPU servers or TPU pods. Quantized variants (GGUF, AWQ) reduce requirements further for on-device use.

Question 5

What are the "Thinking" variants?

Accepted Answer

Thinking variants are Gemma 4 models trained to reason step-by-step before producing a final answer. They trade a small amount of latency for substantially better performance on math, science, coding, and multi-step reasoning benchmarks — as seen in our AIME 2026 and GPQA Diamond results.

Question 6

Where can I download Gemma 4?

Accepted Answer

Gemma 4 is available through Google AI Studio, Vertex AI, Kaggle, Hugging Face, and Ollama. You can also run it directly with popular inference frameworks including llama.cpp, vLLM, and MLX for Apple Silicon.

Question 7

Can I fine-tune Gemma 4 on my own data?

Accepted Answer

Absolutely. Gemma 4 supports the full range of fine-tuning techniques: full supervised fine-tuning, LoRA, QLoRA, DPO, and RLHF. We provide reference training recipes and notebooks for all four model sizes to help you get started.

Question 8

How does Gemma 4 handle safety?

Accepted Answer

Every Gemma 4 release goes through extensive safety evaluation including red-teaming, bias testing, and responsible AI reviews. We publish detailed model cards for each variant and ship with a built-in responsible use policy. Because the weights are open, the broader research community can audit and improve safety as well.

Question 9

What's new compared to Gemma 3?

Accepted Answer

Gemma 4 introduces Thinking variants for step-by-step reasoning, a new sparse Mixture-of-Experts architecture (26B A4B), native multimodal input across the entire family, a 128K context window, and substantial gains on math, coding, and agentic benchmarks. Gemma 4 27B IT beats Gemma 3 27B IT by over 25 points on AIME 2026 and nearly triples its score on LiveCodeBench v6.

Question 10

Does Gemma 4 support tool use and function calling?

Accepted Answer

Yes. All instruction-tuned variants were trained with structured tool-use data and support function calling via a standard JSON schema. They can plan multi-step workflows, invoke external APIs, and recover from tool errors — as reflected in our τ2-bench retail scores.

Question 11

Which languages are supported?

Accepted Answer

Gemma 4 was trained on more than 140 languages with balanced representation across European, Asian, African, and Indic language families. Instruction tuning covers the top 40 languages with human-verified evaluations; the remaining languages benefit from strong transfer learning.

Question 12

How do I report a bug or request a feature?

Accepted Answer

The Gemma community lives on GitHub, the Google Developer forums, and the Hugging Face discussions board. Security-sensitive reports can be sent privately to the DeepMind responsible disclosure address listed in every model card.

Benchmark		Gemma 4 31B IT Thinking	Gemma 4 26B A4B IT Thinking	Gemma 4 E4B IT Thinking	Gemma 4 E2B IT Thinking	Gemma 3 27B IT
Arena AI (text) As of 4/2/26		1452	1441	—	—	1365
MMMLU Multilingual Q&A	No tools	85.2%	82.6%	69.4%	60.0%	67.6%
MMMU Pro Multimodal reasoning		76.9%	73.8%	52.6%	44.2%	49.7%
AIME 2026 Mathematics	No tools	89.2%	88.3%	42.5%	37.5%	20.8%
LiveCodeBench v6 Competitive coding problems		80.0%	77.1%	52.0%	44.0%	29.1%
GPQA Diamond Scientific knowledge	No tools	84.3%	82.3%	58.6%	43.4%	42.4%
τ2-bench Agentic tool use	Retail	86.4%	85.5%	57.5%	29.4%	6.6%

Meet Gemma 4

Open intelligence,
powered by Gemini research.

Trained on 14 trillion tokens

New Mixture-of-Experts routing

Natively multimodal

Open weights, open tools

Built for builders

Open weights

Multimodal reasoning

Runs anywhere

128K context

Responsible by design

140+ languages

One family. Four sizes.

Gemma 4 Nano

Gemma 4 Small

Gemma 4 Pro

Gemma 4 Ultra

Benchmark performance

How does Gemma 4 stack up?

Frequently asked questions

Start building with Gemma 4