Gemma 3 27B: Google's Multimodal Open Model That Surpasses Llama 3.3

Gemma 3 27B scores 67.5% on MMLU while adding native image understanding and 128k context - outperforming Llama 3.3 70B on MMLU at less than half the parameter count.

Mahmudul Haque Qudrati

CEO & ML Engineer

April 28, 2026

7 min read

// tags

#gemma-3#google#multimodal#open-source#128k-context

FIG. ART-29

7 min read

“

Gemma 3 27B: Google's Multimodal Open Model That Surpasses Llama 3.3

// reading plan

sections

440

words

min read

// Developer Tools

Open Code Review – An AI-powered code review CLI tool: A Practical Overview

Open Code Review is an open-source CLI tool from Alibaba that uses AI to review code changes. It runs locally, supports multiple LLMs, and costs about $0.01 per review. Here's a practical breakdown.

4 min read

// Open Source AI

OpenCode vs Claude Code: Open-Source Agentic CLI Compared

Multimodal Image Understanding

from transformers import AutoProcessor, Gemma3ForConditionalGeneration
import torch
from PIL import Image

model_id = "google/gemma-3-27b-it"
processor = AutoProcessor.from_pretrained(model_id)
model = Gemma3ForConditionalGeneration.from_pretrained(
    model_id,
    torch_dtype=torch.bfloat16,
    device_map="auto",
)

image = Image.open("product_photo.jpg")

messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "image": image},
            {"type": "text", "text": "Describe all the product details visible in this image, including brand, model, color, and condition."},
        ],
    }
]

inputs = processor.apply_chat_template(
    messages,
    add_generation_prompt=True,
    tokenize=True,
    return_tensors="pt",
).to(model.device)

output = model.generate(**inputs, max_new_tokens=512)
print(processor.decode(output[0], skip_special_tokens=True))

140+ Language Support

Gemma 3 was trained on a significantly more multilingual dataset than Gemma 2. While Llama 3.3 70B is primarily English with good multilingual coverage, Gemma 3 was explicitly designed for breadth - 140+ languages with quality emphasis on under-resourced languages.

This makes Gemma 3 relevant for deployments targeting South Asian, African, and Southeast Asian language markets where most frontier models have limited training data.

Free GPU Access via Kaggle

Kaggle provides free access to Nvidia T4 (15GB) and P100 (16GB) GPUs. For Gemma 3 27B, you need the quantized version (Q4_K_M at ~16GB), which fits on Kaggle's free tier:

# Kaggle notebook  -  free T4 GPU
!pip install -q bitsandbytes transformers accelerate

from transformers import AutoTokenizer, AutoModelForCausalLM, BitsAndBytesConfig

quantization_config = BitsAndBytesConfig(load_in_4bit=True)
model = AutoModelForCausalLM.from_pretrained(
    "google/gemma-3-27b-it",
    quantization_config=quantization_config,
    device_map="auto",
)

KerasHub Integration

import keras_hub

gemma = keras_hub.models.Gemma3CausalLM.from_preset("gemma3_instruct_27b")
gemma.generate("Explain gradient descent in three sentences:", max_length=200)

KerasHub handles weight loading, tokenization, and generation in a few lines. It also supports JAX backends, enabling TPU inference for high-throughput serving.

Gemma 3 27B: Google's Multimodal Open Model That Surpasses Llama 3.3

Related Articles

Open Code Review – An AI-powered code review CLI tool: A Practical Overview

The Gemma 3 Family

Benchmark Results

Multimodal Image Understanding

140+ Language Support

Free GPU Access via Kaggle

KerasHub Integration

Links

The workspace your team
actually needs

AI & ML insights, weekly

Mahmudul Haque Qudrati

OpenCode vs Claude Code: Open-Source Agentic CLI Compared

DeepSeek V4 Pro and Kimi K2.6 vs Claude Opus 4.8: Open Weights at Frontier Level

Model	MMLU	Parameters	Context
Gemma 3 27B	67.5%	27B	128k
Llama 3.3 70B	65.4%	70B	128k
Gemma 3 12B	62.4%	12B	128k
Qwen 2.5 32B	71.1%	32B	128k

Gemma 3 27B: Google's Multimodal Open Model That Surpasses Llama 3.3

Related Articles

Open Code Review – An AI-powered code review CLI tool: A Practical Overview

The Gemma 3 Family

Benchmark Results

Multimodal Image Understanding

140+ Language Support

Free GPU Access via Kaggle

KerasHub Integration

Links

The workspace your teamactually needs

AI & ML insights, weekly

Mahmudul Haque Qudrati

OpenCode vs Claude Code: Open-Source Agentic CLI Compared

DeepSeek V4 Pro and Kimi K2.6 vs Claude Opus 4.8: Open Weights at Frontier Level

The workspace your team
actually needs