SOLAR 10.7B: How Depth Upscaling Makes a 10B Model Beat 30B Models

Upstage's SOLAR 10.7B uses depth upscaling - duplicating and fine-tuning Llama 2 layers - to create a model that outperforms 30B-class models on the HuggingFace leaderboard while remaining practical to serve.

Mahmudul Haque Qudrati

CEO & ML Engineer

April 14, 2026

7 min read

// tags

#solar#upstage#depth-upscaling#korean#open-source

FIG. ART-18

7 min read

“

SOLAR 10.7B: How Depth Upscaling Makes a 10B Model Beat 30B Models

// reading plan

sections

530

words

min read

// Developer Tools

Open Code Review – An AI-powered code review CLI tool: A Practical Overview

Open Code Review is an open-source CLI tool from Alibaba that uses AI to review code changes. It runs locally, supports multiple LLMs, and costs about $0.01 per review. Here's a practical breakdown.

4 min read

// Machine Learning

ONNX: Export Any ML Model and Run It Anywhere

Korean and English Bilingual Strength

Upstage is a South Korean AI company, and SOLAR 10.7B was trained with strong Korean language data alongside English. This makes it notable among open-source models for Korean language tasks:

Korean MMLU: outperforms models twice its size that were not specifically trained for Korean
Korean instruction following: the instruct variant handles polite/formal Korean register correctly
Code-switching (Korean + English in same conversation): handled gracefully

Using the Instruct Variant

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_UPSTAGE_KEY",
    base_url="https://api.upstage.ai/v1/solar",
)

response = client.chat.completions.create(
    model="solar-1-mini-chat",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Explain the depth upscaling technique in simple terms."},
    ],
)
print(response.choices[0].message.content)

Self-Hosting with Ollama

ollama pull solar
ollama run solar

At 10.7B parameters, SOLAR runs comfortably on a machine with 16GB VRAM or 32GB unified memory (MacBook Pro M2). In Q4_K_M quantization it requires about 7GB, making it viable on consumer GPUs like the RTX 3080 10GB.

Apache 2.0 License

SOLAR 10.7B is licensed under Apache 2.0 - fully permissive for commercial use without attribution requirements or usage restrictions. This is an important distinction from Llama 2's custom license (which has user-count thresholds) and makes SOLAR suitable for building commercial products.

When to Choose SOLAR

You need a 10B model that punches above its weight class on English and Korean
You want Apache 2.0 commercial licensing without restrictions
You are running on hardware that fits 7 - 16GB VRAM
You want a model that demonstrates the depth upscaling technique for your own fine-tuning research

Benchmark	SOLAR 10.7B	Llama 2 70B	Mistral 7B
Average (4-task)	74.2	67.9	60.1
ARC	66.5	67.3	59.9
HellaSwag	88.1	87.3	81.3
MMLU	65.5	68.9	64.2
TruthfulQA	76.8	44.9	45.5

SOLAR 10.7B: How Depth Upscaling Makes a 10B Model Beat 30B Models

Related Articles

Open Code Review – An AI-powered code review CLI tool: A Practical Overview

What Is Depth Upscaling?

HuggingFace Leaderboard Performance

Korean and English Bilingual Strength

Using the Instruct Variant

Self-Hosting with Ollama

Apache 2.0 License

When to Choose SOLAR

Links

The workspace your team
actually needs

AI & ML insights, weekly

Mahmudul Haque Qudrati

ONNX: Export Any ML Model and Run It Anywhere

Supervised Learning Explained: How Models Learn from Labeled Examples

SOLAR 10.7B: How Depth Upscaling Makes a 10B Model Beat 30B Models

Related Articles

Open Code Review – An AI-powered code review CLI tool: A Practical Overview

What Is Depth Upscaling?

HuggingFace Leaderboard Performance

Korean and English Bilingual Strength

Using the Instruct Variant

Self-Hosting with Ollama

Apache 2.0 License

When to Choose SOLAR

Links

The workspace your teamactually needs

AI & ML insights, weekly

Mahmudul Haque Qudrati

ONNX: Export Any ML Model and Run It Anywhere

Supervised Learning Explained: How Models Learn from Labeled Examples

The workspace your team
actually needs