Haystack 2.0: The Production RAG Pipeline Framework by deepset

Haystack 2.0 from deepset offers a component-based pipeline architecture for building production RAG systems, with YAML-defined pipelines and a built-in evaluation framework.

Mahmudul Haque Qudrati

CEO & ML Engineer

April 1, 2026

8 min read

// tags

#haystack#rag#pipeline#deepset#production

FIG. ART-31

8 min read

“

Haystack 2.0: The Production RAG Pipeline Framework by deepset

// reading plan

sections

381

words

min read

// AI Agents

Building reliable agentic AI systems: A Practical Overview

A practical guide to building reliable agentic AI systems covering structured outputs, observability, fallbacks, and cost controls with real code examples.

4 min read

// Developer Tools

What is SpaceX Is Buying Cursor? A Practical Overview

What Is Haystack?

Haystack is an open-source NLP framework by deepset designed specifically for building production retrieval-augmented generation (RAG) pipelines. Version 2.0, released in 2024, was a complete redesign that made the framework more modular and production-ready.

While LangChain and LlamaIndex are broad frameworks covering many LLM use cases, Haystack is narrower and deeper: it's built around the retrieval pipeline as the core abstraction, and it shows in the quality of the retrieval-specific tooling.

Component-Based Architecture

Every Haystack pipeline is a directed acyclic graph of components. Each component has typed inputs and outputs, making pipelines composable and testable in isolation.

The four key component types:

DocumentStore - the vector database (Elasticsearch, OpenSearch, Weaviate, Qdrant, Chroma, in-memory)
Retriever - fetches relevant documents (BM25, dense, hybrid)
Ranker - re-ranks retrieved documents for relevance
Generator - the LLM that produces the final answer

from haystack import Pipeline
from haystack.components.retrievers.in_memory import InMemoryBM25Retriever
from haystack.components.generators import OpenAIGenerator
from haystack.components.builders import RAGPromptBuilder
from haystack.document_stores.in_memory import InMemoryDocumentStore

document_store = InMemoryDocumentStore()
# ... index documents ...

pipeline = Pipeline()
pipeline.add_component("retriever", InMemoryBM25Retriever(document_store=document_store))
pipeline.add_component("prompt_builder", RAGPromptBuilder(template=prompt_template))
pipeline.add_component("llm", OpenAIGenerator(model="gpt-4o-mini"))

pipeline.connect("retriever", "prompt_builder.documents")
pipeline.connect("prompt_builder", "llm")

result = pipeline.run({"retriever": {"query": "What is RAG?"}})

Haystack 2.0: The Production RAG Pipeline Framework by deepset

Related Articles

Building reliable agentic AI systems: A Practical Overview

What is SpaceX Is Buying Cursor? A Practical Overview

What Is Haystack?

Component-Based Architecture

YAML Pipeline Definition

Hybrid Retrieval

Evaluation Framework

deepset Cloud

Haystack vs LangChain vs LlamaIndex

Resources

The workspace your team
actually needs

AI & ML insights, weekly

Mahmudul Haque Qudrati

Open Code Review – An AI-powered code review CLI tool: A Practical Overview

Haystack 2.0: The Production RAG Pipeline Framework by deepset

Related Articles

Building reliable agentic AI systems: A Practical Overview

What is SpaceX Is Buying Cursor? A Practical Overview

What Is Haystack?

Component-Based Architecture

YAML Pipeline Definition

Hybrid Retrieval

Evaluation Framework

deepset Cloud

Haystack vs LangChain vs LlamaIndex

Resources

The workspace your teamactually needs

AI & ML insights, weekly

Mahmudul Haque Qudrati

Open Code Review – An AI-powered code review CLI tool: A Practical Overview

The workspace your team
actually needs