GitHub - deepset-ai/haystack: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.


CI/CD
Docs
Package
Meta

Haystack is an end-to-end LLM framework that allows you to build applications powered by LLMs, Transformer models, vector search and more. Whether you want to perform retrieval-augmented generation (RAG), document search, question answering or answer generation, Haystack can orchestrate state-of-the-art embedding models and LLMs into pipelines to build end-to-end NLP applications and solve your use case.

Installation

The simplest way to get Haystack is via pip:

pip install haystack-ai

Install from the main branch to try the newest features:

pip install git+https://github.com/deepset-ai/haystack.git@main

Haystack supports multiple installation methods including Docker images. For a comprehensive guide please refer to the documentation.

Documentation

If you're new to the project, check out "What is Haystack?" then go through the "Get Started Guide" and build your first LLM application in a matter of minutes. Keep learning with the tutorials. For more advanced use cases, or just to get some inspiration, you can browse our Haystack recipes in the Cookbook.

At any given point, hit the documentation to learn more about Haystack, what can it do for you and the technology behind.

Features

Technology agnostic: Allow users the flexibility to decide what vendor or technology they want and make it easy to switch out any component for another. Haystack allows you to use and compare models available from OpenAI, Cohere and Hugging Face, as well as your own local models or models hosted on Azure, Bedrock and SageMaker.
Explicit: Make it transparent how different moving parts can “talk” to each other so it's easier to fit your tech stack and use case.
Flexible: Haystack provides all tooling in one place: database access, file conversion, cleaning, splitting, training, eval, inference, and more. And whenever custom behavior is desirable, it's easy to create custom components.
Extensible: Provide a uniform and easy way for the community and third parties to build their own components and foster an open ecosystem around Haystack.

Some examples of what you can do with Haystack:

Build retrieval augmented generation (RAG) by making use of one of the available vector databases and customizing your LLM interaction, the sky is the limit 🚀
Perform Question Answering in natural language to find granular answers in your documents.
Perform semantic search and retrieve documents according to meaning.
Build applications that can make complex decisions making to answer complex queries: such as systems that can resolve complex customer queries, do knowledge search on many disconnected resources and so on.
Scale to millions of docs using retrievers and production-scale components.
Use off-the-shelf models or fine-tune them to your data.
Use user feedback to evaluate, benchmark, and continuously improve your models.

Tip

Would you like to deploy and serve Haystack pipelines as REST APIs yourself? Hayhooks provides a simple way to wrap your pipelines with custom logic and expose them via HTTP endpoints, including OpenAI-compatible chat completion endpoints and compatibility with fully-featured chat interfaces like open-webui.

Haystack Enterprise: Best Practices and Expert Support

Get expert support from the Haystack team, build faster with enterprise-grade templates, and scale securely with deployment guides for cloud and on-prem environments - all with Haystack Enterprise. Read more about it our announcement post.

👉 Get Haystack Enterprise

deepset Studio: Your Development Environment for Haystack

Use deepset Studio to visually create, deploy, and test your Haystack pipelines. Learn more about it in our announcement post.

👉 Sign up!

Tip

Are you looking for a managed solution that benefits from Haystack? deepset AI Platform is our fully managed, end-to-end platform to integrate LLMs with your data, which uses Haystack for the LLM pipelines architecture.

Telemetry

Haystack collects anonymous usage statistics of pipeline components. We receive an event every time these components are initialized. This way, we know which components are most relevant to our community.

Read more about telemetry in Haystack or how you can opt out in Haystack docs.

🖖 Community

If you have a feature request or a bug report, feel free to open an issue in Github. We regularly check these and you can expect a quick response. If you'd like to discuss a topic, or get more general advice on how to make Haystack work for your project, you can start a thread in Github Discussions or our Discord channel. We also check 𝕏 (Twitter) and Stack Overflow.

Contributing to Haystack

We are very open to the community's contributions - be it a quick fix of a typo, or a completely new feature! You don't need to be a Haystack expert to provide meaningful improvements. To learn how to get started, check out our Contributor Guidelines first.

There are several ways you can contribute to Haystack:

Contribute to the main Haystack project
Contribute an integration on haystack-core-integrations

Tip

👉 Check out the full list of issues that are open to contributions

Who Uses Haystack

Here's a list of projects and companies using Haystack. Are you also using Haystack? Open a PR or tell us your story.

Tech & AI Innovators: Apple, Meta, Databricks, NVIDIA, PostHog
Public Sector: German Federal Ministry of Research, Technology, and Space (BMFTR), PD, Baden-Württemberg State
Enterprise & Telecom: Alcatel-Lucent, Intel, NOS Portugal, TELUS Agriculture & Consumer Goods
Aerospace & Hardware: Airbus, Infineon, LEGO
Media & Entertainment: Netflix, Comcast, Zeit Online, Rakuten
Legal & Publishing: Manz, Oxford University Press
Startups & Research: YPulse, BetterUp, Intel Labs

Name		Name	Last commit message	Last commit date
Latest commit History 4,348 Commits
.github		.github
docker		docker
docs-website		docs-website
docs		docs
e2e		e2e
examples		examples
haystack		haystack
releasenotes		releasenotes
test		test
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CITATION.cff		CITATION.cff
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
VERSION.txt		VERSION.txt
code_of_conduct.txt		code_of_conduct.txt
license-header.txt		license-header.txt
licenserc.toml		licenserc.toml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Uh oh!

Repository files navigation

Table of Contents

Installation

Documentation

Features

Haystack Enterprise: Best Practices and Expert Support

deepset Studio: Your Development Environment for Haystack

Telemetry

🖖 Community

Contributing to Haystack

Who Uses Haystack

About

Licenses found

Uh oh!

Releases 192

Used by 1.2k

Contributors 309

Languages

License

Licenses found

deepset-ai/haystack

Folders and files

Latest commit

History

Repository files navigation

Table of Contents

Installation

Documentation

Features

Haystack Enterprise: Best Practices and Expert Support

deepset Studio: Your Development Environment for Haystack

Telemetry

🖖 Community

Contributing to Haystack

Who Uses Haystack

About

Topics

Resources

License

Licenses found

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 192

Used by 1.2k

Contributors 309

Languages