Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Migrate Components to Pipeline v2 #5265

Closed
25 of 28 tasks
julian-risch opened this issue Jul 4, 2023 · 1 comment
Closed
25 of 28 tasks

Migrate Components to Pipeline v2 #5265

julian-risch opened this issue Jul 4, 2023 · 1 comment

Comments

@julian-risch
Copy link
Member

julian-risch commented Jul 4, 2023

We are working on Haystack 2.0, with a major refactoring of pipelines and components.

Rationale

We need to prioritize the list of components and separately the list of document stores to migrate to pipelines v2. Most risky components and components essential to most pipelines should be migrated first. Let's also collect feedback on what components are most relevant to Sol (@sjrl) to enable them to give feedback early on based on real use cases. Let's also use telemetry data to see what components are most important to the community.

Use cases

List of the usecases to support, in priority order, with a list of the bare minimum components required for them to work. Note: every pipeline needs the components of all the pipelines above it in priority order in order to work.

Each "component type" links to another small epic where the specific component is broken down into a set of requirements, which might eventually be covered by one or more v2 components.

1. Document Search

Tasks

  1. 3 of 3
    2.x
  2. ignore-for-release-notes proposal showcase
    anakin87
  3. 4 of 4
    2.x
    anakin87
  4. 3 of 3
    2.x P1 epic
    silvanocerza

Note: Retrievers and Embedder's planning will follow the Docstores

2. Generative QA & Agent Pipelines

Tasks

  1. 10 of 13
    2.x epic
    anakin87
  2. 4 of 4
    2.x epic
    vblagoje

3. Extractive QA

Tasks

  1. 1 of 1
    2.x

4. Minimal Indexing

Tasks

  1. 2.x
    julian-risch
  2. bogdankostic
  3. 5 of 6
    2.x epic
    julian-risch

6. General Indexing

Tasks

  1. 1 of 2
    2.x good first issue hacktoberfest
    vblagoje
  2. 7 of 7
    2.x epic
    vblagoje
  3. 0 of 3
    2.x P3 epic

7. Advanced querying

Tasks

  1. 2.x epic

Agent Pipelines

Agent pipelines will need a bit of exploration to get right. I expect their main enabler to be the LLM component: any other unforeseen component that might be needed here will be prioritized accordingly.

Other

Tasks

  1. ZanSara
  2. ZanSara
  3. ZanSara
  4. ignore-for-release-notes proposal showcase
    anakin87
  5. 2.x
  6. 2.x
    silvanocerza
  7. 2.x
    julian-risch
  8. 3 of 3
    2.x
  9. 1 of 1
    2.x
  10. 2.x
  11. 1 of 1
    2.x
  12. 6 of 6
    2.x epic
    ZanSara

Developer relations efforts

  • Have initial demonstration and preview content on Pipelins v2: articles, demos and videos
  • Set up the Haystack website to house both v1 and v2 content: tutorials, Integrations, articles

Context

@julian-risch julian-risch changed the title Pipeline v2 Migrate Components to Pipeline v2 Jul 5, 2023
@masci
Copy link
Contributor

masci commented Oct 2, 2023

Unfinished items were added to the roadmap for Q4, closing this one as complete

@masci masci closed this as completed Oct 2, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Development

No branches or pull requests

4 participants