Pipelines tutorial #991

brandenchan · 2021-04-22T10:50:23Z

This PR adds a notebook and .py tutorial which shows what can be done with Haystack pipelines

review-notebook-app · 2021-04-22T10:50:27Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

brandenchan · 2021-04-22T11:06:12Z

Note that the Colab notebook has not been fully tested yet

tholor

Looking good. Added a few comments + requests for minor changes.

One bigger question: Any particular reason to not use a pipeline for indexing the data in the beginning? I think it would make the picture complete.

tholor · 2021-04-22T14:18:36Z

haystack/utils.py

@@ -29,6 +29,39 @@ def launch_es():
        time.sleep(15)


+def launch_milvus():
+    # Start a Milvus server
+    # You can start Elasticsearch on your local machine instance using Docker. If Docker is not readily available in


Update comment here for milvus / remove the "executable part"

What do you mean exactly by the "executable part"?

haystack/utils.py

tholor · 2021-04-22T14:25:01Z

haystack/utils.py

+        pp.pprint(results)
+
+
+def print_answers_gen(results: dict):


I assume this is for generated answers? Can't we adjust print_answers() or adjust the result format from the generator so that we need a special method here? I think we should try to "standardize" the formats more in the next weeks. The more special helpers we add now on top, the more we'll have to refactor again soon...

I have merged this fn with print_answers() in a rather hacky way. Anticipating that Reader and Generator outputs will be standardized and that we can come up with a proper solution once we have decided on that format.

tutorials/Tutorial11_Pipelines.py

tutorials/Tutorial11_Pipelines.ipynb

brandenchan · 2021-04-29T15:31:15Z

Looking good. Added a few comments + requests for minor changes.

One bigger question: Any particular reason to not use a pipeline for indexing the data in the beginning? I think it would make the picture complete.

I don't have so much familiarity with it yet and I couldn't find a good example. Also with the current tutorial setup, it might over complicate what is otherwise very simple indexing.

For the sake of just getting this tutorial out there, I am going to move ahead with merging this. Adding indexing pipeline is in the backlog (#992) and we can think of a good way to incorporate indexing pipelines when we revisit it.

brandenchan added 3 commits April 19, 2021 18:20

Start Pipelines tutorial

b17960d

Make Tutorial 11 run locally

6bb9337

Add colab compatibility

51c7ab0

brandenchan self-assigned this Apr 22, 2021

brandenchan requested a review from tholor April 22, 2021 10:50

brandenchan added 2 commits April 22, 2021 12:51

Fix pip install

53d3422

Add ES install from source

cd53f7a

brandenchan added 2 commits April 22, 2021 15:35

Add ES install from source

8db5127

Add pygraphviz installation

7cd85e3

tholor requested changes Apr 22, 2021

View reviewed changes

tholor reviewed Apr 22, 2021

View reviewed changes

tutorials/Tutorial11_Pipelines.ipynb Show resolved Hide resolved

tutorials/Tutorial11_Pipelines.ipynb Show resolved Hide resolved

tutorials/Tutorial11_Pipelines.ipynb Show resolved Hide resolved

brandenchan added 3 commits April 26, 2021 17:59

Incorporate reviewer feedback

299ff2f

Ensure print_answers() works for Generator output

68b7c3f

Fix typo

3d6f664

brandenchan merged commit 9827b36 into master Apr 29, 2021

brandenchan deleted the pipelines_tutorial branch April 29, 2021 15:31

brandenchan mentioned this pull request May 3, 2021

Add Pipeline Tutorial #798

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pipelines tutorial #991

Pipelines tutorial #991

brandenchan commented Apr 22, 2021

review-notebook-app bot commented Apr 22, 2021

brandenchan commented Apr 22, 2021

tholor left a comment

tholor Apr 22, 2021

brandenchan Apr 26, 2021

tholor Apr 22, 2021

brandenchan Apr 26, 2021

brandenchan commented Apr 29, 2021

Pipelines tutorial #991

Pipelines tutorial #991

Conversation

brandenchan commented Apr 22, 2021

review-notebook-app bot commented Apr 22, 2021

brandenchan commented Apr 22, 2021

tholor left a comment

Choose a reason for hiding this comment

tholor Apr 22, 2021

Choose a reason for hiding this comment

brandenchan Apr 26, 2021

Choose a reason for hiding this comment

tholor Apr 22, 2021

Choose a reason for hiding this comment

brandenchan Apr 26, 2021

Choose a reason for hiding this comment

brandenchan commented Apr 29, 2021