New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
'docs_bulk'-functionality for ingest attachment-plugin ('pipeline_attachment') ? #253
Milestone
Comments
thanks for the detailed report, i'll take a look soon |
Great. |
So I make sure I understand:
|
To answer your questions:
|
Yes, it was the httr pkg back then Okay, just pushed a change, I think this should work for you now, if you're still interested. See the query param in docs_bulk |
@Aeilert 👆 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I have a question about pushing documents in bulk with the ingest attachment-plugin. This used to work by setting an additional parameter,
query = 'pipeline=attachment'
, indocs_bulk
(tested with version 0.8.4), but no longer seems to work with the current version of the package.When using
docs_bulk
with a pipeline like the one below, the data is pushed through to Elasticsearch, but not using the plugin. The result is a an index containing a base64-encodeddata
-field, and not a list offulltext
-fields like you would expect.This does not work
I could of course use
pipeline_attachment
but I have several thousands files and want to take advantage of the bulk API. Maybe this could be solved with adocs_bulk
wrapper forpipeline_attachment
? Or just a parameter to add 'pipeline=attachment' to thePOST
-statement (not sure why passingquery
-option tocrul
doesn't work)?As an example of the functionality I'm looking for I created a simple wrapper-function for
pipeline_attachment
. I'm not saying this should be the solution. It's just to illustrate the functionality.This does work
I'm using R 3.5.3. and Elasticsearch 7.0.0 w/ Docker. I have installed the ingest-attachment plugin. See below for other session info.
Session Info
Dockerfile
The text was updated successfully, but these errors were encountered: