Support feeding from Spark #9158

lesters · 2019-04-23T09:20:38Z

Today, the Hadoop integration tools for Vespa support Hadoop and Pig for feeding and querying Vespa. The Pig feeder is a thin wrapper around the Vespa HTTP client.

We should support feeding directly from Spark as well, to avoid Spark pipelines having to write to HDFS and run another Pig job for the actual feeding. Similarly to the Pig feeder, this could be implemented as a thin wrapper around the HTTP client.

prasad-marne · 2023-10-09T06:25:17Z

@kkraune i dont see Hadoop integration anymore. do we want to have Spark Support. I would be interested in taking it up.

kkraune · 2023-10-09T06:57:10Z

Hi, yes that would be a great addition! A good starting point is https://docs.vespa.ai/en/vespa-feed-client.html. Thanks!

prasad-marne · 2023-10-10T09:19:53Z

Great. Will spend some time to investigate and see how we can design a sink in Spark

tsafacjo · 2023-11-01T21:37:42Z

can I take this issue ?

kkraune · 2023-11-02T09:09:00Z

Sure, thanks for contributing! https://github.com/vespa-engine/vespa/blob/master/CONTRIBUTING.md is a good place to start

lesters added the enhancement label Apr 23, 2019

frodelu added this to the later milestone Apr 24, 2019

frodelu added the good first issue label Apr 24, 2019

kkraune added the HackTogether https://yahoo.github.io/hacktogether/ label Mar 10, 2021

kkraune removed the HackTogether https://yahoo.github.io/hacktogether/ label Apr 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support feeding from Spark #9158

Support feeding from Spark #9158

lesters commented Apr 23, 2019

prasad-marne commented Oct 9, 2023

kkraune commented Oct 9, 2023

prasad-marne commented Oct 10, 2023

tsafacjo commented Nov 1, 2023

kkraune commented Nov 2, 2023

Support feeding from Spark #9158

Support feeding from Spark #9158

Comments

lesters commented Apr 23, 2019

prasad-marne commented Oct 9, 2023

kkraune commented Oct 9, 2023

prasad-marne commented Oct 10, 2023

tsafacjo commented Nov 1, 2023

kkraune commented Nov 2, 2023