Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SPARK-938 - Openstack Swift object storage support #1010

Closed
wants to merge 14 commits into from
Closed

SPARK-938 - Openstack Swift object storage support #1010

wants to merge 14 commits into from

Conversation

gilv
Copy link

@gilv gilv commented Jun 8, 2014

This is very initial commit to initiate discussion about Spark and OpenStack Swift integration.
At this point I provided information how to setup standalone Spark cluster and connect it to Swift using Hadoop 2.3.0. Current patch can be used by all to experiment integration between Swift and Spark.

I am working to extend this patch and future patches will contain information how to configure Spark with previous Hadoop versions and how to configure Swift for other cluster deployment of Spark.

Will glad to hear all remarks / suggestion / comments from the community.

gilv added 2 commits June 8, 2014 10:23
This is initial documentation describing how to integrate Spark with
Swift. This commit contains documentation for stand alone cluster.
Next patches will contain details how to integrate Swift in other
deployment of Spark.
@AmplabJenkins
Copy link

Can one of the admins verify this patch?



<h2>Configuring Spark - stand alone cluster</h2>
You need to configure the compute-classpath.sh and add Hadoop classpath for
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is the swift jar not included in hadoop-client? Is there a way to specify this through Maven dependencies rather than manually including the path?

gilv added 4 commits June 10, 2014 16:02
Documentation how to integrate Spark with Openstack Swift.
Conflicts:
	docs/openstack-integration.md
Conflicts:
	docs/openstack-integration.md
To configure Hadoop to work with Swift one need to modify core-sites.xml of Hadoop and
setup Swift FS.

<configuration>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this needed? Can we just put this in core-site.xml under conf? (Basically removing the configuring Hadoop section)

@gilv
Copy link
Author

gilv commented Jun 12, 2014

Removed all openstack dependencies from pom.xml
Added an example of Java job with parameters passed to the Swift driver using SparkContext.hadoopConfiguration

SPARK-938 - Openstack Swift object storage support
@@ -132,7 +132,7 @@
<codahale.metrics.version>3.0.0</codahale.metrics.version>
<avro.version>1.7.6</avro.version>
<jets3t.version>0.7.1</jets3t.version>

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

remove whitespace changes

@SparkQA
Copy link

SparkQA commented Sep 5, 2014

Can one of the admins verify this patch?

rxin added a commit to rxin/spark that referenced this pull request Sep 6, 2014
SPARK-938 - Openstack Swift object storage support
@rxin
Copy link
Contributor

rxin commented Sep 8, 2014

This is subsumed by #2298

@rxin
Copy link
Contributor

rxin commented Sep 8, 2014

@gilv can you close this pull request? We will merge #2298.

@asfgit asfgit closed this in eddfedd Sep 8, 2014
asfgit pushed a commit that referenced this pull request Sep 8, 2014
See compiled doc at
http://people.apache.org/~rxin/tmp/openstack-swift/_site/storage-openstack-swift.html

This is based on #1010. Closes #1010.

Author: Reynold Xin <rxin@apache.org>
Author: Gil Vernik <gilv@il.ibm.com>

Closes #2298 from rxin/openstack-swift and squashes the following commits:

ff4e394 [Reynold Xin] Two minor comments from Patrick.
279f6de [Reynold Xin] core-sites -> core-site
dfb8fea [Reynold Xin] Updated based on Gil's suggestion.
846f5cb [Reynold Xin] Added a link from overview page.
0447c9f [Reynold Xin] Removed sample code.
e9c3761 [Reynold Xin] Merge pull request #1010 from gilv/master
9233fef [Gil Vernik] Fixed typos
6994827 [Gil Vernik] Merge pull request #1 from rxin/openstack
ac0679e [Reynold Xin] Fixed an unclosed tr.
47ce99d [Reynold Xin] Merge branch 'master' into openstack
cca7192 [Gil Vernik] Removed white spases from pom.xml
99f095d [Reynold Xin] Pending openstack changes.
eb22295 [Reynold Xin] Merge pull request #1010 from gilv/master
39a9737 [Gil Vernik] Spark integration with Openstack Swift
c977658 [Gil Vernik] Merge branch 'master' of https://github.com/gilv/spark
2aba763 [Gil Vernik] Fix to docs/openstack-integration.md
9b625b5 [Gil Vernik] Merge branch 'master' of https://github.com/gilv/spark
eff538d [Gil Vernik] SPARK-938 - Openstack Swift object storage support
ce483d7 [Gil Vernik] SPARK-938 - Openstack Swift object storage support
b6c37ef [Gil Vernik] Openstack Swift support
(cherry picked from commit eddfedd)

Signed-off-by: Patrick Wendell <pwendell@gmail.com>
wangyum pushed a commit that referenced this pull request May 26, 2023
…ectory (#1010)

* [CARMEL-6076][Followup] Fix UT failure caused by compacting empty directory

dddd

tttt

* show path
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
5 participants