Panic while downloading siva files to HDFS #33

rporres · 2018-03-07T15:08:19Z

Testing with a build done using a binary from #32 (I could not connect to HDFS otherwise), pga tool is failing to get files to HDFS

# cat /siva.txt | ./pga-rafa get --verbose -i -o hdfs://hdfs-namenode-0.hdfs-namenode.default.svc.cluster.local:8020
downloading siva files by name from stdin
filter flags will be ignored
DEBU[0000] syncing http://pga.sourced.tech//siva/latest/4a/4a14cc02da0a9280538cd3f3242365601d72f241.siva to hdfs://hdfs-namenode-0.hdfs-namenode.default.svc.cluster.local:8020/siva/latest/4a/4a14cc02da0a9280538cd3f3242365601d72f241.siva
panic: runtime error: slice bounds out of range

goroutine 1 [running]:
github.com/src-d/datasets/PublicGitArchive/pga/cmd.downloadFilenames(0x86d4e0, 0xc4201ca080, 0x86d560, 0xc4201b8030, 0xc4201dc000, 0x1f, 0x1f, 0xa, 0x8000105, 0x0)
	/root/go/src/github.com/src-d/datasets/PublicGitArchive/pga/cmd/get.go:91 +0x263
github.com/src-d/datasets/PublicGitArchive/pga/cmd.glob..func1(0xa67920, 0xc4200b6340, 0x0, 0x4, 0x0, 0x0)
	/root/go/src/github.com/src-d/datasets/PublicGitArchive/pga/cmd/get.go:79 +0x3a8
github.com/src-d/datasets/PublicGitArchive/pga/vendor/github.com/spf13/cobra.(*Command).execute(0xa67920, 0xc4200b6300, 0x4, 0x4, 0xa67920, 0xc4200b6300)
	/root/go/src/github.com/src-d/datasets/PublicGitArchive/pga/vendor/github.com/spf13/cobra/command.go:698 +0x46d
github.com/src-d/datasets/PublicGitArchive/pga/vendor/github.com/spf13/cobra.(*Command).ExecuteC(0xa67d60, 0xc4200abf58, 0x73fc95, 0xc4201763c0)
	/root/go/src/github.com/src-d/datasets/PublicGitArchive/pga/vendor/github.com/spf13/cobra/command.go:783 +0x2e4
github.com/src-d/datasets/PublicGitArchive/pga/vendor/github.com/spf13/cobra.(*Command).Execute(0xa67d60, 0xc42002a0b8, 0x0)
	/root/go/src/github.com/src-d/datasets/PublicGitArchive/pga/vendor/github.com/spf13/cobra/command.go:736 +0x2b
github.com/src-d/datasets/PublicGitArchive/pga/cmd.Execute()
	/root/go/src/github.com/src-d/datasets/PublicGitArchive/pga/cmd/root.go:34 +0x2d
main.main()
	/root/go/src/github.com/src-d/datasets/PublicGitArchive/pga/main.go:8 +0x20

Find attached the contents of siva.txt

For the moment I'm using multitool to download to HDFS as it is not giving me issues

cc @vmarkovtsev

The text was updated successfully, but these errors were encountered:

campoy · 2018-03-09T23:07:56Z

do we have an hdfs server I can use for testing? or a guide explaining how to get an hdfs server running on docker?

eiso · 2018-03-10T00:51:46Z

This seems to be the same -i error I was having, not related to hdfs itself.

rporres · 2018-03-12T13:49:38Z

@campoy: You can deploy your own HDFS in minikube using https://github.com/apache-spark-on-k8s/kubernetes-HDFS/tree/master/charts

Alternatively you can access our pipeline HDFS server. Ping me to give you instructions.

jfontan · 2018-03-12T13:53:46Z

I use this to start HDFS + Spark locally:

https://github.com/jfontan/spark-docker-compose/blob/master/engine.md

campoy · 2018-03-14T21:21:21Z

HDFS on minikube sounds perfect, I'll give it a try

campoy · 2018-04-06T01:48:31Z

Took a long time, but I was able to finally connect to an HDFS server using Google Cloud Dataproc and run all my tests.

See #44

rporres assigned campoy Mar 7, 2018

This was referenced Mar 27, 2018

panic: runtime error: slice bounds out of range #41

Closed

Skip downloading file by name if it's empty #42

Closed

campoy mentioned this issue Apr 6, 2018

fix HDFS storage #44

Merged

campoy closed this as completed in #44 Apr 6, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Panic while downloading siva files to HDFS #33

Panic while downloading siva files to HDFS #33

rporres commented Mar 7, 2018 •

edited

Loading

campoy commented Mar 9, 2018

eiso commented Mar 10, 2018

rporres commented Mar 12, 2018

jfontan commented Mar 12, 2018

campoy commented Mar 14, 2018

campoy commented Apr 6, 2018

Panic while downloading siva files to HDFS #33

Panic while downloading siva files to HDFS #33

Comments

rporres commented Mar 7, 2018 • edited Loading

campoy commented Mar 9, 2018

eiso commented Mar 10, 2018

rporres commented Mar 12, 2018

jfontan commented Mar 12, 2018

campoy commented Mar 14, 2018

campoy commented Apr 6, 2018

rporres commented Mar 7, 2018 •

edited

Loading