Skip to content

Commit

Permalink
cleanup
Browse files Browse the repository at this point in the history
  • Loading branch information
LorenzBuehmann committed Feb 16, 2021
1 parent 34d2961 commit 6293296
Showing 1 changed file with 1 addition and 5 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -24,14 +24,10 @@ object LMDB_Pipeline {
.getOrCreate()
spark.sparkContext.setLogLevel("ERROR")

JenaSystem.init()
val confHadoop = org.apache.hadoop.mapreduce.Job.getInstance().getConfiguration
confHadoop.setBoolean("sansa.rdf.parser.skipinvalid", true)

/*
READ IN DATA
*/
val inputFilePath = "/Users/carstendraschner/Datasets/linkedmdb-latest-dump.nt.txt"
val inputFilePath = args(0)
// val df: DataFrame = spark.read.rdf(Lang.NTRIPLES)(inputFilePath).cache()
// val dataset = spark.rdf(Lang.NTRIPLES)(inputFilePath).toDS().cache()
val dataset = NTripleReader.load(
Expand Down

0 comments on commit 6293296

Please sign in to comment.