Spark 3.0 support #50

wey-gu · 2022-06-16T04:20:17Z

update: 2023-April, spark connector supports spark 3.0 now.

Nicole00 · 2022-09-06T14:22:03Z

it depends on spark connector with spark 3.0， not implemented but in the schedule.

xiajingchun · 2022-12-27T05:25:03Z

@Nicole00 I noticed there's already a pull request of connector supporting spark 3.0. When that one is done, any further work here in order to run algos in spark 3?

porscheme · 2023-02-21T06:15:18Z

Any update on this?

We cannot use nebula-algorithm since our Spark-Operator framework is spark 3.0.

Nicole00 · 2023-02-21T07:45:35Z

Any update on this?

We cannot use nebula-algorithm since our Spark-Operator framework is spark 3.0.

At present, you can do this temporarily： pull branch and execute maven install for nebula-spark-connector_3.0 , and update the spark connector version referenced in algorithm

meet-alfie · 2023-11-03T02:44:48Z

Any update on this?

We cannot use nebula-algorithm since our Spark-Operator framework is spark 3.0.

I also encountered the same problem。The spark version used by our platform is 3.x (3.2.2)。
My algorithm does not use nebula data directly, but uses the results of business query nebula. It can be thought of as only data containing vertices, end points, and weights.
I only extracted the source code of nebula and nebula-spark-connector used in the algorithm to my project, and relied on my own, which can run the algorithm like pagerank normally.

My main modifications are as follows：

source file

├── base
│   └── client
│       ├── meta_data
│       │   ├── FieldMetaData.java
│       │   └── FieldValueMetaData.java
│       ├── protocol
│       │   ├── ShortStack.java
│       │   ├── TCompactProtocol.java
│       │   ├── TException.java
│       │   ├── TField.java
│       │   ├── TList.java
│       │   ├── TMap.java
│       │   ├── TMessage.java
│       │   ├── TProtocol.java
│       │   ├── TProtocolException.java
│       │   ├── TProtocolFactory.java
│       │   ├── TSet.java
│       │   ├── TStruct.java
│       │   └── TTransportException.java
│       ├── schema
│       │   ├── IScheme.java
│       │   ├── SchemeFactory.java
│       │   └── StandardScheme.java
│       ├── thrift
│       │   └── TBase.java
│       └── transport
│           ├── TException.java
│           ├── TTransport.java
│           └── TTransportException.java
├── config
│   ├── AlgoConfig.scala
│   └── SparkConfigEntry.scala
├── examples
│   └── PageRankExample.scala
├── lib
│   └── PageRankAlgo.scala
├── reader
│   └── ReadData.scala
└── utils
    ├── DecodeUtil.scala
    └── NebulaUtil.scala

13 directories, 29 files

pom.xml

        <properties>
           <maven.compiler.source>8</maven.compiler.source>
           <maven.compiler.target>8</maven.compiler.target>
           <scala.version>2.12</scala.version>
           <spark.version>3.2.2</spark.version>
           <lombok.version>1.18.28</lombok.version>
           <config.version>1.4.0</config.version>
           <scopt.version>3.7.1</scopt.version>
         </properties>
        
         <dependency>
            <groupId>org.apache.spark</groupId>
            <artifactId>spark-core_${scala.version}</artifactId>
            <version>${spark.version}</version>
            <scope>provided</scope>
        </dependency>
        <dependency>
            <groupId>org.apache.spark</groupId>
            <artifactId>spark-sql_${scala.version}</artifactId>
            <version>${spark.version}</version>
        </dependency>
        <dependency>
            <groupId>org.apache.spark</groupId>
            <artifactId>spark-sql-kafka-0-10_${scala.version}</artifactId>
            <version>${spark.version}</version>
        </dependency>
        <dependency>
            <groupId>org.apache.spark</groupId>
            <artifactId>spark-graphx_${scala.version}</artifactId>
            <version>${spark.version}</version>
        </dependency>
        <dependency>
            <groupId>com.typesafe</groupId>
            <artifactId>config</artifactId>
            <version>${config.version}</version>
        </dependency>
        <dependency>
            <groupId>com.github.scopt</groupId>
            <artifactId>scopt_${scala.version}</artifactId>
            <version>${scopt.version}</version>
        </dependency>

Just follow this method to add your algorithm source code and update your own dependencies
Hope it helps you

xin-hao-awx · 2023-12-04T08:37:08Z

Can we take this as a higher priority?

Nicole00 · 2024-03-25T07:37:24Z

https://github.com/vesoft-inc/nebula-algorithm/tree/spark3

wey-gu added the help wanted Community: does anyone want to work on it? label Jun 16, 2022

wey-gu mentioned this issue Jun 18, 2022

Weekly Report 2022-06-17 vesoft-inc/nebula-community#116

Closed

Sophie-Xie added the type/feature req Type: feature request label Nov 30, 2022

Nicole00 closed this as completed Mar 25, 2024

wey-gu mentioned this issue Mar 30, 2024

Weekly Report 2024-03-29 vesoft-inc/nebula-community#432

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spark 3.0 support #50

Spark 3.0 support #50

wey-gu commented Jun 16, 2022 •

edited

Nicole00 commented Sep 6, 2022

xiajingchun commented Dec 27, 2022

porscheme commented Feb 21, 2023

Nicole00 commented Feb 21, 2023

meet-alfie commented Nov 3, 2023 •

edited by wey-gu

xin-hao-awx commented Dec 4, 2023

Nicole00 commented Mar 25, 2024

Spark 3.0 support #50

Spark 3.0 support #50

Comments

wey-gu commented Jun 16, 2022 • edited

Nicole00 commented Sep 6, 2022

xiajingchun commented Dec 27, 2022

porscheme commented Feb 21, 2023

Nicole00 commented Feb 21, 2023

meet-alfie commented Nov 3, 2023 • edited by wey-gu

xin-hao-awx commented Dec 4, 2023

Nicole00 commented Mar 25, 2024

wey-gu commented Jun 16, 2022 •

edited

meet-alfie commented Nov 3, 2023 •

edited by wey-gu