This extension try provider a common node to execute python script.
- IBM SPSS Modeler v18
More information here: IBM Predictive Extensions
- Open Extension Hub in Modeler Client by: "Extensions" -> "Extension Hub"
- Select "Spark ML Feature TF-IDF"
- Click "OK" to start auto installation
- Open extension repository organization Github IBM Predictive Analytics
- Search and open repository, download corresponding *.mpe file.
- Start installation by: "Extensions" -> "Install Local Extension Bundle"
- Select *.mpe file and install it.
Example Code:
from pyspark.sql.functions import lit
from pyspark.sql.types import StructType, StructField, DoubleType
import sys
data = sys.argv['inputDataframe']
schema = sys.argv['inputSchema']
if sys.argv['schemaOnly']:
sys.argv['outputSchema'] = StructType(data.schema.fields +
[StructField("newConstant", DoubleType(), False) ])
else:
sys.argv['outputDataframe'] = data.withColumn('new', lit(1.0))
In next release, python model node will provider that able to produce a nugget node.
- Yu Wenpei (mail)