ISLab Scala Parallel Text Classification

More information comming soon.

Using

Install Engine Prediction Core

Install Prediction.io server:

$ bash -c "$(curl -s https://install.prediction.io/install.sh)"

Deploying

It takes 6 simple steps to deploy and use an engine:

Install and Run PredictionIO
Create an Engine by downloading an Engine Template
Generate an App ID and Access Key, if you are integrating PredictionIO with a new application
Collecting Data
Deploy the Engine as a Service
Use the Engine

Install Scala Parallel Text Classification:

git clone https://github.com/duyetdev/scala-parallel-textclassification islab-scala-parallel-textclassification
cd islab-scala-parallel-textclassification

Engine Commands

Start Prediction Server

pio-start-all

pio build Build the engine at the current directory.
pio train Kick off a training using an engine.
pio deploy Deploy an engine as an engine server. If no instance ID is specified, it will deploy the latest instance.

Import the data

pio import --appid <app_id> --input data/vnexpress-1000-import-able.json

Build, Train and deploy server

pio build
pio train
pio deploy

List server

pio app list # list all app

API Endpoint

Event server: http://localhost:7070/events.json?accessKey=xxxxxx

TextManipulationEngine

Look at the following tutorial for a Quick Start guide and implementation details.

Release Information

Version 2.2

Modified PreparedData to use MLLib hashing and tf-idf implementations.

Version 2.1

Fixed dot product implementation in the predict methods to work with batch predict method for evaluation.

Version 2.0

Included three different data sets: e-mail spam, 20 newsgroups, and the rotten tomatoes semantic analysis set. Includes Multinomial Logistic Regression algorithm for text classification.

Version 1.2

Fixed import script bug occuring with Python 2.

Version 1.1 Changes

Changed data import Python script to pull straight from the 20 newsgroups page.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
data		data
project		project
src/main/scala/com/islab		src/main/scala/com/islab
tools		tools
.gitignore		.gitignore
README.md		README.md
build.sbt		build.sbt
engine.json		engine.json
pio-runtime-jars.iml		pio-runtime-jars.iml
template.json		template.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ISLab Scala Parallel Text Classification

Using

Install Engine Prediction Core

Deploying

Engine Commands

API Endpoint

TextManipulationEngine

Release Information

Version 2.2

Version 2.1

Version 2.0

Version 1.2

Version 1.1 Changes

About

Uh oh!

Releases

Packages

Uh oh!

Languages

duyet/scala-parallel-textclassification

Folders and files

Latest commit

History

Repository files navigation

ISLab Scala Parallel Text Classification

Using

Install Engine Prediction Core

Deploying

Engine Commands

API Endpoint

TextManipulationEngine

Release Information

Version 2.2

Version 2.1

Version 2.0

Version 1.2

Version 1.1 Changes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages