This code gets connected to Solr DB created for Sparkler Crawled Data to do further data extraction, classification, filtering and insights generation using various Machine Learning models. The ML models are capable of using keywords list from user, extract features from URL content, and classify (score) output and update Solr parameter accordin…
Switch branches/tags
Nothing to show
Clone or download
Pull request Compare This branch is 8 commits ahead of ahmadika:master.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
Extraction
Keywords
Post-Processing-Scripts
README.md

README.md

Apache Sparkler Post Processing using Machine Learning

This code gets connected to Solr DB created for Sparkler Crawled Data to do further data extraction, classification, filtering and insights generation using various Machine Learning models.

The ML models are capable of using keywords list from user, extract features from URL content, and classify (score) output and update Solr parameter accordingly.

Apache Sparkler Link: https://github.com/USCDataScience/sparkler