Skip to content
🍊 πŸ“„ Text Mining add-on for Orange3
Branch: master
Clone or download
ajdapretnar Merge pull request #419 from robertcv/fix/stanford_pos
[FIX] OWPreprocess: fix Stanford tagger loading
Latest commit 18f7052 Mar 25, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
.github Github: Add issue and PR templates Mar 30, 2017
.travis Fix PyQt version to 5.11 Feb 18, 2019
doc Updated documentation Mar 19, 2019
orangecontrib Merge pull request #419 from robertcv/fix/stanford_pos Mar 25, 2019
scripts
.coveragerc Setting up Codecov.io Sep 4, 2015
.gitignore setup.py: Write version.py, improve versioning Sep 15, 2016
.travis.yml Fix PyQt version to 5.11 Feb 18, 2019
CHANGELOG.md Update CHANGELOG.md Sep 11, 2018
LICENSE License AGPLv3 Feb 2, 2017
MANIFEST.in add MANIFEST.in Jun 24, 2016
README.md Remove requirements-opt.txt from Travis & Readme.md Dec 4, 2017
README.pypi readme: Updated text Mar 31, 2016
SERVERS.txt SERVERS: Add http:// Oct 5, 2016
codecov.yml Added codecov.yml May 13, 2016
conda_environment.yml Doc: Install PyQt5 with Anaconda instead of pip Apr 4, 2017
requirements-doc.txt Separate requirements file for Readthedocs Aug 8, 2018
requirements.txt Test suit updated Dec 10, 2018
setup.cfg Modifications before registring with pypi. Jun 13, 2015
setup.py Test suit updated Dec 10, 2018

README.md

Orange3 Text

Build Status codecov Documentation Status

Orange3 Text extends Orange3, a data mining software package, with common functionality for text mining. It provides access to publicly available data, like NY Times, Twitter, Wikipedia and PubMed. Furthermore, it provides tools for preprocessing, constructing vector spaces (like bag-of-words, topic modeling, and similarity hashing) and visualizations like word cloud end geo map. All features can be combined with powerful data mining techniques from the Orange data mining framework.

Anaconda installation

The easiest way to install Orange3-Text is with Anaconda distribution. Download Anaconda for your OS (Python version 3.5). In your Anaconda Prompt first add conda-forge to your channels:

conda config --add channels conda-forge

Then install Orange3-Text

conda install orange3-text

Run

python -m Orange.canvas

to open Orange and check if everything is installed properly.

Installation from source

To install the add-on from source

# Clone the repository and move into it
git clone https://github.com/biolab/orange3-text.git
cd orange3-text

# Install the dependencies:
pip install -r requirements.txt

# Finally install Orange3-Text in editable/development mode.
pip install -e .

To register this add-on with Orange, but keep the code in the development directory (do not copy it to Python's site-packages directory), run

python setup.py develop

Windows setup for biopython library

If you're not using Anaconda distribution, you can manually install biopython library before installing the add-on. First, download the compiler Visual Studio and run the setup with:

python setup.py build_ext --inplace --compiler=msvc install

Usage

After the installation, the widgets from this add-on are registered with Orange. To run Orange from the terminal, use

python3 -m Orange.canvas

or

orange-canvas

The new widgets are in the toolbox bar under Text Mining section.

You can’t perform that action at this time.