Deep Sentence

Deep Sentence is a deep learning based engine to summarize texts from multiple sources into a single short summary.

Setup

Requirements

Python 3.5
psycopg2 requirements

The scraper module also relies on html-extractor-miniserver which is available at http://extractor.deepsentence.com

Installing dependencies

Setup a new virtualenv environment if you want, then simply run

make

Configuration

Copy .env.example to .env, and modify the variables to your needs.

Scraper

Usage

To start the scraper, run

scrapy crawl line_news

if you want a shell to play around with the responses, run

scrapy shell ARTICLE_URL --spider=line_news

Learning

Dependencies

To learn, you will first need to download the word embeddings for word2vec. You can get them at the following URL: http://www.cl.ecei.tohoku.ac.jp/~m-suzuki/jawiki_vector/entity_vector.tar.bz2

Or you can use make download_models to download them for you.

Webapp

The web application lives in deep_sentence/webapp.

Requirements

NodeJS >= 4
yarn (recommended)
foreman (recommended)

Usage

To install dependencies, run make prepare_web. You can then start the application by running make dev_webapp. If you do not have foreman, you can start the app with make debug_app and start webpack (in another shell) with make webpack_watch.

Guidelines

Adding dependencies

Run

make write_dependencies

to regenerate requirements.txt. Please be sure to run this from a clean environment, and only add needed dependencies.

DB setup

You can access the database as follow

psql -h public-db.claudetech.com -p 5433 -U deep_sentence

To be able to use it in from Python, set DATABASE_URL to the following value

postgres://deep_sentence:PASSWORD@public-db.claudetech.com:5433/deep_sentence

Deployment

See deployment/README.md for more information about how to setup a node.

Name		Name	Last commit message	Last commit date
Latest commit History 108 Commits
bin		bin
deep_sentence		deep_sentence
deployment		deployment
models		models
tmp		tmp
.env.example		.env.example
.gitignore		.gitignore
.pylintrc		.pylintrc
.python-version		.python-version
Makefile		Makefile
README.md		README.md
members.json		members.json
requirements.txt		requirements.txt
scrapy.cfg		scrapy.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Sentence

Table of contents

Setup

Requirements

Installing dependencies

Configuration

Scraper

Usage

Learning

Dependencies

Webapp

Requirements

Usage

Guidelines

Adding dependencies

DB setup

Deployment

About

Releases

Packages

Contributors 3

Languages

danhper/deepsentence

Folders and files

Latest commit

History

Repository files navigation

Deep Sentence

Table of contents

Setup

Requirements

Installing dependencies

Configuration

Scraper

Usage

Learning

Dependencies

Webapp

Requirements

Usage

Guidelines

Adding dependencies

DB setup

Deployment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages