The goal of this project is to generate semantically proper articles in polish language. Project is divided into 2 parts:
- parallel web scrapper to gather training data
- statistical and neural networks approach to generate articles
Some of the results so far:
- "w miami miałem spotkania a międzyspotkaniami pochodziłem po wodzie w kranie"
- "pigułka odbiera kobiecie ochotę na mielone wsurowej szynce parmeńskiej z jajkiem"
- "seksistowski horror w uk zabito nawet 4700 dziewczynek ze względu na ciężar historii pomiędzy niemcami a rosją"
Database: postgres
You need to have postgres installed. When creating user use:
- username: "root",
- password: "root"
For archlinux follow this.
Then create db:
sudo -iu postgres
pqsl
create database bsgenerator;And finally run setupdb.sql from this directory.
In sbt console:
run