Skip to content

softmarshmallow/inked-engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

inked engine ๐Ÿค–๐Ÿค–

| ๋‰ด์Šค ๋ถ„์„์„ ์œ„ํ•œ ํˆดํ‚ท์ž…๋‹ˆ๋‹ค.

Main features

  • news data indexing

  • news data processing

  • provide api for service server

Inked-news-crawler ์—์„œ ์ƒˆ๋กœ์šด ๋‰ด์Šค๋ฐ์ดํ„ฐ๋ฅผ ๋ฐ›์•„์˜จํ›„, ์ธ๋ฑ์‹ฑ๊ณผ pre-proccessing ์„ ํ•ฉ๋‹ˆ๋‹ค. ์„œ๋น„์Šค ์„œ๋ฒ„์—์„œ ์š”์ฒญํ•˜๋Š” ์ •๋ณด๋ฅผ ๋ถ„์„ํ•˜์—ฌ ์„œ๋น„์Šค ์„œ๋ฒ„๋กœ ์ „๋‹ฌํ•˜๋ฉฐ, ์„œ๋น„์Šค ์„œ๋ฒ„์—์„œ ํด๋ผ์ด์–ธํŠธ๋กœ ๋‰ด์Šค ์ •๋ณด๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.

News data model

  • tags : { company : [], namedEntities: [], keywords: []}
  • content
  • origin
  • title
  • time

How to install virtualenv:

Install pip first

sudo apt-get install python3-pip

Then install virtualenv using pip3

sudo pip3 install virtualenv

Now create a virtual environment

virtualenv venv

KoNlPy setup

http://konlpy.org/en/v0.4.4/install/ sudo apt-get install g++ openjdk-8-jdk bash <(curl -s https://raw.githubusercontent.com/konlpy/konlpy/master/scripts/mecab.sh)

start the engine server

daphne server.asgi:application

supervisor ctrl

restart server sudo supervisorctl restart asgi_daphne

IMPORTANT:: seed credential files

you can see

server/settings/production.py
credentials/db-connection.json

from .gitignore which two files you will have to provide manually to run this project.

modules

  • duplicate news checker โœ…
  • spam news detector ๐Ÿšซ
  • word2vec โœ… (wiki) ๐Ÿšซ (news)

used by

developed by

develped by softmarshmallow

About

๐Ÿค– natural language processing out of the box

Topics

Resources

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published