Analisis de los grupos de investigación identificados por colciencias en todo el país. Creación de un modelo matematico que permita a la vicerrectoria de investigación, innovación y extensión UTP, tomar mejores decisiones frente a la inversion de recursos y esfuerzo en los grupos de investigación de la UTP.
- Python 3.7
- pip for Python3
- virtualenv
- Node.js >= 8.X.X
- yarn or npm (package manager for Node.js)
- MongoDB
Inside '/grupvis' there are two important subfolders, first we have 'grupvis/script' which contains everything related with the colciencias' page scraping and second, inside 'grupvis/app' we can find the web server and the React client.
Get inside 'grupvis/script' folder
virtualenv -p /usr/bin/python3 env/
This must be done every new session. Before installing or using something make sure env is active.
source env/bin/activate
Previous code is for bash shell, but there is also activation script for fish shell. Just use:
source env/bin/activate.fish
pip3 install -r top-requirements.txt
And thats all.
This time get inside 'grupvis/app' folder
Just run:
yarn
then enter to 'client' folder and run same command. That's all.
-
Download the data and save it in
datasets/
folder -
Run jupyter notebook
jupyter notebook
In order to scrape groups' information first make sure mongoDB service is running, then get into './script/scienti/scienti' folder and run:
scrapy crawl research_groups
For redirecting the log messages use following command:
scrapy crawl research_groups --logfile filename.log
This will save scraped data into mongo database
- TODO