CBIS - A Data Integration project about the cannabis

Purpose

The purpose of this data integration project is to combine some sources about the cannabis strains in one place. Roughly speaking there are some trade names for cannabis strains, we refer to Wikipedia for retrieving that list (https://en.wikipedia.org/wiki/List_of_names_for_cannabis_strains), that is split to three main types: Indica strains, Sativa strains and Hybrid strains. For each strain retrieved from the Wikipedia list, we can provide a full card about it. Some of the information about each strain are:

Time of use;
What is;
% of Indica and % of Sativa;
Effects;
Fragrance;
Flavours;
Adverse reactions;
Medical use;
Growing info;
Growing video;
Video review;
...

Actually, there are more than 500 strains.

Video presentation

Sources

At that moment, we use those sources for data retrieval:

URL: Wikipedia TECHNIC: Web Scraping DATA RETRIEVED: List of all cannabis strains grouped by type. VOLATILITY: None.
URL: I love Growing Marijuana TECHNIC: Web Scraping. DATA RETRIEVED: General Information about the strain. Effects, Fragrance, Flavours, Adverse Reaction, Medical, Growing, Flowering Time information about the strain and video review. VOLATILITY: Monthly
URL: Leafly TECHNIC: Web Scraping. DATA RETRIEVED: “What is” of the strain. Flavours, Places where is popular, info pictures, Similar strain of the strain. VOLATILITY: Monthly
URL: WikiLeaf TECHNIC: Web Scraping. DATA RETRIEVED: Time of Use of the strain, user reviews, small info pictures. VOLATILITY: Monthly
URL: World Wide Marijuana Seeds TECHNIC: Web Scraping. DATA RETRIEVED: Price of the strain. VOLATILITY: High
URL: Youtube TECHNIC: API Wrapper DATA RETRIEVED: A video that talks about strain growing. VOLATILITY: High

Architecture

How to install

Download MongoDB and install.
Download NodeJS and install.
Clone the repository from GitHub.
In the folder "config" there is a file called "db.json" is setted with a default configuration for a MongoDB in localhost, if your configuration are different, edit the fields.
Open a terminal and go in root project from it (in windows for example with Windows PowerShell), and digit:

$ npm install

After the installation, go from terminal in the folder called "init" and digit:

$ node initDatabase.js

Wait around 5 minutes, until you are able to read "Init has been completed!" (this is a step to accomplish just one time, because it stores all the strains retrieved from wikipedia in the database, after that you don't need to iterate it again).

Go from terminal in the root folder of the project and digit:

$ node app.js

Now you are able to see the project in your browser (by default at the url: http://localhost:8080/cbis/index ).

OPTIONAL: in the folder "config" there is a file called "api.json", you need to set the field the "auth" with your api from youtube. Is optional because you are able to run the project also without that, but you can not see a embedded youtube video in the singular strain info.

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
config		config
core		core
init		init
public		public
views		views
.gitignore		.gitignore
README.md		README.md
app.js		app.js
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

config

config

core

core

init

init

public

public

views

views

.gitignore

.gitignore

README.md

README.md

app.js

app.js

package-lock.json

package-lock.json

package.json

package.json

Repository files navigation

CBIS - A Data Integration project about the cannabis

Purpose

Video presentation

Sources

Architecture

How to install

About

Releases

Packages

Contributors 3

Languages

Armando1514/CBIS-data-integration-project-about-cannabis

Folders and files

Latest commit

History

Repository files navigation

CBIS - A Data Integration project about the cannabis

Purpose

Video presentation

Sources

Architecture

How to install

About

Resources

Stars

Watchers

Forks

Languages