covid_Graphs4Good

In this repo, I store all the notebooks and script about the Covid data analysis proposed by Kaggle (see CORD-19 NLP Challenge)

My first idea was to build a search engine using Neo4j. This idea was originally thought for a search engine for arxiv papers but due to the recent events and the Kaggle's challange I decided to change my plans. The data used in this first part are in metadata.csv file here.

On kaggle competition page there are already a lot of interesting kernels which are really good to know and learn.

Here is a short description on how I've structured this repo at the moment.

In the main page you find the following:

a ReadMe obviously
a notebook file with the main "result"
a folder Model where I store all the devoleped models
a folder Notebook where I store all the developed notebooks

Next Steps

In this section, I list what I plan to do next. I have not planned a priority list at the moment:

Test higher NUM_TOPIC
Test connector to PowerBI
Include a similarity analisys based on Topics' distribution
Add data from arxiv
Test some graph analysis on Neo4j of course

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Model		Model
Notebooks		Notebooks
.gitignore		.gitignore
Covid_Kaggle_n1.ipynb		Covid_Kaggle_n1.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

covid_Graphs4Good

Next Steps

About

Releases

Packages

Languages

radema/covid_Graphs4Good

Folders and files

Latest commit

History

Repository files navigation

covid_Graphs4Good

Next Steps

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages