This project explores the CORD-19 dataset
focusing on the metadata.csv
file, which contains metadata for COVID-19 research papers.
Frameworks_Assignment/
│── data/ # Input data files
│── notebooks/ # Jupyter notebooks
│── app/ # Streamlit app
│── reports/ # Report outputs
│── requirements.txt # Dependencies
│── run.sh # Script to run the app
│── README.md # Project overview
│── .gitignore # Ignore unnecessary files
- Clone the repo and place
metadata.csv
insidedata/
folder (optional, large file). - Run the notebook for analysis:
jupyter notebook notebooks/cord19_analysis.ipynb
- Start the Streamlit app:
./run.sh
- Python 3.8+
- pandas, matplotlib, seaborn, streamlit, wordcloud
- For quick tests, use the provided
metadata_sample.csv
. - For full analysis, download the complete
metadata.csv
from Kaggle.