Skip to content

Victorbjay/python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

1 Commit
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🦠 Frameworks_Assignment – CORD-19 Metadata Explorer

πŸ“Œ Overview

This project provides a reproducible workflow for analyzing the CORD-19 metadata.csv file.
It includes:

  • A Jupyter Notebook for interactive exploration
  • A Python script for automated analysis
  • A Streamlit dashboard for interactive visualization

⚠️ Note: The full CORD-19 dataset is very large. For this project, only download metadata.csv from Kaggle:
CORD-19 Research Challenge
Place it inside the data/ folder as data/metadata.csv.


πŸ“‚ Project Structure

  • requirements.txt β†’ Python dependencies
  • notebooks/analysis.ipynb β†’ Jupyter Notebook (exploration & cleaning)
  • analysis.py β†’ Script version of the analysis
  • streamlit_app/app.py β†’ Streamlit web application
  • data/metadata.csv β†’ Dataset (not included in repo, must be downloaded separately)

πŸš€ Quick Start

  1. Install dependencies

    pip install -r requirements.txt
  2. Download the dataset

Download the dataset and place metadata.csv into the data/ folder.


  1. Run the analysis
jupyter notebook notebooks/analysis.ipynb
# or
python analysis.py
  1. Launch the Streamlit app
streamlit run streamlit_app/app.py

πŸ“Notes

Both the notebook and script include error handling for missing files/columns.

The Streamlit app supports filtering by publication year and provides:

πŸ“ˆ Number of publications over time

πŸ“Š Top journals chart

πŸ“ Word frequency insights from titles

πŸ‘€ A preview of the dataset

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published