Skip to content

kidanen/Python-Framework-Assignment

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CORD-19 Data Explorer

Project Overview

This project explores the CORD-19 dataset, which contains metadata of COVID-19 research papers. The goal is to perform a complete data science workflow, including data loading, cleaning, analysis, visualization, and building an interactive application using Streamlit.

By completing this project, I gained hands-on experience with real-world data, learning how to handle missing values, extract insights, visualize trends, and create interactive dashboards.


Dataset

  • File used: metadata.csv from the CORD-19 dataset
  • Source: CORD-19 Dataset
  • Key columns:
    • title: Title of the research paper
    • abstract: Abstract text of the paper
    • publish_time: Publication date
    • journal: Journal name
    • source_x: Dataset source

Project Steps

Part 1: Data Loading and Basic Exploration

  1. Download the dataset and place metadata.csv in your project folder.
  2. Load the dataset using pandas:
import pandas as pd

df = pd.read_csv('metadata.csv', low_memory=False)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages