# Data Download

Download datasets for COVID-19 ABSA and emotion analysis.

**Authors:** Marko Haralović, Onat Akca, Salih Eren Yücetürk

## COVIDSenti Dataset

90,000 COVID-19 tweets with sentiment labels (positive, negative, neutral).

**Source:** https://github.com/usmaann/COVIDSenti

In [None]:
import os
import requests
import pandas as pd

# Create directory
os.makedirs('data/COVIDSenti', exist_ok=True)

# Download
url = 'https://raw.githubusercontent.com/usmaann/COVIDSenti/master/data/Final_data.csv'
response = requests.get(url)

if response.status_code == 200:
    with open('data/COVIDSenti/COVIDSenti.csv', 'wb') as f:
        f.write(response.content)
    print("Downloaded COVIDSenti dataset")
    
    # Verify
    df = pd.read_csv('data/COVIDSenti/COVIDSenti.csv')
    print(f"Shape: {df.shape}")
    print(f"Columns: {df.columns.tolist()}")
    print(f"\nLabel distribution:")
    print(df['label'].value_counts())
else:
    print(f"Failed to download: {response.status_code}")

## METS-CoV Dataset

Medical Entity and Targeted Sentiment on COVID-19 tweets.

**Source:** https://github.com/YLab-Open/METS-CoV

In [None]:
# Clone repository
!git clone https://github.com/YLab-Open/METS-CoV.git data/METS-CoV-temp

# Move data files
!mkdir -p data/METS-CoV
!mv data/METS-CoV-temp/data/* data/METS-CoV/
!rm -rf data/METS-CoV-temp

print("Downloaded METS-CoV dataset")
!ls -lh data/METS-CoV/

## Verify Downloads

In [None]:
import os

datasets = {
    'COVIDSenti': 'data/COVIDSenti/COVIDSenti.csv',
    'METS-CoV': 'data/METS-CoV'
}

for name, path in datasets.items():
    if os.path.exists(path):
        print(f"[OK] {name} downloaded")
    else:
        print(f"[MISSING] {name}")