Make input.sh compatible with all OSes (re-write input.sh using python) #26

ilyasst · 2020-06-19T17:43:56Z

Is your feature request related to a problem? Please describe.
Currently, input.sh works for Ubuntu (it might work on MacOS if SVN is available but I did not test it), however it can definitely not be used for Windows.

Describe the solution you'd like
input.sh could be written in python which would make it possible to execute it using any OS as long as the python environment is properly setup.

The text was updated successfully, but these errors were encountered:

lisphilar · 2020-06-20T06:35:26Z

Dear @ilyasst ,
Thank you very much for your proposal and pull request!
input.py is very useful and the script was successfully marged to master branch!

lisphilar · 2020-06-20T07:33:48Z

Dear @ilyasst ,
As the next step, I plan to create a Python class CovsirPhy.DataLoader. This will download the datasets automatically and show the citations of the datasets.

For the users who are not Kaggers,

The number of cases (Global): directory download JHU data
The number of cases in Japan: will be discussed in Add example dataset to this repository #17
Total population: will be discussed in Automatic downloading of dataset: total population #29
OxCGRT: GitHub repository as the previous versions

(Kaggle users can download them manually with input.py.)

I will create a pull request for "1. The number of cases" later.

lisphilar · 2020-06-24T16:11:39Z

Dear @ilyasst ,
covsirphy.cleaning.data_loader.DataLoader was created for automatic data downloading of JHU/Japan/OxcGRT data. (Data loader of population dataset is pending now. #29 )

Please kindly comfirm it with the default branch. (Version 2.2.5)
Example codes are as follows.

import covsirphy as cs
# Set the directory to save the datasets
data_loader = cs.DataLoader("input")
# JHU dataset
jhu_data = data_loader.jhu()
print(jhu_data.citation)
jhu_data.cleaned()
# The number of cases in Japan
japan_data = data_loader.japan()
print(japan_data.citation)
jhu_data.replace(japan_data)
ncov_df = jhu_data.cleaned()
# OxCGRT dataset
oxcgrt_data = data_loader.oxcgrt()
print(oxcgrt_data.citation)
oxcgrt_df = oxcgrt_data.cleaned()
jpn_oxcgrt_df = oxcgrt_data.subset(iso3="JPN")

input.py was also updated.

ilyasst · 2020-06-24T22:29:12Z

I have pulled the code from master and followed the For developers guide. I had no problems with the installation, I was also able to download the JHU dataset and OxCGRT datasets with the method you provided above with no problems.

I was also able to use input.py to download all the datasets only when the kaggle.json file was stored in ~/.kaggle. It is not possible to simply put the kaggle.json file in the same folder as input.py because a modification in f837386 . It is necessary to set the OS environment variable "KAGGLE_CONFIG_DIR" before loading KaggleApi library otherwise it will fail to detect the kaggle.json file.

I will submit a PR for this shortly.

lisphilar · 2020-06-25T14:41:27Z

Dear @ilyasst ,
Thank you for your pull request. I merged it.

However, I don't recommend keeping kaggle.json in your working directory for a security reason. It may cause leak of your API keys accidentally.
I plan to stop using Kaggle API because we can replace Kaggle datasets (secondary data) with datasets provided by primary sources. Kagglers can import Kaggle datasets to their Kaggle notebooks with GUI and load the datasets with local_file argument.

data_loader = DataLoader(directory=None)
jhu_data = data_loader.jhu(local_file="kaggle/input/novel-corona-virus-2019-dataset/covid_19_data.csv")
japan_data = data_loader.japan(local_file="kaggle/input/covid19-dataset-in-japan/covid_jpn_total.csv")

Currenly, OxCGRT dataset in Kaggle is provided as EXCEL file. We need to convert it to CSV file.
https://www.kaggle.com/paultimothymooney/oxford-covid19-government-response-tracker

The difference of primary/Kaggle datasets will be adjust using covsirphy.cleaning sub-module.
What do you think about this?

I will add DataLoader.population method and update README.md within several days.

lisphilar · 2020-06-27T15:20:23Z

Dear @ilyasst ,
Please confirm that data loader of population dataset was included with the default branch.

import covsirphy as cs
# Set the directory to save the datasets
data_loader = cs.DataLoader("input")
# Population in each country
population_data = data_loader.population()

README.md was also updated.
Thank you.

lisphilar · 2020-06-30T12:28:54Z

Because this change was applied, I will close this issue. Thank you.

ilyasst added the enhancement New feature or request label Jun 19, 2020

ilyasst self-assigned this Jun 19, 2020

ilyasst mentioned this issue Jun 19, 2020

An alternative method (input.py) can be used to download the datasets… #27

Merged

lisphilar mentioned this issue Jun 20, 2020

add: kaggle.json to .gitignore #28

Merged

lisphilar mentioned this issue Jun 20, 2020

Automatic downloading of dataset: total population #29

Closed

lisphilar mentioned this issue Jun 20, 2020

Automatic downloading of dataset: JHU data #30

Merged

lisphilar added this to the Release CovsirPhy v2.3 milestone Jun 20, 2020

This was referenced Jun 20, 2020

Add COVID-19 dataset in Japan #31

Merged

ModuleNotFoundError: No module named 'better_exceptions' in installation #32

Closed

Issue26 #37

Merged

lisphilar referenced this issue Jun 24, 2020

add: data loader for OxcGRT datset

a6ccc65

lisphilar referenced this issue Jun 24, 2020

update: input files not using data loader

7c0d3e9

lisphilar referenced this issue Jun 24, 2020

update: example codes

510caaa

ilyasst mentioned this issue Jun 24, 2020

KAGGLE_CONFIG_DIR must be setup before loading KaggleApi otherwise ./… #39

Merged

lisphilar mentioned this issue Jun 27, 2020

Issue26 #41

Merged

lisphilar closed this as completed Jun 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make input.sh compatible with all OSes (re-write input.sh using python) #26

Make input.sh compatible with all OSes (re-write input.sh using python) #26

ilyasst commented Jun 19, 2020

lisphilar commented Jun 20, 2020

lisphilar commented Jun 20, 2020 •

edited

lisphilar commented Jun 24, 2020 •

edited

ilyasst commented Jun 24, 2020

lisphilar commented Jun 25, 2020

lisphilar commented Jun 27, 2020

lisphilar commented Jun 30, 2020

Make input.sh compatible with all OSes (re-write input.sh using python) #26

Make input.sh compatible with all OSes (re-write input.sh using python) #26

Comments

ilyasst commented Jun 19, 2020

lisphilar commented Jun 20, 2020

lisphilar commented Jun 20, 2020 • edited

lisphilar commented Jun 24, 2020 • edited

ilyasst commented Jun 24, 2020

lisphilar commented Jun 25, 2020

lisphilar commented Jun 27, 2020

lisphilar commented Jun 30, 2020

lisphilar commented Jun 20, 2020 •

edited

lisphilar commented Jun 24, 2020 •

edited