# Loading Local Data

This script shows how to load a local data .csv using our modules.

## Set configuration

There is a file named as cofigure.json in the /doc path. The methods implemented in our modules uses this file to load configures values that will be used, values as model type, nodes, epochs to train, data test size and others. Things like fitted models path and a remote model path are configure there too.

In [1]:
# This is a configure.json file example.
import json

# Load the local configure.json and print it
configure_json = open('../doc/configure.json', 'r')
print(json.dumps(json.load(configure_json), indent=4))

{
    "ncovid": "ML COVID-19 configure file",
    "author": "NatalNet NCovid",
    "published_at": 2021,
    "folder_configs": {
        "docs_path": "../doc/",
        "data_path": "../dbs/",
        "model_path": "fitted_model/",
        "model_path_remote": "https://",
        "glossary_file": "glossary.json"
    },
    "model_configs": {
        "type_used": "Artificial",
        "is_predicting": "False",
        "Artificial": {
            "model": "lstm",
            "nodes": 300,
            "epochs": 100,
            "dropout": 0.1,
            "batch_size": 64,
            "earlystop": 30,
            "is_output_in_input": "True",
            "data_configs": {
                "is_accumulated_values": "False",
                "is_apply_moving_average": "True",
                "window_size": 7,
                "data_test_size_in_days": 35,
                "type_norm": ""
            },
            "Autoregressive": {
                "model": "arima",
                "p": 1,
    

To load this set of configurations, import the configs_manner.py file.

In [2]:
# If this script is running in another folder, change the base path to the /src folder.
import sys
sys.path.append("../src")

import configs_manner

# Priting some configures variabels.
print("Model infos: \n", configs_manner.model_infos)
print("\n")
print("Models path: \n", configs_manner.model_path)
print("\n")
print("Data path: \n", configs_manner.data_path)

Model infos: 
 {'model_nodes': 300, 'model_epochs': 100, 'model_dropout': 0.1, 'model_batch_size': 64, 'model_earlystop': 30, 'model_is_output_in_input': True, 'data_is_accumulated_values': False, 'data_is_apply_moving_average': True, 'data_window_size': 7, 'data_test_size_in_days': 35, 'data_type_norm': ''}


Models path: 
 ../dbs/fitted_model/


Data path: 
 ../dbs/


To configure any path param, just change the value in the configure.json file.

## Create data

In our modules, almost all procedures uses class objects and class methods. To create a data constructor, firstly import the data_manner.py file

In [3]:
# import the data_manner.py file. (taking into account that you are in src/ path)
import data_manner

So, create a new DataConstructor() constructor instace.

In [4]:
# creating the DataConstructor instance
data_constructor = data_manner.DataConstructor()

## Collect local data

Once the data constructor was created, call the collect_dataframe() method passing as arguments the data local path and the .csv name file.

In [5]:
# collect data from the local path.
collected_data = data_constructor.collect_dataframe(configs_manner.data_path + "df_araraquara.csv")

The collected data is a N-feature vector, where each feature is a vector with the size of the days number.

In [6]:
print("Feature 0 (confirmed cases): length ", len(collected_data[0]))
print("Feature 1 (confirmed deaths): length ", len(collected_data[1]))

Feature 0 (confirmed cases): length  430
Feature 1 (confirmed deaths): length  430
