Skip to content

Commit

Permalink
Chapter6-CodeFix
Browse files Browse the repository at this point in the history
  • Loading branch information
tatsath committed Jul 26, 2020
1 parent 117a295 commit 42d11ca
Show file tree
Hide file tree
Showing 10 changed files with 600,043 additions and 22 deletions.
5 changes: 0 additions & 5 deletions .gitattributes
Original file line number Diff line number Diff line change
@@ -1,5 +0,0 @@
*.psd filter=lfs diff=lfs merge=lfs -text
Chapter[[:space:]]6[[:space:]]-[[:space:]]Sup.[[:space:]]Learning[[:space:]]-[[:space:]]Classification[[:space:]]models/CaseStudy1[[:space:]]-[[:space:]]Fraud[[:space:]]Detection/creditcard.csv filter=lfs diff=lfs merge=lfs -text
Chapter[[:space:]]6[[:space:]]-[[:space:]]Sup.[[:space:]]Learning[[:space:]]-[[:space:]]Classification[[:space:]]models/CaseStudy2[[:space:]]-[[:space:]]Loan[[:space:]]Default[[:space:]]Probability/LoansData.csv.gz filter=lfs diff=lfs merge=lfs -text
Chapter[[:space:]]6[[:space:]]-[[:space:]]Sup.[[:space:]]Learning[[:space:]]-[[:space:]]Classification[[:space:]]models/CaseStudy3[[:space:]]-[[:space:]]Bitcoin[[:space:]]Trading[[:space:]]Strategy/BitstampData.csv filter=lfs diff=lfs merge=lfs -text

Original file line number Diff line number Diff line change
Expand Up @@ -80,11 +80,19 @@
},
{
"cell_type": "code",
"execution_count": 148,
"execution_count": 1,
"metadata": {
"_cell_guid": "5d8fee34-f454-2642-8b06-ed719f0317e1"
},
"outputs": [],
"outputs": [
{
"name": "stderr",
"output_type": "stream",
"text": [
"Using TensorFlow backend.\n"
]
}
],
"source": [
"# Load libraries\n",
"import numpy as np\n",
Expand Down Expand Up @@ -125,16 +133,26 @@
"## 2.2. Loading the Data"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"We load the data in this step.\n",
"\n",
"#### <font color='red'>Note : Due to limit in the github for the data size, a sample of the data has been loaded in the jupyter notebook repository of this book. However, all the subsequent results in this jupyter notebook is with actual data (144MB) under https://www.kaggle.com/mlg-ulb/creditcardfraud. You should load the full data in case you want to reproduce the results. </font> "
]
},
{
"cell_type": "code",
"execution_count": 52,
"execution_count": 2,
"metadata": {
"_cell_guid": "787e35f7-bf9e-0969-8d13-a54fa87f3519"
},
"outputs": [],
"source": [
"# load dataset\n",
"dataset = read_csv('creditcard.csv')"
"dataset = read_csv('creditcard_sample.csv')\n",
"#dataset = read_csv('creditcard.csv') #Load this for the actual data."
]
},
{
Expand Down

This file was deleted.

Large diffs are not rendered by default.

Original file line number Diff line number Diff line change
Expand Up @@ -68,7 +68,7 @@
"after you’ve missed payments for several months. The predicted variable takes value 1\n",
"in case of charge-off and 0 otherwise.\n",
"\n",
"This case study aims to analyze data for loans through 2007-2017Q3 from Lending Club available on Kaggle (https://www.kaggle.com/przemekblo/lending-club-loan-classification/data). Dataset contains over 887 thousand observations and 150 variables among which one is describing the loan status. "
"This case study aims to analyze data for loans through 2007-2017Q3 from Lending Club available on Kaggle. Dataset contains over 887 thousand observations and 150 variables among which one is describing the loan status. "
]
},
{
Expand Down Expand Up @@ -142,17 +142,27 @@
"## 2.2. Loading the Data"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"We load the data in this step.\n",
"\n",
"#### <font color='red'>Note : Due to limit in the github for the data size, a sample of the data has been loaded in the jupyter notebook repository of this book. However, all the subsequent results in this jupyter notebook is with actual data (~1GB) under https://www.kaggle.com/mlfinancebook/lending-club-loans-data. You should load the full data in case you want to reproduce the results. </font> "
]
},
{
"cell_type": "code",
"execution_count": 4,
"execution_count": 2,
"metadata": {
"_cell_guid": "787e35f7-bf9e-0969-8d13-a54fa87f3519",
"scrolled": true
},
"outputs": [],
"source": [
"# load dataset\n",
"loans = pd.read_csv('LoansData.csv.gz', compression='gzip', low_memory=True)"
"loans = pd.read_csv('LoansData_sample.csv.gz', compression='gzip', encoding='utf-8')\n",
"#loans = pd.read_csv('LoansData.csv.gz', compression='gzip', low_memory=True) #Use this for the actual data"
]
},
{
Expand Down

This file was deleted.

Binary file not shown.
Original file line number Diff line number Diff line change
Expand Up @@ -137,6 +137,13 @@
"## 2.2. Loading the Data"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"#### <font color='red'>Note : Due to limit in the github for the data size, a sample of the data has been loaded in the jupyter notebook repository of this book. However, all the subsequent results in this jupyter notebook is with actual data (~150MB) under https://www.kaggle.com/mlfinancebook/bitstamp-bicoin-minutes-data. You should load the full data in case you want to reproduce the results. </font> "
]
},
{
"cell_type": "code",
"execution_count": 2,
Expand All @@ -146,7 +153,7 @@
"outputs": [],
"source": [
"# load dataset\n",
"dataset = pd.read_csv('BitstampData.csv')"
"dataset = pd.read_csv('BitstampData_sample.csv')"
]
},
{
Expand Down

This file was deleted.

Loading

0 comments on commit 42d11ca

Please sign in to comment.