GitHub - UCB-stat-159-s22/hw07-Group11: hw07-hw07-group11 created by GitHub Classroom

Welcome to HW 7

Credit Account Default Status and Characteristics

Also available on GitHub Pages: https://ucb-stat-159-s22.github.io/hw07-Group11/

Note: This repository is public. The Credit Card Fraud Detection Dataset is from Kaggle. The EDA and Logistic Regression Model were developed by Joe, Isaac, and Uma as a homework assignment for the Spring 2022 installment of UC Berkeley's Stat 159/259 course, _Reproducible and Collaborative Data Science.

In our studies, we will take a look at credit risk and how default status varies within different groups of people with different characteristics. We will not only try to find the relationship of some of our selected variables like how does default rate varies among gender or income classes, but will also build and train a simple logistic regression model that can help us predict whether a client is likely to default or not. We are curious about this topic because we believe this is actually a very import question to solve in the real-life financial world, and we wonder what how good can we predict if one is actually going to default or not. The entire analysis is contained in main.ipynb with computation details contained in details.ipynb

Makefile

The following are the available make commands:

env: Creates the environment and configures it by activating it and installing ipykernel into it
hw7_tools: Installs the hw07 tools package
all: Executes and generates outputs
clean: Removes all figures from the /output directory

Some helpful commands:

Install the packages: pip install . or make hw7_tools

Test the packages: pytest hw7_tools

Some helpful tips:

After make clean, rerun details.ipynb or make all to regenerate the data and outputs needed for main.ipynb

Detailed data computations of processing/plotting/modeling/serialization can be found in details.ipynb, if you are curious and want to play with our results(e.g. by adjusting data/parameters), please also look into details.ipynb and do not work in main.ipynb since all outputs are loaded into main.ipynb instead of being computed there.

License

The project is released under the BSD 3-clause License.

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
.github/workflows		.github/workflows
build/lib/hw7_tools		build/lib/hw7_tools
data		data
docs/_build		docs/_build
hw7_tools.egg-info		hw7_tools.egg-info
hw7_tools		hw7_tools
output		output
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
_config.yml		_config.yml
_toc.yml		_toc.yml
conf.py		conf.py
contribution_statement.md		contribution_statement.md
details.ipynb		details.ipynb
environment.yml		environment.yml
hw07-description.md		hw07-description.md
main.ipynb		main.ipynb
null_index.npy		null_index.npy
pyproj.toml		pyproj.toml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Welcome to HW 7

Credit Account Default Status and Characteristics

Makefile

Some helpful commands:

Some helpful tips:

License

About

Releases

Packages

Contributors 4

Languages

License

UCB-stat-159-s22/hw07-Group11

Folders and files

Latest commit

History

Repository files navigation

Welcome to HW 7

Credit Account Default Status and Characteristics

Makefile

Some helpful commands:

Some helpful tips:

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages