Skip to content

- tarball handler py ###########/ |||||||||| - shell script3.sh ########## |||||||||||||||||| - Unsupervised learning Clustering kmeans and hierarchical ##### |||||||||||||||| - topics: | 20-topics | may-2023-filtered | may-2023-filtered-2 | may-2023-filtered-3 | filtered-4

License

Notifications You must be signed in to change notification settings

CoderSales/unsupervised-learning-clustering

Repository files navigation

unsupervised-learning-clustering

Key elements in this repository:

Setup:

  1. References
  2. script3.sh - activates virtual environment (or source scripy3.sh)
  3. .bashrc - for virtual environment
    • in .venv folder
  4. Python Select Interpreter
  5. pip install ipykernel
  6. pip install jupyter
  7. sh installer.sh
  8. python.exe -m pip install --upgrade pip
  9. pip install notebook
  10. pip install pandas
  11. python -m pip install -U pip
  12. python -m pip install -U matplotlib
  13. pip install seaborn
  14. pip install -U scikit-learn
  15. pip install openpyxl
  16. pip install nb-black
  17. pip install xlwings
  18. xlwings addin install
  19. pip install natsort

Save setup

  1. pip freeze > requirements.txt

Load setup

  1. pip install -r requirements.txt

Analysis:

  1. notebooks/K-Means.ipynb
  2. data/technical_support_data-2.csv

Tarball Data Extraction:

  1. python tarball-handler.py

    Add to gitignore:

    1. custom components
    1.1 large files
    1.1.1 tarball

    TCGA-PANCAN-HiSeq-801x20531.tar.gz

    1.1.2 large data from tarball

    data/gene_data/TCGA-PANCAN-HiSeq-801x20531/TCGA-PANCAN-HiSeq-801x20531/data.csv data/gene_data/TCGA-PANCAN-HiSeq-801x20531/TCGA-PANCAN-HiSeq-801x20531/labels.csv

Note on data files and Large Data sets on GitHub:

Add data in own commit in case of 50 MB GitHub warning

update to .gitignore:

*.json
!spec/*.json

adapted to:

!*/ProcessedData.xlsx

git ignore all files of a certain type, except those in a specific subfolder

About

- tarball handler py ###########/ |||||||||| - shell script3.sh ########## |||||||||||||||||| - Unsupervised learning Clustering kmeans and hierarchical ##### |||||||||||||||| - topics: | 20-topics | may-2023-filtered | may-2023-filtered-2 | may-2023-filtered-3 | filtered-4

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages