Skip to content

Clustering methods for galaxy morphology classification

License

Notifications You must be signed in to change notification settings

Hack4Dev/galaxy_CV

 
 

Repository files navigation

Classification of galaxies.

Status: in progress

Description

Welcome to the Galaxy Hackathon project! This project includes the following tutorials:

  • Tutorial 1 : Data Preprocessing
  • Tutorial 2 : Automatic Feature Extraction/Engineering
  • Tutorial 3 : Manual Feature Extraction/Engineering
  • Tutorial 4 : Data Visualisation
  • Tutorial 5 : Galaxy classification/clustering

Data

The GalaxyMNIST dataset is used for these tutorials. It contains 10,000 images of galaxies (either 3x64x64 or 3x224x224), labelled by Galaxy Zoo volunteers as belonging to one of four morphology classes:

  1. smooth and round
  2. smooth and cigar-shaped
  3. edge-on-disk
  4. unbarred spiral

Hackathon Task

After completing the tutorials, teams must utilize supervised learning to classify four types of galaxy objects. Collaborate to develop a pipeline that surpasses the one presented here. Support your choices with available literature or by experimenting and optimizing with different methods to find the best approach. Please note, avoid optimizing your methods on the testing set, as this constitutes data leakage. Instead, create a validation set from the training set and use the validation set to optimize your methods. A 20:80 split should be sufficient.

Prerequisites

All the libraries/dependencies necessary to run the tutorials are listed in the requirements.txt file.

Installation

pip install -r requirements.txt

Would you like to clone this repository? Feel free!

git clone https://github.com/Hack4Dev/galaxy_CV.git

Then make sure you have the right Python libraries for the tutorials.

New to Github?

The easiest way to get all of the lecture and tutorial material is to clone this repository. To do this you need git installed on your laptop. If you're working on Linux you can install git using apt-get (you might need to use sudo):

apt install git

You can then clone the repository by typing:

git clone https://github.com/Hack4Dev/galaxy_CV.git

To update your clone if changes are made, use:

cd galaxy_CV/
git pull

Original research work:

E. Fielding, C. N. Nyirenda and M. Vaccari, "The Classification of Optical Galaxy Morphology Using Unsupervised Learning Techniques," 2022 International Conference on Electrical, Computer and Energy Technologies (ICECET), 2022, pp. 1-6, doi: 10.1109/ICECET55527.2022.9872611.

Data used

Walmsley, Mike, et al. "Galaxy Zoo DECaLS: Detailed visual morphology measurements from volunteers and deep learning for 314 000 galaxies." Monthly Notices of the Royal Astronomical Society 509.3 (2022): 3966-3988, doi:10.48550/arXiv.2102.08414, code.

About

Clustering methods for galaxy morphology classification

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 89.7%
  • Python 10.3%