Binary-Classification-using-CNN-and-Data-parallelism-with-MPI

Binary data classification using TensorFlow and Keras in python. Data parallelism using MPI for data augmentation, model training and testing.

Steps to use the program

Download/ clone the repository.
Navigate to the respective folder in PC and create 4 seperate folders named "logs", "models", "dog_cat", "data" and "augmented". The model will be saved in the "models" folder and logs in the "logs" folder.
Download the data from the dataset and put them in the "dog_cat" folder by creating subfolders with their label name. For eg: in the data folder create a folder called "dog" and inside the dog folder create another folder by the same name "dog" and put all the downloaded images of dog in here. NOTE: The data augmentation program would work properly only if this is done properly.
Once after the data folder is ready with all the data files go to the augmented folder and create separate folders for separate labels, for eg: one folder named "dog", one named "cat". These label names should be same as the name in the "dog_cat" folder. NOTE: It is the data in the augmented folder that is used as the main data for training and testing the model.
Now we can successfully run the augmentation by just changing the number of iterations to get the desired number of data generated.
Now we can just run the data_separation.py file to separate the data to acheive data parallelism. The total data would be separated into a number of folders equal to the number of threads giving during in MPI execution command. Now each data subfolder will have separate class of data. **NOTE: Currently the classification would work for only 4 data folders. So give 4 threads. If you change the number of threads then you are expected to make neccessary changes in the pet_classfier_prl.py file.
Finally, run the classifier program by changing the desired number of iterations.

Commands to run the program

mpiexec -np 4 python data_separation.py
mpiexec -np 5 python pet_classifier_prl.py

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
README.md		README.md
augmentation.ipynb		augmentation.ipynb
data_separation.py		data_separation.py
pet_classifier.ipynb		pet_classifier.ipynb
pet_classifier.py		pet_classifier.py
pet_classifier_prl.py		pet_classifier_prl.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Binary-Classification-using-CNN-and-Data-parallelism-with-MPI

Steps to use the program

Commands to run the program

About

Releases

Packages

Languages

Sujith013/Binary-Classification-using-Machine-Learning-and-Data-parallelism

Folders and files

Latest commit

History

Repository files navigation

Binary-Classification-using-CNN-and-Data-parallelism-with-MPI

Steps to use the program

Commands to run the program

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages