Skip to content

Cassini-chris/Clustering-App

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

51 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Clustering-App running on Google Cloud App Engine

Topic-Modeling aka. unsupervised learning using Non-negative matrix factorization

Please checkout:

Clustering_Pic

In detail, in this project I have explored topic modeling via Non-negative matrix factorization. The Clustering APP is an application that identifies Clusters by running topic modeling over an uploaded CSV file. The application is powered by GAE and runs on Python 3.9. Although topic modeling is not new, the idea of this project is to highight Cloud technologies, predominately GCP services to make clustering tangible and accessible for everyone. In detail, to build and run the application, I only used:

  • Google App Engine
  • Flask

Tl;dr: Machine Learning does not have to be a black-box. Applications that use ML generally consist of different building blocks. My motivation to build, deploy and open-source applications is entirely focused on educational purposes. Further, I want to introduce a simple guide on how to leverage ML applications in the Cloud and make them accessible to the wider public. In this repo you find entire code for performing Clustering over a CSV file.

the repo does hold the following:

  • app.yaml (to specify the python runtime)
  • cron.yaml (to avoid shutting down of the Google app engine instance)
  • main.py (the python application with the NMF model)
  • requiremenrs.txt (to specify the dependencies)
  • A static folder (that holds the css, images, and favicons used)
  • A template folder (that holds the app interface)
  • A dummy_data folder (that holds a CSV example file)

Disclaimer: This is not an officially supported Google product

About

The Clustering APP is an application that identifies Trends by running topic modelling over your CSV file. The application is powered by GAE and uses Non-negative matrix factorization to perform clustering.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors