GitHub - sbendimerad/ml_deployment: Deploying ml model with flask/streamlit/heroku

Text Classification for StackOverflow Tags

The purpose of this project is to create an algorithm capable of taking textual data as input, specifically questions from the StackOverflow forum, and predicting the most relevant tag to automate and streamline question indexing on the site.

The algorithm used in this project consists of a pipeline with two main steps. It leverages the CountVectorizer for feature extraction and employs Logistic Regression based on the OneVsRestClassifier approach for multi-label classification. The pre-trained model is available in the file trained_bow_logreg.joblib.

The model is deployed on a web application using Flask, Streamlit, and Heroku.

This repository includes the following files:

flask_app.py: Our API
preprocess.py: Contains all functions to prepare the data
viz_app.py: Contains information about the Streamlit web app's interface

To use the algorithm, follow these steps:

Download the code from this repository (either using git clone or direct download).
Open a command line interface and navigate to the downloaded folder.
Create a virtual environment using conda or venv (optional but recommended).
Run the command pip install -r requirements.txt to install all the necessary modules for the algorithm to work.
Run the command python flask_app.py and wait for the server to launch.
In a new terminal tab, still within the repository folder, run the command streamlit run viz_app.py. A window should open in your browser.

If everything works correctly, you should see an input bar prompting you to enter text. When you input a computer-related question, the algorithm will recommend relevant tags based on the topic you are addressing. If the algorithm is unsure of what to recommend, it will let you know.

Don't forget to deploy everything on Heroku!

Enjoy using the application!

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
.gitignore		.gitignore
Procfile		Procfile
README.md		README.md
flask_app.py		flask_app.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
setup.sh		setup.sh
top_10_tags.txt		top_10_tags.txt
trained_bow_logreg.joblib		trained_bow_logreg.joblib
viz_app.py		viz_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text Classification for StackOverflow Tags

About

Releases

Packages

Contributors 3

Languages

sbendimerad/ml_deployment

Folders and files

Latest commit

History

Repository files navigation

Text Classification for StackOverflow Tags

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages