Custom BERT Model for Text Classification

Model Description

This is a custom BERT model fine-tuned for text classification. The model was trained using a subset of a publicly available dataset and is capable of classifying text into 3 classes.

The dataset we've used contains information in the form of sentence pairs (premise and hypothesis). This dataset seems to be used for natural language understanding tasks, like inference classification or predicting the relationship between the premise and the hypothesis. As far as we are concerned, we are responsible for classifying the "premise" + "hypothesis" column based on the corresponding label.

Project Objective: The main objective is to develop a Deep Learning model capable of predicting the relationship between the premise and the hypothesis in each sentence pair.

Task: Classification task: Predict the relationship between the premise and the hypothesis. The classes might be different possible relationships such as "Contradiction", "Entailment", and "Neutral".

Training Details

Architecture: BERT Base Multilingual Cased
Training data: Custom dataset
Preprocessing: Tokenized using BERT's tokenizer, with a max sequence length of 80.
Fine-tuning: The model was trained for 1 epoch with a learning rate of 2e-5, using AdamW optimizer and Cross-Entropy Loss.
Evaluation Metrics: Accuracy on a held-out validation set.

How to Use

Dependencies

Transformers 4.x
Torch 1.x

Code Snippet

For classification:

import torch
import torch.nn as nn
from transformers import AutoTokenizer, BertModel

tokenizer = AutoTokenizer.from_pretrained("billfass/multilingual_bert_model_classiffication")
model = BertModel.from_pretrained("billfass/multilingual_bert_model_classiffication")

def prediction(premise, hypothesis):
    inputs = tokenizer(premise, hypothesis, return_tensors="pt")
    with torch.no_grad():
        output = model(**inputs)

    logits = output.last_hidden_state
    linear_layer = nn.Linear(768, 3)
    predict = torch.max(linear_layer(logits.view(-1, 768)), dim=1)
    label_pred = predict.indices[torch.argmax(predict.values).item()].item()

    return {
        "label" : label_pred
    }

print(prediction("Hello world", "I'm here"))

Test the model using gradio

Lunch demo.ipynb file to test the model with gradio

Test the model using FAST API

Lunch main.py file to test the model with FastApi

Limitations and Bias

Trained on a specific dataset, so may not generalize well to other kinds of text.
Uses multilingual cased BERT, so it's not optimized for any specific language.

Authors

Bill Fassinou
bill.fassinou@gmail.com

Acknowledgments

Special thanks to Hugging Face for providing the Transformers library that made this project possible.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Dockerfile		Dockerfile
Fassinou_Bill_bert_model.ipynb		Fassinou_Bill_bert_model.ipynb
LICENSE		LICENSE
README.md		README.md
demo.ipynb		demo.ipynb
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Custom BERT Model for Text Classification

Model Description

Training Details

How to Use

Dependencies

Code Snippet

Test the model using gradio

Test the model using FAST API

Limitations and Bias

Authors

Acknowledgments

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

billfassinou/multilingual_bert_classification

Folders and files

Latest commit

History

Repository files navigation

Custom BERT Model for Text Classification

Model Description

Training Details

How to Use

Dependencies

Code Snippet

Test the model using gradio

Test the model using FAST API

Limitations and Bias

Authors

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages