Repository summary

This repository is used to make and understand concepts of NLP through multiple examples/models and data processing method.

All methods used will be written in a document when I'll consider to have got enough information to make a useful note. (This will likely be after the implementation of tranformers networks)

Specifications

To clarify, my ressources to run the code come from the shadow computer most used for cloud gaming, here's the specs though:

Intel Xeon CPU E5-2678
12 GB RAM
Nvidia Quadro P5000

Branches

main branch is used as a meddle of multiples models/methods
tranformer branch is used exclusively for transformer network and to improve these

Current Performance

Main Branch

The main metric is F1 Score

Using a simple LSTM with only a coarse usage of "CountVectorizer": Train: ~ 90%+ | Eval: ~70% Using LSTM with Embedded layer and glove pretrained weights: Train: ~ 95%+ | Eval: ~40%

Might come back on these later (After transformer built)

Tranformer Branch

WIP

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
__pycache__		__pycache__
dataset		dataset
images		images
.gitignore		.gitignore
classes.py		classes.py
main.py		main.py
notebook.ipynb		notebook.ipynb
readme.md		readme.md
textProcessing.ipynb		textProcessing.ipynb
textfn.py		textfn.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Repository summary

Specifications

Branches

Current Performance

Main Branch

Tranformer Branch

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Repository summary

Specifications

Branches

Current Performance

Main Branch

Tranformer Branch

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages