text_classification

Tools used in this project

hydra: Manage configuration files - article
pdoc: Automatically create an API documentation for your project
pre-commit plugins: Automate code reviewing formatting
Poetry: Dependency management - article

Project Structure

.
├── config                      
│   ├── main.yaml                   # Main configuration file
│   ├── model                       # Configurations for training model
│   │   └── model1.yaml             # Second variation of parameters to train model
│   └── process                     # Configurations for processing data
│       └── process1.yaml           # Second variation of parameters to process data
├── data            
│   ├── final                       # data after training the model
│   ├── processed                   # data after processing
│   └── raw                         # raw data
├── docs                            # documentation for your project
├── .gitignore                      # ignore files that cannot commit to Git
├── Makefile                        # store useful commands to set up the environment
├── models                          # store models
├── notebooks                       # store notebooks
├── .pre-commit-config.yaml         # configurations for pre-commit
├── pyproject.toml                  # dependencies for poetry
├── README.md                       # describe your project
├── requirements.txt                # This contains the requirements file
└── src                             # store source code
    ├── __init__.py                 # make src a Python module 
    ├── process.py                  # process data before training model
    ├── train_model.py              # train model
    └── utils.py                    # store helper functions

Set up the environment

Install Poetry
Activate the virtual environment:

poetry shell

Install dependencies:

To install all dependencies from pyproject.toml, run:

poetry install

To install only production dependencies, run:

poetry install --only main

To install a new package, run:

poetry add <package-name>

View and alter configurations

To view the configurations associated with a Pythons script, run the following command:

python src/process.py --help

Output:

process is powered by Hydra.

  == Configuration groups ==
  Compose your configuration from those groups (group=option)

model: model1
process: process1


  == Config ==
  Override anything in the config (foo.bar=value)

process:
  use_columns: sentence
  batch_size: 16
model:
  name: Logistic regression
  parameters:
    steps: 200
data:
  raw:
    train: ../data/raw/train.parquet
    val: ../data/raw/val.parquet

  processed:
    train: ../data/processed/train.parquet
    val: ../data/processed/val.parquet

  final: ../data/final/metrics.csv

To alter the configurations associated with a Python script from the command line, run the following:

python src/process.py data.raw=sample2.csv

Auto-generate API documentation

To auto-generate API document for your project, run:

make docs_save

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

text_classification

Tools used in this project

Project Structure

Set up the environment

View and alter configurations

Auto-generate API documentation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
config		config
data		data
docs		docs
models		models
notebooks		notebooks
src		src
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Makefile		Makefile
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

text_classification

Tools used in this project

Project Structure

Set up the environment

View and alter configurations

Auto-generate API documentation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages