Skip to content

Commit

Permalink
Merge 4b77b52 into 0354dcf
Browse files Browse the repository at this point in the history
  • Loading branch information
Ghostvv committed Aug 29, 2019
2 parents 0354dcf + 4b77b52 commit e008897
Showing 1 changed file with 19 additions and 0 deletions.
19 changes: 19 additions & 0 deletions docs/nlu/choosing-a-pipeline.rst
Original file line number Diff line number Diff line change
Expand Up @@ -68,6 +68,25 @@ We do not recommend that you use these; mitie support is likely to be deprecated
You need to provide enough data for both intents and entities.


Class imbalance
---------------

In order to mitigate class imbalance problem, we introduced ``balanced`` batching strategy for
``supervised_embeddings`` pipeline. This algorithm ensures that all classes are represented
at least in subsequent batches, still mimicking the fact that some classes are more frequent than others.
It is turned on by default. In order to turn it off and use classic batching strategy pass
``batch_strategy: sequence`` in config file.

.. code-block:: yaml
language: "en"
pipeline:
- name: "CountVectorsFeaturizer"
- name: "EmbeddingIntentClassifier"
batch_strategy: sequence
Multiple Intents
----------------

Expand Down

0 comments on commit e008897

Please sign in to comment.