Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add normalization option to Chinese characters (using OpenCC) and separate symbols from merging #473

Closed
wants to merge 26 commits into from

Commits on Oct 14, 2020

  1. update

    ecchochan committed Oct 14, 2020
    Configuration menu
    Copy the full SHA
    3dbe3eb View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    0d941d1 View commit details
    Browse the repository at this point in the history

Commits on Oct 19, 2020

  1. Configuration menu
    Copy the full SHA
    fb24cd7 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    d14f12b View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    50d6b28 View commit details
    Browse the repository at this point in the history
  4. Formatting

    ecchochan committed Oct 19, 2020
    Configuration menu
    Copy the full SHA
    86795b6 View commit details
    Browse the repository at this point in the history
  5. Formatting

    ecchochan committed Oct 19, 2020
    Configuration menu
    Copy the full SHA
    924a0c7 View commit details
    Browse the repository at this point in the history

Commits on Oct 20, 2020

  1. Configuration menu
    Copy the full SHA
    ef94c76 View commit details
    Browse the repository at this point in the history
  2. Formatting :S

    ecchochan committed Oct 20, 2020
    Configuration menu
    Copy the full SHA
    440a7c5 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    f1bc628 View commit details
    Browse the repository at this point in the history
  4. Formatting :S

    ecchochan committed Oct 20, 2020
    Configuration menu
    Copy the full SHA
    cfc986e View commit details
    Browse the repository at this point in the history

Commits on Feb 21, 2021

  1. Configuration menu
    Copy the full SHA
    ecde2ff View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    b28a602 View commit details
    Browse the repository at this point in the history
  3. Merge commit '8f03d6ddc1f5d503160aac9082a94ed0006aca43' into zh-norm-3

    # Conflicts:
    #	bindings/node/lib/bindings/normalizers.js
    #	bindings/node/native/src/normalizers.rs
    ecchochan committed Feb 21, 2021
    Configuration menu
    Copy the full SHA
    37296f4 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    cb2b4fe View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    8e1bb7f View commit details
    Browse the repository at this point in the history
  6. no message

    ecchochan committed Feb 21, 2021
    Configuration menu
    Copy the full SHA
    e9c7db2 View commit details
    Browse the repository at this point in the history
  7. Merge commit '558f2d87795ffc9d9786f1e923398e3eebe14187' into zh-norm-3

    # Conflicts:
    #	docs/source/_static/js/custom.js
    #	docs/source/api/python.inc
    #	docs/source/components.rst
    #	docs/source/conf.py
    #	docs/source/index.rst
    #	docs/source/quicktour.rst
    ecchochan committed Feb 21, 2021
    Configuration menu
    Copy the full SHA
    54342ec View commit details
    Browse the repository at this point in the history
  8. Merge commit 'dc60d4fc0c940c7c24962aec996150cd9708430f' into zh-norm-3

    # Conflicts:
    #	bindings/python/Cargo.lock
    #	bindings/python/py_src/tokenizers/normalizers/__init__.pyi
    ecchochan committed Feb 21, 2021
    Configuration menu
    Copy the full SHA
    4d2152c View commit details
    Browse the repository at this point in the history

Commits on Feb 22, 2021

  1. Merge commit '6e364cb685858dab4d19a8ac79176588053c8c0e' into zh-norm-3

    # Conflicts:
    #	bindings/python/src/normalizers.rs
    #	bindings/python/tests/bindings/test_normalizers.py
    #	tokenizers/src/normalizers/bert.rs
    ecchochan committed Feb 22, 2021
    Configuration menu
    Copy the full SHA
    a6c0f65 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    6aaeb79 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    0ad10e1 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    3d06542 View commit details
    Browse the repository at this point in the history
  5. fix for linting

    ecchochan committed Feb 22, 2021
    Configuration menu
    Copy the full SHA
    b105bf3 View commit details
    Browse the repository at this point in the history
  6. formatting

    ecchochan committed Feb 22, 2021
    Configuration menu
    Copy the full SHA
    c8152a9 View commit details
    Browse the repository at this point in the history

Commits on Nov 11, 2021

  1. Update setup.py

    ecchochan committed Nov 11, 2021
    Configuration menu
    Copy the full SHA
    0f8370c View commit details
    Browse the repository at this point in the history