Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unary estimator for detecting names and transforming to gender #445

Merged
merged 64 commits into from
Jan 14, 2020

Commits on Dec 5, 2019

  1. Re-added unary estimator code and started porting logic to Algebird m…

    …onoids instead of custom accumulators
    MWYang committed Dec 5, 2019
    Configuration menu
    Copy the full SHA
    2acf3fc View commit details
    Browse the repository at this point in the history

Commits on Dec 6, 2019

  1. Configuration menu
    Copy the full SHA
    b55c31e View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    f443952 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    557ef39 View commit details
    Browse the repository at this point in the history
  4. Added HLL back to monoid accumulator and code now compiles correctly;…

    … Still need to fix HLL serialization in Spark issue
    MWYang committed Dec 6, 2019
    Configuration menu
    Copy the full SHA
    b6728ec View commit details
    Browse the repository at this point in the history
  5. Fixed HLL in NameDetectStats not serializing correctly; Now need to f…

    …ix printing bug for NameDetectStats
    MWYang committed Dec 6, 2019
    Configuration menu
    Copy the full SHA
    ff1b2ef View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    33b77c9 View commit details
    Browse the repository at this point in the history
  7. Fixed guard stat calculation computing moments of number of tokens in…

    …stead of moments of text length; Still need to fix no moments higher than the 1st being calculated
    MWYang committed Dec 6, 2019
    Configuration menu
    Copy the full SHA
    4caec75 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    b60dc4a View commit details
    Browse the repository at this point in the history

Commits on Dec 7, 2019

  1. Configuration menu
    Copy the full SHA
    469111b View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    e5f169e View commit details
    Browse the repository at this point in the history

Commits on Dec 9, 2019

  1. Configuration menu
    Copy the full SHA
    b701612 View commit details
    Browse the repository at this point in the history
  2. Added honorific detection

    MWYang committed Dec 9, 2019
    Configuration menu
    Copy the full SHA
    8342dae View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    2e0e85a View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    b079a27 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    a1197a7 View commit details
    Browse the repository at this point in the history
  6. Updated documentation

    MWYang committed Dec 9, 2019
    Configuration menu
    Copy the full SHA
    19bad0b View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    3172bde View commit details
    Browse the repository at this point in the history
  8. Added flag for ignoring nulls

    MWYang committed Dec 9, 2019
    Configuration menu
    Copy the full SHA
    0d82eef View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    7812d12 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    345508f View commit details
    Browse the repository at this point in the history

Commits on Dec 10, 2019

  1. Configuration menu
    Copy the full SHA
    a80e382 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    f7817d9 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    cf3eff0 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    b928a49 View commit details
    Browse the repository at this point in the history
  5. Created metadata case class per PR review; Added tests for metadata; …

    …Cleaned up test code
    MWYang committed Dec 10, 2019
    Configuration menu
    Copy the full SHA
    048c084 View commit details
    Browse the repository at this point in the history
  6. Added test for name threshold

    MWYang committed Dec 10, 2019
    Configuration menu
    Copy the full SHA
    5d30e79 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    78c321c View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    a597d21 View commit details
    Browse the repository at this point in the history

Commits on Dec 11, 2019

  1. Configuration menu
    Copy the full SHA
    997a132 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    6b8a039 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    1250559 View commit details
    Browse the repository at this point in the history
  4. Fixed tests sometimes failing because they were not using the same na…

    …me dictionary as NameDetectUtils; Removed now unncessary change to RandomText test helper
    MWYang committed Dec 11, 2019
    Configuration menu
    Copy the full SHA
    2d804ee View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    5336e71 View commit details
    Browse the repository at this point in the history

Commits on Dec 12, 2019

  1. Configuration menu
    Copy the full SHA
    464fe52 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    bff4d1b View commit details
    Browse the repository at this point in the history
  3. Removed type parameter from NameDetectFun because of later conflict w…

    …ith SmartTextMapVectorizer
    MWYang committed Dec 12, 2019
    Configuration menu
    Copy the full SHA
    639af3d View commit details
    Browse the repository at this point in the history

Commits on Dec 13, 2019

  1. Removed Pythonic i.e. not Scala-ic index thing and added separate cas…

    …e for checkign the last token instead, per PR comment
    MWYang committed Dec 13, 2019
    Configuration menu
    Copy the full SHA
    440068c View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    e6136f9 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    283d76e View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    80d81ed View commit details
    Browse the repository at this point in the history
  5. Fixed serialization of GenderDetectStrategy, per PR recommendation to…

    … test with local workflow
    MWYang committed Dec 13, 2019
    Configuration menu
    Copy the full SHA
    e003eb2 View commit details
    Browse the repository at this point in the history
  6. Fixed missing plus sign in OpPipelineStageReaderWriter causing double…

    … serialization test to fail
    MWYang committed Dec 13, 2019
    Configuration menu
    Copy the full SHA
    b51e14b View commit details
    Browse the repository at this point in the history

Commits on Dec 14, 2019

  1. Configuration menu
    Copy the full SHA
    393275c View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    ad9574c View commit details
    Browse the repository at this point in the history

Commits on Dec 18, 2019

  1. Configuration menu
    Copy the full SHA
    5d7716b View commit details
    Browse the repository at this point in the history

Commits on Dec 20, 2019

  1. Configuration menu
    Copy the full SHA
    eaad6f8 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    2bea311 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    77f91a2 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    b6b385a View commit details
    Browse the repository at this point in the history

Commits on Dec 21, 2019

  1. Configuration menu
    Copy the full SHA
    8ed02bd View commit details
    Browse the repository at this point in the history

Commits on Jan 6, 2020

  1. Configuration menu
    Copy the full SHA
    799eb58 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    ca69ed8 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    b82440a View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    d747c92 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    7d01d70 View commit details
    Browse the repository at this point in the history

Commits on Jan 7, 2020

  1. Configuration menu
    Copy the full SHA
    b4209c6 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    23d7a57 View commit details
    Browse the repository at this point in the history

Commits on Jan 8, 2020

  1. Configuration menu
    Copy the full SHA
    b9522de View commit details
    Browse the repository at this point in the history
  2. Incorporated PR comments (using enumeratum for NameStats map keys/val…

    …ues and proper camel casing for guard check params)
    MWYang committed Jan 8, 2020
    Configuration menu
    Copy the full SHA
    e4e3ddd View commit details
    Browse the repository at this point in the history

Commits on Jan 9, 2020

  1. Incorporated PR comments (renamed GenderStrings to GenderValues and r…

    …emoved BooleanStrings enum
    MWYang committed Jan 9, 2020
    Configuration menu
    Copy the full SHA
    8b02dff View commit details
    Browse the repository at this point in the history
  2. Removed plural names from NameStats enums and factored out method in …

    …identifyGender to hopefully reduce complexity
    MWYang committed Jan 9, 2020
    Configuration menu
    Copy the full SHA
    8844ef5 View commit details
    Browse the repository at this point in the history

Commits on Jan 10, 2020

  1. Configuration menu
    Copy the full SHA
    0cc10e0 View commit details
    Browse the repository at this point in the history

Commits on Jan 14, 2020

  1. Removed emptiness check

    MWYang committed Jan 14, 2020
    Configuration menu
    Copy the full SHA
    4fea007 View commit details
    Browse the repository at this point in the history