-
Notifications
You must be signed in to change notification settings - Fork 393
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Incorporate name detection into SmartTextVectorizer #456
Commits on Dec 5, 2019
-
Re-added unary estimator code and started porting logic to Algebird m…
…onoids instead of custom accumulators
Configuration menu - View commit details
-
Copy full SHA for 2acf3fc - Browse repository at this point
Copy the full SHA 2acf3fcView commit details
Commits on Dec 6, 2019
-
Configuration menu - View commit details
-
Copy full SHA for b55c31e - Browse repository at this point
Copy the full SHA b55c31eView commit details -
Configuration menu - View commit details
-
Copy full SHA for f443952 - Browse repository at this point
Copy the full SHA f443952View commit details -
Configuration menu - View commit details
-
Copy full SHA for 557ef39 - Browse repository at this point
Copy the full SHA 557ef39View commit details -
Added HLL back to monoid accumulator and code now compiles correctly;…
… Still need to fix HLL serialization in Spark issue
Configuration menu - View commit details
-
Copy full SHA for b6728ec - Browse repository at this point
Copy the full SHA b6728ecView commit details -
Fixed HLL in NameDetectStats not serializing correctly; Now need to f…
…ix printing bug for NameDetectStats
Configuration menu - View commit details
-
Copy full SHA for ff1b2ef - Browse repository at this point
Copy the full SHA ff1b2efView commit details -
Configuration menu - View commit details
-
Copy full SHA for 33b77c9 - Browse repository at this point
Copy the full SHA 33b77c9View commit details -
Fixed guard stat calculation computing moments of number of tokens in…
…stead of moments of text length; Still need to fix no moments higher than the 1st being calculated
Configuration menu - View commit details
-
Copy full SHA for 4caec75 - Browse repository at this point
Copy the full SHA 4caec75View commit details -
Fixed moments calculation and fixed divide by zero error when list of…
… tokens is empty
Configuration menu - View commit details
-
Copy full SHA for b60dc4a - Browse repository at this point
Copy the full SHA b60dc4aView commit details
Commits on Dec 7, 2019
-
Configuration menu - View commit details
-
Copy full SHA for 469111b - Browse repository at this point
Copy the full SHA 469111bView commit details -
Configuration menu - View commit details
-
Copy full SHA for e5f169e - Browse repository at this point
Copy the full SHA e5f169eView commit details
Commits on Dec 9, 2019
-
Configuration menu - View commit details
-
Copy full SHA for b701612 - Browse repository at this point
Copy the full SHA b701612View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8342dae - Browse repository at this point
Copy the full SHA 8342daeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2e0e85a - Browse repository at this point
Copy the full SHA 2e0e85aView commit details -
Configuration menu - View commit details
-
Copy full SHA for b079a27 - Browse repository at this point
Copy the full SHA b079a27View commit details -
Configuration menu - View commit details
-
Copy full SHA for a1197a7 - Browse repository at this point
Copy the full SHA a1197a7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 19bad0b - Browse repository at this point
Copy the full SHA 19bad0bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3172bde - Browse repository at this point
Copy the full SHA 3172bdeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0d82eef - Browse repository at this point
Copy the full SHA 0d82eefView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7812d12 - Browse repository at this point
Copy the full SHA 7812d12View commit details -
Configuration menu - View commit details
-
Copy full SHA for 345508f - Browse repository at this point
Copy the full SHA 345508fView commit details
Commits on Dec 10, 2019
-
Configuration menu - View commit details
-
Copy full SHA for a80e382 - Browse repository at this point
Copy the full SHA a80e382View commit details -
Configuration menu - View commit details
-
Copy full SHA for f7817d9 - Browse repository at this point
Copy the full SHA f7817d9View commit details -
Configuration menu - View commit details
-
Copy full SHA for cf3eff0 - Browse repository at this point
Copy the full SHA cf3eff0View commit details -
Configuration menu - View commit details
-
Copy full SHA for eaaa23a - Browse repository at this point
Copy the full SHA eaaa23aView commit details -
Made small changes based on PR comments (updated inline comment and i…
…mport statement)
Configuration menu - View commit details
-
Copy full SHA for b928a49 - Browse repository at this point
Copy the full SHA b928a49View commit details -
Created metadata case class per PR review; Added tests for metadata; …
…Cleaned up test code
Configuration menu - View commit details
-
Copy full SHA for 048c084 - Browse repository at this point
Copy the full SHA 048c084View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5d30e79 - Browse repository at this point
Copy the full SHA 5d30e79View commit details -
Configuration menu - View commit details
-
Copy full SHA for 78c321c - Browse repository at this point
Copy the full SHA 78c321cView commit details -
Configuration menu - View commit details
-
Copy full SHA for a597d21 - Browse repository at this point
Copy the full SHA a597d21View commit details -
Configuration menu - View commit details
-
Copy full SHA for 12b3eae - Browse repository at this point
Copy the full SHA 12b3eaeView commit details
Commits on Dec 11, 2019
-
Started porting over name detection code before wanting to try and si…
…mplify the shared code even further
Configuration menu - View commit details
-
Copy full SHA for e299677 - Browse repository at this point
Copy the full SHA e299677View commit details -
Configuration menu - View commit details
-
Copy full SHA for 997a132 - Browse repository at this point
Copy the full SHA 997a132View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6b8a039 - Browse repository at this point
Copy the full SHA 6b8a039View commit details -
Added default dictionaries to NameDetectUtils object (for lazy and pe…
…rsistent loading)
Configuration menu - View commit details
-
Copy full SHA for 1250559 - Browse repository at this point
Copy the full SHA 1250559View commit details -
Fixed tests sometimes failing because they were not using the same na…
…me dictionary as NameDetectUtils; Removed now unncessary change to RandomText test helper
Configuration menu - View commit details
-
Copy full SHA for 2d804ee - Browse repository at this point
Copy the full SHA 2d804eeView commit details -
Configuration menu - View commit details
-
Copy full SHA for f4ecea1 - Browse repository at this point
Copy the full SHA f4ecea1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6c34935 - Browse repository at this point
Copy the full SHA 6c34935View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9427cd3 - Browse repository at this point
Copy the full SHA 9427cd3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 11f5bf3 - Browse repository at this point
Copy the full SHA 11f5bf3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 38bff0b - Browse repository at this point
Copy the full SHA 38bff0bView commit details -
Configuration menu - View commit details
-
Copy full SHA for e823ecb - Browse repository at this point
Copy the full SHA e823ecbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5f4f592 - Browse repository at this point
Copy the full SHA 5f4f592View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5336e71 - Browse repository at this point
Copy the full SHA 5336e71View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9e9c149 - Browse repository at this point
Copy the full SHA 9e9c149View commit details
Commits on Dec 12, 2019
-
Configuration menu - View commit details
-
Copy full SHA for 141f1af - Browse repository at this point
Copy the full SHA 141f1afView commit details -
Configuration menu - View commit details
-
Copy full SHA for 33f6809 - Browse repository at this point
Copy the full SHA 33f6809View commit details -
Configuration menu - View commit details
-
Copy full SHA for 464fe52 - Browse repository at this point
Copy the full SHA 464fe52View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6cc90de - Browse repository at this point
Copy the full SHA 6cc90deView commit details -
Started to make Changes to SmartTextMapVectorizer but ran into proble…
…m with utils and types
Configuration menu - View commit details
-
Copy full SHA for 87c568d - Browse repository at this point
Copy the full SHA 87c568dView commit details -
Configuration menu - View commit details
-
Copy full SHA for bff4d1b - Browse repository at this point
Copy the full SHA bff4d1bView commit details -
Removed type parameter from NameDetectFun because of later conflict w…
…ith SmartTextMapVectorizer
Configuration menu - View commit details
-
Copy full SHA for 639af3d - Browse repository at this point
Copy the full SHA 639af3dView commit details -
Configuration menu - View commit details
-
Copy full SHA for a8b4423 - Browse repository at this point
Copy the full SHA a8b4423View commit details
Commits on Dec 13, 2019
-
Configuration menu - View commit details
-
Copy full SHA for e7795dc - Browse repository at this point
Copy the full SHA e7795dcView commit details -
Removed Pythonic i.e. not Scala-ic index thing and added separate cas…
…e for checkign the last token instead, per PR comment
Configuration menu - View commit details
-
Copy full SHA for 440068c - Browse repository at this point
Copy the full SHA 440068cView commit details -
Configuration menu - View commit details
-
Copy full SHA for e6136f9 - Browse repository at this point
Copy the full SHA e6136f9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 283d76e - Browse repository at this point
Copy the full SHA 283d76eView commit details -
Removed usage of broadcast variables in transformer b/c it does not s…
…erialize correctly
Configuration menu - View commit details
-
Copy full SHA for 80d81ed - Browse repository at this point
Copy the full SHA 80d81edView commit details -
Fixed serialization of GenderDetectStrategy, per PR recommendation to…
… test with local workflow
Configuration menu - View commit details
-
Copy full SHA for e003eb2 - Browse repository at this point
Copy the full SHA e003eb2View commit details -
Configuration menu - View commit details
-
Copy full SHA for ee6c24b - Browse repository at this point
Copy the full SHA ee6c24bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2fc0c3d - Browse repository at this point
Copy the full SHA 2fc0c3dView commit details -
Fixed missing plus sign in OpPipelineStageReaderWriter causing double…
… serialization test to fail
Configuration menu - View commit details
-
Copy full SHA for b51e14b - Browse repository at this point
Copy the full SHA b51e14bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 634b664 - Browse repository at this point
Copy the full SHA 634b664View commit details -
Cleaned up utils file by moving all implicit definitions to NameDetec…
…tStats object and started integrating name detection in SmartTextMapVectorizer
Configuration menu - View commit details
-
Copy full SHA for 03431e9 - Browse repository at this point
Copy the full SHA 03431e9View commit details
Commits on Dec 14, 2019
-
Configuration menu - View commit details
-
Copy full SHA for cde7551 - Browse repository at this point
Copy the full SHA cde7551View commit details -
Configuration menu - View commit details
-
Copy full SHA for 393275c - Browse repository at this point
Copy the full SHA 393275cView commit details -
Tidied up monoid definition for NameDetectStats after figuring out ho…
…w to use Algebird
Configuration menu - View commit details
-
Copy full SHA for ad9574c - Browse repository at this point
Copy the full SHA ad9574cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 72a1d48 - Browse repository at this point
Copy the full SHA 72a1d48View commit details
Commits on Dec 16, 2019
-
Configuration menu - View commit details
-
Copy full SHA for a8d91de - Browse repository at this point
Copy the full SHA a8d91deView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0639af5 - Browse repository at this point
Copy the full SHA 0639af5View commit details
Commits on Dec 17, 2019
-
Updated tests based on my new correct understanding that Text.empty =…
…> null, not empty string
Configuration menu - View commit details
-
Copy full SHA for 4f316e5 - Browse repository at this point
Copy the full SHA 4f316e5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 714dcdc - Browse repository at this point
Copy the full SHA 714dcdcView commit details
Commits on Dec 18, 2019
-
Configuration menu - View commit details
-
Copy full SHA for 27a6ce6 - Browse repository at this point
Copy the full SHA 27a6ce6View commit details -
Changed SensitiveFeatureInformation.Name to log gender detection stra…
…tegies and started on passing first sensitive feature test
Configuration menu - View commit details
-
Copy full SHA for 9e7bfd2 - Browse repository at this point
Copy the full SHA 9e7bfd2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5d7716b - Browse repository at this point
Copy the full SHA 5d7716bView commit details -
Configuration menu - View commit details
-
Copy full SHA for bc9bc63 - Browse repository at this point
Copy the full SHA bc9bc63View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7c218e9 - Browse repository at this point
Copy the full SHA 7c218e9View commit details
Commits on Dec 19, 2019
-
Passed first metadata test for SmartTextVectorizer; Started to re-wor…
…k SensitiveFeatureInformation for map feature types
Configuration menu - View commit details
-
Copy full SHA for 500d16e - Browse repository at this point
Copy the full SHA 500d16eView commit details
Commits on Dec 20, 2019
-
Configuration menu - View commit details
-
Copy full SHA for 08af16b - Browse repository at this point
Copy the full SHA 08af16bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 573820c - Browse repository at this point
Copy the full SHA 573820cView commit details -
Configuration menu - View commit details
-
Copy full SHA for eb5ff9b - Browse repository at this point
Copy the full SHA eb5ff9bView commit details -
Configuration menu - View commit details
-
Copy full SHA for eaad6f8 - Browse repository at this point
Copy the full SHA eaad6f8View commit details -
Merge branch 'my/unary-detect-names' of https://github.com/MWYang/Tra…
…nsmogrifAI into my/unary-detect-names
Configuration menu - View commit details
-
Copy full SHA for 2bea311 - Browse repository at this point
Copy the full SHA 2bea311View commit details -
Small fixes (better Scala code, more safe, better patterns) from Matt…
…hew's PR comments
Configuration menu - View commit details
-
Copy full SHA for 77f91a2 - Browse repository at this point
Copy the full SHA 77f91a2View commit details -
Configuration menu - View commit details
-
Copy full SHA for b6b385a - Browse repository at this point
Copy the full SHA b6b385aView commit details
Commits on Dec 21, 2019
-
Configuration menu - View commit details
-
Copy full SHA for 8ed02bd - Browse repository at this point
Copy the full SHA 8ed02bdView commit details -
Configuration menu - View commit details
-
Copy full SHA for e92101a - Browse repository at this point
Copy the full SHA e92101aView commit details
Commits on Jan 6, 2020
-
Configuration menu - View commit details
-
Copy full SHA for 799eb58 - Browse repository at this point
Copy the full SHA 799eb58View commit details -
Configuration menu - View commit details
-
Copy full SHA for ca69ed8 - Browse repository at this point
Copy the full SHA ca69ed8View commit details -
Merge branch 'my/unary-detect-names' of https://github.com/MWYang/Tra…
…nsmogrifAI into my/unary-detect-names
Configuration menu - View commit details
-
Copy full SHA for b82440a - Browse repository at this point
Copy the full SHA b82440aView commit details -
Configuration menu - View commit details
-
Copy full SHA for d747c92 - Browse repository at this point
Copy the full SHA d747c92View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7d01d70 - Browse repository at this point
Copy the full SHA 7d01d70View commit details
Commits on Jan 7, 2020
-
Configuration menu - View commit details
-
Copy full SHA for b4209c6 - Browse repository at this point
Copy the full SHA b4209c6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 23d7a57 - Browse repository at this point
Copy the full SHA 23d7a57View commit details -
Configuration menu - View commit details
-
Copy full SHA for d86f2d6 - Browse repository at this point
Copy the full SHA d86f2d6View commit details
Commits on Jan 8, 2020
-
Configuration menu - View commit details
-
Copy full SHA for eea3a3c - Browse repository at this point
Copy the full SHA eea3a3cView commit details -
Configuration menu - View commit details
-
Copy full SHA for b9522de - Browse repository at this point
Copy the full SHA b9522deView commit details -
Incorporated PR comments (using enumeratum for NameStats map keys/val…
…ues and proper camel casing for guard check params)
Configuration menu - View commit details
-
Copy full SHA for e4e3ddd - Browse repository at this point
Copy the full SHA e4e3dddView commit details -
Configuration menu - View commit details
-
Copy full SHA for f551543 - Browse repository at this point
Copy the full SHA f551543View commit details -
Configuration menu - View commit details
-
Copy full SHA for a9d95a1 - Browse repository at this point
Copy the full SHA a9d95a1View commit details -
Made all tests pass - Debuging wasn't being enabled due to non-intuit…
…ve execution order for ScalaTest
Configuration menu - View commit details
-
Copy full SHA for 30476c8 - Browse repository at this point
Copy the full SHA 30476c8View commit details -
Fixed test to show that the output for SmartTextVectorizer is the sam…
…e with or without name entries
Configuration menu - View commit details
-
Copy full SHA for aa680b8 - Browse repository at this point
Copy the full SHA aa680b8View commit details
Commits on Jan 9, 2020
-
Configuration menu - View commit details
-
Copy full SHA for a57aa29 - Browse repository at this point
Copy the full SHA a57aa29View commit details -
Configuration menu - View commit details
-
Copy full SHA for b9120a5 - Browse repository at this point
Copy the full SHA b9120a5View commit details -
Configuration menu - View commit details
-
Copy full SHA for be2047d - Browse repository at this point
Copy the full SHA be2047dView commit details -
Incorporated PR comments (renamed GenderStrings to GenderValues and r…
…emoved BooleanStrings enum
Configuration menu - View commit details
-
Copy full SHA for 8b02dff - Browse repository at this point
Copy the full SHA 8b02dffView commit details -
Removed plural names from NameStats enums and factored out method in …
…identifyGender to hopefully reduce complexity
Configuration menu - View commit details
-
Copy full SHA for 8844ef5 - Browse repository at this point
Copy the full SHA 8844ef5View commit details
Commits on Jan 10, 2020
-
Configuration menu - View commit details
-
Copy full SHA for 0cc10e0 - Browse repository at this point
Copy the full SHA 0cc10e0View commit details -
Configuration menu - View commit details
-
Copy full SHA for a0d97c7 - Browse repository at this point
Copy the full SHA a0d97c7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3ad8ce1 - Browse repository at this point
Copy the full SHA 3ad8ce1View commit details -
Configuration menu - View commit details
-
Copy full SHA for b00775b - Browse repository at this point
Copy the full SHA b00775bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 03ae4d0 - Browse repository at this point
Copy the full SHA 03ae4d0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9759747 - Browse repository at this point
Copy the full SHA 9759747View commit details
Commits on Jan 13, 2020
-
Configuration menu - View commit details
-
Copy full SHA for 63ad17a - Browse repository at this point
Copy the full SHA 63ad17aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 89fa865 - Browse repository at this point
Copy the full SHA 89fa865View commit details -
Configuration menu - View commit details
-
Copy full SHA for 10644d8 - Browse repository at this point
Copy the full SHA 10644d8View commit details
Commits on Jan 14, 2020
-
Configuration menu - View commit details
-
Copy full SHA for 4fea007 - Browse repository at this point
Copy the full SHA 4fea007View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8f7b125 - Browse repository at this point
Copy the full SHA 8f7b125View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9317250 - Browse repository at this point
Copy the full SHA 9317250View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6424463 - Browse repository at this point
Copy the full SHA 6424463View commit details -
Configuration menu - View commit details
-
Copy full SHA for a107db1 - Browse repository at this point
Copy the full SHA a107db1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 32f29ce - Browse repository at this point
Copy the full SHA 32f29ceView commit details -
Fixed SensitiveFeatureInformation tests failing due to not changing t…
…he to/fromMetadata tests
Configuration menu - View commit details
-
Copy full SHA for acb873e - Browse repository at this point
Copy the full SHA acb873eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 973aabb - Browse repository at this point
Copy the full SHA 973aabbView commit details -
1
Configuration menu - View commit details
-
Copy full SHA for 3ae735c - Browse repository at this point
Copy the full SHA 3ae735cView commit details
Commits on Jan 15, 2020
-
Configuration menu - View commit details
-
Copy full SHA for bef2b22 - Browse repository at this point
Copy the full SHA bef2b22View commit details -
Fixed failing test by making default behavior of SmartTextVectorizer …
…and SmartTextMapVectorizer the same as previous behavior - don't ignore any features
Configuration menu - View commit details
-
Copy full SHA for 069031c - Browse repository at this point
Copy the full SHA 069031cView commit details
Commits on Jan 21, 2020
-
Configuration menu - View commit details
-
Copy full SHA for d8f7f21 - Browse repository at this point
Copy the full SHA d8f7f21View commit details -
Configuration menu - View commit details
-
Copy full SHA for e13c4d1 - Browse repository at this point
Copy the full SHA e13c4d1View commit details -
Configuration menu - View commit details
-
Copy full SHA for fe57c89 - Browse repository at this point
Copy the full SHA fe57c89View commit details -
Configuration menu - View commit details
-
Copy full SHA for 458af4a - Browse repository at this point
Copy the full SHA 458af4aView commit details
Commits on Jan 22, 2020
-
Made all tests pass after merge (Ignore in STMapV didn't handle empty…
… hashFeatures case)
Configuration menu - View commit details
-
Copy full SHA for 40cb64b - Browse repository at this point
Copy the full SHA 40cb64bView commit details -
Configuration menu - View commit details
-
Copy full SHA for fde5c8d - Browse repository at this point
Copy the full SHA fde5c8dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2572332 - Browse repository at this point
Copy the full SHA 2572332View commit details
Commits on Jan 24, 2020
-
Configuration menu - View commit details
-
Copy full SHA for a8504da - Browse repository at this point
Copy the full SHA a8504daView commit details -
Configuration menu - View commit details
-
Copy full SHA for eac0e05 - Browse repository at this point
Copy the full SHA eac0e05View commit details -
Configuration menu - View commit details
-
Copy full SHA for 26f30fb - Browse repository at this point
Copy the full SHA 26f30fbView commit details -
Configuration menu - View commit details
-
Copy full SHA for f66d896 - Browse repository at this point
Copy the full SHA f66d896View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9a3f5af - Browse repository at this point
Copy the full SHA 9a3f5afView commit details -
Configuration menu - View commit details
-
Copy full SHA for bd7b90d - Browse repository at this point
Copy the full SHA bd7b90dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5c37c26 - Browse repository at this point
Copy the full SHA 5c37c26View commit details
Commits on Jan 29, 2020
-
Configuration menu - View commit details
-
Copy full SHA for 9344e5f - Browse repository at this point
Copy the full SHA 9344e5fView commit details