Add Bert Language Modeling example #21818

yeandy · 2022-06-13T12:01:43Z

Add Bert Language Modeling example

Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:

Choose reviewer(s) and mention them in a comment (R: @username).
Mention the appropriate issue in your description (for example: addresses #123), if applicable. This will automatically add a link to the pull request in the issue. If you would like the issue to automatically close on merging the pull request, comment fixes #<ISSUE NUMBER> instead.
Update CHANGES.md with noteworthy changes.
If this contribution is large, please file an Apache Individual Contributor License Agreement.

See the Contributor Guide for more tips on how to make review process smoother.

To check the build health, please visit https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md

GitHub Actions Tests Status (on master branch)

See CI.md for more information about GitHub Actions CI.

yeandy · 2022-06-13T12:01:56Z

R: @AnandInguva @tvalentyn

yeandy · 2022-06-13T12:02:19Z

Run Python 3.8 PostCommit

sdks/python/apache_beam/examples/inference/pytorch_bert.py

yeandy · 2022-06-13T15:37:10Z

Postcommits and unit tests pass locally. PTAL @tvalentyn

asf-ci · 2022-06-13T17:56:11Z

Can one of the admins verify this patch?

codecov · 2022-06-13T18:20:24Z

Codecov Report

Merging #21818 (ac12207) into master (d2fb942) will decrease coverage by 0.07%.
The diff coverage is 0.00%.

@@            Coverage Diff             @@
##           master   #21818      +/-   ##
==========================================
- Coverage   74.15%   74.07%   -0.08%     
==========================================
  Files         698      699       +1     
  Lines       92417    92504      +87     
==========================================
- Hits        68530    68526       -4     
- Misses      22636    22727      +91     
  Partials     1251     1251

Flag	Coverage Δ
python	`83.65% <0.00%> (-0.12%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
...am/examples/inference/pytorch_language_modeling.py	`0.00% <0.00%> (ø)`
.../python/apache_beam/testing/test_stream_service.py	`88.09% <0.00%> (-4.77%)`	⬇️
sdks/python/apache_beam/utils/interactive_utils.py	`95.12% <0.00%> (-2.44%)`	⬇️
...n/apache_beam/ml/gcp/recommendations_ai_test_it.py	`73.46% <0.00%> (-2.05%)`	⬇️
.../python/apache_beam/transforms/periodicsequence.py	`96.77% <0.00%> (-1.59%)`	⬇️
sdks/python/apache_beam/io/source_test_utils.py	`88.01% <0.00%> (-1.39%)`	⬇️
...che_beam/runners/interactive/interactive_runner.py	`90.06% <0.00%> (-1.33%)`	⬇️
...eam/runners/portability/fn_api_runner/execution.py	`92.44% <0.00%> (-0.65%)`	⬇️
sdks/python/apache_beam/transforms/combiners.py	`93.05% <0.00%> (-0.39%)`	⬇️
sdks/python/apache_beam/pipeline.py	`91.80% <0.00%> (ø)`
... and 45 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update d2fb942...ac12207. Read the comment docs.

sdks/python/apache_beam/examples/inference/pytorch_bert.py

yeandy · 2022-06-14T12:48:37Z

Changed the dataset to a custom file with my own sentences. If we get the Ok for the model, then I think this example should be goo.

sdks/python/apache_beam/examples/inference/pytorch_bert.py