Simplify Machine Translation demo by using Trainer API #10895

nickyfantasy · 2018-05-24T01:28:25Z

No description provided.

jetfuel · 2018-05-24T02:09:10Z

python/paddle/fluid/tests/book/high-level-api/machine_translation/test_machine_translation.py

+    context = encoder(is_sparse)
+    translation_ids, translation_scores = decoder_decode(context, is_sparse)
+
+    exe = Executor(place)


We should move away from using executor, right?

Yeah, I think we should not expose executor. Probably we can write decode_main() similar to the train() method above?

The decode and train in this example are not compatible with each other, we cannot provide the save model from train and use it in infer

We cannot use trainer.train either because during decode, it is not using optimizer or backward pass, it is doing a beam search

Yeah, and I think the GPU implementation of beam search is still missing?

jetfuel · 2018-05-24T02:10:22Z

python/paddle/fluid/tests/book/high-level-api/machine_translation/test_machine_translation.py

+
+        src_word_data = to_lodtensor(map(lambda x: x[0], data), place)
+
+        result_ids, result_scores = exe.run(


I don't see any Inferencer. We should use the high level api.

Or will there be any sub-sequence PR to add the infer part?

Discussed with Nicky and Jeff. We could add some simple test to translate a sample sentence later.

I will talk to Longfei to see if there is any solution to not expose executor, will try to update in next PR

Simplify Machine Translation demo by using Trainer API

b0868af

nickyfantasy requested review from jetfuel, jacquesqiao, daming-lu and sidgoyal78 May 24, 2018 01:28

nickyfantasy mentioned this pull request May 24, 2018

Add machine translation book with High Level Trainer API #10763

Closed

jetfuel reviewed May 24, 2018

View reviewed changes

Merge branch 'develop' into high_level_api_machine_translation

5b9d09d

daming-lu approved these changes May 24, 2018

View reviewed changes

nickyfantasy merged commit d4c2164 into PaddlePaddle:develop May 24, 2018

Provide feedback


		src_word_data = to_lodtensor(map(lambda x: x[0], data), place)

		result_ids, result_scores = exe.run(