Skip to content
This repository has been archived by the owner on Jun 10, 2021. It is now read-only.

CTranslate does not work with Deep bidirectional encoders #43

Open
fdalvi opened this issue Nov 25, 2018 · 2 comments
Open

CTranslate does not work with Deep bidirectional encoders #43

fdalvi opened this issue Nov 25, 2018 · 2 comments

Comments

@fdalvi
Copy link
Contributor

fdalvi commented Nov 25, 2018

CTranslate silently quits if the model being loaded was trained with the -encoder_type dbrnn option. I tried to run cli/translate under gdb, but it did not give additional information, other than loading the model and silently exiting. A model trained with similar data and options, but -encoder_type brnn works fine.

Is this expected since Deep bidirectional encoders were introduced later in the lifetime of OpenNMT, and can one expect this to be implemented in CTranslate any time soon?

@guillaumekln
Copy link
Collaborator

The supported features are documented in the README:

https://github.com/OpenNMT/CTranslate#supported-features

and dbrnn is not one of them.

There is no plan to support this encoder. Did you get better results with dbrnn compared to brnn?

@fdalvi
Copy link
Contributor Author

fdalvi commented Nov 27, 2018

Ah, sorry I missed that!

As for results, yes, we do consistently get better results with dbrnn compared to brnn (we train models involving Arabic, English & German)

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants