Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Evaluation script for task 1 of WMT2016 QE shared task. #34

Merged
merged 1 commit into from
Jun 8, 2016

Conversation

kepler
Copy link
Contributor

@kepler kepler commented Jun 7, 2016

  • Includes a much more efficient DeltaAvg method (the Perl version takes 16s, while this Python version takes less than 1s).
  • Accepts files in either the tab-separated submission format or just with HTER scores.
  • Various predicted/submitted files can be passed at once, and the script will print ordered results for both scoring and ranking metrics.

* Includes a much more efficient DeltaAvg method (the Perl version takes 16s, while this Python version takes less than 1s).
* Accepts files in either the tab-separated submission format or just with HTER scores.
* Various predicted/submitted files can be passed at once, and the script will print ordered results for both scoring and ranking metrics.
@varvara-l
Copy link
Contributor

Thanks for the contribution!
I'm afraid though that this script is a bit irrelevant for this particular project -- Marmot is targeted at subsentence-level QE (task 2 of WMT16), not sentence-level QE.
I think this script should be more relevant for QuEst++: https://github.com/ghpaetzold/questplusplus

@kepler
Copy link
Contributor Author

kepler commented Jun 8, 2016

No problem. I had sent the script as a gist to Lucia and Chris just for you
guys to think about using it in next year's shared task. Chris then
suggested a pull request to Marmot.

Either way, I'll leave the script here:
https://gist.github.com/kepler/6043a41ed8f3ed0be1e68c5942b99734

A qua, 8/06/2016, 09:52, varvara-l notifications@github.com escreveu:

Thanks for the contribution!
I'm afraid though that this script is a bit irrelevant for this particular
project -- Marmot is targeted at subsentence-level QE (task 2 of WMT16),
not sentence-level QE.
I think this script should be more relevant for QuEst++:
https://github.com/ghpaetzold/questplusplus


You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
#34 (comment), or mute
the thread
https://github.com/notifications/unsubscribe/AAEgHXDg7FR59RhPRYOQv07jdClpzWUOks5qJoKwgaJpZM4IwZS1
.

@chrishokamp
Copy link
Contributor

I'm merging because our original ideas was to support all levels of qe, and
the evaluation script itself doesn't break any other code.
On Jun 8, 2016 2:04 AM, "Fabio Natanael Kepler" notifications@github.com
wrote:

No problem. I had sent the script as a gist to Lucia and Chris just for you
guys to think about using it in next year's shared task. Chris then
suggested a pull request to Marmot.

Either way, I'll leave the script here:
https://gist.github.com/kepler/6043a41ed8f3ed0be1e68c5942b99734

A qua, 8/06/2016, 09:52, varvara-l notifications@github.com escreveu:

Thanks for the contribution!
I'm afraid though that this script is a bit irrelevant for this
particular
project -- Marmot is targeted at subsentence-level QE (task 2 of WMT16),
not sentence-level QE.
I think this script should be more relevant for QuEst++:
https://github.com/ghpaetzold/questplusplus


You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
#34 (comment), or
mute
the thread
<
https://github.com/notifications/unsubscribe/AAEgHXDg7FR59RhPRYOQv07jdClpzWUOks5qJoKwgaJpZM4IwZS1

.


You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
#34 (comment), or mute
the thread
https://github.com/notifications/unsubscribe/ABicP8LBwijwbaRhDdP9yas729Lp_Vk4ks5qJoWIgaJpZM4IwZS1
.

@chrishokamp chrishokamp merged commit 0da1cc7 into qe-team:master Jun 8, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants