Code switching test set for NLMap corpus [1] as described in our ConLL 2017 paper, titled "Multilingual Semantic Parsing and Code-switching".
If you use the dataset, please cite the paper.
@InProceedings{duong-EtAl:2017,
author = {Duong, Long and Afshar, Hadi and Estival, Dominique and Pink, Glen and Cohen, Phillip and Johnson, Mark},
title = {Multilingual Semantic Parsing and Code-switching},
booktitle = {Proceedings of the 2017 Conference on Computational Natural Language Learning (CoNLL 2017).},
month = {Aug},
year = {2017},
address = {Vancouver, Canada},
publisher = {Association for Computational Linguistics},
}
The dataset nlmaps.test.cs contains 880 lines of English-German code switching data.
Each line corresponds to a logical form in the test
section of the original NLMap corpus.
[1] Carolin Haas and Stefan Riezler (2016). A Corpus and Semantic Parser for Multilingual Natural Language Querying of OpenStreetMap. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics – Human Language Technologies (NAACL HLT 2016), San Diego, CA.