This is the README file for the CCG Induction, models, CCGbank Simplification, etc in
Y. Bisk and J. Hockenmaier, “Simple Robust Grammar Induction with Combinatory Categorial Grammars,” in Proceedings of the Twenty-Sixth Conference on Artificial Intelligence (AAAI-12), Toronto, Canada, July 2012, pp. 1643–1649
Y. Bisk and J. Hockenmaier, “An HDP Model for Inducing Combinatory Categorial Grammars,” Transactions of the Association for Computational Linguistics, pp. 75–88, 2013.
Y. Bisk and J. Hockenmaier, “Probing the linguistic strengths and limitations of unsupervised grammar induction,” in Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Beijing,China, July 2015
Please don't hesitate to contact me or file bugs. I have tried to clean up the code, but may have introduced bugs or deleted something important in the process. Thanks!
Checkout the code
git clone https://github.com/ybisk/CCG-Induction.git
Compiling Maven code
- Download and Install Maven
- Move into git repo:
- Building a jar file:
mvn package [-DskipTests]
All the classes (and files under resources) are in
Running the code
java -jar target/CCGInduction-1.0-jar-with-dependencies.jar
For training a parser/HMM: java -jar CCGInduction.jar config.txt For Evaluation: java -cp CCGInduction.jar CCGInduction.evaluation.PARGDependencies java -cp CCGInduction.jar CCGInduction.evaluation.CoNLLDependencies java -cp CCGInduction.jar CCGInduction.evaluation.SupertagAccuracy java -cp CCGInduction.jar CCGInduction.evaluation.TagEvaluator For Utilities: java -cp CCGInduction.jar CCGInduction.utils.JSONFormat java -cp CCGInduction.jar CCGInduction.utils.ConvertFromAUTO java -cp CCGInduction.jar CCGInduction.utils.ComputeLexicon