Skip to content

Latest commit

 

History

History
71 lines (35 loc) · 4.05 KB

2.md

File metadata and controls

71 lines (35 loc) · 4.05 KB

Извлечение коллокаций

Лингвистические подходы к определению

Firth J. R. A synopsis of linguistic theory, 1930-1955. // J. R. Firth et al. Studies in Linguistic Analysis. — Special volume of the Philological Society. —Oxford: Blackwell. 1957. P. 1‑32.

Jackson H. (1995) Words and their Meaning. — London and New York, Longman

Fillmore C.J. (1988) The Mechanisms of “Construction Grammar” // Proceedings of the Fourteenth Annual Meeting of the Berkeley Linguistics Society. Berkeley, MA

Bybee J.L. (2007) Frequency of use and the organization of language. Oxford; New York: Oxford University Press

Hanks P. (2013) Lexical analysis: norms and exploitations. Cambridge, Mass: The MIT Press

A. Goldberg (2006) Constructions at Work: The Nature of Generalization in Language

Статистические методы

Обзор из учебника Foundations of Statistical Language Processing

Dagan I., Lee L., Pereira F.C.N. Similarity-Based Models of Word Cooccurrence Probabilities. Machine Learning. Т. 34. №1–3. Springer, 1999

Gries S.Th., Stefanowitsch A. (2004) Extending collostructional analysis: A corpus-based perspective on 'alternations’. IJCL

Обзор и сравнение коллокационных метрик \

Тематический сайт by Stefan Evert

Список мер ассоциации

Тьюториал NLTK

Про синтаксические фичи

Akinina, Y.S., Kuznetsov, I.O. and Toldova, S.Y., 2013. The impact of syntactic structure on verb-noun collocation extraction. In Компьютерная лингвистика и интеллектуальные технологии: материалы международной конференции «Диалог (Vol. 29, pp. 2-17).

Carlini R., Codina-Filba J., Wanner L. Improving Collocation Correction by Ranking Suggestions Using Linguistic Knowledge // Proceedings of the third workshop on NLP for computer-assisted language learning. Uppsala, Sweden: LiU Electronic Press, 2014.

Векторные методы

Baroni M., Bernardi R., Zamparelli R. (2014) Frege in Space: A Program for Compositional Distributional Semantics

Biemann C., Giesbrecht E. (2011) Distributional Semantics and Compositionality 2011: Shared Task Description and Results // Proceedings of the Workshop on Distributional Semantics and Compositionality. Portland, Oregon, USA: Association for Computational Linguistics.

Erk K., Pado S. (2008) A Structured Vector Space Model for Word Meaning in Context // Proceedings of the Conference on Empirical Methods in Natural Language Processing 2008

Kochmar E., Briscoe T. (2014) Detecting Learner Errors in the Choice of Content Words Using Compositional Distributional Semantics // Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers.

Mikolov T., Le Q.V., Sutskever I. Exploiting Similarities among Languages for Machine Translation. 2013. pdf

Rodríguez-Fernández S., Anke L., Carlini R., Wanner L. (2016) Semantics-driven recognition of collocations using word embeddings, Proceedings of the 2016 Annual Meeting of the Association for Computational Linguistics (ACL), Berlin, Germany.

Russian Collocation Extraction Based on Word Embeddings

реализация

Espinosa-Anke L., Wanner L., Schockaert S. (2019) Collocation Classification with Unsupervised Relation Vectors. https://aclanthology.org/P19-1576.pdf

Ресурсы

SketchEngine

RusVectores

COCA Project

CoCoCo

Словари на основе НКРЯ