This course focuses on the analysis of plain text, syntactic and semantic structures, ontologies and taxonomies, as well as their applications in computational linguistics. For text analysis, regular expressions and n-gram models are discussed. For syntactic analysis, phrase and dependency structures are discussed. For semantic analysis, predicate argument structures and abstract meaning representation are discussed. Computational lexicons such as Treebank, PropBank, WordNet, and FrameNet as well as advanced topics such as clustering algorithms, distributional semantics, and computational grammars are also discussed. Homework assignments may involve big data processing using cloud computing.
Prerequisites: CS 171 (CS and LING students) or QTM 220 (QTM students).