Open Corpus Workbench with TEITOK Docker compose file
-
Updated
May 30, 2019 - Dockerfile
Open Corpus Workbench with TEITOK Docker compose file
Corpus linguistics final project for the course COMM 313: Computational Text Analysis at the University of Pennsylvania. Aims to determine how the anti-vaccination movement has evolved on social media before and during the COVID-19 pandemic.
Easy Text Annotator
(Ongoing module in development) Getting Wikipedia articles parsed content. Created for getting text corpuses data fast and easy. But can be freely used for other purpuses too
The recordings of marwari speech by Bharti, the speaker of it. It Includes setences of all kinds using translation method and narrations of health care and lifecycle.
Corpus for linguistic study of natural gas pipeline debates.
Treebanks modified from PROIEL and Perseus.
A module to quickly create Corpus objects containing TTR, tokenized sentences, lexical density, class frequencies and more.
A tool for determinating distances between multimodal annotations.
Paper that Lena Baunaz and I are working on as part of my SNSF-funded 'Focus in diachrony' research project at the University of Cambridge, UK.
All scripts needed to exploit French corpus and create the associated database for the CODIM Project.
Heuristics and cognitive biases in public discourse on climate changes - lingustic data analysis
Repository for the MA Digital Text Analysis thesis.
A Shiny app for visualizing Multi-Dimensional Analysis results
Code for my Master's thesis
This repository hosts BonyadAI, a Persian question answering AI Model. We developed an initial web crawler and scraper to gather the dataset. The second phase involved building a machine learning model based on word embeddings and NLP techniques. This AI model operates end-to-end, receiving user voice input and providing responses in Persian voice.
codes to perform semantic network analysis on multiple concepts (defined as multiple words-set, i.e. dictionaries) across multiple texts with R
Add a description, image, and links to the corpus-linguistics topic page so that developers can more easily learn about it.
To associate your repository with the corpus-linguistics topic, visit your repo's landing page and select "manage topics."