Skip to content

RichardLitt/lrl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

lrl

For script work concerning low resource languages. This does not include visualisations, Semantic Web data, or other random scripts that can be found in my other repositories.

facebook-scraper

The Facebook scripts in here are for non-automatically harvesting data from Facebook groups, using manual AJAX querying and saving the source from the browser. It is not an automatic data collection scheme, nor a scraper, which makes it legal (afaik). A paper based is currently in progress.

maltese-dict

I have developed a GUI and terminal-side dictionary program based on word lists I have access to; one from the internet, and a cleaned-up copy available via METASHARE on a CC BY-NC-SA license. I will presumably keep working on this throughout my time in Malta.

maltese-*

In development. These are for courses at the University of Malta. One is a stemmer, based on the NLTK stemmers (Snowball, ISRI). One is a broken plural noun morphological analyser, based on previous work by Farrugia. The other is a chunker and basic code switching identifier, based on the work done in facebook-scraper and on Fabri's theoretical research on Maltese compounds.

About

For work concerning low resource languages.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published