Skip to content
Access to lexical databases
HTML R TeX Python Awk Perl
Branch: master
Clone or download
Type Name Latest commit message Commit time
Failed to load latest commit information.
apps started modifiting datafetcher Dec 16, 2019
datasets-info added Boris readmes and json Dec 18, 2019
documents correction doc expressions regulieres Apr 21, 2019
scripts added morphalou, some json files and scripts Oct 1, 2019
.gitignore added subtlex-us-corpus.json and concordancer Rmd May 1, 2019
LICENSE.txt corrected erroe in name .Rdata -> .RData Mar 30, 2019 started to simply fetcher Nov 23, 2019 added Jessica app for openlexicon Nov 18, 2019
README.txt clean README May 1, 2019 add to TODO Nov 18, 2019
_config.yml sur le server Apr 5, 2019

OpenLexicon: Easy access to lexical databases

At OpenLexicon, you will find:

  1. a directory of lexical databases
  2. scripts to query, manipulate or create lexical databases.
  3. apps to create dynamic graphical interfaces to the lexical databases. Some of the apps provide the infrastructure of where many of lexical databases are hosted and can be queried on-line --- check out, for example,
  4. documents of interest related to the processing of lexical resources.

Downloading a database to work locally (on your computer)

You can download datasets directly from the links provided in the README files. Then you would open them in Excel, in R with readr.read_csv or in Python with pandas.read_csv.

However, in R, there is a much better way: the R dataset fetcher.

It provides several advantages:

  • it avoids having to specify the location of the dataset on the web
  • it will always point to the latest version of a dataset if it has been updated
  • it provides a caching mechanism: the dataset will be downloaded only if necessary, otherwise a local copy will be used.
  • it checks the sumfile of the dataset to make sure that you have the correct version.

For example, to download the table of Lexique383:

    lexique383 <- get_lexique383()

For maintainers


Everybody is warmly encouraged to contribute, by adding new databases (To add a new dataset, you must create a .json file in datasets-info/_json. Check out How-to-install-a-new-dataset), an app, or a script.

First, check out Lexique's google-group at!forum/lexiqueorg. You can then contact and privately, or, much better, fork the openlexicon repository (see How to fork a repo) and issue a pull request (see Creating a pull request from a fork).


Check out the TODO list


Most databases have associated publications listed in their respective README files. They must be cited in any derivative work! The same goes for some of the scripts (read their documentation for instructions).

If needed, you can cite this repository as:


The files in this repository are distributed under a CC BY-SA 4.0 LICENSE (see

Time-stamp: <2019-10-05 09:48:32>

You can’t perform that action at this time.