Xposition

Xposition (URL forthcoming) is a multilingual database of the semantics of adpositions (prepositions, postpositions) and case markers. Semantic information is primarily categorized in terms of coarse-grained supersenses. The database is designed to support corpus annotation.

Xposition is the successor to PrepWiki, which was limited to English prepositions and used an earlier inventory of supersenses.

Languages

Initial languages:

English
Hebrew (Modern)
Hindi
Korean

If you would like to contribute to these or other languages, let us know.

Installation

Note: Xposition requires Python 3.6+.

We recommend you create a new conda environment for Xposition:

conda create --name Xposition python=3.7
conda activate Xposition

Once you've cloned the git repository, you'll need to install the django-wiki and django-categories libraries (and their dependencies):

git clone git@github.com:nert-nlp/Xposition.git --recursive
cd Xposition
pip install -r requirements.txt
# make sure submodules were cloned
git submodule update

Configuration

Create the file xposition/xp/settings/unversioned.py, which contains a variable SECRET_KEY. Do not commit this file to git.

echo 'SECRET_KEY = "..."' > xposition/xp/settings/unversioned.py

2a. If you are configuring Xposition for development, make sure that the end of xposition/xp/settings/local.py has the .dev import uncommented, and the .base import commented, like this:

from .dev import *
#from .base import *

2b. If you are configuring Xposition for production, make sure that the end of xposition/xp/settings/local.py has the .base import uncommented, and the .dev import commented, like this:

#from .dev import *
from .base import *

Run the migration to update the server's SQL schema:

python xposition/manage.py migrate

(optional) Instead of using a blank database, you may place a copy of the production database in xposition/xp/db/prepopulated.db. Ask Nathan for a backup. If the backup is a SQL dump instead of a raw database file, you may reconsistute the database like below. Note that some backups end in a ROLLBACK; for some reason--check to see if it's there and replace it with an END instead.

sqlite3 prepopulated.db < prod_dump.sql

Running

You should now be able to run the server:

python xposition/manage.py runserver

Note: If you are running on Windows, you might get an error about wiki.plugins.categories. This is because a Unix symlink is not interpreted propertly by Windows. In the xposition directory, remove wiki and replace it with a Windows symlink: mklink /D wiki ..\src\wiki.

Login

Django admin:

Username: admin
Password: admin

Testing

This project can be run directly with the manage.py script, provided that you have checked out the root of the Git repository.

It comes with a prepopulated SQLite database.

Importing a new Corpus

We use the library django-import-export for loading new models when there are too many to create by hand. Developers can follow the following procedure to load a new corpus into the database. Make sure that you have validated your corpus using the conllulex validator.

Sentence and Document IDs

Before import, make sure that the way doc_id is set is appropriate for your corpus. If it is not, then either temporarily change the import script or change your document IDs.

Performing the Import

On the webpage, click the language you are working with and then click metadata (You may first need to create the Language object if it doesn't exist. In this case, click metadata on the homepage). Click Create a Corpus and fill out the form.
The corpus you want to import must be in the STREUSLE json format. Place it in the directory <Xposition>/xposition/scripts.
Modify the top of the file new_corpus.py so that the constants LANGUAGE, CORPUS, etc. are correct.
Go to the xposition directory: cd <Xposition>/xposition
Import new supersenses and adpositions:
- Run python manage.py shell and then type from scripts.generate_basic_files import main; main(). This will create json files for all supersenses and adpositions in the corpus and it will place them in <Xposition>/xposition/scripts/<corpus><version>_files.
- You can then import new supersenses and adpositions through the admin interface on the webpage at <homepage_url>/admin, by clicking Supersense revisions or Adposition revisions, Import, and then choose the corresponding file from <Xposition>/xposition/scripts/<corpus><version>_files.
Import new construals:
- Run python manage.py shell and then type from scripts.generate_construal_file import main; main(). This will create a json file for all construals in the corpus.
- You can then import new construals through the admin interface on the webpage at <homepage_url>/admin, by clicking Construals, Import, and then choose the corresponding file from <Xposition>/xposition/scripts/<corpus><version>_files.
- Note: Construals depend on supersenses, so if there are any supersenses missing from the database, you may get a 'Missing Supersense' error.
Import new usages:
- Run python manage.py shell and then type from scripts.generate_usage_file import main; main(). This will create a json file for all usages in the corpus.
- You can then import new usages through the admin interface on the webpage at <homepage_url>/admin, by clicking Usage revisions, Import, and then choose the corresponding file from <Xposition>/xposition/scripts/<corpus><version>_files.
- Note: Usages depend on construals and adpositions, so if there are any construals or adpositions missing from the database, you may get a 'Missing' error.
Import new sentences and ptoken_annotations:
- Run python manage.py shell and then type from scripts.generate_corpus_files import main; main(). This will create a tsv file for all CorpusSentences and PTokenAnnotations in the corpus.
- You can then import them through the admin interface on the webpage at <homepage_url>/admin, by clicking Corpus sentences or Adposition token annotations, Import, and then choose the corresponding file from <Xposition>/xposition/scripts/<corpus><version>_files. Adposition token annotations will be divided into multiple files labelled ptoken_annotations0.tsv, ptoken_annotations1.tsv, etc.
- Note: PTokenAnnotations depend on usages and adpositions, so if there are any usages or adpositions missing from the database, you may get a 'Missing' error.
If you are importing a new version of an existing Corpus, please mark any older versions of the Corpus as deprecated, so that annotations from older versions will be hidden by default. Go to the corpus page and in the table titled "Metadata", click the edit button and check the box for "Is this a deprecated version of an existing Corpus".

Credits

Project leader: Nathan Schneider (Georgetown)
Xposition software development: Max Kim, Joseph Ledford, Austin Blodgett, Luke Gessler, Nitin Venkateswaran (Georgetown)
Other collaborators/contributors: TODO/see PrepWiki

Name		Name	Last commit message	Last commit date
Latest commit History 3,429 Commits
.circleci		.circleci
.tx		.tx
docs		docs
src/wiki		src/wiki
testproject		testproject
tests		tests
xposition		xposition
.coveragerc		.coveragerc
.editorconfig		.editorconfig
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
COPYING		COPYING
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
README.rst		README.rst
SUPPORT.rst		SUPPORT.rst
pytest.ini		pytest.ini
requirements.txt		requirements.txt
requirements_readthedocs.txt		requirements_readthedocs.txt
setup.cfg		setup.cfg
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Xposition

Languages

Installation

Configuration

Running

Login

Testing

Importing a new Corpus

Sentence and Document IDs

Performing the Import

Credits

About

Releases 1

Packages

Contributors 91

Languages

License

nert-nlp/Xposition

Folders and files

Latest commit

History

Repository files navigation

Xposition

Languages

Installation

Configuration

Running

Login

Testing

Importing a new Corpus

Sentence and Document IDs

Performing the Import

Credits

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 91

Languages

Packages