Skip to content

Commit

Permalink
Add Dacoref to the library documentation
Browse files Browse the repository at this point in the history
  • Loading branch information
ophelielacroix committed Nov 23, 2020
1 parent 5dd0e63 commit 069e1fb
Show file tree
Hide file tree
Showing 3 changed files with 21 additions and 6 deletions.
13 changes: 10 additions & 3 deletions danlp/datasets/dacoref.py
Expand Up @@ -6,7 +6,14 @@

class Dacoref:
"""
This Danish coreference annotation contains parts of the Copenhagen Dependency Treebank, It was originally annotatated as part of the Copenhagen Dependency Treebank (CDT) project but never finished. This resource extens the annotation by using different mapping techniques and by augmenting with Qcodes from Wiktionary. Read more about it in the danlp docs.
This Danish coreference annotation contains parts of the Copenhagen Dependency Treebank.
It was originally annotated as part of the Copenhagen Dependency Treebank (CDT) project but never finished.
This resource extends the annotation by using different mapping techniques and by augmenting with Qcodes from Wiktionary.
Read more about it in the danlp docs.
:param str cache_dir: the directory for storing cached models
:param bool verbose: `True` to increase verbosity
"""

def __init__(self, cache_dir: str = DEFAULT_CACHE_DIR):
Expand All @@ -15,8 +22,8 @@ def __init__(self, cache_dir: str = DEFAULT_CACHE_DIR):

def load_as_conllu(self, predefined_splits: bool = False):
"""
:param predefined_splits: Boolean
:return A single parsed conllu list
:param bool predefined_splits: Boolean
:return: A single parsed conllu list
or a list of train, dev, test split parsed conllu list
depending on predefined_split
"""
Expand Down
6 changes: 3 additions & 3 deletions docs/docs/datasets.md
Expand Up @@ -49,9 +49,9 @@ The dataset can also be downloaded directly in CoNLL-U format.

### Dacoref

This Danish coreference annotation contains parts of the Copenhagen Dependency Treebank (Kromann and Lynge (2004), It was originally annotated as part of the Copenhagen Dependency Treebank (CDT) project but never finished. This resource extends the annotation by using different mapping techniques and by augmenting with Qcodes from Wiktionary. This works is conducted by Maria Jung Barrett. Read more about it in the in the dedicated [dacoref docs](https://github.com/alexandrainst/danlp/master/docs/docs/dacoref_docs.md).
This Danish coreference annotation contains parts of the Copenhagen Dependency Treebank (Kromann and Lynge, 2004), It was originally annotated as part of the Copenhagen Dependency Treebank (CDT) project but never finished. This resource extends the annotation by using different mapping techniques and by augmenting with Qcodes from Wiktionary. This work is conducted by Maria Jung Barrett. Read more about it in the dedicated [dacoref docs](dacoref_docs.md).

The dataset can be load with the DaNLP package:
The dataset can be loaded with the DaNLP package:

```python
from danlp.datasets import Dacoref
Expand All @@ -60,7 +60,7 @@ dacoref = Dacoref()
corpus = dacoref.load_as_conllu(predefined_splits=True)
```

The dataset can also be download directly:
The dataset can also be downloaded directly:

[Download dacoref](http://danlp-downloads.alexandra.dk/datasets/dacoref.zip)

Expand Down
8 changes: 8 additions & 0 deletions docs/library/datasets.rst
Expand Up @@ -9,6 +9,14 @@ Danish Dependency Treebank
:show-inheritance:


Dacoref
--------------------------

.. automodule:: danlp.datasets.dacoref
:members:
:show-inheritance:


Sentiment datasets
------------------

Expand Down

0 comments on commit 069e1fb

Please sign in to comment.