Skip to content
Switch branches/tags

Latest commit


Git stats


Failed to load latest commit information.
Latest commit message
Commit time

PyPI PyPI License

LUCI: Linguistic Uncertainty Classifier Interface

Pronounced: [ˈlusi]


A Python implementation of a classifier for linguistic uncertainty, based on the work described in Vincze et al.[1]:

Vincze, V. (2015). Uncertainty detection in natural language texts (Doctoral dissertation, szte).

Consult the Wiki for further details.


If you have questions regarding this API, please contact (Benjamin Meyers) or (Nuthan Munaiah).

For questions regarding the annotated dataset or the theory behind the uncertainty classifier, please contact (György Szarvas), (Richárd Farkas), and/or (Veronika Vincze).


There has been no collaboration between Vincze et al. and the developers of this codebase.


[A] This feature is present in the reverse-engineered dataset, but is not described within Vincze et al.[1]

[1] Vincze, V. (2015). Uncertainty detection in natural language texts (Doctoral dissertation, szte).

[2] Vincze, V., Szarvas, G., Farkas, R., Móra, G., & Csirik, J. (2008). The BioScope corpus: biomedical texts annotated for uncertainty, negation and their scopes. BMC bioinformatics, 9(11), S9.

[3] Saurí, R., & Pustejovsky, J. (2009). FactBank: a corpus annotated with event factuality. Language resources and evaluation, 43(3), 227.

[4] Farkas, R., Vincze, V., Móra, G., Csirik, J., & Szarvas, G. (2010, July). The CoNLL-2010 shared task: learning to detect hedges and their scope in natural language text. In Proceedings of the Fourteenth Conference on Computational Natural Language Learning---Shared Task (pp. 1-12). Association for Computational Linguistics.