POIROT

Phenomena Explanation from Text 🕵️‍♂️

Source code and dataset for "Phenomena Explanation from Text: Unsupervised Learning of Interpretable and Statistically Significant Knowledge".

Abstract: Learning knowledge from text is becoming increasingly important as the amount of unstructured content on the Web rapidly grows. Despite recent breakthroughs in natural language understanding, the explanation of phenomena from textual documents is still a difficult and poorly addressed problem. Additionally, current NLP solutions often require labeled data, are domain-dependent, and based on black box models. In this paper, we introduce POIROT, a new descriptive text mining methodology for phenomena explanation. POIROT is designed to provide accurate and interpretable results in unsupervised settings, quantifying them based on their statistical significance. We evaluated POIROT on a medical case study, with the aim of learning the "voice of patients" from short social posts. Taking Esophageal Achalasia as a reference, we automatically derived scientific correlations with an F1 score of about 79% and built useful explanations on the patients' point of view on topics such as symptoms, treatments, drugs, and foods.

Other related papers

DATA 2020, Learning Interpretable and Statistically Significant Knowledge from Unlabeled Corpora of Social Text Messages: A Novel Methodology of Descriptive Text Mining. Best Paper Award.
KDIR 2020, Unsupervised Descriptive Text Mining for Knowledge Graph Learning.

Replicability

All the instructions to fully replicate the results reported in our papers are in the notebook POIROT.ipynb.

Contacts

Please contact Giacomo Frisoni (giacomo.frisoni[at]unibo.it) or Gianluca Moro (gianluca.moro[at]unibo.it).
For help or issues using POIROT, please write to us.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
data/it		data/it
notebooks		notebooks
source		source
.gitignore		.gitignore
LICENSE		LICENSE
POIROT.Rproj		POIROT.Rproj
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data/it

data/it

notebooks

notebooks

source

source

.gitignore

.gitignore

LICENSE

LICENSE

POIROT.Rproj

POIROT.Rproj

README.md

README.md

Repository files navigation

POIROT

Phenomena Explanation from Text 🕵️‍♂️

Other related papers

Replicability

Contacts

About

Releases

Packages

Contributors 2

Languages

License

disi-unibo-nlp/POIROT

Folders and files

Latest commit

History

Repository files navigation

POIROT

Phenomena Explanation from Text 🕵️‍♂️

Other related papers

Replicability

Contacts

About

Topics

Resources

License

Stars

Watchers

Forks

Languages