This is the main repository that contains the supplementary materials related to my doctoral dissertation.
The unprecedented rate of scientific publications is a major threat to the productivity of knowledge workers, who rely on scrutinizing the latest scientific discoveries for their daily tasks. In this dissertation, we introduce a Personal Research Agent that can help scientists in discovering, reading and learning from scientific documents, primarily in the computer science domain. We demonstrate how a confluence of techniques from the Natural Language Processing and Semantic Web domains can construct a knowledge base of semantically-rich, inter-connected graph of scholarly artifacts, effectively transforming scientific literature from written content in isolation, into a queryable web of knowledge, suitable for machine interpretation.
- datasets contains the data used in the evaluation chapter of the dissertation
- knowledgebase contains the populated knowledge bases that resulted from the experiments in the dissertation
- pipelines contains the GATE text mining pipelines
- recommendation contains the supplementary materials related to evaluation of our semantic vector-based recommendation of scientific articles
- utilities contains tools and scripts frequently used in my research
All the files included in this repository are licensed under GNU Lesser General Public License version 3 (LGPL v3).