Skip to content
Go to file

Latest commit


Git stats


Failed to load latest commit information.
Latest commit message
Commit time


All materials of the involvement in the FutureTDM project.

All materials worked out in this repository where conducted within the EU Horizon2020 project Future TDM - The Future of Text and Data Mining, an EU Horizon2020 research project with participation of Open Knowledge International and ContentMine.

FutureTDM ContentMine Open Knowledge International

The main outcomes are:

  • three tutorials about specific use-cases of text data mining techniques
  • one workshop
  • one presentation of the outcomes at a conference

All content and data is licensed under the Creative Commons Attribution 4.0 International License. All code is under the MIT license.

Creative Commons by


To do text data mining with the content mine software you need two things:

  1. Install the ContentMine software. Find out more in
  2. Learn about text data mining. As preparation we recommend to have a look at the resources list in


We worked out three different use-cases to show the power of text data mining with our software.

Zika Virus

Use text data mining to get an overview on the research around the zika virus. How did the research field evolve in the last decades? What authors and journals contributed most and how are they connected? and and dive into the data/publications to get a better understanding, on the state of it and have a look at the species mentioned.

Go to the Zika Tutorial.


(soon to come...)

Systematic Literature Review (Train the Trainees for Librarians)

Filter out and find relevant publications, to support you doing a systematic review around your research question - in a fully open and reproducible way.

Go to the Systematic Literature Review Tutorial.


FutureTDM Workshop II at Brussels

Date: 29th of March 2017

Location: EU Parliament, Brussels

Go to the documentation.

Workshop at ELPUB 2017 Conference

Date: 6th of June 2017

Location: 21st ELPUB Conference at Limassol, Cyprus

Go to the documentation.

Presentation at FutureTDM Symposium at Salzburg

Date: 13th of June 2017

Location: University of Applied Sciences in Salzburg, Austria

Go to the documentation.


All content is openly licensed under the Creative Commons Attribution 4.0 license, unless otherwisely stated.

All sourcecode is free software: you can redistribute it and/or modify it under the terms of the MIT License. Visit to learn more about the MIT License.


In the spirit of free software, everyone is encouraged to help improve the content created and curated here.

Here are some ways you can contribute:

  • by reporting bugs
  • by suggesting new sections
  • by translating to a new language
  • by writing or editing documentation
  • by analyzing the data
  • by visualizing the data
  • by writing code (no pull request is too small: fix typos in the user interface, add code comments, clean up inconsistent whitespace)
  • by refactoring code
  • by closing issues
  • by reviewing pull requests
  • by enriching the data with other data sources

When you are ready, submit a pull request.

Submitting an Issue

We use the GitHub issue tracker to track bugs and features. Before submitting a bug report or feature request, check to make sure it hasn't already been submitted. When submitting a bug report, please try to provide a screenshot that demonstrates the problem.




You can’t perform that action at this time.