Skip to content

Version 1.1.0 – Implementation of requirements

Choose a tag to compare

@MiiNeLoC0 MiiNeLoC0 released this 26 Feb 14:30
· 3 commits to main since this release

In this new version, the following features have been implemented:

  1. Word Cloud from Abstracts
    • Grobid processes the PDFs and generates XML with structured content.
    • The program is capable of extracting abstracts from the <abstract> section of the XML.
    • It generates a word cloud based on the extracted abstracts.
    • It counts the figures within the documents and creates a bar chart.
    • It saves the extracted links in a CSV file.
  2. Bar Chart for Number of Images
    • A visualization has been added that shows the number of images per article.
  3. Link Extraction
    • A list of all links found in each article is generated.