A simple WARC extractor that extract HTML from WARC!
-
Updated
Oct 16, 2017 - Java
A simple WARC extractor that extract HTML from WARC!
A tool to extract courses' average GPA from UW-Madison grade reports
This software (prototype) extracts values of Excel spreadsheet properties and calculates a tentative spreadsheet complexity assessment based on threshold values.
These are methods of Computing a table Index from a key.
Java library for making it easy to extract/read resources on the classpath.
News extraction / archival desktop app.
Lightweight Java-based entity extraction engine
Tools, probes and libraries used in iObserve to monitor and analyze software, as well as to plan and execute its modification (MAPE-K loop)
Fork off https://github.com/DevBoost/JaMoPP
SIARDexcerpt is a Java-based application that searches and extracts individual records of SIARD files. (outdated)
EXTRACT makes it easy to extract and deliver of your geodata
DBpedia Open Text Extraction Challenge - a never ending knowledge acquisition spiral
Java implementation of Rapid Automatic Keyword Extraction Algorithm
Incremental crawling capabilities for Apache Tika. Crawl content out of e.g. file systems, http(s) sources (webcrawling) imap(s) servers or your own arbitrary data sources. LeechCrawler offers additional Tika parsers providing these crawling capabilities.
Earth Science Knowledge Graph - An Automatic Approach to Building Earth Science Knowledge Graph to Improve Data Discovery
Add a description, image, and links to the extraction topic page so that developers can more easily learn about it.
To associate your repository with the extraction topic, visit your repo's landing page and select "manage topics."