extract text from any document. no muss. no fuss.
-
Updated
Jul 1, 2024 - HTML
extract text from any document. no muss. no fuss.
Reworked https://www.readability.com/ parsing library (now https://mercury.postlight.com/ is living alternative)
A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one package
Easily access song lyrics from Genius in a tibble.
A practical guide to topic mining and interactive visualizations
Explorador da Constituição: a Constituição Federal e suas Emendas acessíveis para o mundo da Ciência de Dados
Notebooks for teaching Named Entity Recognition at the Cultural Heritage Data School, run by Cambridge Digital Humanities
Detecting whether a particular tweet contains negative emotions attached with it or not from the given dataset
IPO Investment via Text Mining.
Beginner's Introduction to Text Mining: An App Store Reviews Exercise
Website for "Awesome Learning to Hash" https://learning2hash.github.io
This workshop introduces participants to the Learning Analytics (LA), and provides a brief overview of LA methodologies, literature, applications, and ethical issues as they relate to STEM education.
Various text analytics tutorials
Multilable classification of legal documents (Eur-Lex)
Twitter Sentiment Analyzer
A literature review for constructing and using knowledge graphs in a biomedical setting.
Repository for FlexiTerm: a software tool to automatically recognise multi-word terms in text documents.
Repository for the website of the book (github hosting support)
👉 Visual XML Schema differencing tool 👈 ➡️ Static site generator ⬅️
A reproducible platform to improve paper search and selection in OnePetro
Add a description, image, and links to the text-mining topic page so that developers can more easily learn about it.
To associate your repository with the text-mining topic, visit your repo's landing page and select "manage topics."