Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
-
Updated
Jun 7, 2024 - Python
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
✨ This is the official Punctual Letters repository ✨ The app for people with ADHD who love to read ✨
Generating Summaries with Controllable Readability Levels (EMNLP 2023)
A Python library for calculating a large variety of metrics from text
📝 python package to calculate readability statistics of a text object - paragraphs, sentences, articles.
Web scraper with a simple REST API living in Docker and using a Headless browser and Readability.js for parsing.
Extract clean(er), readable text from web pages via Mercury Web Parser.
🌐 Translation plugin (multi-engine, fast, flexible) for SublimeText 3 & 4, works without API keys, works in China
PyYAML-based module to produce a bit more pretty and readable YAML-serialized data
The Learning Made Easy is a Streamlit application designed to simplify PDF content, meant for K-12 education.
📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more
Plain Russian Language / Понятный (простой) русский язык.
Optimizing Readability Using Genetic Algorithms
HSK Character Profiler is a Python tool that analyzes Chinese character proficiency and text readability based on HSK lists, with customizable settings. Developed as part of a Master's thesis in Computational Linguistics.
This repository contains the code and data for BasahaCorpus paper accepted for EMNLP 2023 (Main).
How readable is your text? Provide a text input and get its grade level. Scientifically validated.
Easily create semantic search based LLM applications
Soft access of nested data for more readable Python code
base: Data Augmentation using Pre-trained Transformer Models
Add a description, image, and links to the readability topic page so that developers can more easily learn about it.
To associate your repository with the readability topic, visit your repo's landing page and select "manage topics."