html-parser

Fast Indexed python HTML parser which builds a DOM node tree, providing common getElementsBy* functions for scraping, testing, modification, and formatting. Also XPath.

python html parser formatter tree dom tags attributes filter html-parser create getelementbyid dom-tree getelementsbyclassname getelementsbyname getelementsbytagname

Updated Jul 5, 2023
Python

Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser

html pdf ocr table-of-contents excel html-parser docx documents doc scanned-documents txt document-analysis odt pdf-parser table-recognition docx-parser document-content-extraction logical-structure-extraction

Updated Jul 23, 2024
Python

iamareebjamal / ctengg-api

Star

Unofficial REST API for ctengg.amu.ac.in

python appengine university rest-api restful api-server html-parser attendance results

Updated Sep 27, 2017
Python

MichaelE919 / ncaa-stats-webscraper

Star

Python webscraping module for NCAA Basketball Stats

python3 requests html-parser webscraping openpyxl beautifulsoup4

Updated Dec 8, 2022
Python

viur-framework / html5

Star

A Python library for HTML5 web apps in Pyodide.

python html library framework html5 html-parser viur pyjs pyodide

Updated Nov 5, 2022
Python

iamareebjamal / get_results

Star

Python Script to download results of whole class/branch by providing attendance Excel file.

python-script html-parser attendance result-analysis student-information

Updated Sep 17, 2017
Python

sihaelov / harser

Star

Easy way for HTML parsing and building XPath

python html parser html-parser xpath

Updated Jul 6, 2022
Python

Bystroushaak / pyDHTMLParser

Sponsor

Star

Lightweight HTML/XML parser for quick and dirty web scraping.

python parser library html-parser parsing-library

Updated Oct 21, 2022
Python

menggatot / youtube-watch-history-to-csv

Star

This project allows you to convert your YouTube watch history HTML file from Google Takeout into a CSV file that can be used by the universalscrobbler.com to Scrobble manually in bulk.

scrobble youtube csv lastfm youtube-dl html-parser google-takeout yt-dlp youtube-watch-history universalscrobbler