Skip to content

Latest commit

 

History

History
120 lines (102 loc) · 5.85 KB

text-parsing.md

File metadata and controls

120 lines (102 loc) · 5.85 KB

Bookmarks tagged [text-parsing]

https://github.com/kennethreitz/tablib

A module for Tabular Datasets in XLS, CSV, JSON, YAML.


https://openpyxl.readthedocs.io/en/stable/

A library for reading and writing Excel 2010 xlsx/xlsm/xltx/xltm files.


https://github.com/pyexcel/pyexcel

Providing one API for reading, manipulating and writing csv, ods, xls, xlsx and xlsm files.


https://github.com/python-openxml/python-docx

Reads, queries and modifies Microsoft Word 2007/2008 docx files.


https://github.com/scanny/python-pptx

Python library for creating and updating PowerPoint (.pptx) files.


https://github.com/unoconv/unoconv

Convert between any document format supported by LibreOffice/OpenOffice.


https://github.com/jmcnamara/XlsxWriter

A Python module for creating Excel .xlsx files.


https://github.com/ZoomerAnalytics/xlwings

A BSD-licensed library that makes it easy to call Python from Excel and vice versa.


https://github.com/python-excel/xlwt

Writing and reading data and formatting information from Excel files.


https://github.com/euske/pdfminer

A tool for extracting information from PDF documents.


https://github.com/mstamy2/PyPDF2

A library capable of splitting, merging and transforming PDF pages.


https://www.reportlab.com/opensource/

Allowing Rapid creation of rich PDF documents.


https://github.com/lepture/mistune

Fastest and full featured pure Python parsers of Markdown.


https://github.com/waylan/Python-Markdown

A Python implementation of John Gruber’s Markdown.


http://pyyaml.org/

YAML implementations for Python.


https://github.com/wireservice/csvkit

Utilities for converting to and working with CSV.


https://github.com/mitsuhiko/unp

A command line tool that can unpack archives easily.